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A Note on the Nature of Science 



In this twenty-first century, a typical science education 
often leaves the philosophical underpinnings of sci- 
ence unstated, or relies on oversimplified definitions. As 
you contemplate a career in science, it may be useful to 
consider once again the terms science, scientist, and 
scientific method. 

Science is both a way of thinking about the natural 
world and the sum of the information and theory that re- 
sult from such thinking. The power and success of sci- 
ence flow directly from its reliance on ideas that can be 
tested: information on natural phenomena that can be 
observed, measured, and reproduced and theories that 
have predictive value. The progress of science rests on a 
foundational assumption that is often unstated but cru- 
cial to the enterprise: that the laws governing forces and 
phenomena existing in the universe are not subject to 
change. The Nobel laureate Jacques Monod referred to 
this underlying assumption as the "postulate of objectiv- 
ity." The natural world can therefore be understood by 
applying a process of inquiry — the scientific method. 
Science could not succeed in a universe that played 
tricks on us. Other than the postulate of objectivity, sci- 
ence makes no inviolate assumptions about the natural 
world. A useful scientific idea is one that (1) has been or 
can be reproducibly substantiated and (2) can be used 
to accurately predict new phenomena. 

Scientific ideas take many forms. The terms that sci- 
entists use to describe these forms have meanings quite 
different from those applied by nonscientists. A hypoth- 
esis is an idea or assumption that provides a reasonable 
and testable explanation for one or more observations, 
but it may lack extensive experimental substantiation. A 
scientific theory is much more than a hunch. It is an 
idea that has been substantiated to some extent and 
provides an explanation for a body of experimental ob- 
servations. A theory can be tested and built upon and is 
thus a basis for further advance and innovation. When a 
scientific theory has been repeatedly tested and vali- 
dated on many fronts, it can be accepted as a fact. 

In one important sense, what constitutes science or 
a scientific idea is defined by whether or not it is pub- 
lished in the scientific literature after peer review by 
other working scientists. About 16,000 peer-reviewed 
scientific journals worldwide publish some 1.4 million 
articles each year, a continuing rich harvest of informa- 
tion that is the birthright of every human being. 

Scientists are individuals who rigorously apply the 
scientific method to understand the natural world. 
Merely having an advanced degree in a scientific disci- 
pline does not make one a scientist, nor does the lack of 
such a degree prevent one from making important sci- 
entific contributions. A scientist must be willing to chal- 
lenge any idea when new findings demand it. The ideas 
that a scientist accepts must be based on measurable, 



reproducible observations, and the scientist must report 
these observations with complete honesty. 

The scientific method is actually a collection of 
paths, all of which may lead to scientific discovery. In the 
hypothesis and experiment path, a scientist poses a hy- 
pothesis, then subjects it to experimental test. Many of 
the processes that biochemists work with every day were 
discovered in this manner. The DNA structure elucidated 
by James Watson and Francis Crick led to the hypothesis 
that base pairing is the basis for information transfer in 
polynucleotide synthesis. This hypothesis helped inspire 
the discovery of DNA and RNA polymerases. 

Watson and Crick produced their DNA structure 
through a process of model building and calcula- 
tion. No actual experiments were involved, although 
the model building and calculations used data col- 
lected by other scientists. Many adventurous scientists 
have applied the process of exploration and observa- 
tion as a path to discovery- Historical voyages of dis- 
covery (Charles Darwin's 1831 voyage on H.M.S. 
Beagle among them) helped to map the planet, catalog 
its living occupants, and change the way we view the 
world. Modern scientists follow a similar path when 
they explore the ocean depths or launch probes to 
other planets. An analog of hypothesis and experiment 
is hypothesis and deduction. Crick reasoned that 
there must be an adaptor molecule that facilitated 
translation of the information in messenger RNA into 
protein. This adaptor hypothesis led to the discovery of 
transfer RNA by Mahlon Hoagland and Paul Zamecnik. 

Not all paths to discovery involve planning. Serendip- 
ity often plays a role. The discovery of penicillin by 
Alexander Fleming in 1928, and of RNA catalysts by 
Thomas Cech in the early 1980s, were both chance discov- 
eries, albeit by scientists well prepared to exploit them. 
Inspiration can also lead to important advances. The poly- 
merase chain reaction (PCR) , now a central part of biotech- 
nology, was developed by Kary Mullis after a flash of 
inspiration during a road trip in northern California in 1983. 

These many paths to scientific discovery can seem 
quite different, but they have some important things 
in common. They are focused on the natural world. 
They rely on reproducible observation and/or experi- 
ment All of the ideas, insights, and experimental facts 
that arise from these endeavors can be tested and 
reproduced by scientists anywhere in the world. All can 
be used by other scientists to build new hypotheses and 
make new discoveries. All lead to information that is 
properly included in the realm of science. Understand- 
ing our universe requires hard work. At the same time, 
no human endeavor is more exciting and potentially re- 
warding than trying, and occasionally succeeding, to un- 
derstand some part of the natural world. 

H 
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The first edition of Principles of Biochemistry, written 
by Albert Lehninger twenty-five years ago, has served as 
the starting point and the model for our four subsequent 
editions. Over that quarter-century, the world of biochem- 
istry has changed enormously. Twenty-five years ago, not a 
single genome had been sequenced, not a single membrane 
protein had been solved by crystallography, and not a sin- 
gle knockout mouse existed. Ribozymes had just been dis- 
covered, PCR technology introduced, and archaea 
recognized as members of a kingdom separate from bac- 
teria. Now, new genomic sequences are announced weekly, 
new protein structures even more frequently, and re- 
searchers have engineered thousands of different knock- 
out mice, with enormous promise for advances in basic 
biochemistry, physiology, and medicine. This fifth edition 
contains the photographs of 31 Nobel laureates who have 
received their prizes for Chemistry or for Physiology or Med- 
icine since that first edition of Principles of Biochemistry. 

One major challenge of each edition has been to re- 
flect the torrent of new information without making the 
book overwhelming for students having their first en- 
counter with biochemistry. This has required much care- 
ful sifting aimed at emphasizing principles while still 
conveying the excitement of current research and its 
promise for the future. The cover of this new edition ex- 
emplifies this excitement and promise: in the x-ray struc- 
ture of RNA polymerase, we see DNA, RNA, and protein 
in their informational roles, in atomic dimensions, caught 
in the central act of information transfer. 



We are at the threshold of a new molecular physiol- 
ogy in which processes such as membrane excitation, 
secretion, hormone action, vision, gustation, olfaction, 
respiration, muscle contraction, and cell movements will 
be explicable in molecular terms and will become acces- 
sible to genetic dissection and pharmacological manipu- 
lation. Knowledge of the molecular structures of the 
highly organized membrane complexes of oxidative 
phosphorylation and photophosphorylation, for exam- 
ple, will certainly bring deepened insight into those 
processes, so central to life. (These developments make 
us wish we were young again, just beginning our careers 
in biochemical research and teaching. Our book is not 
the only thing that has acquired a touch of silver over 
the years!) 

In the past two decades, we have striven always to 
maintain the qualities that made the original Lehninger 
text a classic — clear writing, careful explanations of diffi- 
cult concepts, and communicating to students the ways 
in which biochemistry is understood and practiced today. 
We have written together for twenty years and taught to- 
gether for almost twenty-five. Our thousands of students 
at the University of Wisconsin-Madison over those years 
have been an endless source of ideas about how to pres- 
ent biochemistry more clearly; they have enlightened and 
inspired us. We hope that this twenty-fifth anniversary 
edition will enlighten and inspire current students of bio- 
chemistry everywhere, and perhaps lead some of them to 
love biochemistry as we do. 



Major Recent Advances in Biochemistry 

Every chapter has been thoroughly revised and up- 
dated to include the most important advances in bio- 
chemistry including: 

■ Concepts of proteomes and proteomics, 

introduced earlier in the book (Chapter 1) 

■ New discussion of amyloid diseases in the 

context of protein folding (Chapter 4) 

■ New section on pharmaceuticals developed from 
an understanding of enzyme mechanism, using 
penicillin and HIV protease inhibitors as examples 
(Chapter 6) 

m New discussion of sugar analogs as drugs that 
target viral neuraminidase (Chapter 7) 

■ New material on green fluorescent protein 

(Chapter 9) 

■ New section on lipidomics (Chapter 10) 

■ New descriptions of volatile lipids used as signals 

[v»] 



by plants, and of bird feather pigments derived 
from colored lipids in plant foods (Chapter 10) 

■ Expanded and updated section on lipid rafts and 
caveolae to include new material on membrane 
curvature and the proteins that influence it, and 
introducing amphitropic proteins and annular 
lipids (Chapter 11) 

m New section on the emerging role of ribulose 
5-phosphate as a central regulator of glycolysis 
and gluconeogenesis (Chapter 15) 

New Box 16-1, Moonlighting Enzymes: Proteins 
with More Than One Job 

■ New section on the role of transcription factors 
(PPARs) in regulation of lipid catabolism 
(Chapter 17) 

■ Revised and updated section on fatty acid 
synthase, including new structural information 
on FAS I (Chapter 21) 
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Updated coverage of 
the nitrogen cycle, 
including new Box 
22-1, Unusual Life 
Styles of the Obscure 
but Abundant, 
discussing anammox 
bacteria (Chapter 22) 

New Box 24-2 , 
Epigenetics, Nucleosome 
Structure, and Histone 
Variants describing the 
role of histone 
modification and 
nucleosome deposition 
in the transmission of 
epigenetic information in heredity 

New information on the initiation of replication 
and the dynamics at the replication fork, 
introducing AAA+ ATPases and their functions 
in replication and other aspects of DNA 
metabolism (Chapter 25) 

New section on the expanded understanding of 
the roles of RNA in cells (Chapter 26) 




FIGURE 21-3 The structure of fatty acid synthase type I systems. 



New information on the roles of RNA 
in protein biosynthesis 

(Chapter 27) 

New section on riboswitches 

(Chapter 28) 

New Box 28-1, Of Fins, Wings, Beaks, and Things, 
describing the connections between evolution 
and development 



Biochemical Methods 

An appreciation of biochemistry often 
requires an understanding of how bio- 
chemical information is obtained. Some 
of the new methods or updates described 
in this edition are: 

■ Circular dichroism (Chapter 4) 

■ Measurement of glycated 
hemoglobin as an indicator of 
average blood glucose concentration, 
over days, in persons with diabetes 
(Chapter 7) 

■ Use of MALDI-MS in determination of 
oligosaccharide structure (Chapter 7) 

■ Forensic DNA analysis, a major update 
covering modern STR analysis (Chap- 
ter 9) 

■ More on microarrays (Chapter 9) 

■ Use of tags for protein analysis and 
purification (Chapter 9) 

■ PET combined with 

CT scans to pinpoint cancer 
(Chapter 14) 

■ Chromatin immunoprecipitation and ChlP- 
experiments (Chapter 24) 
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FIGURE 9-12 The use of tagged proteins in protein purifi- 
cation. The use of a GST tag is illustrated, (a) Glu- 
tathione-5-transferase (GST) is a small enzyme (depicted 
here by the purple icon) that binds glutathione (a gluta- 
mate residue to which a Cys-Gly dipeptide is attached at 
the carboxyl carbon of the Glu side chain, hence the ab- 
breviation GSH). (b) The GST tag is fused to the car- 
boxyl terminus of the target protein by genetic 
engineering The tagged protein is expressed in host 
cells, and is present in the crude extract when the cells 
are lysed. The extract is subjected to chromatography 
on a column containing a medium with immobilized 
glutathione. The GST-tagged protein binds to the glu- 
tathione, retarding its migration through the column, 
while the other proteins wash through rapidly. The 
tagged protein is subsequently eluted from the column 
with a solution containing elevated salt concentration or 
free glutathione. 
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■ Development of bacterial strains with altered ge 
netic codes, for site-specific insertion of novel 
amino acids into proteins (Chapter 27) 
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Medically Relevant Examples 

I This icon is used throughout the book to denote 
mm material of special medical interest. As teachers ; 
our goal is for students to learn biochemistry and to 
understand its relevance to a healthier life and a 
healthier planet. We have included many new exam- 
ples that relate biochemistry to medicine and to health 
issues in general. Some of the medical applications new 
to this edition are: 

■ The role of polyunsaturated fatty acids and trans 
fatty acids in cardiovascular disease (Chapter 10) 

■ G protein-coupled receptors (GCPRs) and the 
range of diseases for which drugs targeted to 
GPCRs are being used or developed (Chapter 12) 

■ G proteins, the regulation of GTPase activity, 

and the medical consequences of defective G protein 
function (Chapter 12), including new Box 12-2, 
G Proteins: Binary Switches in Health and Disease 

■ Box 12-5, Development of Protein Kinase Inhibitors 
for Cancer Treatment 

■ Box 14-1 , High Rate of Glycolysis in Tumors Suggests 
Targets for Chemotherapy and Facilitates Diagnosis 



■ Box 15-3, Genetic Mutations That Lead to Rare 
Forms of Diabetes 

■ Mutations in citric acid cycle enzymes that lead to 
cancer (Chapter 16) 

■ Pernicious anemia and associated problems in strict 
vegetarians (Chapter 18) 

■ Updated information on cyclooxygenase inhibitors 
(pain relievers Vioxx, Celebrex, Bextra) 
(Chapter 21) 

■ HMG-CoA reductase (Chapter 21) and Box 21-3, 
The Lipid Hypothesis and the Development of 
Statins 

■ Box 24-1, Curing Disease by Inhibiting 
Topoisomerases, describing the use of 
topoisomerase inhibitors in the treatment 
of bacterial infections and cancer, including 
material on ciprofloxacin (the antibiotic effective 
for anthrax) 



Special Theme: Understanding Metabolism through Obesity and Diabetes 



Obesity and its medical consequences — cardiovascu- 
lar disease and diabetes — are fast becoming epidemic 
in the industrialized world, and we include new mate- 
rial on the biochemical connections between obesity 
and health throughout this edition. Our focus on dia- 
betes provides an integrating theme throughout the 
chapters on metabolism and its control, and this will, 
we hope, inspire some students to find solutions for 
this disease. Some of the sections and boxes that 
highlight the interplay of metabolism, obesity, and 
diabetes are: 

■ Untreated Diabetes Produces Life-Threatening Aci 
dosis (Chapter 2) 

■ Box 7-1, Blood Glucose Measurements in the 
Diagnosis and Treatment of Diabetes, introducing 
hemoglobin glycation and AGEs and 

their role in the pathology of advanced diabetes 

■ Box 11-2, Defective Glucose and Water Transport 
in Two Forms of Diabetes 

■ Glucose Uptake Is Deficient in Type 1 Diabetes Mel 
litus (Chapter 14) 

■ Ketone Bodies Are Overproduced in Diabetes and 
during Starvation (Chapter 17) 

■ Some Mutations in Mitochondrial Genomes Cause 
Disease (Chapter 19) 

■ Diabetes Can Result from Defects in the Mitochon 
dria of Pancreatic /? Cells (Chapter 19) 
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FIGURE 23-42 

■ Adipose Tissue Generates Glycerol 3-phosphate 
by Glyceroneogenesis (Chapter 21) 

■ Diabetes Mellitus Arises from Defects in Insulin 
Production or Action (Chapter 23) 

■ Section 23.4, Obesity and the Regulation of Body 
Mass, discusses the role of adiponectin and insulin 
sensitivity and type 2 diabetes 

■ Section 23.5, Obesity, the Metabolic Syndrome, and 
Type 2 Diabetes, includes a discussion of managing 
type 2 diabetes with exercise, diet, and medication 
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Advances in Teaching Biochemistry 

Revising this textbook is never just an updating exercise. At least as much time 
is spent reexamining how the core topics of biochemistry are presented. We 
have revised each chapter with an eye to helping students learn and master the 
fundamentals of biochemistry. Students encountering biochemistry for the first 
time often have difficulty with two key aspects of the course: approaching quan- 
titative problems and drawing on what they learned in organic chemistry to help 
them understand biochemistry. Those same students must also learn a com- 
plex language, with conventions that are often unstated. We have made some 
major changes in the book to help students cope with all these challenges: 
new problem-solving tools, a focus on organic chemistry foundations, and 
highlighted key conventions. 



New Problem-Solving Tools 

■ New in-text Worked Examples help students improve their 
quantitative problem-solving skills, taking them through some of the 
most difficult equations. 

■ More than 100 new end-of-chapter problems give students 
further opportunity to practice what they have learned. 

■ New Data Analysis Problems (one at the end of each chapter), con 
tributed by Brian White of the University of Massachusetts-Boston, en 
courage students to synthesize what they have learned and apply their 
knowledge to the interpretation of data from the literature. 

Focus on Organic Chemistry Foundations 

■ New Section 13.2, Chemical logic and common biochemical 
reactions, discusses the common biochemical reaction types that 
underlie all metabolic reactions. 

■ Chemical logic is reinforced in the discussions of _ 

central metabolic pathways. 

■ Mechanism figures feature step-by-step 
descriptions to help students understand the 
reaction process. 

■ In the presentation of reaction mechanisms, 

we consistently use a set of conventions 
introduced and explained in detail with the 
first enzyme mechanism encountered 
(chymotrypsin, pp. 208-209). Some of 
the new problems focus on chemical 
mechanisms and reinforce mechanistic themes. 



Wtm WORKED EXAMPLE 11-3 Energetics of Pumping 
by Symport 

„ , [glucose] in , 

Calculate the maximum — : — ratio that can be 

lglucoseJ out 

achieved by the plasma membrane Na + -glucose sym- 
porter of an epithelial cell, when [Na + ] in is 12 mM, 
[Na + ] out is 145 mM, the membrane potential is -50 mV 
(inside negative), and the temperature is 37 °C 

Solution: Using Equation 11-4 (p 396), we can calcu- 
late the energy inherent in an electrochemical Na + 
gradient— that is, the cost of moving one Na + ion up 
this gradient: 



AG t = RT In 



[Na] out 
[Na] in 



h Z3 b$ 



We then substitute standard values fori?, T, and J, and 
the given values for [Na + ] (expressed as molar concen- 
trations), + 1 for Z (because Na + has a positive 
charge), and 0 050 V for Aij>. Note that the membrane 
potential is -50 mV (inside negative), so the change in 
potential when an ion moves from inside to outside is 
50 mV. 



AG t = (8 315 J/mol-K)(310K)ln 



1.45 X 10" 



1.2 X 1(T 2 
+ 1(96,500 J/V-mo])(0 050 V) 

= 112 kJ/mol 

This AG t is the potential energy per mole of Na + in the 
Na + gradient that is available to pump glucose. Given 
that two Na + ions pass down their electrochemical gra- 
dient and into the cell for each glucose carried in by 
symport, the energy available to pump 1 mol of glucose 
is 2 x 11.2 kJ/mol = 22 4 kJ/mol We can now calculate 
the concentration ratio of glucose that can be achieved 
by this pump (from Equation 11-3, p 396): 



AG t = RT In 



[glucoaeljn 



[glucose] out 

Rearranging, then substituting the values of AG t , R, and 
T, gives 



In 



[glucoseJin 
[glucose] ou t 



[glucose] in 
[glucose] ou 



= R,T = 



22.4 kJ/mol 



(8.315 J/mol-K)(310K) 



= 5.94 x ltt :i 



WORKED EXAMPLE 11-3 



Glyceraldehyde 
3 -phosphate 



NAD + HCOH 



Glyceraldehyde 
3 -phosphate 
dehydrogenase 



ill i -5-i ■ 

HCOH 



Key Conventions 

In this edition, many of the conventions that are so 
necessary for understanding each biochemical topic 
and the biochemical literature are broken out of the 
text and highlighted. These Key Conventions FIGURE 14-7 
include clear statements of many assumptions and 
conventions that students are often expected to assimilate without 
being told (for example, peptide sequences are written from amino- 
to carboxyl-terminal end, left to right; nucleotide sequences are 
written from 5' to 3' end, left to right). 



Formation of enzyme- 
substrate complex. The 
active-site Cys has a 
reduced jtK a (55 instead 
of 8) when NAD + is bound, 
and is in the more reactive, 
thiolate form 



The covalent thioeater 
linkage between the 
substrate and enzyme 
undergoes phosphor olysis 
(attack by P;) releasing 
the second product, 
1 , 3 -bisphosphoglycera te 



The NADH product leavea the 
active site and is replaced by 
another molecule of NAD + . 



A covalent thiohemiacetal 
linkage forms between the 
substrate and the -S~ 
group of the Cys residue. 



CH,opor 

\ I 

\HCOH 



! 

Cys 



The enzyme -substrate 
intermediate is oxidized by 
the NAD + bound to the 
active site. 



CH 2 OPOf 

HCOH 

I 

c=o 



KEY CONVENTION: When an amino acid sequence of a 
peptide, polypeptide, or protein is displayed, the amino- 
terminal end is placed on the left, the carboxyl-terrriinal 
end on the right. The sequence is read left to right, be- 
ginning with the amino -terminal end ■ 
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Media and Supplements 

A full package of media resources and supplements pro- 
vides instructors and students with innovative tools to 
support a variety of teaching and learning approaches. 
All these resources are fully integrated with the style 
and goals of the fifth edition textbook. 



eBook 



This online version of the textbook 
combines the contents of the printed 
book, electronic study tools, and a 
full complement of student media 
specifically created to support the 
text. The eBook also provides useful 
material for instructors. 



eBook study tools include instant navigation to any 
section or page of the book, bookmarks, highlighting, 
note-taking, instant search for any term, pop-up key- 
term definitions, and a spoken glossary. 

The text-specific student media, fully integrated 
throughout the eBook, include animated enzyme 
mechanisms, animated biochemical techniques, 
problem-solving videos, molecular structure 
tutorials in Jmol, Protein Data Bank IDs in Jmol, liv- 
ing graphs, and online quizzes (each described un 
der "Additional Student Media" below). 

Instructor features include the ability to add 
notes or files to any page and to share these 
notes with students. Notes may include text, Web 
links, animations, or photos. Instructors can also 
assign the entire text or a custom version of the 
eBook. 



Additional Instructor Media 

Instructors are provided with a comprehensive set 
of teaching tools, each developed to support the text, 
lecture presentations, and individual teaching styles. All 
instructor media are available for download on the 
book Web site (www.whfreeman.com/lehninger5e) 
and on the Instructor Resource CD/DVD (ISBN 
1-4292-1912-2). These media tools include: 

■ Fully optimized JPEG files of every figure, photo, 
and table in the text, with enhanced color, higher 
resolution, and enlarged fonts. The files have been 
reviewed by course instructors and tested in a 
large lecture hall to ensure maximum clarity 

and visibility. The JPEGs are also offered in 
separate files and in PowerPoint® format for each 
chapter. 

■ The 150 most popular images in the textbook are 
available in an Overhead Transparency Set (ISBN 



1-4292-1911-4), fully optimized for maximum visibil- 
ity in the lecture hall. 

■ Animated Enzyme Mechanisms and Animated 
Biochemical Techniques are available in Flash 
files and preloaded into PowerPoint, in both PC and 
Macintosh formats, for lecture presentation. (See 
list of animation topics on the inside front cover.) 

■ A list of Protein Data Bank IDs for the structures 
in the text is provided, arranged by figure number. A 
new feature in this edition is an index to all struc- 
tures in the Jmol interactive Web browser applet. 

■ Living Graphs illustrate key equations from the 
textbook, showing the graphic results of changing 
parameters. 

■ A comprehensive Test Bank in PDF and editable 
Word formats includes 150 multiple-choice and 
short-answer problems per chapter, rated by level 
of difficulty. 



Additional Student Media 

Students are provided with media designed to enhance 
their understanding of biochemical principles and im- 
prove their problem-solving ability. All student media, 
along with the PDB Structures and Living Graphs, are 
also in the eBook, and many are available on the book 
Web site (www.whfreeman.com/lehninger5e). The 
student media include: 

■ New Problem-Solving 

Videos, created by 
Scott Ensign of Utah State 
University provide 24/7 
online problem-solving help 
to students. Through a 
two-part approach, each 
10-minute video covers a 
key textbook problem repre- 
senting a topic that students traditionally struggle to 
master. Dr. Ensign first describes a proven problem- 
solving strategy and then applies the strategy to the 
problem at hand in clear, concise steps. Students can 
easily pause, rewind, and review any steps as they 
wish until they firmly grasp not just the solution but 
also the reasoning behind it. Working through the 
problems in this way is designed to make students 
better and more confident at applying key strategies 
as they solve other textbook and exam problems. 

Student versions of the Animated Enzyme 
Mechanisms and Animated Biochemical 
Techniques help students understand key 
mechanisms and techniques at their own pace. 
For a complete list of animation topics, see the 
inside front cover. 
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■ Molecular Structure Tutorials, using the Jmol- 
Web browser applet, allow students to explore in 
more depth the molecular structures included in 
the textbook, including: 

Protein Architecture 
Bacteriorhodopsin 
Lac Repressor 
Nucleotides 
MHC Molecules 
Trimeric G Proteins 
Oxygen-Binding Proteins 
Restriction Endonucleases 
Hammerhead Ribozyme 

■ Online Quizzes include approximately 20 chal- 
lenging multiple-choice questions for each chapter, 
with automatic grading and text references and 
feedback on all answers. 

The Absolute, Ultimate Guide to Lehninger Principles of 
Biochemistry, Fifth Edition, Study Guide and Solutions 
Manual, by Marcy Osgood (University of New Mexico 
School of Medicine) and Karen Ocorr (University of 
California, San Diego); 1-4292-1241-1 

The Absolute, Ultimate Guide combines an innovative 
study guide with a reliable solutions manual (providing 
extended solutions to end-of-chapter problems) in one 
convenient volume. Thoroughly class-tested, the Study 
Guide includes for each chapter: 

■ Major Concepts: a roadmap through the chapter 

■ What to Review: questions that recap key points 
from previous chapters 



Discussion Questions: provided for 
each section; designed for individual 
review, study groups, or classroom 
discussion 

A Self-Test: "Do you know the terms?"; 
crossword puzzles; multiple-choice, 
fact-driven questions; and questions 
that ask students to apply their new 
knowledge in new directions — plus 
answers! 
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With the cell, biology discovered its atom ... To characterize life, it 
was henceforth essential to study the cell and analyze its structure: to 
single out the common denominators necessary for the life of every 
cell; alternatively, to identify differences associated with the perfor- 
mance of special functions. 

—francois Jacob, La logique du vivant; une histoire de I'heredite 
(The Logic of Life: A History of Heredity), 1970 
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About fifteen billion years ago, the universe arose as 
a cataclysmic eruption of hot, energy-rich sub- 
atomic particles. Within seconds, the simplest 
elements (hydrogen and helium) were formed. As the 
universe expanded and cooled, material condensed 
under the influence of gravity to form stars. Some stars 
became enormous and then exploded as supernovae, 
releasing the energy needed to fuse simpler atomic 
nuclei into the more complex elements. Thus were pro- 
duced, over billions of years, Earth itself and the 
chemical elements found on Earth today. About four bil- 
lion years ago, life arose — simple microorganisms with 
the ability to extract energy from chemical compounds 
and, later, from sunlight, which they used to make a vast 
array of more complex biomolecules from the simple 
elements and compounds on the Earth's surface. 

Biochemistry asks how the remarkable properties of 
living organisms arise from the thousands of different 
biomolecules. When these molecules are isolated and 
examined individually, they conform to all the physical 
and chemical laws that describe the behavior of inani- 
mate matter — as do all the processes occurring in living 
organisms. The study of biochemistry shows how the 
collections of inanimate molecules that constitute living 
organisms interact to maintain and perpetuate life 
animated solely by the physical and chemical laws that 
govern the nonliving universe. 

Yet organisms possess extraordinary attributes, 
properties that distinguish them from other collections 



of matter. What are these distinguishing features of 
living organisms? 

A high degree of chemical complexity and mi- 
croscopic organization. Thousands of different 
molecules make up a cell's intricate internal struc- 
tures (Fig. 1-la). These include very long polymers, 
each with its characteristic sequence of subunits, its 
unique three-dimensional structure, and its highly 
specific selection of binding partners in the cell. 

Systems for extracting, transforming, and us- 
ing energy from the environment (Fig. 1- lb) , en- 
abling organisms to build and maintain their intricate 
structures and to do mechanical, chemical, osmotic, 
and electrical work. This counteracts the tendency of 
all matter to decay toward a more disordered state, 
to come to equilibrium with its surroundings. 

Defined functions for each of an organism's 
components and regulated interactions among 
them. This is true not only of macroscopic struc- 
tures, such as leaves and stems or hearts and lungs, 
but also of microscopic intracellular structures 
and individual chemical compounds. The interplay 
among the chemical components of a living organism 
is dynamic; changes in one component cause coordi- 
nating or compensating changes in another, with the 
whole ensemble displaying a character beyond that 
of its individual parts. The collection of molecules 
carries out a program, the end result of which is re- 
production of the program and self-perpetuation of 
that collection of molecules — in short, life. 

Mechanisms for sensing and responding to al- 
terations in their surroundings, constantly ad- 
justing to these changes by adapting their internal 
chemistry or their location in the environment. 

A capacity for precise self-replication and 
self-assembly (Fig. 1-1 c). A single bacterial cell 
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FIGURE 1-1 Some characteristics of living matter, (a) Microscopic 
complexity and organization are apparent in this colorized thin sec- 
tion of vertebrate muscle tissue, viewed with the electron microscope. 

(b) A prairie falcon acquires nutrients by consuming a smaller bird. 

(c) Biological reproduction occurs with near-perfect fidelity. 

placed in a sterile nutrient medium can give rise to 
a billion identical "daughter" cells in 24 hours. Each 
cell contains thousands of different molecules, 
some extremely complex; yet each bacterium is a 
faithful copy of the original, its construction di- 
rected entirely from information contained in the 
genetic material of the original cell. 

A capacity to change over time by gradual evo- 
lution. Organisms change their inherited life 
strategies, in very small steps, to survive in new cir- 
cumstances. The result of eons of evolution is an 
enormous diversity of life forms, superficially very 
different (Fig. 1-2) but fundamentally related 
through their shared ancestry. This fundamental 
unity of living organisms is reflected at the molecu- 
lar level in the similarity of gene sequences and 
protein structures. 

Despite these common properties, and the funda- 
mental unity of life they reveal, it is difficult to make 
generalizations about living organisms. Earth has an 
enormous diversity of organisms. The range of habitats, 
from hot springs to Arctic tundra, from animal intestines 
to college dormitories, is matched by a correspondingly 
wide range of specific biochemical adaptations, achieved 



within a common chemical framework. For the sake of 
clarity, in this book we sometimes risk certain general- 
izations, which, though not perfect, remain useful; we 
also frequently point out the exceptions to these gener- 
alizations, which can prove illuminating. 

Biochemistry describes in molecular terms the 
structures, mechanisms, and chemical processes shared 
by all organisms and provides organizing principles that 
underlie life in all its diverse forms, principles we refer 
to collectively as the molecular logic of life. Although 
biochemistry provides important insights and practical 
applications in medicine, agriculture, nutrition, and 
industry, its ultimate concern is with the wonder of 
life itself. 

In this introductory chapter we give an overview 
of the cellular, chemical, physical, and genetic back- 
grounds to biochemistry and the overarching principle 
of evolution — the development over generations of 
the properties of living cells. As you read through the 
book, you may find it helpful to refer back to this chap- 
ter at intervals to refresh your memory of this back- 
ground material. 

1.1 Cellular Foundations 

The unity and diversity of organisms become apparent 
even at the cellular level. The smallest organisms consist 
of single cells and are microscopic. Larger, multicellular 
organisms contain many different types of cells, which 
vary in size, shape, and specialized function. Despite 




FIGURE 1-2 Diverse living organisms share common chemical features. 

Birds, beasts, plants, and soil microorganisms share with humans the 
same basic structural units (cells) and the same kinds of macromolecules 
(DNA, RNA, proteins) made up of the same kinds of monomeric subunits 
(nucleotides, amino acids). They utilize the same pathways for synthesis 
of cellular components, share the same genetic code, and derive from the 
same evolutionary ancestors. Shown here is a detail from "The Garden of 
Eden/' by Jan van Kessel the Younger (1626-1679). 
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these obvious differences, all cells of the simplest and 
most complex organisms share certain fundamental 
properties, which can be seen at the biochemical level. 

Ceils Are the Structural and Fooctional Units of 
All Living Organisms 

Cells of all kinds share certain structural features 
(Fig, 1-3). The plasma membrane defines the pe- 
riphery of the cell, separating its contents from the sur- 
roundings. It is composed of lipid and protein molecules 
that form a thin, tough, pliable, hydrophobic barrier 
around the cell. The membrane is a barrier to the free 
passage of inorganic ions and most other charged or po- 
lar compounds. Transport proteins in the plasma mem- 
brane allow the passage of certain ions and molecules; 
receptor proteins transmit signals into the cell; and 
membrane enzymes participate in some reaction path- 



Nucleus (eukaryotes) 
or nucleoid (bacteria, archaea) 

Contains genetic material-DNA and 
associated proteins. Nucleus is 
membrane-enclosed. 

Plasma membrane 

Tough, flexible lipid bilayer. 
Selectively permeable to 
polar substances. Includes 
membrane proteins that 
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FIGURE 1 - 3 The universal features of living cells. All cells have a nu- 
cleus or nucleoid, a plasma membrane, and cytoplasm. The cytosol is 
defined as that portion of the cytoplasm that remains in the supernatant 
after gentle breakage of the plasma membrane and centrifugation of 
the resulting extract at 1 50,000 g for 1 hour. 



ways. Because the individual lipids and proteins of the 
plasma membrane are not covalently linked, the entire 
structure is remarkably flexible, allowing changes in the 
shape and size of the cell. As a cell grows, newly made 
lipid and protein molecules are inserted into its plasma 
membrane; cell division produces two cells, each with 
its own membrane. This growth and cell division (fis- 
sion) occurs without loss of membrane integrity. 

The internal volume enclosed by the plasma mem- 
brane, the cytoplasm (Fig. 1-3), is composed of an aque- 
ous solution, the cytosol, and a variety of suspended 
particles with specific functions. The cytosol is a highly 
concentrated solution containing enzymes and the RNA 
molecules that encode them; the components (amino 
acids and nucleotides) from which these macromolecules 
are assembled; hundreds of small organic molecules 
called metabolites, intermediates in biosynthetic and 
degradative pathways; coenzymes, compounds essential 
to many enzyme-catalyzed reactions; inorganic ions; and 
such supramolecular structures as ribosomes, the sites 
of protein synthesis, and proteasomes, which degrade 
proteins no longer needed by the cell. 

All cells have, for at least some part of their life, 
either a nucleus or a nucleoid, in which the genome — 
the complete set of genes, composed of DNA — is stored 
and replicated. The nucleoid, in bacteria and archaea, is 
not separated from the cytoplasm by a membrane; the 
nucleus, in eukaryotes, consists of nuclear material en- 
closed within a double membrane, the nuclear envelope. 
Cells with nuclear envelopes make up the large group 
Eukarya (Greek eu, "true," and karyon, "nucleus"). 
Microorganisms without nuclear envelopes, formerly 
grouped together as prokaryotes (Greek pro, "be- 
fore"), are now recognized as comprising two very dis- 
tinct groups, Bacteria and Archaea, described below. 

Cellular Dimensions Are Limited by Diffusion 

Most cells are microscopic, invisible to the unaided eye. 
Animal and plant cells are typically 5 to 100 /xm in diam- 
eter, and many unicellular microorganisms are only 1 to 
2 fim long (see the inside back cover for information on 
units and their abbreviations). What limits the dimen- 
sions of a cell? The lower limit is probably set by the 
minimum number of each type of biomolecule required 
by the cell. The smallest cells, certain bacteria known as 
mycoplasmas, are 300 nm in diameter and have a vol- 
ume of about 10~ 14 mL. A single bacterial ribosome is 
about 20 nm in its longest dimension, so a few ribosomes 
take up a substantial fraction of the volume in a my- 
coplasmal cell. 

The upper limit of cell size is probably set by the 
rate of diffusion of solute molecules in aqueous systems. 
For example, a bacterial cell that depends on oxygen- 
consuming reactions for energy production must obtain 
molecular oxygen by diffusion from the surrounding 
medium through its plasma membrane. The cell is so 
small, and the ratio of its surface area to its volume is so 
large, that every part of its cytoplasm is easily reached 
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FIGURE 1-4 Phylogeny of the three domains of life. Phylogenetic rela- 
tionships are often illustrated by a "family tree" of this type. The basis for 
this tree is the similarity in nucleotide sequences of the ribosomal RNAs 
of each group; the more similar the sequence, the closer the location of 
the branches, with the distance between branches representing the de- 
gree of difference between two sequences. Phylogenetic trees can also 



Diplomonads 



be constructed from similarities across species of the amino acid se- 
quences of a single protein. For example, sequences of the protein 
GroEL (a bacterial protein that assists in protein folding) were compared 
to generate the tree in Figure 3-32. The tree in Figure 3-33 is a "con- 
sensus" tree, which uses several comparisons such as these to make the 
best estimates of evolutionary relatedness of a group of organisms. 



by 0 2 diffusing into the cell. With increasing cell size, 
however, surface-to-volume ratio decreases, until me- 
tabolism consumes 0 2 faster than diffusion can supply 
it. Metabolism that requires 0 2 thus becomes impossible 
as cell size increases beyond a certain point, placing a 
theoretical upper limit on the size of cells. 

There Are Three Distinct Domains of Life 

All living organisms fall into one of three large groups 
(domains) that define three branches of evolution from 



a common progenitor (Fig. 1-4). Two large groups of 
single-celled microorganisms can be distinguished on 
genetic and biochemical grounds: Bacteria and 
Archaea. Bacteria inhabit soils, surface waters, and 
the tissues of other living or decaying organisms. Many of 
the Archaea, recognized as a distinct domain by Carl 
Woese in the 1980s, inhabit extreme environments — salt 
lakes, hot springs, highly acidic bogs, and the ocean 
depths. The available evidence suggests that the Archaea 
and Bacteria diverged early in evolution. All eukaryotic 
organisms, which make up the third domain, Eukarya, 
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FIGURE 1-5 Or ganisms can be classified according to their source of energy (sunlight or oxidizable chemical 
compounds) and their source of carbon for the synthesis of cellular material. 
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evolved from the same branch that gave rise to the Ar- 
chaea; eukaryotes are therefore more closely related to 
archaea than to bacteria. 

Within the domains of Archaea and Bacteria are 
subgroups distinguished by their habitats. In aerobic 
habitats with a plentiful supply of oxygen, some resident 
organisms derive energy from the transfer of electrons 
from fuel molecules to oxygen. Other environments are 
anaerobic, virtually devoid of oxygen, and microorgan- 
isms adapted to these environments obtain energy by 
transferring electrons to nitrate (forming N 2 ), sulfate 
(forming H 2 S), or C0 2 (forming CH 4 ). Many organisms 
that have evolved in anaerobic environments are obli- 
gate anaerobes: they die when exposed to oxygen. Oth- 
ers are facultative anaerobes, able to live with or 
without oxygen. 

We can classify organisms according to how they ob- 
tain the energy and carbon they need for synthesizing 
cellular material (as summarized in Fig. 1-5). There are 
two broad categories based on energy sources: pho- 
totrophs (Greek trophe, "nourishment") trap and use 
sunlight, and chemotrophs derive their energy from 
oxidation of a chemical fuel. Some chemotrophs, the 
lithotrophs, oxidize inorganic fuels — HS~ to S° (ele- 
mental sulfur), S° to SO^, N0 2 " to N0 3 ~, or Fe 2+ to Fe 3+ , 
for example. Organotropic oxidize a wide array of or- 
ganic compounds available in their surroundings. Pho- 
totrophs and chemotrophs may also be divided into 
those that can obtain all needed carbon from C0 2 (au- 
totrophs) and those that require organic nutrients 
(heterotrophs). 

Escherichia coli Is the Most-Studied Bacterium 

Bacterial cells share certain common structural features, 
but also show group-specific specializations (Fig. 1-6). 
E. coli is a usually harmless inhabitant of the human 
intestinal tract. The E. coli cell is about 2 /xm long and 
a little less than 1 ^tm in diameter. It has a protective 
outer membrane and an inner plasma membrane that 
encloses the cytoplasm and the nucleoid. Between the 
inner and outer membranes is a thin but strong layer of 
a polymer (peptidoglycan) that gives the cell its shape 
and rigidity. The plasma membrane and the layers out- 
side it constitute the cell envelope. We should note 
here that in archaea, rigidity is conferred by a different 
type of polymer (pseudopeptidoglycan). The plasma 
membranes of bacteria consist of a thin bilayer of lipid 



FIGURE 1 — 6 Common structural features of bacterial cells. Because of 
differences in cell envelope structure, some bacteria (gram-positive 
bacteria) retain Gram's stain (introduced by Hans Christian Gram in 
1882), and others (gram-negative bacteria) do not. E. coli is gram- 
negative. Cyanobacteria are distinguished by their extensive internal 
membrane system, which is the site of photosynthetic pigments. Al- 
though the cell envelopes of archaea and gram-positive bacteria look 
similar under the electron microscope, the structures of the membrane 
lipids and the polysaccharides are distinctly different (see Fig. 10-12). 



molecules penetrated by proteins. Archaeal membranes 
have a similar architecture, but the lipids are strikingly 
different from those of bacteria (see Fig. 10-12). 



Ribosomes Bacterial ribosomes are smaller than 
eukaryotic ribosomes, but serve the same function — 
protein synthesis from an RNA message. 
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The cytoplasm of E. coli contains about 15,000 ri- 
bosomes, various numbers (10 to thousands) of copies 
of each of 1,000 or so different enzymes, perhaps 1,000 
organic compounds of molecular weight less than 1,000 



(metabolites and cofactors), and a variety of inorganic 
ions. The nucleoid contains a single, circular molecule of 
DNA, and the cytoplasm (like that of most bacteria) 
contains one or more smaller, circular segments of DNA 
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FIGURE 1-7 Eukaryotic cell structure. Schematic illustrations of two 
major types of eukaryotic cell: (a) a representative animal cell and 
(b) a representative plant cell. Plant cells are usually 1 0 to 1 00 ju,m in 
diameter — larger than animal cells, which typically range from 5 to 30 fim. 



(b) Plant cell 

Structures labeled in red are unique to either animal or plant cells. 
Eukaryotic microorganisms (such as protists and fungi) have structures 
similar to those in plant and animal cells, but many also contain spe- 
cialized organelles not illustrated here. 
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called plasmids. In nature, some plasmids confer resis- 
tance to toxins and antibiotics in the environment. In 
the laboratory, these DNA segments are especially 
amenable to experimental manipulation and are powerful 
tools for genetic engineering (see Chapter 9). 

Most bacteria (including E. colz) exist as individual 
cells, but cells of some bacterial species (the myxobac- 
teria, for example) show simple social behavior, forming 
many-celled aggregates. 

Eukaryotic Cells Have a Variety of Membranous Organelles, 
Which Can Be Isolated for Study 

Typical eukaryotic cells (Fig. 1-7) are much larger than 
bacteria — commonly 5 to 100 fim in diameter, with cell 
volumes a thousand to a million times larger than those 



of bacteria. The distinguishing characteristics of eukary- 
otes are the nucleus and a variety of membrane-enclosed 
organelles with specific functions: mitochondria, endo- 
plasmic reticulum, Golgi complexes, peroxisomes, and 
lysosomes. In addition to these, plant cells also contain 
vacuoles and chloroplasts (Fig. 1-7). Also present in the 
cytoplasm of many cells are granules or droplets con- 
taining stored nutrients such as starch and fat. 

In a major advance in biochemistry, Albert Claude, 
Christian de Duve, and George Palade developed meth- 
ods for separating organelles from the cytosol and from 
each other — an essential step in investigating their 
structures and functions. In a typical cell fractionation 
(Fig. 1-8), cells or tissues in solution are gently dis- 
rupted by physical shear. This treatment ruptures the 
plasma membrane but leaves most of the organelles 
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FIGURE 1-8 Subcellular fractionation of tissue. A tissue such as liver 
is first mechanically homogenized to break cells and disperse their 
contents in an aqueous buffer. The sucrose medium has an osmotic 
pressure similar to that in organelles, thus balancing diffusion of water 
into and out of the organelles, which would swell and burst in a solu- 
tion of lower osmolarity (see Fig. 2-12). (a) The large and small parti- 
cles in the suspension can be separated by centrifugation at different 
speeds, or (b) particles of different density can be separated by isopyc- 
nic centrifugation. In isopyenic centrifugation, a centrifuge tube is 
filled with a solution, the density of which increases from top to bot- 
tom; a solute such as sucrose is dissolved at different concentrations to 
produce the density gradient. When a mixture of organelles is layered 
on top of the density gradient and the tube is centrifuged at high speed, 
individual organelles sediment until their buoyant density exactly 
matches that in the gradient. Each layer can be collected separately. 
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intact. The homogenate is then centrifuged; organelles 
such as nuclei, mitochondria, and lysosomes differ in 
size and therefore sediment at different rates. 

Differential centrifugation results in a rough frac- 
tionation of the cytoplasmic contents, which may be 
further purified by isopycnic ("same density") centrifu- 
gation. In this procedure, organelles of different buoy- 
ant densities (the result of different ratios of lipid and 
protein in each type of organelle) are separated by cen- 
trifugation through a column of solvent with graded 
density. By carefully removing material from each re- 
gion of the gradient and observing it with a microscope, 
the biochemist can establish the sedimentation position 
of each organelle and obtain purified organelles for fur- 
ther study. For example, these methods were used to es- 
tablish that lysosomes contain degradative enzymes, 
mitochondria contain oxidative enzymes, and chloro- 
plasts contain photosynthetic pigments. The isolation of 
an organelle enriched in a certain enzyme is often the 
first step in the purification of that enzyme. 

The Cytoplasm Is Organized by the Cytoskeleton and Is 
Highly Dynamic 

Fluorescence microscopy reveals several types of protein 
filaments crisscrossing the eukaryotic cell, forming 
an interlocking three-dimensional meshwork, the cyto- 




FIGURE 1-9 The three types of cytoskeletal filaments: actin filaments, 
microtubules, and intermediate filaments. Cellular structures can be 
labeled with an antibody (that recognizes a characteristic protein) co- 
valently attached to a fluorescent compound. The stained structures are 
visible when the cell is viewed with a fluorescence microscope, (a) En- 
dothelial cells from the bovine pulmonary artery. Bundles of actin fila- 
ments called "stress fibers" are stained red; microtubules, radiating 



skeleton. There are three general types of cytoplasmic 
filaments — actin filaments, microtubules, and interme- 
diate filaments (Fig. 1-9) — differing in width (from 
about 6 to 22 nm), composition, and specific function. 
All types provide structure and organization to the cyto- 
plasm and shape to the cell. Actin filaments and micro- 
tubules also help to produce the motion of organelles or 
of the whole cell. 

Each type of cytoskeletal component is composed 
of simple protein subunits that associate noncovalently 
to form filaments of uniform thickness. These filaments 
are not permanent structures; they undergo constant 
disassembly into their protein subunits and reassembly 
into filaments. Their locations in cells are not rigidly 
fixed but may change dramatically with mitosis, cytoki- 
nesis, amoeboid motion, or changes in cell shape. The 
assembly, disassembly, and location of all types of fila- 
ments are regulated by other proteins, which serve to 
link or bundle the filaments or to move cytoplasmic 
organelles along the filaments. 

The picture that emerges from this brief survey of 
eukaryotic cell structure is of a cell with a meshwork of 
structural fibers and a complex system of membrane- 
enclosed compartments (Fig. 1-7). The filaments disas- 
semble and then reassemble elsewhere. Membranous 
vesicles bud from one organelle and fuse with another. 
Organelles move through the cytoplasm along protein 




(b) 



from the cell center, are stained green; and chromosomes (in the nu- 
cleus) are stained blue, (b) A newt lung cell undergoing mitosis. Micro- 
tubules (green), attached to structures called kinetochores (yellow) on 
the condensed chromosomes (blue), pull the chromosomes to opposite 
poles, or centrosomes (magenta), of the cell. Intermediate filaments, 
made of keratin (red), maintain the structure of the cell. 
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filaments, their motion powered by energy-dependent 
motor proteins. The endomembrane system segre- 
gates specific metabolic processes and provides sur- 
faces on which certain enzyme-catalyzed reactions 
occur. Exocytosis and endocytosis, mechanisms of 
transport (out of and into cells, respectively) that in- 
volve membrane fusion and fission, provide paths be- 
tween the cytoplasm and surrounding medium, allowing 
for secretion of substances produced in the cell and 
uptake of extracellular materials. 

(a) Some of the amino acids of proteins 



. ? 

H 3 N-C- 



00" 
H 
CH 3 
Alanine 




COO" 
+ I 
H 3 N-(^-H 

CH 2 OH 
Serine 



H,N— C— H 




? oo- 

H 3 N-(^-H 
<j!H 2 
COO" 
Aspartate 



COO" 

+ I 
H3N-C-H 

CH 2 
SH 

Cysteine 



Histidine 



(b) The components of nucleic acids 



0 

II 



o 



HINT ^CH HN^ v c 



XH 3 



O 



H 

Uracil 



,CH 



or n 

H 

Thymine 



NH 2 

N^^CH 

I II 
^C. .CH 

H 

Cytosine 



I CH 
H 



? 



hn 

I II CH 

H 2 N^ C% N^ C ^N 
i H 



Adenine Guanine 
Nitrogenous bases 



Although complex, this organization of the cyto- 
plasm is far from random. The motion and positioning of 
organelles and cytoskeletal elements are under tight 
regulation, and at certain stages in its life, a eukaryotic 
cell undergoes dramatic, finely orchestrated reorganiza- 
tions, such as the events of mitosis. The interactions be- 
tween the cytoskeleton and organelles are noncovalent, 
reversible, and subject to regulation in response to 
various intracellular and extracellular signals. 

Cells Build Supramolecular Structures 

Macromolecules and their monomeric subunits differ 
greatly in size (Fig. 1-10). An alanine molecule is less 
than 0.5 nm long. A molecule of hemoglobin, the oxygen- 
carrying protein of erythrocytes (red blood cells), con- 
sists of nearly 600 amino acid subunits in four long chains, 
folded into globular shapes and associated in a structure 
5.5 nm in diameter. In turn, proteins are much smaller 
than ribosomes (about 20 nm in diameter), which are in 
turn much smaller than organelles such as mitochondria, 

FIGURE 1-10 The organic compounds from which most cellular ma- 
terials are constructed: the ABCs of biochemistry. Shown here are 
(a) six of the 20 amino acids from which all proteins are built (the side 
chains are shaded pink); (b) the five nitrogenous bases, two five- 
carbon sugars, and phosphate ion from which ail nucleic acids are 
built; (c) five components of membrane lipids; and (d) D-giucose, the 
simple sugar from which most carbohydrates are derived. Note that 
phosphate is a component of both nucleic acids and membrane lipids. 

(c) Some components of lipids 



O" 

I 

HO-P-OH 
O 

Phosphate 



HOCHs^O H 
OH OH 



HOCH^O H 



H 



H 



H 


I 


OH 


H 



OH 



a-D-Ribose 2-Deoxy-a-D-ribose 
Five-carbon sugars 



COCT 


COO" 


1 

CH 2 


1 

CH 2 

1 


CH 2 


CH 2 


1 

CH 2 


CH 2 

1 


CH 2 


CH 2 


1 

CH 2 


1 

CH 2 


1 

CH 2 


1 

CH 2 


1 

! ch 2 


CH 2 


! CH 


CH 2 


II 

CH 


1 

CH 2 


1 

1 CH 2 


CH 2 
1 


■ 1 
CH 2 


CH 2 
1 


1 

CH 2 


CH 2 


1 

CH 2 


1 

CH 2 
1 


1 

CH 2 


CH 2 


1 

CH 2 


CH 3 


1 

CH 2 


Palmitate 


CH 3 




Oleate 





CHoOH 

I 

CHOH 

I 

CH 2 OH 
Glycerol 



CH 3 

+ 1 

CH 3 — N— CH 2 CH 2 OH 

CH 3 
Choline 



(d) The parent sugar 





CH2OH 








4 




1 










JjJ^H 


HO 








H 


OH 


a 


-D-Glucose 



[10] The Foundations of Biochemistry 



Level 4: 
The cell 
and its organelles 



Level 3: Level 2: Level 1: 

Supramolecular Macromolecules Monomeric units 
complexes 

DNA Nucleotides ^ 




Cell wall 



FIGURE 1-11 Structural hierarchy in the molecular organization of 
cells. The nucleus of this plant cell is an organelle containing several 
types of supramolecular complexes, including chromatin. Chromatin 

typically 1 ,000 run in diameter. It is a long jump from sim- 
ple biomolecules to cellular structures that can be seen 
with the light microscope. Figure 1-11 illustrates the 
structural hierarchy in cellular organization. 

The monomeric subunits of proteins, nucleic acids, 
and polysaccharides are joined by covalent bonds. In 
supramolecular complexes, however, macromolecules 
are held together by noncovalent interactions — much 
weaker, individually, than covalent bonds. Among these 
noncovalent interactions are hydrogen bonds (between 
polar groups), ionic interactions (between charged 
groups), hydrophobic interactions (among nonpolar 
groups in aqueous solution), and van der Waals interac- 
tions (London forces) — all of which have energies much 
smaller than those of covalent bonds. These noncova- 
lent interactions are described in Chapter 2. The large 
numbers of weak interactions between macromolecules 
in supramolecular complexes stabilize these assemblies, 
producing their unique structures. 

In Vitro Studies May Overlook Important Interactions 
among Molecules 

One approach to understanding a biological process is to 
study purified molecules in vitro ("in glass" — in the test 
tube), without interference from other molecules pres- 
ent in the intact cell— that is, in vivo ("in the living"). Al- 
though this approach has been remarkably revealing, we 
must keep in mind that the inside of a cell is quite differ- 
ent from the inside of a test tube. The "interfering" com- 
ponents eliminated by purification may be critical to the 
biological function or regulation of the molecule purified. 
For example, in vitro studies of pure enzymes are com- 
monly done at very low enzyme concentrations in thor- 
oughly stirred aqueous solutions. In the cell, an enzyme 



consists of two types of macromolecules, DNA and many different 
proteins, each made up of simple subunits. 



is dissolved or suspended in a gel-like cytosol with thou- 
sands of other proteins, some of which bind to that en- 
zyme and influence its activity. Some enzymes are 
components of multienzyme complexes in which reac- 
tants are channeled from one enzyme to another, never 
entering the bulk solvent. Diffusion is hindered in the 
gel-like cytosol, and the cytosolic composition varies 
throughout the cell. In short, a given molecule may be- 
have quite differently in the cell and in vitro. A central 
challenge of biochemistry is to understand the influences 
of cellular organization and macromolecular associations 
on the function of individual enzymes and other biomol- 
ecules — to understand function in vivo as well as in vitro. 

SUMMARY 1.1 Cellular Foundations 

■ All cells are bounded by a plasma membrane; 
have a cytosol containing metabolites, coenzymes, 
inorganic ions, and enzymes; and have a set of 
genes contained within a nucleoid (bacteria and 
archaea) or nucleus (eukaryotes). 

■ Phototrophs use sunlight to do work; chemotrophs 
oxidize fuels, passing electrons to good electron 
acceptors: inorganic compounds, organic 
compounds, or molecular oxygen. 

■ Bacterial and archaeal cells contain cytosol, a 
nucleoid, and plasmids. Eukaryotic cells have a 
nucleus and are multicompartmented, with certain 
processes segregated in specific organelles; 
organelles can be separated and studied in isolation. 

■ Cytoskeletal proteins assemble into long filaments 
that give cells shape and rigidity and serve as rails 
along which cellular organelles move throughout 
the cell. 
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FIGURE 1-12 Elements essential to animal life 
and health. Bulk elements (shaded orange) are 
structural components of cells and tissues and are 
required in the diet in gram quantities daily. For 
trace elements (shaded bright yellow), the re- 
quirements are much smaller: for humans, a few 
milligrams per day of Fe, Cu, and Zn, even less of 
the others. The elemental requirements for plants 
and microorganisms are similar to those shown 
here; the ways in which they acquire these ele- 
ments vary. 



Supramolecular complexes are held together by 
noncovalent interactions and form a hierarchy of 
structures, some visible with the light microscope. 
When individual molecules are removed from these 
complexes to be studied in vitro, interactions 
important in the living cell may be lost. 



1.2 Chemical Foundations 

Biochemistry aims to explain biological form and function 
in chemical terms. By the late eighteenth century, chemists 
had concluded that the composition of living matter is 
strikingly different from that of the inanimate world. An- 
toine-Laurent Lavoisier (1743-1794) noted the relative 
chemical simplicity of the "mineral world" and contrasted 
it with the complexity of the "plant and animal worlds"; the 
latter, he knew, were composed of compounds rich in the el- 
ements carbon, oxygen, nitrogen, and phosphorus. 

During the first half of the twentieth century, paral- 
lel biochemical investigations of glucose breakdown in 
yeast and in animal muscle cells revealed remarkable 
chemical similarities in these two apparently very differ- 
ent cell types; the breakdown of glucose in yeast and 
muscle cells involved the same 10 chemical intermedi- 
ates, and the same 10 enzymes. Subsequent studies of 
many other biochemical processes in many different or- 
ganisms have confirmed the generality of this observa- 
tion, neatly summarized in 1954 by Jacques Monod: 
"What is true of E. coli is true of the elephant." The cur- 
rent understanding that all organisms share a common 



evolutionary origin is based in part on this observed uni- 
versality of chemical intermediates and transformations, 
often termed "biochemical unity." 

Fewer than 30 of the more than 90 naturally occur- 
ring chemical elements are essential to organisms. Most 
of the elements in living matter have relatively low atomic 
numbers; only two have atomic numbers above that of se- 
lenium, 34 (Fig. 1-12). The four most abundant ele- 
ments in living organisms, in terms of percentage of total 
number of atoms, are hydrogen, oxygen, nitrogen, and 
carbon, which together make up more than 99% of the 
mass of most cells. They are the lightest elements capable 
of efficiently forming one, two, three, and four bonds, re- 
spectively; in general, the lightest elements form the 
strongest bonds. The trace elements (Fig. 1-12) repre- 
sent a miniscule fraction of the weight of the human body, 
but all are essential to life, usually because they are es- 
sential to the function of specific proteins, including 
many enzymes. The oxygen-transporting capacity of the 
hemoglobin molecule, for example, is absolutely depend- 
ent on four iron ions that make up only 0.3% of its mass. 

Biomolecules Are Compounds of Carbon with a Variety of 
Functional Groups 

The chemistry of living organisms is organized around 
carbon, which accounts for more than half the dry 
weight of cells. Carbon can form single bonds with hy- 
drogen atoms, and both single and double bonds with 
oxygen and nitrogen atoms (Fig. 1-13). Of greatest sig- 
nificance in biology is the ability of carbon atoms to form 
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FIGURE 1-13 Versatility of carbon 
bonding. Carbon can form covalent 
single, double, and triple bonds (all 
bonds in red), particularly with other 
carbon atoms. Triple bonds are rare 
in biomolecules. 
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FIGURE 1-14 Geometry of carbon bonding, (a) Carbon atoms have a 
characteristic tetrahedral arrangement of their four single bonds, 
(b) Carbon-carbon single bonds have freedom of rotation, as shown 




for the compound ethane (CH 3 — CH 3 ). (c) Double bonds are shorter 
and do not allow free rotation. The two doubly bonded carbons and 
the atoms designated A, B, X, and Y all lie in the same rigid plane. 
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FIGURE 1-15 Some common functional groups of biomolecules. In 

this figure and throughout the book, we use R to represent "any sub- 
stituent." It may be as simple as a hydrogen atom, but typically it is a 



Phosphoanhydride R 




Mixed anhydride R— C— P— OH 

(carboxylic acid and 
phosphoric acid; 
also called acyl 
phosphate) 

carbon-containing group. When two or more substituents are shown in 
a molecule, we designate them R 1 , R 2 , and so forth. 
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very stable single bonds with up to four other carbon 
atoms. Two carbon atoms also can share two (or three) 
electron pairs, thus forming double (or triple) bonds. 

The four single bonds that can be formed by a car- 
bon atom project from the nucleus to the four apices of 
a tetrahedron (Fig. 1- 14), with an angle of about 109.5° 
between any two bonds and an average bond length of 
0.154 nm. There is free rotation around each single 
bond, unless very large or highly charged groups are at- 
tached to both carbon atoms, in which case rotation may 
be restricted. A double bond is shorter (about 0.134 nm) 
and rigid and allows only limited rotation about its axis. 

Covalently linked carbon atoms in biomolecules can 
form linear chains, branched chains, and cyclic struc- 
tures. It seems likely that the bonding versatility of car- 
bon, with itself and with other elements, was a major 
factor in the selection of carbon compounds for the mo- 
lecular machinery of cells during the origin and evolu- 
tion of living organisms. No other chemical element can 
form molecules of such widely different sizes, shapes, 
and composition. 

Most biomolecules can be regarded as derivatives of 
hydrocarbons, with hydrogen atoms replaced by a vari- 
ety of functional groups that confer specific chemical 
properties on the molecule, forming various families of 
organic compounds. Typical of these are alcohols, which 
have one or more hydroxyl groups; amines, with amino 
groups; aldehydes and ketones, with carbonyl groups; 
and carboxylic acids, with carboxyl groups (Fig. 1-15). 
Many biomolecules are polyfunctional, containing two 
or more types of functional groups (Fig. 1-16), each 
with its own chemical characteristics and reactions. The 



chemical "personality" of a compound is determined by 
the chemistry of its functional groups and their disposi- 
tion in three-dimensional space. 

Cells Contain a Universal Set of Small Molecules 

Dissolved in the aqueous phase (cytosol) of all cells is a 
collection of perhaps a thousand different small organic 
molecules (M r —100 to —500), the central metabolites 
in the major pathways occurring in nearly every cell — 
the metabolites and pathways that have been conserved 
throughout the course of evolution. (See Box 1-1 for an 
explanation of the various ways of referring to molecular 
weight.) This collection of molecules includes the com- 
mon amino acids, nucleotides, sugars and their phos- 
phorylated derivatives, and mono-, di-, and tricarboxylic 
acids. The molecules are polar or charged, water solu- 
ble, and present in micromolar to millimolar concentra- 
tions. They are trapped in the cell because the plasma 
membrane is impermeable to them — although specific 
membrane transporters can catalyze the movement of 
some molecules into and out of the cell or between com- 
partments in eukaryotic cells. The universal occurrence 
of the same set of compounds in living cells reflects the 
evolutionary conservation of metabolic pathways that 
developed in the earliest cells. 

There are other small biomolecules, specific to 
certain types of cells or organisms. For example, vascu- 
lar plants contain, in addition to the universal set, 
small molecules called secondary metabolites, which 
play roles specific to plant life. These metabolites in- 
clude compounds that give plants their characteristic 
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FIGURE 1-16 Several common functional groups in a single biomolecule. 

Acetyl-coenzyme A (often abbreviated as acetyl-CoA) is a carrier of acetyl 
groups in some enzymatic reactions. The functional groups are screened in 
the structural formula. As we will see in Chapter 2, several of these func- 



tional groups can exist in protonated or unprotonated forms, depending 
on the pH. In the space-filling model, N is blue, C is black, P is orange, 
O is red, and H is white. The yellow atom at the left is the sulfur of the crit- 
ical thioester bond between the acetyl moiety and coenzyme A. 
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Molecular Weight, Molecular Mass, 



There are two common (and equivalent) ways to de- 
scribe molecular mass; both are used in this text. The 
first is molecular weight, or relative molecular mass, 
denoted Af r . The molecular weight of a substance is de- 
fined as the ratio of the mass of a molecule of that sub- 
stance to one-twelfth the mass of carbon-12 ( 12 C). Since 
M r is a ratio, it is dimensionless — it has no associated 
units. The second is molecular mass, denoted m. This 
is simply the mass of one molecule, or the molar mass di- 
vided by Avogadro's number. The molecular mass, m } is 
expressed in daltons (abbreviated Da). One dalton is 
equivalent to one-twelfth the mass of carbon-12; a kilo- 
dalton (kDa) is 1,000 daltons; a megadalton (MDa) is 
1 million daltons. 



and Their Correct Unit 




Consider, for example, a molecule with a mass 1,000 
times that of water. We can say of this molecule either 
M T = 18,000 or m = 18,000 daltons. We can also de- 
scribe it as an "18 kDa molecule." However, the expres- 
sion M T - 18,000 daltons is incorrect. 

Another convenient unit for describing the mass of 
a single atom or molecule is the atomic mass unit (for- 
merly amu, now commonly denoted u). One atomic 
mass unit (1 u) is defined as one-twelfth the mass of an 
atom of carbon-12. Since the experimentally measured 
mass of an atom of carbon-12 is 1.9926 X 10~ 23 g, 1 u = 
1.6606 X 10~ 24 g. The atomic mass unit is convenient 
for describing the mass of a peak observed by mass 
spectrometry (see Box 3-2). 



scents, and compounds such as morphine, quinine, nico- 
tine, and caffeine that are valued for their physiolog- 
ical effects on humans but used for other purposes 
by plants. 

The entire collection of small molecules in a given 
cell has been called that cell's metabolome, in parallel 
with the term "genome" (defined earlier and expanded 
on in Section 1.5). 

Macromolecules Are the Major Constituents of Cells 

Many biological molecules are macromolecules, poly- 
mers with molecular weights above —5,000 that are as- 
sembled from relatively simple precursors. Shorter 
polymers are called oligomers (Greek oligos, "few"). 
Proteins, nucleic acids, and polysaccharides are macro- 
molecules composed of monomers with molecular 
weights of 500 or less. Synthesis of macromolecules is a 
major energy-consuming activity of cells. Macromole- 
cules themselves may be further assembled into 
supramolecular complexes, forming functional units 



TABLE 1-1 



Molecular Components of an £ coli Cell 



Percentage of 
total weight 
of cell 



Approximate 
number of 
different 
molecular 



Water 


70 


1 


Proteins 


15 


3,000 


Nucleic acids 






DNA 


1 


1-4 


RNA 


6 


>3,000 


Polysaccharides 


3 


10 


Lipids 


2 


20 


Monomeric subunits 






and intermediates 


2 


500 


Inorganic ions 


1 


20 



such as ribosomes. Table 1-1 shows the major classes 
of biomolecules in anE. coli cell. 

Proteins, long polymers of amino acids, constitute 
the largest fraction (besides water) of a cell. Some pro- 
teins have catalytic activity and function as enzymes; 
others serve as structural elements, signal receptors, or 
transporters that carry specific substances into or out of 
cells. Proteins are perhaps the most versatile of all bio- 
molecules; a catalog of their many functions would be 
very long. The sum of all the proteins functioning in a 
given cell is the cell's proteome. The nucleic acids, 
DNA and RNA, are polymers of nucleotides. They store 
and transmit genetic information, and some RNA mole- 
cules have structural and catalytic roles in supramolec- 
ular complexes. The polysaccharides, polymers of 
simple sugars such as glucose, have three major func- 
tions: as energy-rich fuel stores, as rigid structural com- 
ponents of cell walls (in plants and bacteria), and as 
extracellular recognition elements that bind to proteins 
on other cells. Shorter polymers of sugars (oligosaccha- 
rides) attached to proteins or lipids at the cell surface 
serve as specific cellular signals. The lipids, water- 
insoluble hydrocarbon derivatives, serve as structural 
components of membranes, energy-rich fuel stores, pig- 
ments, and intracellular signals. 

Proteins, polynucleotides, and polysaccharides have 
large numbers of monomeric subunits and thus high 
molecular weights — in the range of 5,000 to more than 
1 million for proteins, up to several billion for nucleic 
acids, and in the millions for polysaccharides such as 
starch. Individual lipid molecules are much smaller (M r 
750 to 1,500) and are not classified as macromolecules. 
But they can associate noncovalently into very large 
structures. Cellular membranes are built of enormous 
noncovalent aggregates of lipid and protein molecules. 

Given their characteristic information-rich subunit 
sequences, proteins and nucleic acids are often referred 
to as informational macromolecules. Some oligosac- 
charides, as noted above, also serve as informational 
molecules. 
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Three-Dimensional Structure Is Described by Configuration 
and Conformation 

The covalent bonds and functional groups of a biomole- 
cule are, of course, central to its function, but so also is 
the arrangement of the molecule's constituent atoms in 
three-dimensional space — its stereochemistry. A carbon- 
containing compound commonly exists as stereoiso- 
mers, molecules with the same chemical bonds but 
different configuration, the fixed spatial arrangement 
of atoms. Interactions between biomolecules are invari- 
ably stereospecific, requiring specific configurations in 
the interacting molecules. 

Figure 1-17 shows three ways to illustrate the 
stereochemistry, or configuration, of simple molecules. 
The perspective diagram specifies stereochemistry un- 
ambiguously, but bond angles and center-to-center bond 
lengths are better represented with ball-and-stick mod- 
els. In space-filling models, the radius of each "atom" is 
proportional to its van der Waals radius, and the con- 
tours of the model define the space occupied by the mol- 
ecule (the volume of space from which atoms of other 
molecules are excluded). 

Configuration is conferred by the presence of either 
(1) double bonds, around which there is no freedom of 
rotation, or (2) chiral centers, around which substituent 
groups are arranged in a specific orientation. The identi- 
fying characteristic of stereoisomers is that they cannot 
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FIGURE 1-17 Representations of molecules. Three ways to represent 
the structure of the amino acid alanine (shown here in the ionic form 
found at neutral pH). (a) Structural formula in perspective form: a solid 
wedge ) represents a bond in which the atom at the wide end pro- 
jects out of the plane of the paper, toward the reader; a dashed wedge 

( ) represents a bond extending behind the plane of the paper. 

(b) Ball-and-stick model, showing relative bond lengths and the bond 
angles, (c) Space-filling model, in which each atom is shown with its 
correct relative van der Waals radius. 



be interconverted without temporarily breaking one or 
more covalent bonds. Figure l-18a shows the configu- 
rations of maleic acid and its isomer, fumaric acid. These 
compounds are geometric isomers, or cis-trmns iso- 
mers; they differ in the arrangement of their substituent 
groups with respect to the nonrotating double bond 
(Latin cis, "on this side" — groups on the same side of 
the double bond; trans, "across" — groups on opposite 
sides). Maleic acid (maleate at the neutral pH of cyto- 
plasm) is the cis isomer and fumaric acid (fumarate) the 

FIGURE 1-18 Configurations of geometric isomers, (a) Isomers such 
as maleic acid (maleate at pH 7) and fumaric acid (fumarate) cannot be 
interconverted without breaking covalent bonds, which requires tim 
input of much more energy than the average kinetic energy of mole- 
cules at physiological temperatures, (b) In the vertebrate retina, the ini- 
tial event in light detection is the absorption of visible light by 
1 1-c/s-retinal. The energy of the absorbed light (about 250 kj/mol) con- 
verts 1 1-c/s-retinal to all-trans-retinal, triggering electrical changes ift 
the retinal cell that lead to a nerve impulse. (Note that the hydrogen 
atoms are omitted from the ball-and-stick models of the retinals.) 
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FIGURE 1-19 Molecular asymmetry: chiral and achiral molecules, (a) 

When a carbon atom has four different substituent groups (A, B, X, Y), 
they can be arranged in two ways that represent nonsuperposable mir- 
ror images of each other (enantiomers). This asymmetric carbon atom is 
called a chiral atom or chiral center (b) When a tetrahedral carbon has 




only three dissimilar groups (i.e., the same group occurs twice), only 
one configuration is possible and the molecule is symmetric, or achiral. 
In this case the molecule is superposable on its mirror image: the mol- 
ecule on the left can be rotated counterclockwise (when looking down 
the vertical bond from A to C) to create the molecule in the mirror. 



trans isomer; each is a well-defined compound that can 
be separated from the other, and each has its own 
unique chemical properties. A binding site (on an en- 
zyme, for example) that is complementary to one of 
these molecules would not be complementary to the 
other, which explains why the two compounds have dis- 
tinct biological roles despite their similar chemistry. 

In the second type of stereoisomer, four different 
substituents bonded to a tetrahedral carbon atom may 
be arranged in two different ways in space — that is, have 
two configurations (Fig. 1-19)— yielding two stereoiso- 
mers with similar or identical chemical properties but 
differing in certain physical and biological properties. A 
carbon atom with four different substituents is said to 



be asymmetric, and asymmetric carbons are called chi- 
ral centers (Greek chiros, "hand"; some stereoisomers 
are related structurally as the right hand is to the left). 
A molecule with only one chiral carbon can have two 
stereoisomers; when two or more (n) chiral carbons are 
present, there can be 2 n stereoisomers. Stereoisomers 
that are mirror images of each other are called enan- 
tiomers (Fig. 1-19). Pairs of stereoisomers that are not 
mirror images of each other are called diastereomers 
(Fig. 1-20). 

As Louis Pasteur first observed in 1848 (Box 1-2), 
enantiomers have nearly identical chemical properties 
but differ in a characteristic physical property: their in- 
teraction with plane-polarized light. In separate solutions, 
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FIGURE 1-20 Two types of stereoisomers. There are four different 2,3- stick model, which has been rotated to allow the reader to view all the 
disubstituted butanes (n = 2 asymmetric carbons, hence 2 n = 4 stereoiso- groups. Two pairs of stereoisomers are mirror images of each other, or 
mers). Each is shown in a box as a perspective formula and a ball-and- enantiomers. Other pairs are not mirror images; these are diastereomers. 
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BOX 1-2 Louis Pasteur and Optical Activity: Ir 



Louis Pasteur 
1822-1895 



Louis Pasteur encountered the phenomenon 
of optical activity in 1843 ; during his in- 
vestigation of the crystalline sediment that 
accumulated in wine casks (a form of tar- 
taric acid called paratartaric acid — also 
called racemic acid, from Latin racemus, 
"bunch of grapes"). He used fine forceps to 
separate two types of crystals identical in 
shape but mirror images of each other. Both 
types proved to have all the chemical prop- 
erties of tartaric acid, but in solution one 
type rotated plane-polarized light to the left 
(levorotatory), the other to the right (dex- 
trorotatory). Pasteur later described the 
experiment and its interpretation: 

In isomeric bodies, the elements and the propor- 
tions in which they are combined are the same, only 
the arrangement of the atoms is different ... We 
know, on the one hand, that the molecular arrange- 
ments of the two tartaric acids are asymmetric, and, 
on the other hand, that these arrangements are ab- 
solutely identical, excepting that they exhibit asym- 
metry in opposite directions. Are the atoms of the 
dextro acid grouped in the form of a right-handed 
spiral, or are they placed at the apex of an irregular 
tetrahedron, or are they disposed according to this 
or that asymmetric arrangement? We do not know* 




Now we do know. X-ray crystallographic 
studies in 1951 confirmed that the levorota- 
tory and dextrorotatory forms of tartaric 
acid are mirror images of each other at the 
molecular level and established the absolute 
configuration of each (Fig. 1). The same 
approach has been used to demonstrate that 
although the amino acid alanine has two 
stereoisomeric forms (designated d and l), 
alanine in proteins exists exclusively in one 
form (the l isomer; see Chapter 3). 
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FIGURE 1 Pasteur separated crystals of two stereoisomers of tartaric 
acid and showed that solutions of the separated forms rotated plane- 
polarized light to the same extent but in opposite directions. These dex- 
trorotatory and levorotatory forms were later shown to be the {R,R) 
and (5,5) isomers represented here. The RS system of nomenclature is 
explained in the text. 

*From Pasteur's lecture to the Societe Chimique de Paris in 1883, 
quoted in DuBos, R. (1 976) Louis Pasteur: Free Lance of Science, 
p. 95, Charles Scribner's Sons, New York. 



two enantiomers rotate the plane of plane-polarized light 
in opposite directions, but an equimolar solution of the 
two enantiomers (a racemic mixture) shows no optical 
rotation. Compounds without chiral centers do not rotate 
the plane of plane-polarized light. 

KEY CONVENTION: Given the importance of stereochem- 
istry in reactions between biomolecules (see below), 
biochemists must name and represent the structure of 
each biomolecule so that its stereochemistry is unam- 
biguous. For compounds with more than one chiral cen- 
ter, the most useful system of nomenclature is the RS 
system. In this system, each group attached to a chiral 
carbon is assigned a priority. The priorities of some 
common substituents are 



-OCH, > —OH > 



-NH 2 > — COOH > 
— CHO > — CH 2 OH > 



-CH, > — H 



For naming in the RS system, the chiral atom is viewed 
with the group of lowest priority (4 in the following dia- 
gram) pointing away from the viewer. If the priority of 
the other three groups (1 to 3) decreases in clockwise 
order, the configuration is (R) (Latin rectus, "right"); if 
counterclockwise, the configuration is (5) (Latin sinis- 
ter, "left"). In this way each chiral carbon is designated 



either (R) or (5), and the inclusion of these designa- 
tions in the name of the compound provides an unam- 
biguous description of the stereochemistry at each 
chiral center. 





Clockwise 
(R) 



Counterclockwise 

(S) 



Another naming system for stereoisomers, the d and L 
system, is described in Chapter 3. A molecule with a 
single chiral center (the two isomers of glyceralde- 
hyde, for example) can be named unambiguously by 
either system. ■ 
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CH 2 OH 
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Distinct from configuration is molecular confor- 
mation, the spatial arrangement of substituent groups 
that, without breaking any bonds, are free to assume 
different positions in space because of the freedom of 
rotation about single bonds. In the simple hydrocar- 
bon ethane, for example, there is nearly complete 
freedom of rotation around the C — C bond. Many dif- 
ferent, interconvertible conformations of ethane are 
possible, depending on the degree of rotation (Fig. 
1-21). Two conformations are of special interest: the 
staggered, which is more stable than all others and 
thus predominates, and the eclipsed, which is least 
stable. We cannot isolate either of these conforma- 
tional forms, because they are freely interconvert- 
ible. However, when one or more of the hydrogen 
atoms on each carbon is replaced by a functional 
group that is either very large or electrically charged, 
freedom of rotation around the C — C bond is hin- 
dered. This limits the number of stable conforma- 
tions of the ethane derivative. 

Interactions between Biomolecules Are Stereospecific 

When biomolecules interact, the "fit" between them must 
be stereochemically correct. The three-dimensional struc- 
ture of biomolecules large and small — the combination of 
configuration and conformation — is of the utmost impor- 
tance in their biological interactions: reactant with its 
enzyme, hormone with its receptor on a cell surface, anti- 
gen with its specific antibody, for example (Fig. 1-22). 
The study of biomolecular stereochemistry, with precise 
physical methods, is an important part of modern research 
on cell structure and biochemical function. 
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FIGURE 1-21 Conformations. Many conformations of ethane are 
possible because of freedom of rotation around the C — C bond. In 
the ball-and-stick model, when the front carbon atom (as viewed by 
the reader) with its three attached hydrogens is rotated relative to the 
rear carbon atom, the potential energy of the molecule rises to a 
maximum in the fully eclipsed conformation (torsion angle 0°, 120°, 
etc.), then falls to a minimum in the fully staggered conformation 
(torsion angle 60°, 180°, etc), Because the energy differences are 
small enough to allow rapid interconversion of the two forms (mil- 
lions of times per second), the eclipsed and staggered forms cannot 
be separately isolated. 



In living organisms, chiral molecules are usually pres- 
ent in only one of their chiral forms. For example, the 
amino acids in proteins occur only as their l isomers; glu- 
cose occurs only as its d isomer. (The conventions for 
naming stereoisomers of the amino acids are described in 
Chapter 3; those for sugars, in Chapter 7; the RS system, 
described above, is the most useful for some biomole- 
cules.) In contrast, when a compound with an asymmetric 
carbon atom is chemically synthesized in the laboratory, 
the reaction usually produces all possible chiral forms: a 
mixture of the d and l forms, for example. Living cells pro- 
duce only one chiral form of a biomolecule because the en- 
zymes that synthesize that molecule are also chiral. 

Stereospecificity, the ability to distinguish between 
stereoisomers, is a property of enzymes and other pro- 
teins and a characteristic feature of the molecular logic 
of living cells. If the binding site on a protein is comple- 
mentary to one isomer of a chiral compound, it will not 
be complementary to the other isomer, for the same rea- 
son that a left glove does not fit a right hand. Some 
striking examples of the ability of biological systems to 
distinguish stereoisomers are shown in Figure 1-23. 

The common classes of chemical reactions encoun- 
tered in biochemistry are described in Chapter 13, as an 
introduction to the reactions of metabolism. 

SUMMARY 1.2 Chemical Foundations 

■ Because of its bonding versatility, carbon can 
produce a broad array of carbon-carbon skeletons 
with a variety of functional groups; these groups 
give biomolecules their biological and chemical 
personalities. 
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FIGURE 1 22 Complementary fit between a macromolecule and a 

small molecule. A segment of RNA from a regulatory region, known as 
TAR, of the human immunodeficiency virus genome (gray) with a 
bound argininamide molecule (colored); the argininamide is used to 
represent an amino acid residue of a protein that binds to the TAR re- 
gion. Argininamide fits into a pocket on the RNA surface and is held in 
this orientation by several noncovalent interactions with the RNA. This 
representation of the RNA molecule is produced with software that can 
calculate the shape of the outer surface of a macromolecule, defined 
either by the van der Waals radii of all the atoms in the molecule or by 
the "solvent exclusion volume/' the volume a water molecule cannot 
penetrate. 
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FIGURE 1-23 Stereoisomers have different 
effects in humans, (a) Two stereoisomers of 
carvone: (K)-carvone (isolated from spearmint 
oil) has the characteristic fragrance of 
spearmint; (5)-carvone (from caraway seed 
oil) smells like caraway, (b) Aspartame, the ar- 
tificial sweetener sold under the trade name 
NutraSweet, is easily distinguishable by taste 
receptors from its bitter-tasting stereoisomer, 
although the two differ only in the configura- 
tion at one of the two chiral carbon atoms, 
(c) The antidepressant medication citalopram 
(trade name Celexa), a selective serotonin re- 
uptake inhibitor, is a racemic mixture of these 
two steroisomers, but only (5)-citalopram has 
the therapeutic effect. A stereochemical^ 
pure preparation of (5)-citalopram (escitalo- 
pram oxalate) is sold under the trade name 
Lexapro. As you might predict, the effective 
dose of Lexapro is one-half the effective dose 
of Celexa. 
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A nearly universal set of several hundred small 
molecules is found in living cells; the interconversions 
of these molecules in the central metabolic 
pathways have been conserved in evolution. 

Proteins and nucleic acids are linear polymers of 
simple monomelic subunits; their sequences contain 
the information that gives each molecule its three- 
dimensional structure and its biological functions. 

Molecular configuration can be changed only by 
breaking covalent bonds. For a carbon atom with 
four different substituents (a chiral carbon), the 
substituent groups can be arranged in two different 
ways, generating stereoisomers with distinct 
properties. Only one stereoisomer is biologically 
active. Molecular conformation is the position of 
atoms in space that can be changed by rotation about 
single bonds, without breaking covalent bonds. 

Interactions between biological molecules are 
almost invariably stereospecific: they require a 
precise complementary match between the 
interacting molecules. 
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Living cells and organisms must perform work to stay 
alive and to reproduce themselves. The synthetic reac- 
tions that occur within cells, like the synthetic 
processes in any factory, require the input of energy. 
Energy is also consumed in the motion of a bacterium 
or an Olympic sprinter, in the flashing of a firefly or the 
electrical discharge of an eel. And the storage and ex- 
pression of information require energy, without which 
structures rich in information inevitably become disor- 
dered and meaningless. 

In the course of evolution, cells have developed 
highly efficient mechanisms for coupling the energy 
obtained from sunlight or fuels to the many energy- 
consuming processes they must carry out. One goal 
of biochemistry is to understand, in quantitative and 
chemical terms, the means by which energy is ex- 
tracted, channeled, and consumed in living cells. We can 
consider cellular energy conversions — like all other 
energy conversions — in the context of the laws of ther- 
modynamics. 
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Living Organisms Exist in a Dynamic Steady State, Never at 
Equilibrium with Their Surroundings 

The molecules and ions contained within a living organ- 
ism differ in kind and in concentration from those in the 
organism's surroundings. A Paramecium in a pond, a 
shark in the ocean, a bacterium in the soil, an apple tree 
in an orchard — all are different in composition from 
their surroundings and, once they have reached matu- 
rity, maintain a more or less constant composition in the 
face of constantly changing surroundings. 

Although the characteristic composition of an or- 
ganism changes little through time, the population of 
molecules within the organism is far from static. Small 
molecules, macromolecules, and supramolecular com- 
plexes are continuously synthesized and broken down in 
chemical reactions that involve a constant flux of mass 
and energy through the system. The hemoglobin mole- 
cules carrying oxygen from your lungs to your brain at 
this moment were synthesized within the past month; 
by next month they will have been degraded and en- 
tirely replaced by new hemoglobin molecules. The glu- 
cose you ingested with your most recent meal is now 
circulating in your bloodstream; before the day is over 
these particular glucose molecules will have been con- 
verted into something else — carbon dioxide or fat, per- 
haps — and will have been replaced with a fresh supply 
of glucose, so that your blood glucose concentration is 
more or less constant over the whole day. The amounts 
of hemoglobin and glucose in the blood remain nearly 
constant because the rate of synthesis or intake of each 
just balances the rate of its breakdown, consumption, or 
conversion into some other product. The constancy of 
concentration is the result of a dynamic steady state, a 
steady state that is far from equilibrium. Maintaining 
this steady state requires the constant investment of en- 
ergy; when a cell can no longer generate energy, it dies 
and begins to decay toward equilibrium with its sur- 
roundings. We consider below exactly what is meant by 
"steady state" and "equilibrium." 

Organisms Transform Energy and Matter from 
Their Surroundings 

For chemical reactions occurring in solution, we can de- 
fine a system as all the constituent reactants and prod- 
ucts, the solvent that contains them, and the immediate 
atmosphere — in short, everything within a denned re- 
gion of space. The system and its surroundings together 
constitute the universe. If the system exchanges nei- 
ther matter nor energy with its surroundings, it is said to 
be isolated. If the system exchanges energy but not 
matter with its surroundings, it is a closed system; if it 
exchanges both energy and matter with its surround- 
ings, it is an open system. 

A living organism is an open system; it exchanges 
both matter and energy with its surroundings. Organ- 
isms derive energy from their surroundings in two ways: 
(1) they take up chemical fuels (such as glucose) from 



the environment and extract energy by oxidizing them 
(see Box 1-3, Case 2); or (2) they absorb energy from 
sunlight. 

The first law of thermodynamics describes the prin- 
ciple of the conservation of energy: in any physical or 
chemical change, the total amount of energy in the 
universe remains constant, although the form of the 
energy may change. Cells are consummate transduc- 
ers of energy, capable of interconverting chemical, elec- 
tromagnetic, mechanical, and osmotic energy with great 
efficiency (Fig. 1-24). 
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FIGURE 1-24 Some energy interconversions in living organisms. 

During metabolic energy transductions, the randomness of the system 
plus surroundings (expressed quantitatively as entropy) increases as the 
potential energy of complex nutrient molecules decreases, (a) Living 
organisms extract energy from their surroundings; (b) convert some of 
it into useful forms of energy to produce work; (c) return some energy 
to the surroundings as heat; and (d) release end-product molecules 
that are less well organized than the starting fuel, increasing the en- 
tropy of the universe. One effect of all these transformations is (e) in- 
creased order {decreased randomness) in the system in the form of 
complex macromolecules. We return to a quantitative treatment of en- 
tropy in Chapter 1 3. 
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BOX 1 -3 Entropy: The Advantages of Being Disorganized 



The term "entropy," which literally means "a change 
within," was first used in 1851 by Rudolf Clausius, one of 
the formulators of the second law of thermodynamics. A 
rigorous quantitative definition of entropy involves sta- 
tistical and probability considerations. However, its na- 
ture can be illustrated qualitatively by three simple 
examples, each demonstrating one aspect of entropy. 
The key descriptors of entropy are randomness and 
disorder, manifested in different ways. 

Case 1: The Teakettle and the Randomization of Heat 

We know that steam generated from boiling water can do 
useful work. But suppose we turn off the burner under a 
teakettle full of water at 100 °C (the "system") in the 
kitchen (the "surroundings") and allow the teakettle to 
cool. As it cools, no work is done, but heat passes from the 
teakettle to the surroundings, raising the temperature of 
the surroundings (the kitchen) by an infinitesimally small 
amount until complete equilibrium is attained. At this 
point all parts of the teakettle and the kitchen are at pre- 
cisely the same temperature. The free energy that was 
once concentrated in the teakettle of hot water at 100 °C, 
potentially capable of doing work, has disappeared. Its 
equivalent in heat energy is still present in the teakettle + 
kitchen (i.e., the "universe") but has become completely 
randomized throughout. This energy is no longer avail- 
able to do work because there is no temperature differen- 
tial within the kitchen. Moreover, the increase in entropy 
of the kitchen (the surroundings) is irreversible. We know 
from everyday experience that heat never spontaneously 
passes back from the kitchen into the teakettle to raise 
the temperature of the water to 100 °C again. 

Case 2: The Oxidation of Glucose 

Entropy is a state not only of energy but of matter. Aer- 
obic (heterotrophic) organisms extract free energy from 
glucose obtained from their surroundings by oxidizing 
the glucose with 0 2 , also obtained from the surround- 
ings. The end products of this oxidative metabolism, 
C0 2 and H 2 0, are returned to the surroundings. In this 
process the surroundings undergo an increase in en- 
tropy, whereas the organism itself remains in a steady 
state and undergoes no change in its internal order. Al- 
though some entropy arises from the dissipation of heat, 
entropy also arises from another kind of disorder, illus- 
trated by the equation for the oxidation of glucose: 

C 6 Hi 2 0 6 + 60 2 -»6C0 2 + 6H 2 0 

We can represent this schematically as 



7 molecules 



12 molecules 




Glucose 
(a solid) 




The atoms contained in 1 molecule of glucose plus 6 
molecules of oxygen, a total of 7 molecules, are more 
randomly dispersed by the oxidation reaction and are 
now present in a total of 12 molecules (6C0 2 + 6H 2 0). 

Whenever a chemical reaction results in an increase 
in the number of molecules — or when a solid substance 
is converted into liquid or gaseous products, which al- 
low more freedom of molecular movement than solids — 
molecular disorder, and thus entropy, increases. 

Case 3: Information and Entropy 

The following short passage from Julius Caesar, Act IV, 
Scene 3, is spoken by Brutus, when he realizes that he 
must face Mark Antony's army. It is an information-rich 
nonrandom arrangement of 125 letters of the English 
alphabet: 

There is a tide in the affairs of men, 
Which, taken at the flood, leads on to fortune; 
Omitted, all the voyage of their life 
Is bound in shallows and in miseries. 

In addition to what this passage says overtly, it has many 
hidden meanings. It not only reflects a complex se- 
quence of events in the play, it also echoes the play's 
ideas on conflict, ambition, and the demands of leader- 
ship. Permeated with Shakespeare's understanding of 
human nature, it is very rich in information. 

However, if the 125 letters making up this quotation 
were allowed to fall into a completely random, chaotic 
pattern, as shown in the following box, they would have 
no meaning whatsoever. 
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In this form the 125 letters contain little or no informa- 
tion, but they are very rich in entropy. Such considera- 
tions have led to the conclusion that information is a 
form of energy; information has been called "negative 
entropy." In fact, the branch of mathematics called in- 
formation theory, which is basic to the programming 
logic of computers, is closely related to thermodynamic 
theory. Living organisms are highly ordered, nonrandom 
structures, immensely rich in information and thus 
entropy-poor. 
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The Flow of Electrons Provides Energy for Organisms 

Nearly all living organisms derive their energy, directly 
or indirectly, from the radiant energy of sunlight. The 
light-driven splitting of water during photosynthesis 
releases its electrons for the reduction of C0 2 and the 
release of 0 2 into the atmosphere: 

light 

6C0 2 + 6H 2 0 > C 6 H 12 0 6 + 60 2 

(light-driven reduction of C0 2 ) 

Nonphotosynthetic cells and organisms obtain the en- 
ergy they need by oxidizing the energy-rich products of 
photosynthesis, then passing the electrons thus ac- 
quired to atmospheric 0 2 to form water, C0 2 , and other 
end products, which are recycled in the environment: 

C 6 Hi 2 0 6 + 0 2 ► 6C0 2 + 6H 2 0 + energy 

(energy-yielding oxidation of glucose) 

Thus autotrophs and heterotrophs participate in global 
cycles of 0 2 and C0 2 , driven ultimately by sunlight, mak- 
ing these two large groups of organisms interdependent. 
Virtually all energy transductions in cells can be traced 
to this flow of electrons from one molecule to another, in 
a "downhill" flow from higher to lower electrochemical 
potential; as such, this is formally analogous to the flow 
of electrons in a battery-driven electric circuit. All these 
reactions involved in electron flow are oxidation- 
reduction reactions: one reactant is oxidized (loses 
electrons) as another is reduced (gains electrons). 

Creating and Maintaining Order Requires Work and Energy 

As we've noted, DNA, RNA, and proteins are informa- 
tional macromolecules; the precise sequence of their 
monomelic subunits contains information, just as the 
letters in this sentence do. In addition to using chemical 
energy to form the covalent bonds between these sub- 
units, the cell must invest energy to order the subunits 
in their correct sequence. It is extremely improbable 
that amino acids in a mixture would spontaneously con- 
dense into a single type of protein, with a unique se- 
quence. This would represent increased order in a 
population of molecules; but according to the second 
law of thermodynamics, the tendency in nature is to- 
ward ever-greater disorder in the universe: the total en- 
tropy of the universe is continually increasing. To 
bring about the synthesis of macromolecules from their 
monomelic units, free energy must be supplied to the 
system (in this case, the cell). 

KEY CONVENTION: The randomness or disorder of the 
components of a chemical system is expressed as en- 
tropy, S (Box 1-3). Any change in randomness of the 
system is expressed as entropy change, AS, which by 
convention has a positive value when randomness in- 
creases. J. Willard Gibbs, who developed the theory of 



energy changes during chemical reactions, showed 
that the free-energy content, G y of any closed system 




can be defined in terms of three 
quantities: enthalpy, H, re- 
flecting the number and kinds 
of bonds; entropy, S; and the 
absolute temperature, T (in 
Kelvin). The definition of free 
energy is G = H - TS. When a 
chemical reaction occurs at con- 
stant temperature, the free- 
energy change, AG, is deter- 
mined by the enthalpy change, 



j. Williard Gibbs, AH, reflecting the kinds and 

i 839-1 903 numbers of chemical bonds and 

noncovalent interactions broken and formed, and the 
entropy change, AS, describing the change in the sys- 
tem's randomness: 

AG = AH - TAS 

where, by definition, AH is negative for a reaction that 
releases heat, and AS is positive for a reaction that in- 
creases the system's randomness. ■ 

A process tends to occur spontaneously only if AG is 
negative (if free energy is released in the process). Yet 
cell function depends largely on molecules, such as pro- 
teins and nucleic acids, for which the free energy of for- 
mation is positive: the molecules are less stable and more 
highly ordered than a mixture of their monomelic com- 
ponents. To carry out these thermodynamically unfavor- 
able, energy-requiring (endergonic) reactions, cells 
couple them to other reactions that liberate free energy 
(exergonic reactions), so that the overall process is ex- 
ergonic: the sum of the free-energy changes is negative. 

The usual source of free energy in coupled biologi- 
cal reactions is the energy released by breakage of phos- 
phoanhydride bonds such as those in adenosine 
triphosphate (ATP; Fig. 1-25) and guanosine triphos- 
phate (GTP). Here, each® represents a phosphoryl 
group: 

Amino acids -» protein AGi is positive (endergonic) 

ATP -> AMP + (PHD AG 2 is negative (exergonic) 

[or ATP — » ADP + (g)] 

When these reactions are coupled, the sum of AG X and 
AG 2 is negative — the overall process is exergonic. By 
this coupling strategy, cells are able to synthesize and 
maintain the information-rich polymers essential to life. 

Energy Coupling Links Reactions in Biology 

The central issue in bioenergetics (the study of energy 
transformations in living systems) is the means by which 
energy from fuel metabolism or light capture is coupled 
to a cell's energy-requiring reactions. In thinking about 
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FIGURE 1-25 Adenosine triphosphate (ATP) provides 
energy. Here, each ® represents a phosphoryl group. 
The removal of the terminal phosphoryl group 
(shaded pink) of ATP, by breakage of a phosphoanhy- 
dride bond to generate adenosine diphosphate (ADP) 
and inorganic phosphate ion (HPO4"), is highly ex- 
ergonic, and this reaction is coupled to many ender- 
gonic reactions in the cell (as in the example in 
Fig. 1 —26b). ATP also provides energy for many cellu- 
lar processes by undergoing cleavage that releases 
the two terminal phosphates as inorganic pyrophos- 
phate (H 2 P 2 07~), often abbreviated PPj. 



O— P— O— P— OH + (P)— Adenosine (Adenosine monophosphate, AMP) 
O O 
Inorganic pyrophosphate (PPj) 

energy coupling, it is useful to consider a simple me- 
chanical example, as shown in Figure l-2tia. An object 
at the top of an inclined plane has a certain amount of po- 
tential energy as a result of its elevation. It tends to slide 
down the plane, losing its potential energy of position as 
it approaches the ground. When an appropriate string- 
and-pulley device couples the falling object to another, 
smaller object, the spontaneous downward motion of the 
larger can lift the smaller, accomplishing a certain 
amount of work. The amount of energy available to do 
work is the free-energy change, AG, this is always 
somewhat less than the theoretical amount of energy re- 
leased, because some energy is dissipated as the heat of 
friction. The greater the elevation of the larger object, 
the greater the energy released (AG) as the object slides 
downward and the greater the amount of work that can 
be accomplished. The larger object can lift the smaller 
only because, at the outset, the larger object was far 
from its equilibrium position: it had at some earlier 
point been elevated above the ground, in a process that 
itself required the input of energy. 

How does this apply in chemical reactions? In 
closed systems, chemical reactions proceed sponta- 



neously until equilibrium is reached. When a system is 
at equilibrium, the rate of product formation exactly 
equals the rate at which product is converted to reac- 
tant. Thus there is no net change in the concentration of 
reactants and products. The energy change as the sys- 
tem moves from its initial state to equilibrium, with no 
changes in temperature or pressure, is given by the free- 
energy change, AG. The magnitude of AG depends on 
the particular chemical reaction and on how far from 
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FIGURE 1-26 Energy coupling in mechanical and chemical processes. 

(a) The downward motion of an object releases potential energy that 
can do mechanical work. The potential energy made available by 
spontaneous downward motion, an exergonic process (pink), can be 
coupled to the endergonic upward movement of another object (blue). 

(b) In reaction 1, the formation of glucose 6-phosphate from glucose 
and inorganic phosphate (P,) yields a product of higher energy than the 
two reactants. For this endergonic reaction, AC is positive. In reaction 2, 
the exergonic breakdown of adenosine triphosphate (ATP) has a large, 
negative free-energy change (AC 2 ). The third reaction is the sum of re- 
actions 1 and 2, and the free-energy change, AC 3 , is the arithmetic 
sum of ACt and AC 2 . Because AC 3 is negative, the overall reaction is 
exergonic and proceeds spontaneously. 
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equilibrium the system is initially. Each compound 
involved in a chemical reaction contains a certain 
amount of potential energy ; related to the kind and num- 
ber of its bonds. In reactions that occur spontaneously, 
the products have less free energy than the reactants, 
thus the reaction releases free energy, which is then 
available to do work. Such reactions are exergonic; the 
decline in free energy from reactants to products is ex- 
pressed as a negative value. Endergonic reactions re- 
quire an input of energy, and their AG values are 
positive. As in mechanical processes, only part of the en- 
ergy released in exergonic chemical reactions can be 
used to accomplish work. In living systems some energy 
is dissipated as heat or lost to increasing entropy. 

In biological organisms, just as in the mechanical 
example in Figure l-26a, an exergonic reaction can be 
coupled to an endergonic reaction to drive otherwise 
unfavorable reactions. Figure 1-2 6b (a type of graph 
called a reaction coordinate diagram) illustrates this 
principle for the conversion of glucose to glucose 
6-phosphate, the first step in the pathway for oxida- 
tion of glucose. The simplest way to produce glucose 
6-phosphate would be: 

Reaction 1: Glucose + Pi > glucose 6-phosphate 

(endergonic; AGi is positive) 

(Here, Pi is an abbreviation for inorganic phosphate, 
HPC>4~. Don't be concerned about the structure of these 
compounds now; we describe them in detail later in the 
book.) This reaction does not occur spontaneously; AG : 
is positive. A second, very exergonic reaction can occur 
in all cells: 

Reaction 2: ATP > ADP + P d 

(exergonic; AG 2 is negative) 

These two chemical reactions share a common interme- 
diate, P i? which is consumed in reaction 1 and produced 
in reaction 2. The two reactions can therefore be cou- 
pled in the form of a third reaction, which we can write 
as the sum of reactions 1 and 2, with the common inter- 
mediate, P b omitted from both sides of the equation: 

Reaction 3: Glucose + ATP > 

glucose 6-phosphate + ADP 

Because more energy is released in reaction 2 than is 
consumed in reaction 1, the free-energy change for re- 
action 3, AG 3 , is negative, and the synthesis of glucose 
6-phosphate can therefore occur by reaction 3. 

The coupling of exergonic and endergonic reactions 
through a shared intermediate is central to the energy 
exchanges in living systems. As we shall see, reactions 
that break down ATP (such as reaction 2 in Fig. l-26b) 
release energy that drives many endergonic processes 
in cells. ATP breakdown in cells is exergonic because all 
living cells maintain a concentration of ATP far 
above its equilibrium concentration. It is this disequi- 
librium that allows ATP to serve as the major carrier of 
chemical energy in all cells. 



K n and AG ° Are Measures of a Reaction's Tendency to 
Proceed Spontaneously 

The tendency of a chemical reaction to go to completion 
can be expressed as an equilibrium constant. For the re- 
action in which a moles of A react with b moles of B to 
give c moles of C and d moles of D, 

aA + bB ► cC + dD 

the equilibrium constant, K m , is given by 

_ lC] c eq [D]* q 
eq ~ [A]| q [B] h eq 

where [A] eq is the concentration of A, [B] eq the concen- 
tration of B, and so on, when the system has reached 
equilibrium. A large value of iT eq means the reaction 
tends to proceed until the reactants are almost com- 
pletely converted into the products. 

Gibbs showed that AG (the actual free-energy 
change) for any chemical reaction is a function of the 
standard free-energy change, \G° — a constant that 
is characteristic of each specific reaction — and a term 
that expresses the initial concentrations of reactants 
and products: 

AG = AG 0 + RT\n ^ d-D 
[A]f [B]i 

where [A]i is the initial concentration of A, and so forth; 
R is the gas constant; and Tis the absolute temperature. 

AG is a measure of the distance of a system from its 
equilibrium position. When a reaction has reached equi- 
librium, no driving force remains and it can do no work: 
AG = 0. For this special case, [A]j = [A] eq , and so on, for 
all reactants and products, and 

m _ ^ p& 

[A]f [Btf [A]| q [B]* q 

Substituting 0 for AG and# eq for [C]f [D]f/[A]f [B]j> in 
Equation 1-1, we obtain the relationship 

AG 0 = -RTlnK eci 

from which we see that AG° is simply a second way (be- 
sides K eq ) of expressing the driving force on a reaction. 
Because K eq is experimentally measurable, we have a 
way of determining AG°, the thermodynamic constant 
characteristic of each reaction. 

The units of AG° and AG are joules per mole (or 
calories per mole) . When if eq > > 1 , AG 0 is large and neg- 
ative; when K eq « 1, AG° is large and positive. From a 
table of experimentally determined values of either K eq 
or AG°, we can see at a glance which reactions tend to go 
to completion and which do not. 

One caution about the interpretation of AG°: ther- 
modynamic constants such as this show where the final 
equilibrium for a reaction lies but tell us nothing about 
how fast that equilibrium will be achieved. The rates of 
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reactions are governed by the parameters of kinetics, a 
topic we consider in detail in Chapter 6. 

Enzymes Promote Sequences of Chemical Reactions 

All biological macromolecules are much less thermody- 
namically stable than their monomeric sub units, yet 
they are kinetically stable: their uncatalyzed break- 
down occurs so slowly (over years rather than seconds) 
that, on a time scale that matters for the organism, 
these molecules are stable. Virtually every chemical re- 
action in a cell occurs at a significant rate only because 
of the presence of enzymes — biocatalysts that, like all 
other catalysts, greatly enhance the rate of specific 
chemical reactions without being consumed in the 
process. 

The path from reactant (s) to product (s) almost in- 
variably involves an energy barrier, called the activation 
barrier (Fig. 1-27), that must be surmounted for any re- 
action to proceed. The breaking of existing bonds and 
formation of new ones generally requires, first, a distor- 
tion of the existing bonds to create a transition state of 
higher free energy than either reactant or product. The 
highest point in the reaction coordinate diagram repre- 
sents the transition state, and the difference in energy 
between the reactant in its ground state and in its transi- 
tion state is the activation energy, AG*. An enzyme 
catalyzes a reaction by providing a more comfortable fit 
for the transition state: a surface that complements the 
transition state in stereochemistry, polarity, and charge. 
The binding of enzyme to the transition state is exer- 
gonic, and the energy released by this binding reduces 
the activation energy for the reaction and greatly in- 
creases the reaction rate. 

A further contribution to catalysis occurs when two 
or more reactants bind to the enzyme's surface close to 
each other and with stereospecific orientations that fa- 
vor the reaction. This increases by orders of magnitude 
the probability of productive collisions between reac- 
tants. As a result of these factors and several others, dis- 
cussed in Chapter 6, enzyme-catalyzed reactions 
commonly proceed at rates greater than 10 12 times 
faster than the uncatalyzed reactions. (That is a million 
million times faster!) 

Cellular catalysts are, with a few notable excep- 
tions, proteins. (Some RNA molecules have enzymatic 
activity, as discussed in Chapters 26 and 27.) Again with 
a few exceptions, each enzyme catalyzes a specific reac- 
tion, and each reaction in a cell is catalyzed by a differ- 
ent enzyme. Thousands of different enzymes are 
therefore required by each cell. The multiplicity of en- 
zymes, their specificity (the ability to discriminate be- 
tween reactants), and their susceptibility to regulation 
give cells the capacity to lower activation barriers selec- 
tively. This selectivity is crucial for the effective regula- 
tion of cellular processes. By allowing specific reactions 
to proceed at significant rates at particular times, en- 
zymes determine how matter and energy are channeled 
into cellular activities. 
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FIGURE 1-27 Energy changes during a chemical reaction. An activa- 
tion barrier, representing the transition state (see Chapter 6), must be 
overcome in the conversion of reactants (A) into products (B), even 
though the products are more stable than the reactants, as indicated by 
a large, negative free-energy change (AC). The energy required to over- 
come the activation barrier is the activation energy (AC*). Enzymes 
catalyze reactions by lowering the activation barrier. They bind the 
transition-state intermediates tightly, and the binding energy of this in- 
teraction effectively reduces the activation energy from AC* uncat (blue 
curve) to AG* cat (red curve). (Note that activation energy is not related 
to free-energy change, AC ) 

The thousands of enzyme-catalyzed chemical reac- 
tions in cells are functionally organized into many se- 
quences of consecutive reactions, called pathways, in 
which the product of one reaction becomes the reactant 
in the next. Some pathways degrade organic nutrients 
into simple end products in order to extract chemical 
energy and convert it into a form useful to the cell; to- 
gether these degradative, free-energy-yielding reactions 
are designated catabolism. The energy released by 
catabolic reactions drives the synthesis of ATP. As a re- 
sult, the cellular concentration of ATP is far above its 
equilibrium concentration, so that AG for ATP break- 
down is large and negative. Similarly, metabolism results 
in the production of the reduced electron carriers 
NADH and NADPH, both of which can donate electrons 
in processes that generate ATP or drive reductive steps 
in biosynthetic pathways. 

Other pathways start with small precursor mole- 
cules and convert them to progressively larger and more 
complex molecules, including proteins and nucleic 
acids. Such synthetic pathways, which invariably re- 
quire the input of energy, are collectively designated an- 
abolism. The overall network of enzyme-catalyzed 
pathways constitutes cellular metabolism. ATP (and 
the energetically equivalent nucleoside triphosphates 
cytidine triphosphate (CTP), uridine triphosphate 
(UTP), and guanosine triphosphate (GTP)) is the 
connecting link between the catabolic and anabolic 
components of this network (shown schematically in 
Fig. 1-28). The pathways of enzyme-catalyzed reac- 
tions that act on the main constituents of cells — 
proteins, fats, sugars, and nucleic acids — are virtually 
identical in all living organisms. 
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FIGURE !~2S The central roles of ATP and NAD(P)H in metabolism. 

ATP is the shared chemical intermediate linking energy-releasing and 
energy-consuming cellular processes Its role in the cell is analogous 
to that of money in an economy: it is "earned/produced" in exergonic 
reactions and "spent/consumed" in endergonic ones. NAD(P)H (nicoti- 
namide adenine dinucleotide (phosphate)) is an electron-carrying co- 
factor that collects electrons from oxidative reactions and then donates 
them in a wide variety of reduction reactions in biosynthesis. Present in 
relatively low concentrations, these cofactors essential to anabolic re- 
actions must be constantly regenerated by catabolic reactions. 



Metabolism Is Regulated to Achieve 
Balance and Economy 

Not only do living cells simultaneously synthesize thou- 
sands of different kinds of carbohydrate, fat, protein, 
and nucleic acid molecules and their simpler subunits, 
but they do so in the precise proportions required by the 
cell under any given circumstance. For example, during 
rapid cell growth the precursors of proteins and nucleic 
acids must be made in large quantities, whereas in non- 
growing cells the requirement for these precursors is 
much reduced. Key enzymes in each metabolic pathway 
are regulated so that each type of precursor molecule is 



produced in a quantity appropriate to the current re- 
quirements of the cell. 

Consider the pathway in E. coli that leads to the 
synthesis of the amino acid isoleucine, a constituent of 
proteins. The pathway has five steps catalyzed by five 
different enzymes (A through F represent the interme- 
diates in the pathway): 



® 



-> c 



D 



Threonine 



> F 

Isoleucine 



If a cell begins to produce more isoleucine than it needs 
for protein synthesis, the unused isoleucine accumu- 
lates and the increased concentration inhibits the cat- 
alytic activity of the first enzyme in the pathway, 
immediately slowing the production of isoleucine. Such 
feedback inhibition keeps the production and utiliza- 
tion of each metabolic intermediate in balance. 
(Throughout the book, we will use ® to indicate inhibi- 
tion of an enzymatic reaction.) 

Although the concept of discrete pathways is an im- 
portant tool for organizing our understanding of metab- 
olism, it is an oversimplification. There are thousands of 
metabolic intermediates in a cell, many of which are part 
of more than one pathway. Metabolism would be better 
represented as a web of interconnected and interde- 
pendent pathways. A change in the concentration of any 
one metabolite would start a ripple effect, influencing 
the flow of materials through other pathways. The task 
of understanding these complex interactions among 
intermediates and pathways in quantitative terms is 
daunting, but the new emphasis on systems biology, 
discussed in Chapter 15, has begun to offer important 
insights into the overall regulation of metabolism. 

Cells also regulate the synthesis of their own cata- 
lysts, the enzymes, in response to increased or dimin- 
ished need for a metabolic product; this is the substance 
of Chapter 28. The expression of genes (the translation 
from information in DNA to active protein in the cell) and 
synthesis of enzymes are other layers of metabolic con- 
trol in the cell. All layers must be taken into account when 
describing the overall control of cellular metabolism. 

SUMMARY 1.3 Physical Foundations 

« Living cells are open systems, exchanging matter 
and energy with their surroundings, extracting and 
channeling energy to maintain themselves in a 
dynamic steady state distant from equilibrium. 
Energy is obtained from sunlight or fuels by 
converting the energy from electron flow into 
the chemical bonds of ATP. 

m The tendency for a chemical reaction to proceed 
toward equilibrium can be expressed as the 
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free-energy change, AG, which has two components: 
enthalpy change, AH, and entropy change, AS. 
These variables are related by the equation AG = 
AH- TAS. 

■ When AG of a reaction is negative , the reaction is 
exergonic and tends to go toward completion; when 
AG is positive, the reaction is endergonic and tends 
to go in the reverse direction. When two reactions 
can be summed to yield a third reaction, the AG for 
this overall reaction is the sum of the AGs of the 
two separate reactions. 

■ The reactions converting ATP to and ADP or to 

AMP and PPi are highly exergonic (large negative 
AG) . Many endergonic cellular reactions are driven 
by coupling them, through a common intermediate, 
to these highly exergonic reactions. 

■ The standard free-energy change for a reaction, 
AG°, is a physical constant that is related to the 
equilibrium constant by the equation AG° = 
-RT In iT eq . 

■ Most cellular reactions proceed at useful rates only 
because enzymes are present to catalyze them. 
Enzymes act in part by stabilizing the transition state, 
reducing the activation energy, AG*, and increasing 
the reaction rate by many orders of magnitude. The 
catalytic activity of enzymes in cells is regulated. 

■ Metabolism is the sum of many interconnected 
reaction sequences that interconvert cellular 
metabolites. Each sequence is regulated to provide 
what the cell needs at a given time and to expend 
energy only when necessary. 



1,4 Genetic Foundations 

Perhaps the most remarkable property of living cells and 
organisms is their ability to reproduce themselves for 
countless generations with nearly perfect fidelity. This 
continuity of inherited traits implies constancy, over mil- 
lions of years, in the structure of the molecules that con- 
tain the genetic information. Very few historical records 
of civilization, even those etched in copper or carved in 
stone (Fig. 1-29), have survived for a thousand years. 
But there is good evidence that the genetic instructions 
in living organisms have remained nearly unchanged 
over very much longer periods; many bacteria have 
nearly the same size, shape, and internal structure and 
contain the same kinds of precursor molecules and en- 
zymes as bacteria that lived nearly four billion years ago. 
This continuity of structure and composition is the result 
of continuity in the structure of the genetic material. 

Among the seminal discoveries in biology in the 
twentieth century were the chemical nature and the 
three-dimensional structure of the genetic material, de- 
oxyribonucleic acid, DNA. The sequence of the 
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FIGURE 1-29 Two ancient scripts, (a) The Prism of Sennacherib, in- 
scribed in about 700 bce, describes in characters of the Assyrian lan- 
guage some historical events during the reign of King Sennacherib. The 
Prism contains about 20,000 characters, weighs about 50 kg, and has 
survived almost intact for about 2,700 years, (b) The single DNA mole- 
cule of the bacterium E. coli, leaking out of a disrupted cell, is hundreds 
of times longer than the cell itself and contains all the encoded infor- 
mation necessary to specify the cell's structure and functions. The bac- 
terial DNA contains about 4.6 million characters (nucleotides), weighs 
less than 10" 10 g, and has undergone only relatively minor changes 
during the past several million years. (The yellow spots and dark specks 
in this colorized electron micrograph are artifacts of the preparation.) 

monomeric subunits, the nucleotides (strictly, deoxyri- 
bonucleotides, as discussed below), in this linear poly- 
mer encodes the instructions for forming all other 
cellular components and provides a template for the 
production of identical DNA molecules to be distributed 
to progeny when a cell divides. The perpetuation of a bi- 
ological species requires that its genetic information be 
maintained in a stable form, expressed accurately in the 
form of gene products, and reproduced with a minimum 
of errors. Effective storage, expression, and reproduc- 
tion of the genetic message define individual species, 
distinguish them from one another, and assure their 
continuity over successive generations. 

Genetic Continuity Is Vested in Single DNA Molecules 

DNA is a long, thin, organic polymer, the rare molecule 
that is constructed on the atomic scale in one dimen- 
sion (width) and the human scale in another (length: a 
molecule of DNA can be many centimeters long). A hu- 
man sperm or egg, carrying the accumulated hereditary 
information of billions of years of evolution, transmits 
this inheritance in the form of DNA molecules, in which 
the linear sequence of covalently linked nucleotide sub- 
units encodes the genetic message. 

Usually when we describe the properties of a chem- 
ical species, we describe the average behavior of a very 
large number of identical molecules. While it is difficult 
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to predict the behavior of any single molecule in a col- 
lection of, say, a picomole (about 6 X 10 11 molecules) of 
a compound, the average behavior of the molecules is 
predictable because so many molecules enter into the 
average. Cellular DNA is a remarkable exception. The 
DNA that is the entire genetic material of E. coli is a 
single molecule containing 4.64 million nucleotide 
pairs. That single molecule must be replicated perfectly 
in every detail if an E. coli cell is to give rise to identical 
progeny by cell division; there is no room for averaging 
in this process! The same is true for all cells. A human 
sperm brings to the egg that it fertilizes just one mole- 
cule of DNA in each of its 23 different chromosomes, to 
combine with just one DNA molecule in each correspon- 
ding chromosome in the egg. The result of this union is 
very highly predictable: an embryo with all of its -25,000 
genes, constructed of 3 billion nucleotide pairs, intact. 
An amazing chemical feat! 

WORKED EXAMPLE 1-1 Fidelity of DNA Replication 

Calculate the number of times the DNA of a modern E. 
coli cell has been copied accurately since its earliest 
bacterial precursor cell arose about 3.5 billion years ago. 
Assume for simplicity that over this time period E. coli 
has undergone, on average, one cell division every 12 
hours (this is an overestimate for modern bacteria, but 
probably an underestimate for ancient bacteria). 

Solution: 

(1 generation/12 hr)(24 hr/d)(365 d/yr)(3.5 X 10 9 yr) 

= 2.6 X 10 12 generations. 



A single page of this book contains about 5,000 
characters, so the entire book contains about 5 million 
characters. The chromosome of E. coli also contains 
about 5 million characters (base pairs). If you made a 
hand-written copy of this book and then passed on the 
copy to a classmate to copy by hand, and this copy were 
then copied by a third classmate, and so on, how closely 
would each successive copy of the book resemble the 
original? Now, imagine the textbook that would result 
from hand-copying this one a few trillion times! 

The Structure of DNA Allows for Its Replication and Repair 
with Near-Perfect Fidelity 

The capacity of living cells to preserve their genetic ma- 
terial and to duplicate it for the next generation results 
from the structural complementarity between the two 
strands of the DNA molecule (Fig. 1-30). The basic 
unit of DNA is a linear polymer of four different 
monomeric subunits, deoxyribonucleotides, arranged 
in a precise linear sequence. It is this linear sequence 
that encodes the genetic information. Two of these poly- 
meric strands are twisted about each other to form the 
DNA double helix, in which each deoxyribonucleotide 
in one strand pairs specifically with a complementary 



deoxyribonucleotide in the opposite strand. Before a 
cell divides, the two DNA strands separate and each 
serves as a template for the synthesis of a new, comple- 
mentary strand, generating two identical double-helical 
molecules, one for each daughter cell. If, at any time, 
one strand is damaged, continuity of information is as- 
sured by the information present in the other strand, 
which can act as a template for repair of the damage. 
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FIGURE 1-30 Complementarity between the two strands of DNA. 

DNA is a linear polymer of covalently joined deoxyribonucleotides, of 
four types: deoxyadenylate (A), deoxyguanylate (G), deoxycytidylate 
(C), and deoxythymidylate (T). Each nucleotide, with its unique three- 
dimensional structure, can associate very specifically but noncova- 
lently with one other nucleotide in the complementary chain: A always 
associates with T, and G with C. Thus, in the double-stranded DNA 
molecule, the entire sequence of nucleotides in one strand is comple- 
mentary to the sequence in the other. The two strands, held together by 
hydrogen bonds (represented here by vertical light blue lines) between 
each pair of complementary nucleotides, twist about each other to 
form the DNA double helix. In DNA replication, the two strands (blue) 
separate and two new strands (pink) are synthesized, each with a se- 
quence complementary to one of the original strands. The result is two 
double-helical molecules, each identical to the original DNA. 



The Linear Sequence in DNA Encodes Proteins with Three- 
Dimensional Structures 

The information in DNA is encoded in its linear (one- 
dimensional) sequence of deoxyribonucleotide subunits, 
but the expression of this information results in a three-di- 
mensional cell. This change from one to three dimensions 
occurs in two phases. A linear sequence of deoxyribonu- 
cleotides in DNA codes (through an intermediary, RNA) 
for the production of a protein with a corresponding linear 
sequence of amino acids (Fig. 1-31). The protein folds 
into a particular three-dimensional shape, determined by 
its amino acid sequence and stabilized primarily by nonco- 
valent interactions. Although the final shape of the folded 
protein is dictated by its amino acid sequence, the folding 
is aided by "molecular chaperones" (see Fig. 4-29). The 
precise three-dimensional structure, or native confor- 
mation, of the protein is crucial to its function. 

Once in its native conformation, a protein may asso- 
ciate noncovalently with other macromolecules (other 
proteins, nucleic acids, or lipids) to form supramolecular 
complexes such as chromosomes, ribosomes, and mem- 
branes. The individual molecules of these complexes 
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FIGURE 1-31 DNA to RNA to protein to enzyme (hexokinase). The lin- 
ear sequence of deoxyribonucleotides in the DNA (the gene) that en- 
codes the protein hexokinase is first transcribed into a ribonucleic acid 
(RNA) molecule with the complementary ribonucleotide sequence. The 
RNA sequence (messenger RNA) is then translated into the linear pro- 
tein chain of hexokinase, which folds into its native three-dimensional 
shape, most likely aided by molecular chaperones. Once in its native 
form, hexokinase acquires its catalytic activity: it can catalyze the phos- 
phorylation of glucose, using ATP as the phosphoryl group donor. 
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have specific, high-affinity binding sites for each other, 
and within the cell they spontaneously self-assemble into 
functional complexes. 

Although protein sequences carry all necessary in- 
formation for achieving their native conformation, accu- 
rate folding and self-assembly also require the right 
cellular environment — pH, ionic strength, metal ion 
concentrations, and so forth. Thus the DNA sequence 
alone is not enough to dictate the formation of a cell. 

SUMMARY 1.4 Genetic Foundations 

■ Genetic information is encoded in the linear 
sequence of four types of deoxyribonucleotides 
in DNA. 

■ The double-helical DNA molecule contains an 
internal template for its own replication and repair. 

■ The linear sequence of amino acids in a protein, 
which is encoded in the DNA of the gene for that 
protein, produces a protein's unique three- 
dimensional structure — a process also dependent 
on environmental conditions. 

■ Individual macromolecules with specific affinity 
for other macromolecules self-assemble into 
supramolecular complexes. 

1.5 Evolutionary Foundations 

Nothing in biology makes sense except in the light of 
evolution. 

— Theodosius Dobzhansky, The American Biology Teacher, 

March 1973 

Great progress in biochemistry and molecular biology 
in recent decades has amply confirmed the validity of 
Dobzhansky's striking generalization. The remarkable 
similarity of metabolic pathways and gene sequences 
across the phyla argues strongly that all modern organ- 
isms are derived from a common evolutionary progeni- 
tor by a series of small changes (mutations), each of 
which conferred a selective advantage to some organism 
in some ecological niche. 

Changes in the Hereditary Instructions Allow Evolution 

Despite the near-perfect fidelity of genetic replication, 
infrequent, unrepaired mistakes in the DNA replication 
process lead to changes in the nucleotide sequence of 
DNA, producing a genetic mutation (Fig. 1-32) and 
changing the instructions for a cellular component. In- 
correctly repaired damage to one of the DNA strands 
has the same effect. Mutations in the DNA handed down 
to offspring — that is, mutations carried in the reproduc- 
tive cells — may be harmful or even lethal to the new or- 
ganism or cell; they may, for example, cause the 
synthesis of a defective enzyme that is not able to cat- 
alyze an essential metabolic reaction. Occasionally, 
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FIGURE 1-32 Gene duplication and mutation: one path to 
generate new enzymatic activities. In this example, the 
single hexokinase gene in a hypothetical organism might 
occasionally by accident, be copied twice during DNA 
replication, such that the organism has two full copies of 
the gene, one of which is superfluous. Over many genera- 
tions, as the DNA with two hexokinase genes is repeatedly 
duplicated, rare mistakes occur, leading to changes in the 
nucleotide sequence of the superfluous gene and thus of 
the protein that it encodes. In a few very rare cases, the 
protein produced from this mutant gene is altered so that it 
can bind a new substrate — galactose in our hypothetical 
case. The cell containing the mutant gene has acquired a 
new capability (metabolism of galactose), which may allow 
it to survive in an ecological niche that provides galactose 
but not glucose. If no gene duplication precedes mutation, 
the original function of the gene product is lost. 



however, a mutation better equips an organism or cell 
to survive in its environment. The mutant enzyme might 
have acquired a slightly different specificity, for exam- 
ple, so that it is now able to use some compound that the 
cell was previously unable to metabolize. If a population 
of cells were to find itself in an environment where that 
compound was the only or the most abundant available 
source of fuel, the mutant cell would have a selective ad- 
vantage over the other, unmutated (wild-type) cells in 
the population. The mutant cell and its progeny would 
survive and prosper in the new environment, whereas 
wild-type cells would starve and be eliminated. This is 
what Darwin meant by "survival of the fittest under se- 
lective pressure" — the process of natural selection. 

Occasionally, a second copy of a whole gene is intro- 
duced into the chromosome as a result of defective 
replication of the chromosome. The second copy is su- 
perfluous, and mutations in this gene will not be delete- 
rious; it becomes a means by which the cell may evolve, 
by producing a new gene with a new function while re- 
taining the original gene and gene function. Seen in this 
light, the DNA molecules of modern organisms are his- 
torical documents, records of the long journey from the 
earliest cells to modern organisms. The historical ac- 
counts in DNA are not complete, however; in the course 
of evolution, many mutations must have been erased or 
written over. But DNA molecules are the best source of 
biological history that we have. The frequency of errors 
in DNA replication represents a balance between too 
many errors, which would yield nonviable daughter 



cells, and too few, which would prevent the genetic vari- 
ation that allows survival of mutant cells in new ecologi- 
cal niches. 

Several billion years of adaptive selection have re- 
fined cellular systems to take maximum advantage of 
the chemical and physical properties of available raw 
materials. Chance genetic variations in individuals in a 
population, combined with natural selection, have re- 
sulted in the evolution of today's enormous variety of or- 
ganisms, each adapted to its particular ecological niche. 

Biomolecules First Arose by Chemical Evolution 

In our account thus far we have passed over the first chap- 
ter of the story of evolution: the appearance of the first liv- 
ing cell. Apart from their occurrence in living organisms, 
organic compounds, including the basic biomolecules 
such as amino acids and carbohydrates, are found in only 
trace amounts in the Earth's crust, the sea, and the at- 
mosphere. How did the first living organisms acquire their 
characteristic organic building blocks? According to one 
hypothesis, these compounds were created by the effects 
of powerful atmospheric forces — ultraviolet irradiation, 
lightning, or volcanic eruptions — on the gases in the pre- 
biotic Earth's atmosphere, and on inorganic solutes in su- 
perheated thermal vents deep in the ocean. 

This hypothesis was tested in a classic experiment 
on the abiotic (nonbiological) origin of organic biomole- 
cules carried out in 1953 by Stanley Miller in the labora- 
tory of Harold Urey. Miller subjected gaseous mixtures 
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FIGURE 1-33 Abiotic production of biomolecules. Spark-discharge 
apparatus of the type used by Miller and Urey in experiments demon- 
strating abiotic formation of organic compounds under primitive at- 
mospheric conditions. After subjection of the gaseous contents of the 
system to electrical sparks, products were collected by condensation. 
Biomolecules such as amino acids were among the products. 

such as those presumed to exist on the prebiotic Earth, 
including NH 3) CH 4 , H 2 0, and H 2) to electrical sparks 
produced across a pair of electrodes (to simulate light- 
ning) for periods of a week or more, then analyzed the 
contents of the closed reaction vessel (Fig. 1-33). The 
gas phase of the resulting mixture contained CO and 
C0 2 , as well as the starting materials. The water phase 
contained a variety of organic compounds, including 
some amino acids, hydroxy acids, aldehydes, and hydro- 
gen cyanide (HCN). This experiment established the 
possibility of abiotic production of biomolecules in rela- 
tively short times under relatively mild conditions. 

More refined laboratory experiments have provided 
good evidence that many of the chemical components of 
living cells, including polypeptides and RNA-like mole- 
cules, can form under these conditions. Polymers of 
RNA can act as catalysts in biologically significant reac- 
tions (see Chapters 26 and 27), and RNA probably 
played a crucial role in prebiotic evolution, both as cata- 
lyst and as information repository. 



enzymes catalyze the replication and repair of nucleic 
acids. The mutual dependence of these two classes of 
biomolecules brings up the perplexing question: which 
came first, DNA or protein? 

The answer may be that they appeared about the 
same time, and RNA preceded them both. The discov- 
ery that RNA molecules can act as catalysts in their 
own formation suggests that RNA or a similar mole- 
cule may have been the first gene and the first cata- 
lyst. According to this scenario (Fig. 1-34), one of 
the earliest stages of biological evolution was the 
chance formation, in the primordial soup, of an RNA 
molecule that could catalyze the formation of other 
RNA molecules of the same sequence — a self-replicat- 
ing, self-perpetuating RNA. The concentration of a 
self-replicating RNA molecule would increase expo- 
nentially, as one molecule formed two, two formed 
four, and so on. The fidelity of self-replication was pre- 
sumably less than perfect, so the process would gen- 
erate variants of the RNA, some of which might be 
even better able to self-replicate. In the competition 
for nucleotides, the most efficient of the self-replicat- 
ing sequences would win, and less efficient replicators 
would fade from the population. 



Creation of prebiotic soup, including nucleotides, 
from components of Earth's primitive atmosphere 



Production of short RNA molecules 
with random sequences 



Selective replication of self-duplicating 
catalytic RNA segments 



Synthesis of specific peptides, 
catalyzed by RNA 



Increasing role of peptides in RNA replication; 
coevolution of RNA and protein 



Primitive translation system develops, 
with RNA genome and RNA-protein catalysts 



Genomic RNA begins to be copied into DNA 



RNA or Related Precursors May Have Been the First Genes 
and Catalysts 

In modern organisms, nucleic acids encode the genetic 
information that specifies the structure of enzymes, and 



DN 



DNA genome, translated on RNA-protein compl< 
( riboBome) with RNA and protein catalysts 




FIGURE 1-34 A possible "RNA world" scenario. 
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The division of function between DNA (genetic in- 
formation storage) and protein (catalysis) was, accord- 
ing to the "RNA world" hypothesis, a later development. 
New variants of self-replicating RNA molecules devel- 
oped, with the additional ability to catalyze the conden- 
sation of amino acids into peptides. Occasionally, the 
peptide (s) thus formed would reinforce the self- 
replicating ability of the RNA, and the pair — RNA mole- 
cule and helping peptide — could undergo further 
modifications in sequence, generating increasingly effi- 
cient self-replicating systems. The remarkable discovery 
that, in the protein-synthesizing machinery of modern 
cells (ribosomes), RNA molecules, not proteins, cat- 
alyze the formation of peptide bonds is consistent with 
the RNA world hypothesis. 

Some time after the evolution of this primitive 
protein-synthesizing system, there was a further devel- 
opment: DNA molecules with sequences complemen- 
tary to the self-replicating RNA molecules took over the 
function of conserving the "genetic" information, and 
RNA molecules evolved to play roles in protein synthe- 
sis. (We explain in Chapter 8 why DNA is a more stable 
molecule than RNA and thus a better repository of in- 
heritable information.) Proteins proved to be versatile 
catalysts and, over time, took over most of that function. 
Lipidlike compounds in the primordial soup formed rel- 
atively impermeable layers around self-replicating col- 
lections of molecules. The concentration of proteins and 
nucleic acids within these lipid enclosures favored the 
molecular interactions required in self-replication. 

Biological Evolution Began More Than Three and a Half 
Billion Years Ago 

Earth was formed about 4.6 billion years ago, and the 
first evidence of life dates to more than 3.5 billion years 
ago. In 1996, scientists working in Greenland found 
chemical evidence of life ("fossil molecules") from as far 
back as 3.85 billion years ago, forms of carbon embed- 
ded in rock that seem to have a distinctly biological ori- 
gin. Somewhere on Earth during its first billion years the 
first simple organism arose, capable of replicating its 
own structure from a template (RNA?) that was the first 
genetic material. Because the terrestrial atmosphere at 
the dawn of life was nearly devoid of oxygen, and be- 
cause there were few microorganisms to scavenge or- 
ganic compounds formed by natural processes, these 
compounds were relatively stable. Given this stability 
and eons of time, the improbable became inevitable: the 
organic compounds were incorporated into evolving 
cells to produce increasingly effective self-reproducing 
catalysts. The process of biological evolution had begun. 

The First Cell Probably Used Inorganic Fuels 

The earliest cells arose in a reducing atmosphere (there 
was no oxygen) and probably obtained energy from in- 
organic fuels, such as ferrous sulfide and ferrous car- 



bonate, both abundant on the early Earth. For example, 
the reaction 

FeS + H 2 S^FeS 2 + H 2 

yields enough energy to drive the synthesis of ATP or 
similar compounds. The organic compounds they re- 
quired may have arisen by the nonbiological actions of 
lightning or of heat from volcanoes or thermal vents in 
the sea on components of the early atmosphere: CO, 
C0 2 , N 2 , NH 3 , CH 4 , and such. An alternative source of 
organic compounds has been proposed: extraterrestrial 
space. In 2006, the Stardust space mission brought back 
tiny particles of dust from the tail of a comet; the dust 
contained a variety of organic compounds. 

Early unicellular organisms gradually acquired the 
ability to derive energy from compounds in their envi- 
ronment and to use that energy to synthesize more of 
their own precursor molecules, thereby becoming less 
dependent on outside sources. A very significant evolu- 
tionary event was the development of pigments capable 
of capturing the energy of light from the sun, which 
could be used to reduce, or "fix," C0 2 to form more com- 
plex, organic compounds. The original electron donor 
for these photosynthetic processes was probably H 2 S, 
yielding elemental sulfur or sulfate (SO|~) as the by- 
product; later cells developed the enzymatic capacity to 
use H 2 0 as the electron donor in photosynthetic reac- 
tions, eliminating 0 2 as waste. Cyanobacteria are the 
modern descendants of these early photosynthetic 
oxygen-producers . 

Because the atmosphere of Earth in the earliest 
stages of biological evolution was nearly devoid of 
oxygen, the earliest cells were anaerobic. Under these 
conditions, chemotrophs could oxidize organic com- 
pounds to C0 2 by passing electrons not to 0 2 but to 
acceptors such as S04~, in this case yielding H 2 S as 
the product. With the rise of 0 2 -producing photosyn- 
thetic bacteria, the atmosphere became progressively 
richer in oxygen — a powerful oxidant and deadly poi- 
son to anaerobes. Responding to the evolutionary 
pressure of what Lynn Margulis and Dorion Sagan 
have called the "oxygen holocaust," some lineages of 
microorganisms gave rise to aerobes that obtained en- 
ergy by passing electrons from fuel molecules to oxy- 
gen. Because the transfer of electrons from organic 
molecules to 0 2 releases a great deal of energy, aero- 
bic organisms had an energetic advantage over their 
anaerobic counterparts when both competed in an 
environment containing oxygen. This advantage 
translated into the predominance of aerobic organ- 
isms in 0 2 -rich environments. 

Modern bacteria and archaea inhabit almost every 
ecological niche in the biosphere, and there are organ- 
isms capable of using virtually every type of organic com- 
pound as a source of carbon and energy. Photosynthetic 
microbes in both fresh and marine waters trap solar en- 
ergy and use it to generate carbohydrates and all other 
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cell constituents, which are in turn used as food by other 
forms of life. The process of evolution continues — and, in 
rapidly reproducing bacterial cells, on a time scale that 
allows us to witness it in the laboratory. One approach to- 
ward producing a "protocell" in the laboratory involves 
determining the minimum number of genes necessary 
for life by examining the genomes of simple bacteria. The 
smallest known genome for a free-living bacterium is 
that of Mycobacterium genitalium, which comprises 
580,000 base pairs encoding 483 genes. 

Eukaryotic Ceils Evolved from Simpler Precursors in 
Several Stages 

Starting about 1.5 billion years ago, the fossil record 
begins to show evidence of larger and more complex or- 
ganisms, probably the earliest eukaryotic cells (Fig. 1-35). 
Details of the evolutionary path from non-nucleated to 
nucleated cells cannot be deduced from the fossil record 
alone, but morphological and biochemical comparisons 
of modern organisms have suggested a sequence of 
events consistent with the fossil evidence. 

Three major changes must have occurred. First, as 
cells acquired more DNA, the mechanisms required to 
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FIGURE 1-35 Landmarks in the evolution of life on Earth. 



fold it compactly into discrete complexes with specific 
proteins and to divide it equally between daughter cells 
at cell division became more elaborate. Specialized pro- 
teins were required to stabilize folded DNA and to pull 
the resulting DNA-protein complexes (chromosomes') 
apart during cell division. Second, as cells became 
larger, a system of intracellular membranes developed, 
including a double membrane surrounding the DNA. 
This membrane segregated the nuclear process of RNA 
synthesis on a DNA template from the cytoplasmic 
process of protein synthesis on ribosomes. Finally, early 
eukaryotic cells, which were incapable of photosynthe- 
sis or aerobic metabolism, enveloped aerobic bacteria 
or photosynthetic bacteria to form endosymbiotic as- 
sociations that eventually became permanent (Fig. 
1-36). Some aerobic bacteria evolved into the mito- 
chondria of modern eukaryotes, and some photosyn- 
thetic cyanobacteria became the plastids, such as the 
chloroplasts of green algae, the likely ancestors of mod- 
ern plant cells. 

At some later stage of evolution, unicellular organ- 
isms found it advantageous to cluster together, thereby 
acquiring greater motility, efficiency, or reproductive 
success than their free-living single-celled competitors. 
Further evolution of such clustered organisms led to 
permanent associations among individual cells and 
eventually to specialization within the colony — to cellu- 
lar differentiation. 

The advantages of cellular specialization led to the 
evolution of increasingly complex and highly differenti- 
ated organisms, in which some cells carried out the 
sensory functions, others the digestive, photosynthetic, 
or reproductive functions, and so forth. Many modern 
multicellular organisms contain hundreds of different 
cell types, each specialized for a function that supports 
the entire organism. Fundamental mechanisms that 
evolved early have been further refined and embel- 
lished through evolution. The same basic structures and 
mechanisms that underlie the beating motion of cilia in 
Paramecium and of flagella in Chlamydomonas are 
employed by the highly differentiated vertebrate sperm 
cell, for example. 



Molecular Anatomy Reveals Evolutionary 

Relationships 

Biochemists now have an enormously rich, ever increas- 
ing treasury of information on the molecular anatomy of 
cells that they can use to analyze evolutionary relation- 
ships and refine evolutionary theory. The sequence of 
the genome, the complete genetic endowment of an or- 
ganism, has been determined for hundreds of bacteria 
and more than 40 archaea and for growing numbers of 
eukaryotic microorganisms, including Saccharomyces 
cerevisiae and Plasmodium sp.; plants, including Ara- 
bidopsis thaliana and rice; and multicellular animals, 
including Caenorhabditis elegans (a roundworm), 
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Symbiotic system 
can now carry out 
aerobic catabolism. 
Some bacterial genes 
move to the nucleus, 
and the bacterial 
endosymbionts become 
mitochondria. 




No nphotosynthetic 
eukaryote 




Ancestral anaerobic 
eukaryote 




Chli>rnp!a3t 



Aerobic bacterium 



Aerobic metabolism is 
efficient because fuel 
is oxidized to CO2. 



Photosynthetic 
cyanobacterium 

Light energy is used 
to synthesize 
biomolecules from CO2 . 



Engulfed 
cyanobacterium 
becomes an 
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FIGURE 1-36 Evolution of eukaryotes through endosymbiosis. The 

earliest eukaryote, an anaerobe, acquired endosymbiotic purple bac- 
teria (yellow), which carried with them their capacity for aerobic ca- 
tabolism and became, over time, mitochondria. When photosynthetic 



cyanobacteria (green) subsequently became endosymbionts of some 
aerobic eukaryotes, these cells became the photosynthetic precursors 
of modern green algae and plants. 



Drosophila melanogaster (the fruit fly), mouse, rat, 
dog, chimpanzee, and Homo sapiens (Table 1-2). With 
such sequences in hand, detailed and quantitative com- 
parisons among species can provide deep insight into the 
evolutionary process. Thus far, the molecular phylogeny 
derived from gene sequences is consistent with, but in 
many cases more precise than, the classical phylogeny 
based on macroscopic structures. Although organisms 
have continuously diverged at the level of gross anatomy, 
at the molecular level the basic unity of life is readily ap- 
parent; molecular structures and mechanisms are re- 
markably similar from the simplest to the most complex 
organisms. These similarities are most easily seen at the 
level of sequences, either the DNA sequences that en- 
code proteins or the protein sequences themselves. 

When two genes share readily detectable sequence 
similarities (nucleotide sequence in DNA or amino acid 
sequence in the proteins they encode), their sequences 
are said to be homologous and the proteins they encode 
are homologs If two homologous genes occur in the 
same species, they are said to be paralogous and their 
protein products are paralogs. Paralogous genes are 
presumed to have been derived by gene duplication fol- 
lowed by gradual changes in the sequences of both 
copies. Typically, paralogous proteins are similar not 
only in sequence but also in three-dimensional struc- 
ture, although they commonly have acquired different 
functions during their evolution. 



Two homologous genes (or proteins) found in dif- 
ferent species are said to be orthologous, and their pro- 
tein products are orthologs. Orthologs are commonly 
found to have the same function in both organisms, and 
when a newly sequenced gene in one species is found to 
be strongly orthologous with a gene in another, this gene 
is presumed to encode a protein with the same function 
in both species. By this means, the function of gene 
products can be deduced from the genomic sequence, 
without any biochemical characterization of the gene 
product. An annotated genome includes, in addition 
to the DNA sequence itself, a description of the likely 
function of each gene product, deduced from compar- 
isons with other genomic sequences and established 
protein functions. Sometimes, by identifying the path- 
ways (sets of enzymes) encoded in a genome, we can 
deduce from the genomic sequence alone the organism's 
metabolic capabilities. 

The sequence differences between homologous 
genes may be taken as a rough measure of the degree to 
which the two species have diverged during evolution — 
of how long ago their common evolutionary precursor 
gave rise to two lines with different evolutionary fates. 
The larger the number of sequence differences, the 
earlier the divergence in evolutionary history. One can 
construct a phylogeny (family tree) in which the evolu- 
tionary distance between any two species is represented 
by their proximity on the tree (Fig. l^r is an example). 
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A Few of the 


Many Organisms Whose Genomes Have Been Comple 




Organism 


Genome size 


Number of 




(nucleotide pairs) 


genes 


RiiVlfidiciil in tprpst 


Mycoplasma genitalium 


5.8 X 10 5 


4.8 X 10 2 


Smallest true organism 


Treponema pallidum 


1.1 x 10 6 


1.0 X 10 3 


Causes syphilis 


Borrelia burgdorferi 


9.1 x 10 5 


8.5 X 10 2 


Causes Lyme disease 


Helicobacter pylori 


1.7 X 10 6 


1.6 X 10 3 


Causes gastric ulcers 


Methanococcus jannaschii 


1.7 X 10 6 


1.7 X 10 3 


Archaean; grows at 85 °C! 


Haemophilus influenzae 


1.8 X 10 6 


1.6 X 10 3 


Causes bacterial influenza 


Archaeoglobusfulgidus* 


2.2 X 10 6 


2.4 X 10 3 


High-temperature methanogen 


Synechocystis sp. 


3.6 X 10 6 


3.2 X 10 3 


Cyanobacterium 


Bacillus subtilis 


4.2 X 10 6 


4.1 X 10 3 


Common soil bacterium 


Escherichia coli 


4.6 X 10 6 


4.4 X 10 3 


Some strains are human pathogens 


Saccharomyces cerevisiae 


1.2 X 10 7 


5.9 X 10 3 


Unicellular eukaryote 


Plasmodium falciparum 


2.3 X 10 7 


5.3 X 10 3 


Causes human malaria 


Caenorhabditis elegans 


1.0 X 10 s 


2.3 X 10 4 


Multicellular roundworm 


Anopheles gambiae 


2.3 X 10 8 


1.3 x 10 4 


Malaria vector 


Arabidopsis thaliana 


1.2 X 10 8 


3.2 X 10 4 


Model plant 


Oryza saliva 


3.9 x 10 8 


3.8 X 10 4 


Rice 


Drosophila melanogaster 


1.2 X 10 8 


2.0 X 10 4 


Laboratory fly ("fruit fly") 


Mus musculus domesticus 


2.6 X 10 9 


2.7 X 10 4 


Laboratory mouse 


Pan troglodytes 


3.1 X 10 9 


4.9 X 10 4 


Chimpanzee 


Homo sapiens 


3.1 X 10 9 


2.9 X 10 4 


Human 



Source: RefSeq page for each organism atwww.ncbi.nlm.nih.gov/genomes. 



In the course of evolution, new structures, processes, 
or regulatory mechanisms are acquired, reflections of the 
changing genomes of the evolving organisms. The 
genome of a simple eukaryote such as yeast should have 
genes related to formation of the nuclear membrane, 
genes not present in bacteria or archaea. The genome of 
an insect should contain genes that encode proteins in- 
volved in specifying insects' characteristic segmented 
body plan, genes not present in yeast. The genomes of all 
vertebrate animals should share genes that specify the 
development of a spinal column, and those of mammals 
should have unique genes necessary for the development 
of the placenta, a characteristic of mammals — and so on. 
Comparisons of the whole genomes of species in each 
phylum are leading to the identification of genes critical 
to fundamental evolutionary changes in body plan and 
development. 

Functional Genomics Shows the Allocations of Genes to 
Specific Cellular Processes 

When the sequence of a genome is fully detennined and 
each gene is assigned a function, molecular geneticists 
can group genes according to the processes (DNA syn- 
thesis, protein synthesis, generation of ATP, and so 
forth) in which they function and thus find what fraction 
of the genome is allocated to each of a cell's activities. 



The largest category of genes in E. coli, A. thaliana, 
and H. sapiens consists of genes of (as yet) unknown 
function, which make up more than 40% of the genes in 
each species. The transporters that move ions and small 
molecules across plasma membranes take up a signifi- 
cant proportion of the genes in all three species, more in 
the bacterium and plant than in the mammal (10% of 
the -4,400 genes of E. coli, -8% of the -32,000 genes of 
A. thaliana, and —4% of the —29,000 genes of H sapi- 
ens^}. Genes that encode the proteins and RNA required 
for protein synthesis make up 3% to 4% of the E. coli 
genome, but in the more complex cells of A. thaliana, 
more genes are needed for targeting proteins to their 
final location in the cell than are needed to synthesize 
those proteins (about 6% and 2% of the genome, re- 
spectively). In general, the more complex the organism, 
the greater the proportion of its genome that encodes 
genes involved in the regulation of cellular processes 
and the smaller the proportion dedicated to the basic 
processes themselves, such as ATP generation and pro- 
tein synthesis. 

Genomic Comparisons Have Increasing Importance in 
Human Biology and Medicine 

The genomes of chimpanzees and humans are 
99.9% identical, yet the differences between the 




36 The Foundations of Biochemistry 



two species are vast. The relatively few differences in 
genetic endowment must explain the possession of lan- 
guage by humans, the extraordinary athleticism of 
chimpanzees, and myriad other differences. Genomic 
comparison is allowing researchers to identify candidate 
genes linked to divergences in the developmental pro- 
grams of humans and the other primates and to the 
emergence of complex functions such as language. The 
picture will become clearer only as more primate 
genomes become available for comparison with the hu- 
man genome. 

Similarly, the differences in genetic endowment 
among humans are vanishingly small compared with the 
differences between humans and chimpanzees, yet these 
differences account for the variety among us — including 
differences in health and in susceptibility to chronic dis- 
eases. We have much to learn about the variability in se- 
quence among humans, and the availability of genomic 
information will almost certainly transform medical diag- 
nosis and treatment. We may expect that for some ge- 
netic diseases, palliatives will be replaced by cures; and 
that for disease susceptibilities associated with particu- 
lar genetic markers, forewarning and perhaps increased 
preventive measures will prevail. Today's "medical his- 
tory" may be replaced by a "medical forecast." ■ 

SUMMARY 1.5 Evolutionary Foundations 

■ Occasional inheritable mutations yield organisms 
that are better suited for survival in an ecological 
niche and progeny that are preferentially selected. 
This process of mutation and selection is the basis 
for the Darwinian evolution that led from the first 
cell to all modern organisms and explains the 
fundamental similarity of all living organisms. 

■ Life originated about 3.5 billion years ago, most 
likely with the formation of a membrane-enclosed 
compartment containing a self -replicating RNA 
molecule. The components for the first cell may 
have been produced near thermal vents at the 
bottom of the sea or by the action of lightning and 
high temperature on simple atmospheric molecules 
such as C0 2 and NH 3 . 

■ The catalytic and genetic roles played by the early 
RNA genome were, over time, taken over by 
proteins and DNA, respectively. 

■ Eukaryotic cells acquired the capacity for 
photosynthesis and oxidative phosphorylation from 
endosymbiotic bacteria. In multicellular organisms, 
differentiated cell types specialize in one or more 
of the functions essential to the organism's 
survival. 

■ Knowledge of the complete genomic nucleotide 
sequences of organisms from different branches 
of the phylogenetic tree provides insights into 
evolution and offers great opportunities in human 
medicine. 



Key Terms 



All terms are defined in the glossary. 



enthalpy, H 22 
free-energy change, AG 22 
endergonic reaction 22 
exergonic reaction 22 
equilibrium 23 
standard free-energy 

change, AG 0 24 
activation energy, AG* 
catabolism 25 
anabolism 25 
metabolism 25 
systems biology 
mutation 29 



25 



26 



metabolite 3 
nucleus 3 
genome 3 
eukaryote 3 
prokaryote 3 
bacteria 4 
archaea 4 
cytoskeleton 8 
stereoisomers 15 
configuration 1 5 
chiral center 16 
conformation 18 
entropy, S 22 
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molecular evolution. 

Gesteland, R.R, Atkins, J.F., & Cech, T.R. (eds). (2006) TheRNA 
World, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. 

A collection of stimulating reviews on a wide range of topics 
related to the RNA world scenario, 

Lazcano, A. & Miller, S.L. (1996) The origin and early evolution 
of life: prebiotic chemistry, the pre-RNA world, and time. Cell 85, 
793-798 

Brief review of developments in studies of the origin of life: 
primitive atmospheres, submarine vents, autotrophic versus 
heterotrophic origin, the RNA and pre-RNA worlds, and the time 
required for life to arise 

Margulis, L. (1996) Archaeal-eubacterial mergers in the origin of 
Eukarya: phylogenetic classification of life. Proc. Natl. Acad. Sci. 
USA 93, 1071-1076. 

The arguments for dividing all living creatures into five 
kingdoms: Monera, Protoctista, Fungi, Animalia, Plantae. (Compare 
Woese et al., 1990, below.) 

Margulis, L., Gould, S.J., Schwartz, K.V., & Margulis, A.R. 

(1998) Five Kingdoms: An Illustrated Guide to the Phyla of Life 
on Earth, 3rd edn, W. H. Freeman and Company, New York. 

Description of all major groups of organisms, beautifully 
illustrated with electron micrographs and drawings. 

Mayr, E. (1997) This Is Biology: The Science of the Living World, 
Belknap Press, Cambridge, MA. 

A history of the development of science, with special emphasis 
on Darwinian evolution, by an eminent Darwin scholar. 

Miller, S.L. (1987) Which organic compounds could have occurred on 
the prebiotic earth? Cold Spring Harb Symp. Quant. Biol. 52, 17-27. 

Summary of laboratory experiments on chemical evolution, by 
the person who did the original Miller-Urey experiment. 

Woese, C.R. (2002) On the evolution of cells. Proc Natl Acad Sci. 
USA 99, 8742-8747 
Short, clear review. 

Woese, C.R. (2004) A new biology for a new century. Microbiol. 
Mol Biol Rev 68, 173-186. 

Development of current thinking about cellular evolution by one 
of the seminal thinkers in the field. 

Woese, C.R., Kandler, O., & Wheelis, M.L. (1990) Towards a 
natural system of organisms: proposal for the domains Archaea, 
Bacteria, and Eucarya. Proc, Natl, Acad Sci USA 87, 4576^579. 

The arguments for dividing all living creatures into three 
domains. (Compare Margulis, 1996, above.) 

Problems 

Some problems related to the contents of the chapter follow. 
(In solving end-of-chapter problems, you may wish to refer to 
the tables on the inside of the back cover.) Each problem has 
a title for easy reference and discussion. For all numerical 
problems, keep in mind that answers should be expressed with 
the correct number of significant figures. Brief solutions are 
provided in Appendix B; expanded solutions are published in 
the Absolute Ultimate Study Guide to Accompany Princi- 
ples of Biochemistry. 

1. The Size of Cells and Their Components 

(a) If you were to magnify a cell 10,000 fold (typical of the 
magnification achieved using an electron microscope), how 
big would it appear? Assume you are viewing a "typical" eu- 
karyotic cell with a cellular diameter of 50 fxm. 
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(b) If this cell were a muscle cell (myocyte), how many 
molecules of actin could it hold? (Assume the cell is spherical 
and no other cellular components are present; actin molecules 
are spherical, with a diameter of 3.6 nm. The volume of a 
sphere is 4/3 7rr 3 .) 

(c) If this were a liver cell (hepatocyte) of the same di- 
mensions, how many mitochondria could it hold? (Assume the 
cell is spherical; no other cellular components are present; and 
the mitochondria are spherical, with a diameter of 1.5 ^tm.) 

(d) Glucose is the major energy-yielding nutrient for most 
cells. Assuming a cellular concentration of 1 mM (that is, 1 mil- 
limole/L), calculate how many molecules of glucose would be 
present in our hypothetical (and spherical) eukaryotic cell. 
(Avogadro's number, the number of molecules in 1 mol of a 
nonionized substance, is 6.02 X 10 23 .) 

(e) Hexokinase is an important enzyme in the metabolism 
of glucose. If the concentration of hexokinase in our eukary- 
otic cell is 20 juM, how many glucose molecules are present per 
hexokinase molecule? 

2. Components of E. coli E. coli cells are rod-shaped, 
about 2 jum long and 0.8 fxm in diameter. The volume of a 
cylinder is irr^h, where h is the height of the cylinder. 

(a) If the average density of E. coli (mostly water) is 
1.1 x 10 3 g/L, what is the mass of a single cell? 

(b) E. coli has a protective cell envelope 10 nm thick. 
What percentage of the total volume of the bacterium does the 
cell envelope occupy? 

(c) E. coli is capable of growing and multiplying rapidly 
because it contains some 15,000 spherical ribosomes (diame- 
ter 18 nm), which carry out protein synthesis. What percent- 
age of the cell volume do the ribosomes occupy? 

3. Genetic Information in E. coli DNA The genetic infor- 
mation contained in DNA consists of a linear sequence of cod- 
ing units, known as codons. Each codon is a specific sequence 
of three deoxyribonucleotides (three deoxyribonucleotide 
pairs in double-stranded DNA), and each codon codes for a 
single amino acid unit in a protein. The molecular weight of an 
E. coli DNA molecule is about 3.1 X 10 9 g/mol. The average 
molecular weight of a nucleotide pair is 660 g/mol, and each 
nucleotide pair contributes 0.34 nm to the length of DNA. 

(a) Calculate the length of anE. coli DNA molecule. Com- 
pare the length of the DNA molecule with the cell dimensions 
(see Problem 2). How does the DNA molecule fit into the cell? 

(b) Assume that the average protein in E. coli consists of 
a chain of 400 amino acids. What is the maximum number of 
proteins that can be coded by an E. coli DNA molecule? 

4. The High Rate of Bacterial Metabolism Bacterial cells 
have a much higher rate of metabolism than animal cells. Un- 
der ideal conditions some bacteria double in size and divide 
every 20 min, whereas most animal cells under rapid growth 
conditions require 24 hours. The high rate of bacterial metab- 
olism requires a high ratio of surface area to cell volume. 

(a) Why does surface-to-volume ratio affect the maximum 
rate of metabolism? 

(b) Calculate the surface-to-volume ratio for the spherical 
bacterium Neisseria gonorrhoeae (diameter 0.5 jttm), respon- 



sible for the disease gonorrhea. Compare it with the surface-to- 
volume ratio for a globular amoeba, a large eukaryotic cell (di- 
ameter 150 /Am). The surface area of a sphere is 4irr 2 . 

5. Fast Axonal Transport Neurons have long thin processes 
called axons, structures specialized for conducting signals 
throughout the organism's nervous system. Some axonal 
processes can be as long as 2 m — for example, the axons that 
originate in your spinal cord and terminate in the muscles of 
your toes. Small membrane-enclosed vesicles carrying materi- 
als essential to axonal function move along microtubules of the 
cytoskeleton, from the cell body to the tips of the axons. If the 
average velocity of a vesicle is 1 fim/s, how long does it take a 
vesicle to move from a cell body in the spinal cord to the ax- 
onal tip in the toes? 

6. Is Synthetic Vitamin C as Good as the Natural Vita- 
min? A claim put forth by some purveyors of health foods is 
that vitamins obtained from natural sources are more healthful 
than those obtained by chemical synthesis. For example, pure 
L-ascorbic acid (vitamin C) extracted from rose hips is better 
than pure L-ascorbic acid manufactured in a chemical plant. 
Are the vitamins from the two sources different? Can the body 
distinguish a vitamin's source? 

7. Identification of Functional Groups Figures 1-15 and 
1-16 show some common functional groups of biomolecules. 
Because the properties and biological activities of biomolecules 
are largely determined by their functional groups, it is impor- 
tant to be able to identify them. In each of the compounds be- 
low, circle and identify by name each functional group. 
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Pantothenate, 
a vitamin 
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Phosphoenolpyruvate, 
an intermediate in 
glucose metabolism 

(c) 
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8. Drug Activity and Stereochemistry The quanti- 
tative differences in biological activity between the two 
enantiomers of a compound are sometimes quite large. For 
example, the d isomer of the drug isoproterenol, used to treat 
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mild asthma, is 50 to 80 times more effective as a bronchodilator 
than the l isomer. Identify the chiral center in isoproterenol. Why 
do the two enantiomers have such radically different bioactivity? 



HO 



OH H H 

f ^C-CH 2 -N-C-CH 3 
/ = H CH 3 



HO 

Isoproterenol 

9. Separating Biomolecules In studying a particular bio- 
molecule (a protein, nucleic acid, carbohydrate, or lipid) in the 
laboratory, the biochemist first needs to separate it from other 
biomolecules in the sample— that is, to purify it Specific pu- 
rification techniques are described later in the text. However, 
by looking at the monomeric subunits of a biomolecule, you 
should have some ideas about the characteristics of the mole- 
cule that would allow you to separate it from other molecules. 
For example, how would you separate (a) amino acids from 
fatty acids and (b) nucleotides from glucose? 

10. Silicon-Based Life? Silicon is in the same group of the 
periodic table as carbon and, like carbon, can form up to four 
single bonds. Many science fiction stories have been based on 
the premise of silicon-based life. Is this realistic? What charac- 
teristics of silicon make it less well adapted than carbon as the 
central organizing element for life? To answer this question, 
consider what you have learned about carbon's bonding versa- 
tility, and refer to a beginning inorganic chemistry textbook for 
silicon's bonding properties. 

11. Drug Action and Shape of Molecules Some 
years ago two drug companies marketed a drug under 
the trade names Dexedrine and Benzedrine. The structure of 
the drug is shown below. 

aH 
CH^C-CHs 
NH^ 

The physical properties (C, H, and N analysis, melting point, 
solubility, etc.) of Dexedrine and Benzedrine were identical. The 
recommended oral dosage of Dexedrine (which is still available) 
was 5 mg/day, but the recommended dosage of Benzedrine (no 
longer available) was twice that. Apparently it required consid- 
erably more Benzedrine than Dexedrine to yield the same phys- 
iological response. Explain this apparent contradiction. 

12. Components of Complex Biomolecules Figure 1-10 
shows the major components of complex biomolecules. For 
each of the three important biomolecules below (shown in their 
ionized forms at physiological pH), identify the constituents. 

(a) Guanosine triphosphate (GTP), an energy-rich nu- 
cleotide that serves as a precursor to RNA: 
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(b) Methionine enkephalin, the brain's own opiate: 
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(c) Phosphatidylcholine, a component of many mem- 
branes: 



CH 3 

CH 3 -N-CH 2 -CH 2 - 
CH 3 



? 

-O-P-O-CH, H H 

A 1 11 
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13. Determination of the Structure of a Biomolecule 

An unknown substance, X, was isolated from rabbit muscle. Its 
structure was determined from the following observations and 
experiments. Qualitative analysis showed that X was com- 
posed entirely of C, H, and 0, A weighed sample of X was com- 
pletely oxidized, and the H 2 0 and C0 2 produced were 
measured; this quantitative analysis revealed that X contained 
40.00% C, 6.71% H, and 53.29% 0 by weight. The molecular 
mass of X, determined by mass spectrometry, was 90.00 u 
(atomic mass units; see Box 1-1). Infrared spectroscopy 
showed that X contained one double bond. X dissolved readily 
in water to give an acidic solution; the solution demonstrated 
optical activity when tested in a polarimeter. 

(a) Determine the empirical and molecular formula of X. 

(b) Draw the possible structures of X that fit the molecu- 
lar formula and contain one double bond. Consider only linear 
or branched structures and disregard cyclic structures. Note 
that oxygen makes very poor bonds to itself, 

(c) What is the structural significance of the observed op- 
tical activity? Which structures in (b) are consistent with the 
observation? 

(d) What is the structural significance of the observation 
that a solution of X was acidic? Which structures in (b) are 
consistent with the observation? 

(e) What is the structure of X? Is more than one structure 
consistent with all the data? 
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14. Sweet-Tasting Molecules Many compounds taste sweet 
to humans. Sweet taste results when a molecule binds to the 
sweet receptor, one type of taste receptor, on the surface of 
certain tongue cells. The stronger the binding, the lower 
the concentration required to saturate the receptor and the 
sweeter a given concentration of that substance tastes. The 
standard free-energy change, AG°, of the binding reaction 
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between a sweet molecule and a sweet receptor can be mea- 
sured in kilojoules or kilocalories per mole. 

Sweet taste can be quantified in units of "molar relative 
sweetness" (MRS), a measure that compares the sweetness of 
a substance to the sweetness of sucrose. For example, saccha- 
rin has an MRS of 161; this means that saccharin is 161 times 
sweeter than sucrose. In practical terms, this is measured by 
asking human subjects to compare the sweetness of solutions 
containing different concentrations of each compound. 
Sucrose and saccharin taste equally sweet when sucrose is at a 
concentration 161 times higher than that of saccharin. 

(a) What is the relationship between MRS and the AG 0 of 
the binding reaction? Specifically, would a more negative AG° 
correspond to a higher or lower MRS? Explain your reasoning. 

Shown below are the structures of 10 compounds, all of 
which taste sweet to humans. The MRS and AG° for binding to 
the sweet receptor are given for each substance. 

H OH 

OH V 

Deoxysucrose 
MRS = 0.95 
AG 0 = -6,67 kcal/mol 

H p 

HO Vii^A^H ]^04)H] 
H OH L 



Sucrose 
MRS = 1 
AG° = -6.71 kcal/mol 




D-Tryptophan Saccharin Aspartame 

MRS = 21 MRS = 161 MRS = 172 

AG° = -8.5 kcal/mol AG° = -9.7 kcal/mol AG 0 = -9.7 kcal/mol 



0 




6-Chloro-D-tryptophan Alitame 

MRS = 906 MRS - 1,937 

AG 0 = -10.7 kcal/mol AG° = -11.1 kcal/mol 




CH 3 



Neotame 
MRS = 11,057 
AG°= -12.1 kcal/mol 




Tetrabromosucrose Sucronic acid 

MRS = 13,012 MRS = 200,000 

AG° = -12.2 kcal/mol AG 0 = -13.8 kcal/mol 



Morini, Bassoli, and Temussi (2005) used computer-based 
methods (often referred to as "in silico" methods) to model 
the binding of sweet molecules to the sweet receptor. 

(b) Why is it useful to have a computer model to predict 
the sweetness of molecules, instead of a human- or animal- 
based taste assay? 

In earlier work, Schallenberger and Acree (1967) had 
suggested that all sweet molecules include an "AH-B" struc- 
tural group, in which "A and B are electronegative atoms 
separated by a distance of greater than 2.5 A [0.25 run] but less 
than 4 A [0.4 nm]. H is a hydrogen atom attached to one of the 
electronegative atoms by a covalent bond" (p. 481). 

(c) Given that the length of a "typical" single bond is about 
0 15 nm, identify the AH-B group(s) in each of the molecules 
shown above. 

(d) Based on your findings from (c) , give two objections to 
the statement that "molecules containing an AH-B structure 
will taste sweet." 

(e) For two of the molecules shown above, the AH-B 
model can be used to explain the difference in MRS and AG°. 
Which two molecules are these, and how would you use them 
to support the AH-B model? 

(f) Several of the molecules have closely related struc- 
tures but very different MRS and AG° values. Give two such 
examples, and use these to argue that the AH-B model is un- 
able to explain the observed differences in sweetness. 

In their computer-modeling study, Morini and coauthors 
used the three-dimensional structure of the sweet receptor 
and a molecular dynamics modeling program called GRAMM 
to predict the AG° of binding of sweet molecules to the sweet 
receptor. First, they "trained" their model — that is, they re- 
fined the parameters so that the AG 0 values predicted by the 
model matched the known AG° values for one set of sweet mol- 
ecules (the "training set"). They then "tested" the model by 
asking it to predict the AG 0 values for a new set of molecules 
(the "test set"), 

(g) Why did Morini and colleagues need to test their 
model against a different set of molecules from the set it was 
trained on? 

(h) The researchers found that the predicted AG° values 
for the test set differed from the actual values by, on average, 
1.3 kcal/mol. Using the values given with the structures above, 
estimate the resulting error in MRS values. 
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PART I 



STRUCTURE AND 
C ATA LYS I S 



Biochemistry is nothing less than the chemistry of 
life, and, yes, life can be investigated, analyzed, 
and understood. To begin, every student of bio- 
chemistry needs both a language and some fundamen- 
tals; these are provided in Part I. 

The chapters of Part I are devoted to the structure 
and function of the major classes of cellular constituents: 
water (Chapter 2), amino acids and proteins (Chapters 3 
through 6), sugars and polysaccharides (Chapter 7), nu- 
cleotides and nucleic acids (Chapter 8), fatty acids and 
lipids (Chapter 10), and, finally, membranes and mem- 
brane signaling proteins (Chapters 11 and 12). We sup- 
plement this discourse on molecules with information 
about the technologies used to study them. Techniques 
sections are woven in throughout the text, and one chap- 
ter (Chapter 9) is devoted entirely to biotechnologies 
associated with cloning, genomics, and proteomics. 

We begin, in Chapter 2, with water, because its prop- 
erties affect the structure and function of all other cellu- 
lar constituents. For each class of organic molecules, we 
first consider the covalent chemistry of the monomeric 
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units (amino acids, monosaccharides, nucleotides, and 
fatty acids) and then describe the structure of the macro- 
molecules and supramolecular complexes derived from 
them. An overarching theme is that the polymeric macro- 
molecules in living systems, though large, are highly 
ordered chemical entities, with specific sequences of 
monomeric subunits giving rise to discrete structures and 
functions. This fundamental theme can be broken down 
into three interrelated principles: (1) the unique struc- 
ture of each macromolecule determines its function; 

(2) noncovalent interactions play a critical role in the 
structure and thus the function of macromolecules; and 

(3) the monomeric subunits in polymeric macromole- 
cules occur in specific sequences, representing a form of 
information on which the ordered living state depends. 

The relationship between structure and function 
is especially evident in proteins, which exhibit an 
extraordinary diversity of functions. One particular 
polymeric sequence of amino acids produces a strong, 
fibrous structure found in hair and wool; another pro- 
duces a protein that transports oxygen in the blood; a 

H 
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third binds other proteins and catalyzes the cleavage of 
the bonds between their amino acids. Similarly, the 
special functions of polysaccharides, nucleic acids, and 
lipids can be understood as resulting directly from 
their chemical structure, with their characteristic 
monomeric subunits precisely linked to form func- 
tional polymers. Sugars linked together become energy 
stores, structural fibers, and points of specific molecu- 
lar recognition; nucleotides strung together in DNA or 
RNA provide the blueprint for an entire organism; and 
aggregated lipids form membranes. Chapter 12 unifies 
the discussion of biomolecule function, describing how 
specific signaling systems regulate the activities of 
biomolecules — within a cell, within an organ, and 
among organs — to keep an organism in homeostasis. 

As we move from monomeric units to larger and 
larger polymers, the chemical focus shifts from covalent 
bonds to noncovalent interactions. Covalent bonds, at 
the monomeric and macromolecular level, place con- 
straints on the shapes assumed by large biomolecules. It 
is the numerous noncovalent interactions, however, that 
dictate the stable, native conformations of large mole- 
cules while permitting the flexibility necessary for their 



biological function. As we shall see, noncovalent interac- 
tions are essential to the catalytic power of enzymes, the 
critical interaction of complementary base pairs in 
nucleic acids, and the arrangement and properties of 
lipids in membranes. The principle that sequences of 
monomeric subunits are rich in information emerges 
most fully in the discussion of nucleic acids (Chapter 8). 
However, proteins and some short polymers of sugars 
(oligosaccharides) are also information-rich molecules. 
The amino acid sequence is a form of information that 
directs the folding of the protein into its unique three- 
dimensional structure, and ultimately determines the 
function of the protein. Some oligosaccharides also have 
unique sequences and three-dimensional structures 
that are recognized by other macromolecules. 

Each class of molecules has a similar structural hier- 
archy: subunits of fixed structure are connected by bonds 
of limited flexibility to form macromolecules with three- 
dimensional structures determined by noncovalent inter- 
actions. These macromolecules then interact to form the 
supramolecular structures and organelles that allow a cell 
to carry out its many metabolic functions. Together, the 
molecules described in Part I are the stuff of life. 



I believe that as the methods of structural chemistry are further applied 
to physiological problems, it will be found that the significance of the 
hydrogen bond for physiology is greater than that of any other single 
structural feature. 

— Linus Pauling, The Nature of the Chemical Bond, 1939 
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Water is the most abundant substance in living 
systems, making up 70% or more of the weight 
. of most organisms. The first living organisms on 
Earth doubtless arose in an aqueous environment, and 
the course of evolution has been shaped by the proper- 
ties of the aqueous medium in which life began. 

This chapter begins with descriptions of the physical 
and chemical properties of water, to which all aspects of 
cell structure and function are adapted. The attractive 
forces between water molecules and the slight tendency of 
water to ionize are of crucial importance to the structure 
and function of biomolecules. We review the topic of ion- 
ization in terms of equilibrium constants, pH, and titration 
curves, and consider how aqueous solutions of weak acids 
or bases and their salts act as buffers against pH changes in 
biological systems. The water molecule and its ionization 
products, H + and OH~ , profoundly influence the structure, 
self-assembly, and properties of all cellular components, in- 
cluding proteins, nucleic acids, and lipids. The noncovalent 
interactions responsible for the strength and specificity of 
"recognition" among biomolecules are decisively influ- 
enced by the solvent properties of water, including its abil- 
ity to form hydrogen bonds with itself and with solutes. 

2.1 Weak Interactions in Aqueous Systems 

Hydrogen bonds between water molecules provide the 
cohesive forces that make water a liquid at room tem- 



perature and favor the extreme ordering of molecules 
that is typical of crystalline water (ice). Polar biomole- 
cules dissolve readily in water because they can replace 
water-water interactions with more energetically favor- 
able water-solute interactions. In contrast, nonpolar 
biomolecules interfere with water-water interactions 
but are unable to form water-solute interactions — 
consequently, nonpolar molecules are poorly soluble in 
water. In aqueous solutions, nonpolar molecules tend to 
cluster together. Hydrogen bonds and ionic, hydropho- 
bic (Greek, "water-fearing"), and van der Waals interac- 
tions are individually weak, but collectively they have a 
very significant influence on the three-dimensional 
structures of proteins, nucleic acids, polysaccharides, 
and membrane lipids. 

Hydrogen Bonding Gives Water Its Unusual Properties 

Water has a higher melting point, boiling point, and 
heat of vaporization than most other common solvents 
(Table 2-1). These unusual properties are a conse- 
quence of attractions between adjacent water molecules 
that give liquid water great internal cohesion. A look at 
the electron structure of the H 2 0 molecule reveals the 
cause of these intermolecular attractions. 

Each hydrogen atom of a water molecule shares an 
electron pair with the central oxygen atom. The geome- 
try of the molecule is dictated by the shapes of the outer 
electron orbitals of the oxygen atom, which are similar 
to the sp 3 bonding orbitals of carbon (see Fig. 1-14). 
These orbitals describe a rough tetrahedron, with a hy- 
drogen atom at each of two corners and unshared elec- 
tron pairs at the other two corners (Fig. 2-la). The 
H— 0— H bond angle is 104.5°, slightly less than the 
109.5° of a perfect tetrahedron because of crowding by 
the nonbonding orbitals of the oxygen atom. 

The oxygen nucleus attracts electrons more strongly 
than does the hydrogen nucleus (a proton); that is, oxy- 
gen is more electronegative. This means that the shared 

[43] 
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Melting Point, Boiling Point, and Heat of Vaporization of Some Common Solvents 



TABLE 2-1 





Melting point (°C) 


Boiling point (°C) 


Heat of vaporizatii 


Water 


0 


100 


2,260 


Methanol (CH 3 OH) 


-98 


65 


1,100 


Ethanol (CH 3 CH 2 OH) 


-117 


78 


854 


Propanol (CH3CH2CH2OH) 


-127 


97 


687 


Butanol (CH 3 (CH 2 ) 2 CH 2 OH) 


-90 


117 


590 


Acetone (CH 3 COCH 3 ) 


-95 


56 


523 


Hexane (CH 3 (CH 2 ) 4 CH 3 ) 


-98 


69 


423 


Benzene (C 6 H 6 ) 


6 


80 


394 


Butane (CH 3 (CH 2 ) 2 CH 3 ) 


-135 


-0.5 


381 


Chloroform (CHC1 3 ) 


-63 


61 


247 



*The heat energy required to convert 1.0 g of a liquid at its boiling point and at atmospheric pressure into its gaseous state at the same temperature. 
It is a direct measure of the energy required to overcome attractive forces between molecules in the liquid phase. 



electrons are more often in the vicinity of the oxygen 
atom than of the hydrogen. The result of this unequal 
electron sharing is two electric dipoles in the water mol- 
ecule, one along each of the H — 0 bonds; each hydrogen 
bears a partial positive charge (5 + ), and the oxygen 
atom bears a partial negative charge equal in magnitude 
to the sum of the two partial positives (23"). As a result, 
there is an electrostatic attraction between the oxygen 
atom of one water molecule and the hydrogen of another 
(Fig. 2-lb), called a hydrogen bond. Throughout this 
book, we represent hydrogen bonds with three parallel 
blue lines, as in Figure 2-lb. 

Hydrogen bonds are relatively weak. Those in liquid 
water have a bond dissociation energy (the energy re- 




(a) (b) 



FIGURE 2-1 Structure of the water molecule, (a) The dipolar nature of 
the H 2 0 molecule is shown in a ball-and-stick model; the dashed lines 
represent the nonbonding orbitals. There is a nearly tetrahedral 
arrangement of the outer-shell electron pairs around the oxygen atom; 
the two hydrogen atoms have localized partial positive charges 
and the oxygen atom has a partial negative charge (8~). (b) Two H 2 0 
molecules joined by a hydrogen bond (designated here, and through- 
out this book, by three blue lines) between the oxygen atom of the up- 
per molecule and a hydrogen atom of the lower one. Hydrogen bonds 
are longer and weaker than covalent O — H bonds. 



quired to break a bond) of about 23 kJ/mol, compared 
with 470 kJ/mol for the covalent O — H bond in water or 
348 kJ/mol for a covalent C — C bond. The hydrogen bond 
is about 10% covalent, due to overlaps in the bonding or- 
bitals, and about 90% electrostatic. At room tempera- 
ture, the thermal energy of an aqueous solution (the 
kinetic energy of motion of the individual atoms and 
molecules) is of the same order of magnitude as that re- 
quired to break hydrogen bonds. When water is heated, 
the increase in temperature reflects the faster motion of 
individual water molecules. At any given time, most of 
the molecules in liquid water are hydrogen bonded, but 
the lifetime of each hydrogen bond is just 1 to 20 pi- 
coseconds (1 ps = 10~ 12 s); when one hydrogen bond 
breaks, another hydrogen bond forms, with the same 
partner or a new one, within 0.1 ps. The apt phrase "flick- 
ering clusters" has been applied to the short-lived groups 
of water molecules interlinked by hydrogen bonds in 
liquid water. The sum of all the hydrogen bonds between 
H 2 0 molecules confers great internal cohesion on liquid 
water. Extended networks of hydrogen-bonded water 
molecules also form bridges between solutes (proteins 
and nucleic acids, for example) that allow the larger 
molecules to interact with each other over distances of 
several nanometers without physically touching. 

The nearly tetrahedral arrangement of the orbitals 
about the oxygen atom (Fig. 2-la) allows each water 
molecule to form hydrogen bonds with as many as four 
neighboring water molecules. In liquid water at room 
temperature and atmospheric pressure, however, water 
molecules are disorganized and in continuous motion, so 
that each molecule forms hydrogen bonds with an aver- 
age of only 3.4 other molecules. In ice, on the other hand, 
each water molecule is fixed in space and forms hydro- 
gen bonds with a full complement of four other water 
molecules to yield a regular lattice structure (Fig. 2-2). 
Breaking a sufficient proportion of hydrogen bonds to 
destabilize the crystal lattice of ice requires much thermal 
energy, which accounts for the relatively high melting 
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FIGURE 2-2 Hydrogen bonding in ice. In ice, each water molecule 
forms four hydrogen bonds, the maximum possible for a water mole- 
cule, creating a regular crystal lattice. By contrast, in liquid water at 
room temperature and atmospheric pressure, each water molecule 
hydrogen-bonds with an average of 3.4 other water molecules. This 
crystal lattice structure makes ice less dense than liquid water, and thus 
ice floats on liquid water. 

point of water (Table 2-1). When ice melts or water 
evaporates, heat is taken up by the system: 

H 2 0 (solid) H 2 0 (liquid) AH = +5.9 kJ/mol 

H 2 0 (liquid) -> H 2 0 (gas) AH = +44.0 kJ/mol 

During melting or evaporation, the entropy of the 
aqueous system increases as more highly ordered arrays 
of water molecules relax into the less orderly hydrogen- 
bonded arrays in liquid water or into the wholly disor- 
dered gaseous state. At room temperature, both the 
melting of ice and the evaporation of water occur spon- 
taneously; the tendency of the water molecules to asso- 
ciate through hydrogen bonds is outweighed by the 
energetic push toward randomness. Recall that the free- 
energy change (AG) must have a negative value for a 
process to occur spontaneously: AG = AH - T AS, 
where AG represents the driving force, AH the enthalpy 
change from making and breaking bonds, and AS the 
change in randomness. Because AH is positive for melt- 
ing and evaporation, it is clearly the increase in entropy 
(AS) that makes AG negative and drives these changes. 

Water Forms Hydrogen Bonds with Polar Solutes 

Hydrogen bonds are not unique to water. They readily 
form between an electronegative atom (the hydrogen 
acceptor, usually oxygen or nitrogen) and a hydrogen 
atom covalently bonded to another electronegative 
atom (the hydrogen donor) in the same or another mol- 
ecule (Fig. 2-3). Hydrogen atoms covalently bonded 



Hydrogen 
acceptor 

Hydrogen 
donor 



\> 


% / 
N 


V ' 




H 


H 


H 


H 

1 


1 

0 

1 


1 

0 

1 


1 

0 

1 


A 



V V 



H 
I 

N 



H 

I 

N 



FIGURE 2-3 Common hydrogen bonds in biological systems. The hy- 
drogen acceptor is usually oxygen or nitrogen; the hydrogen donor is 
another electronegative atom. 

to carbon atoms do not participate in hydrogen bond- 
ing, because carbon is only slightly more electronega- 
tive than hydrogen and thus the C— H bond is only very 
weakly polar. The distinction explains why butanol 
(CH 3 (CH2)2CH 2 OH) has a relatively high boiling point 
of 117 °C, whereas butane (CH 3 (CH 2 ) 2 CH 3 ) has a boil- 
ing point of only -0.5 °C. Butanol has a polar hydroxyl 
group and thus can form intermolecular hydrogen 
bonds. Uncharged but polar biomolecules such as sug- 
ars dissolve readily in water because of the stabilizing 
effect of hydrogen bonds between the hydroxyl groups 
or carbonyl oxygen of the sugar and the polar water 
molecules. Alcohols, aldehydes, ketones, and com- 
pounds containing N— H bonds all form hydrogen 
bonds with water molecules (Fig. 2-4) and tend to be 
soluble in water. 



Between the 
hydroxyl group 
of an alcohol 
and water 



O 

I 

H 



H H 



Between the 
carbonyl group 
of a ketone 
and water 

R 2 

I 

C 

n q 

H 

I 

0 



Between peptide 
groups in 
polypeptides 



H 



I 



v/ 

1 o 

H 

I 

c c 
I II 

R O 



Between 
complementary 
bases of DNA 

H 

i i 

I 

¥ H 

H i nh 



Thymine 



1 1 



Adenine 



N-CH 

R 

FIGURE 2-4 Some biologically important hydrogen bonds. 
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FIGURE 2-5 Directionality of the hydrogen bond. The attraction be- 
tween the partial electric charges (see Fig. 2-1 ) is greatest when the three 
atoms involved in the bond (in this case O, H, and O) lie in a straight line. 
When the hydrogen-bonded moieties are structurally constrained (when 
they are parts of a single protein molecule, for example), this ideal geom- 
etry may not be possible and the resulting hydrogen bond is weaker. 

Hydrogen bonds are strongest when the bonded 
molecules are oriented to maximize electrostatic inter- 
action, which occurs when the hydrogen atom and the 
two atoms that share it are in a straight line — that is, 
when the acceptor atom is in line with the covalent bond 
between the donor atom and H (Fig. 2-5), putting the 
positive charge of the hydrogen ion directly between the 
two partial negative charges. Hydrogen bonds are thus 
highly directional and capable of holding two hydrogen- 
bonded molecules or groups in a specific geometric 
arrangement. As we shall see later, this property of 
hydrogen bonds confers very precise three-dimensional 
structures on protein and nucleic acid molecules, which 
have many intramolecular hydrogen bonds. 

Water Interacts Electrostatically with Charged Solutes 

Water is a polar solvent. It readily dissolves most bio- 
molecules, which are generally charged or polar com- 
pounds (Table 2-2); compounds that dissolve easily in 
water are hydrophilic (Greek, "water-loving"). In con- 



trast, nonpolar solvents such as chloroform and benzene 
are poor solvents for polar biomolecules but easily dis- 
solve those that are hydrophobic — nonpolar molecules 
such as lipids and waxes. 

Water dissolves salts such as NaCl by hydrating and 
stabilizing the Na + and Cl~ ions, weakening the electro- 
static interactions between them and thus counteract- 
ing their tendency to associate in a crystalline lattice 
(Fig. 2-6). The same factors apply to charged biomole- 
cules, compounds with functional groups such as 
ionized carboxylic acids (— COO~), protonated amines 
(— NHj), and phosphate esters or anhydrides. Water 
readily dissolves such compounds by replacing solute- 
solute hydrogen bonds with solute-water hydrogen 
bonds, thus screening the electrostatic interactions be- 
tween solute molecules. 

Water is effective in screening the electrostatic in- 
teractions between dissolved ions because it has a high 
dielectric constant, a physical property that reflects the 
number of dipoles in a solvent. The strength, or force 
(F), of ionic interactions in a solution depends on the 
magnitude of the charges (Q), the distance between the 
charged groups (r), and the dielectric constant (e, 
which is dimensionless) of the solvent in which the in- 
teractions occur: 



F = 



Q1Q2 



For water at 25 °C, e is 78.5, and for the very nonpolar 
solvent benzene, e is 4.6. Thus, ionic interactions be- 
tween dissolved ions are much stronger in less polar en- 
vironments. The dependence on r 2 is such that ionic 
attractions or repulsions operate only over short 
distances — in the range of 10 to 40 nm (depending on 
the electrolyte concentration) when the solvent is water. 
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FIGURE 2-6 Water as solvent. Water dissolves many 
crystalline salts by hydrating their component ions. The 
NaCI crystal lattice is disrupted as water molecules 
cluster about the CI" and Na + ions. The ionic charges 
are partially neutralized, and the electrostatic attrac- 
tions necessary for lattice formation are weakened. 



Entropy Increases as Crystalline Substances Dissolve 

As a salt such as NaCI dissolves, the Na + and Cl~ ions 
leaving the crystal lattice acquire far greater freedom of 
motion (Fig. 2-6). The resulting increase in entropy 
(randomness) of the system is largely responsible for 
the ease of dissolving salts such as NaCI in water. In 
thermodynamic terms, formation of the solution occurs 
with a favorable free-energy change: AG = AH - T AS, 
where AH has a small positive value and T AS a large 
positive value; thus AG is negative. 

Nonpolar Gases Are Poorly Soluble in Water 

The molecules of the biologically important gases C0 2 , 
0 2 , and N 2 are nonpolar. In 0 2 and N 2 , electrons are 
shared equally by both atoms. In C0 2 , each C=0 bond 
is polar, but the two dipoles are oppositely directed and 
cancel each other (Table 2-3) . The movement of mole- 
cules from the disordered gas phase into aqueous solu- 
tion constrains their motion and the motion of water 
molecules and therefore represents a decrease in en- 
tropy. The nonpolar nature of these gases and the de- 



crease in entropy when they enter solution combine to 
make them very poorly soluble in water (Table 2-3). 
Some organisms have water-soluble "carrier proteins" 
(hemoglobin and myoglobin, for example) that facili- 
tate the transport of 0 2 . Carbon dioxide forms carbonic 
acid (H 2 C0 3 ) in aqueous solution and is transported as 
the HCO3 (bicarbonate) ion, either free— bicarbonate 
is very soluble in water (-100 g/L at 25 °C)— or bound 
to hemoglobin. Three other gases, NH 3 , NO, and H 2 S, 
also have biological roles in some organisms; these 
gases are polar, dissolve readily in water, and ionize in 
aqueous solution. 

Nonpolar Compounds Force Energetically Unfavorable 
Changes in the Structure of Water 

When water is mixed with benzene or hexane, two 
phases form; neither liquid is soluble in the other. Non- 
polar compounds such as benzene and hexane are 
hydrophobic — they are unable to undergo energeti- 
cally favorable interactions with water molecules, and 
they interfere with the hydrogen bonding among water 



TABLE 2-3 



Gas 



Solubilities of Some Gases in Wbler 



Structure* 



Polarity 



Solubility 

in water (g^L) t 



Nitrogen 
Oxygen 
Carbon dioxide 

Ammonia 

Hydrogen sulfide 



N=N 
0=0 

8- 8- 

o=c=o 

H ? H 

H H I 
S t 



Nonpolar 
Nonpolar 
Nonpolar 

Polar 

Polar 



0.018 (40 °C) 
0.035 (50 °C) 
0.97 (45 °C) 

900 (10 °C) 

1,860 (40 °C) 



*The arrows represent electric dipoles; there is a partial negative charge (S ) at the head of the arrow, a partial positive charge (5 + ; not 
shown here) at the tail. 

f Note that polar molecules dissolve far better even at low temperatures than do nonpolar molecules at relatively high temperatures. 
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molecules. All molecules or ions in aqueous solution in- 
terfere with the hydrogen bonding of some water mole- 
cules in their immediate vicinity, but polar or charged 
solutes (such as NaCl) compensate for lost water-water 
hydrogen bonds by forming new solute-water interac- 
tions. The net change in enthalpy (AH) for dissolving 
these solutes is generally small. Hydrophobic solutes, 
however, offer no such compensation, and their addition 
to water may therefore result in a small gain of enthalpy; 
the breaking of hydrogen bonds between water mole- 
cules takes up energy from the system, requiring the 
input of energy from the surroundings. In addition to 
requiring this input of energy, dissolving hydrophobic 
compounds in water produces a measurable decrease in 
entropy. Water molecules in the immediate vicinity of a 
nonpolar solute are constrained in their possible orien- 
tations as they form a highly ordered cagelike shell 
around each solute molecule. These water molecules 
are not as highly oriented as those in clathrates, crys- 
talline compounds of nonpolar solutes and water, but 
the effect is the same in both cases: the ordering of water 
molecules reduces entropy. The number of ordered 
water molecules, and therefore the magnitude of the en- 
tropy decrease, is proportional to the surface area of the 
hydrophobic solute enclosed within the cage of water 
molecules. The free-energy change for dissolving a non- 
polar solute in water is thus unfavorable: AG = AH - 
T AS, where AH has a positive value, AS has a negative 
value, and AG is positive. 



Hydrophilic 
"head group" 



Amphipathic compounds contain regions that are 
polar (or charged) and regions that are nonpolar 
(Table 2-2). When an amphipathic compound is mixed 
with water, the polar, hydrophilic region interacts favor- 
ably with the solvent and tends to dissolve, but the non- 
polar, hydrophobic region tends to avoid contact with 
the water (Fig. 2-7a). The nonpolar regions of the 
molecules cluster together to present the smallest 
hydrophobic area to the aqueous solvent, and the polar 
regions are arranged to maximize their interaction with 
the solvent (Fig. 2-7b). These stable structures of am- 
phipathic compounds in water, called micelles, may 
contain hundreds or thousands of molecules. The forces 
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(a) 

FIGURE 2-7 Amphipathic compounds in aqueous solution, (a) Long- 
chain fatty acids have very hydrophobic alkyl chains, each of which is 
surrounded by a layer of highly ordered water molecules, (b) By clus- 
tering together in micelles, the fatty acid molecules expose the small- 
est possible hydrophobic surface area to the water, and fewer water 
molecules are required in the shell of ordered water. The energy gained 
by freeing immobilized water molecules stabilizes the micelle. 
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that hold the nonpolar regions of the molecules together 
are called hydrophobic interactions. The strength of 
hydrophobic interactions is not due to any intrinsic at- 
traction between nonpolar moieties. Rather, it results 
from the system's achieving greatest thermodynamic 
stability by minimizing the number of ordered water 
molecules required to surround hydrophobic portions of 
the solute molecules. 

Many biomolecules are amphipathic; proteins, pig- 
ments, certain vitamins, and the sterols and phospho- 
lipids of membranes all have both polar and nonpolar 
surface regions. Structures composed of these mole- 
cules are stabilized by hydrophobic interactions among 
the nonpolar regions. Hydrophobic interactions among 
lipids, and between lipids and proteins, are the most im- 




Enzyme-substrate interaction 
stabilized by hydrogen-bonding, 
ionic, and hydrophobic interactions 



FIGURE 2-8 Release of ordered water favors formation of an enzyme- 
substrate complex. While separate, both enzyme and substrate force 
neighboring water molecules into an ordered shell. Binding of sub- 
strate to enzyme releases some of the ordered water, and the resulting 
increase in entropy provides a thermodynamic push toward formation 
of the enzyme-substrate complex (see p 1 92). 
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portant determinants of structure in biological mem- 
branes. Hydrophobic interactions between nonpolar 
amino acids also stabilize the three-dimensional struc- 
tures of proteins. 

Hydrogen bonding between water and polar solutes 
also causes an ordering of water molecules, but the en- 
ergetic effect is less significant than with nonpolar 
solutes. Part of the driving force for binding of a polar 
substrate (reactant) to the complementary polar sur- 
face of an enzyme is the entropy increase as the enzyme 
displaces ordered water from the substrate, and as the 
substrate displaces ordered water from the enzyme sur- 
face (Fig. 2-8). 

van der Waals interactions Are Weak 
Interatomic Attractions 

When two uncharged atoms are brought very close to- 
gether, their surrounding electron clouds influence each 
other. Random variations in the positions of the elec- 
trons around one nucleus may create a transient electric 
dipole, which induces a transient, opposite electric di- 
pole in the nearby atom. The two dipoles weakly attract 
each other, bringing the two nuclei closer. These weak 
attractions are called van der Waals interactions 
(also known as London forces). As the two nuclei 
draw closer together, their electron clouds begin to 
repel each other. At the point where the net attraction is 
maximal, the nuclei are said to be in van der Waals con- 
tact. Each atom has a characteristic van der Waals 
radius, a measure of how close that atom will allow 
another to approach (Table 2-4). In the "space-filling" 
molecular models shown throughout this book, the 
atoms are depicted in sizes proportional to their van der 
Waals radii. 



van der Waals Radii and Covalent 
(Single-Bond) Radii of Some Elements 



Element 


van der Waals 
radius (nm) 


Covalent radius for 
single bond (nm) 


H 


0.11 


0.030 


0 


0.15 


0.066 


N 


0.15 


0.070 


C 


0.17 


0.077 


S 


0.18 


0.104 


P 


0.19 


0.110 


I 


0.21 


0.133 



Sources: For van der Waals radii, Chauvin, R. (1992) Explicit periodic trend of van der 
Waals radii. J. Phys. Chem. 96, 9194-9197. For covalent radii, Pauling, L (1960) 
Nature of the Chemical Bond, 3rd edn, Cornell University Press, Ithaca, NY. 
Note: van der Waals radii describe the space-filling dimensions of atoms. When two 
atoms are joined covalently, the atomic radii at the point of bonding are less than the 
van der Waals radii, because the joined atoms are pulled together by the shared elec- 
tron pair. The distance between nuclei in a van der Waals interaction or a covalent bond 
is about equal to the sum of the van der Waals or covalent radii, respectively, for the two 
atoms. Thus the length of a carbon-carbon single bond is about 0.077 nm + 0.077 nm = 
0 154 nm. 



^50^ Water 



Weak Interactions Are Crucial to Macromolecular 
Structure and Function 

The noncovalent interactions we have described — 
hydrogen bonds and ionic, hydrophobic, and van der Waals 
interactions (Table 2-5)— are much weaker than covalent 
bonds. An input of about 350 kJ of energy is required to 
break a mole (6 X 10 23 ) of C— C single bonds ; and about 
410 kJ to break a mole of C— H bonds, but as little as 4 kJ 
is sufficient to disrupt a mole of typical van der Waals in- 
teractions. Hydrophobic interactions are also much weaker 
than covalent bonds, although they are substantially 
strengthened by a highly polar solvent (a concentrated salt 
solution, for example). Ionic interactions and hydrogen 
bonds are variable in strength, depending on the polarity of 
the solvent and the alignment of the hydrogen-bonded 
atoms, but they are always significantly weaker than cova- 
lent bonds. In aqueous solvent at 25 °C, the available ther- 
mal energy can be of the same order of magnitude as the 
strength of these weak interactions, and the interaction be- 
tween solute and solvent (water) molecules is nearly 
as favorable as solute-solute interactions. Consequently, 
hydrogen bonds and ionic, hydrophobic, and van der Waals 
interactions are continually forming and breaking. 

Although these four types of interactions are indi- 
vidually weak relative to covalent bonds, the cumulative 
effect of many such interactions can be very significant. 
For example, the noncovalent binding of an enzyme to 
its substrate may involve several hydrogen bonds and 
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one or more ionic interactions, as well as hydrophobic 
and van der Waals interactions. The formation of each of 
these weak bonds contributes to a net decrease in the 
free energy of the system. We can calculate the stability 
of a noncovalent interaction, such as that of a small mol- 
ecule hydrogen-bonded to its macromolecular partner, 
from the binding energy. Stability, as measured by the 
equilibrium constant (see below) of the binding reac- 
tion, varies exponentially with binding energy. The dis- 
sociation of two biomolecules (such as an enzyme and 
its bound substrate) that are associated noncovalently 
through multiple weak interactions requires all these in- 
teractions to be disrupted at the same time. Because the 
interactions fluctuate randomly, such simultaneous dis- 
ruptions are very unlikely. The molecular stability be- 
stowed by 5 or 20 weak interactions is therefore much 
greater than would be expected intuitively from a 
simple summation of small binding energies. 

Macromolecules such as proteins, DNA, and RNA 
contain so many sites of potential hydrogen bonding or 
ionic, van der Waals, or hydrophobic interactions that the 
cumulative effect of the many small binding forces can 
be enormous. For macromolecules, the most stable (that 
is, the native) structure is usually that in which weak 
interactions are maximized. The folding of a single 
polypeptide or polynucleotide chain into its three- 
dimensional shape is determined by this principle. The 
binding of an antigen to a specific antibody depends on 
the cumulative effects of many weak interactions. As 
noted earlier, the energy released when an enzyme binds 
noncovalently to its substrate is the main source of the 
enzyme's catalytic power. The binding of a hormone or a 
neurotransmitter to its cellular receptor protein is the 
result of multiple weak interactions. One consequence of 
the large size of enzymes and receptors (relative to their 
substrates or ligands) is that their extensive surfaces 
provide many opportunities for weak interactions. At the 
molecular level, the complementarity between interact- 
ing biomolecules reflects the complementarity and weak 
interactions between polar, charged, and hydrophobic 
groups on the surfaces of the molecules. 

When the structure of a protein such as hemoglobin 
(Fig. 2-9) is determined by x-ray crystallography 
(see Box 4-5, p. 132), water molecules are often found 
to be bound so tightly that they are part of the crystal 
structure; the same is true for water in crystals of RNA 
or DNA. These bound water molecules, which can also 
be detected in aqueous solutions by nuclear magnetic 
resonance, have distinctly different properties from 
those of the "bulk" water of the solvent. They are, for ex- 
ample, not osmotically active (see below). For many 
proteins, tightly bound water molecules are essential to 
their function. In a reaction central to the process of 
photosynthesis, for example, light drives protons across 
a biological membrane as electrons flow through a series 
of electron-carrying proteins (see Fig. 19-60). One of 
these proteins, cytochrome/, has a chain of five bound 
water molecules (Fig. 2-10) that may provide a path 
for protons to move through the membrane by a process 
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(a) (b) 



FIGURE 2-9 Water binding in hemoglobin. (PDB ID 1 A3 N) The crystal 
structure of hemoglobin, shown (a) with bound water molecules (red 
spheres) and (b) without the water molecules. The water molecules are 
so firmly bound to the protein that they affect the x-ray diffraction pat- 
tern as though they were fixed parts of the crystal. The two a subunits 
of hemoglobin are shown in gray the two (3 subunits in blue. Each sub- 
unit has a bound heme group (red stick structure), visible only in the 
p subunits in this view. The structure and function of hemoglobin are 
discussed in detail in Chapter 5. 



known as "proton hopping" (described below). Another 
such light-driven proton pump, bacteriorhodopsin, al- 
most certainly uses a chain of precisely oriented bound 
water molecules in the transmembrane movement of 
protons (see Fig. 19-67). 




— Fe 



H— N— H 



FIGURE 2-10 Water chain in cytochrome f. Water is bound in a proton 
channel of the membrane protein cytochrome f, which is part of 
the energy-trapping machinery of photosynthesis in chloroplasts (see 
Fig. 19-64). Five water molecules are hydrogen-bonded to each other 
and to functional groups of the protein: the peptide backbone atoms of 
valine, proline, arginine, and alanine residues, and the side chains of 
three asparagine and two glutamine residues. The protein has a bound 
heme (see Fig. 5-1), its iron ion facilitating electron flow during pho- 
tosynthesis. Electron flow is coupled to the movement of protons 
across the membrane, which probably involves "proton hopping" (see 
Fig. 2-1 3) through this chain of bound water molecules. 



Solutes Affect the Colligative Properties of 
Aqueous Solutions 

Solutes of all kinds alter certain physical properties 
of the solvent, water: its vapor pressure, boiling 
point, melting point (freezing point), and osmotic 
pressure. These are called colligative properties 
(colligative meaning "tied together"), because the ef- 
fect of solutes on all four properties has the same ba- 
sis: the concentration of water is lower in solutions 
than in pure water. The effect of solute concentration 
on the colligative properties of water is independent 
of the chemical properties of the solute; it depends 
only on the number of solute particles (molecules, 
ions) in a given amount of water. A compound such 
as NaCl, which dissociates in solution, has an effect 
on osmotic pressure, for example, that is twice that 
of an equal number of moles of a nondissociating 
solute such as glucose. RoshanKetab 021-66950639 

Water molecules tend to move from a region of 
higher water concentration to one of lower water con- 
centration, in accordance with the tendency in nature 
for a system to become disordered. When two different 
aqueous solutions are separated by a semipermeable 
membrane (one that allows the passage of water but not 
solute molecules), water molecules diffusing from the 
region of higher water concentration to the region of 
lower water concentration produce osmotic pressure 
(Fig. 2-11). This pressure, II, measured as the force 



Force (II) 

Pure Nonpermeant resists osmosis 

water solute dissolved 

j in water > r 




Semipermeable 
membrane 

FIGURE 2-11 Osmosis and the measurement of osmotic pressure. 

(a) The initial state. The tube contains an aqueous solution, the beaker 
contains pure water, and the semipermeable membrane allows the pas- 
sage of water but not solute. Water flows from the beaker into the tube 
to equalize its concentration across the membrane, (b) The final state. 
Water has moved into the solution of the nonpermeant compound, di- 
luting it and raising the column of water within the tube. At equilibrium, 
the force of gravity operating on the solution in the tube exactly balances 
the tendency of water to move into the tube, where its concentration is 
lower, (c) Osmotic pressure (II) is measured as the force that must be ap- 
plied to return the solution in the tube to the level of that in the beaker. 
This force is proportional to the height, h, of the column in (b). 
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necessary to resist water movement (Fig. 2-1 lc), is ap- 
proximated by the van't Hoff equation: 

n - icRT 

in which R is the gas constant and T is the absolute tem- 
perature. The term ic is the osmolarity of the solution, 
the product of the van't Hoff factor % which is a measure 
of the extent to which the solute dissociates into two or 
more ionic species, and the solute's molar concentra- 
tion c. In dilute NaCl solutions, the solute completely 
dissociates into Na + and CI", doubling the number of 
solute particles, and thus i = 2. For all nonionizing 
solutes, % = 1. For solutions of several (n) solutes, n is 
the sum of the contributions of each species: 

n = RTdiCx + i 2 c 2 + + i n c n ) 

Osmosis, water movement across a semiperme- 
able membrane driven by differences in osmotic pres- 
sure, is an important factor in the life of most cells. 
Plasma membranes are more permeable to water than 
to most other small molecules, ions, and macromole- 
cules. This permeability is due largely to protein chan- 
nels (aquaporins; see Fig. 11-46) in the membrane 
that selectively permit the passage of water. Solutions 
of osmolarity equal to that of a cell's cytosol are said to 
be isotonic relative to that cell. Surrounded by an iso- 
tonic solution, a cell neither gains nor loses water 
(Fig. 2-12). In a hypertonic solution, one with 
higher osmolarity than that of the cytosol, the cell 
shrinks as water moves out. In a hypotonic solution, 
one with a lower osmolarity than the cytosol, the cell 
swells as water enters. In their natural environments, 
cells generally contain higher concentrations of bio- 
molecules and ions than their surroundings, so os- 
motic pressure tends to drive water into cells. If not 
somehow counterbalanced, this inward movement of 
water would distend the plasma membrane and even- 
tually cause bursting of the cell (osmotic lysis). 

Several mechanisms have evolved to prevent this 
catastrophe. In bacteria and plants, the plasma mem- 
brane is surrounded by a nonexpandable cell wall of 
sufficient rigidity and strength to resist osmotic pres- 
sure and prevent osmotic lysis. Certain freshwater 
protists that live in a highly hypotonic medium have an 
organelle (contractile vacuole) that pumps water out 
of the cell. In multicellular animals, blood plasma and 
interstitial fluid (the extracellular fluid of tissues) are 
maintained at an osmolarity close to that of the cy- 
tosol. The high concentration of albumin and other 
proteins in blood plasma contributes to its osmolarity. 
Cells also actively pump out Na + and other ions into 
the interstitial fluid to stay in osmotic balance with 
their surroundings. 

Because the effect of solutes on osmolarity depends 
on the number of dissolved particles, not their mass, 
macromolecules (proteins, nucleic acids, polysaccha- 
rides) have far less effect on the osmolarity of a solution 
than would an equal mass of their monomeric compo- 
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(b) Cell in hypertonic (c) Cell in hypotonic 

solution; water moves out solution; water moves in, 

and cell shrinks. creating outward pressure; 

cell swells, may eventually 

burst. 

FIGURE 2-12 Effect of extracellular osmolarity on water movement 
across a plasma membrane. When a cell in osmotic balance with its 
surrounding medium — that is, a cell in (a) an isotonic medium — is trans- 
ferred into (b) a hypertonic solution or (c) a hypotonic solution, water 
moves across the plasma membrane in the direction that tends to 
equalize osmolarity outside and inside the cell. 

nents. For example, a gram of a polysaccharide com- 
posed of 1,000 glucose units has the same effect on os- 
molarity as a milligram of glucose. Storing fuel as 
polysaccharides (starch or glycogen) rather than as glu- 
cose or other simple sugars avoids an enormous in- 
crease in osmotic pressure in the storage cell. 

Plants use osmotic pressure to achieve mechanical 
rigidity. The very high solute concentration in the plant 
cell vacuole draws water into the cell (Fig. 2-12), but 
the nonexpandable cell wall prevents swelling; instead, 
the pressure exerted against the cell wall (turgor pres- 
sure) increases, stiffening the cell, the tissue, and the 
plant body. When the lettuce in your salad wilts, it is be- 
cause loss of water has reduced turgor pressure. Osmo- 
sis also has consequences for laboratory protocols. 
Mitochondria, chloroplasts, and lysosomes, for exam- 
ple, are enclosed by semipermeable membranes. In iso- 
lating these organelles from broken cells, biochemists 
must perform the fractionations in isotonic solutions 
(see Fig. 1-8) to prevent excessive entry of water into 
the organelles and the swelling and bursting that would 
follow. Buffers used in cellular fractionations commonly 
contain sufficient concentrations of sucrose or some 
other inert solute to protect the organelles from os- 
motic lysis. 
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WORKED EXAMPLE 2-1 Osmotic Strength of an Organelle I 

Suppose the major solutes in intact lysosomes are KC1 (—0.1 m) and NaCl (—0.03 m). 
When isolating lysosomes, what concentration of sucrose is required in the extract- 
ing solution at room temperature (25 °C) to prevent swelling and lysis? 

Solution: We want to find a concentration of sucrose that gives an osmotic strength 
equal to that produced by the KC1 and NaCl in the lysosomes. The equation for cal- 
culating osmotic strength (the van't Hoff equation) is 

n = RTdtfi + i 2 c 2 + 13C3 + + i n c n ) 
where R is the gas constant 8.315 J/mol • K, T is the absolute temperature (Kelvin), 
Ci, c 2 , and c 3 are the molar concentrations of each solute, and i 1} i 2i and i 3 are the 
numbers of particles each solute yields in solution (i = 2 for KC1 and NaCl). 
The osmotic strength of the lysosomal contents is 

1 Mysosome 

- i?T(i K ClCKCl + ^NaClCNaCl) 

= i?r[(2)(0.03 mol/L) + (2) (0.1 mol/L)] = RT(0.26 mol/L) 

Because the solute concentrations are only accurate to one significant figure, this 
becomes n lysosome = RT(0.3 mol/L). 

The osmotic strength of a sucrose solution is given by 

f^sucrose -^-^(^sucrose ^sucrose) 

In this case, i sucr0 se = 1> because sucrose does not ionize. Thus, 

flsucrose ^^(^sucrose) 

The osmotic strength of the lysosomal contents equals that of the sucrose solution 
when 

Hsucrose fflysosome 
RT(c sucrose ) = ^(0.3 moVL) 
^sucrose = 0.3 Hiol/L 

So the required concentration of sucrose (FW 342) is (0.3 mol/L) (342 g/mol) = 102.6 
g/L. Or, when significant figures are taken into account, c sucrose = 0.1 kg/L. 

■ WORKED EXAMPLE 2-2 Osmotic Strength of an Organelle II 

Suppose we decided to use a solution of a polysaccharide, say glycogen (see p. 246), 
to balance the osmotic strength of the lysosomes (described in Worked Example 2-1). 
Assuming a linear polymer of 100 glucose units, calculate the amount of this polymer 
needed to achieve the same osmotic strength as the sucrose solution in Worked 
Example 2-1. The M r of the glucose polymer is —18,000, and, like sucrose, it does 
not ionize in solution. 

Solution: As derived in Worked Example 2-1, 

n sucrose = #r(0.3 moVL) 

Similarly, 

Uglycogen -^^(^glycogen ^glycogen) -^^(^glycogen) 

For a glycogen solution with the same osmotic strength as the sucrose solution, 

Uglycogen H sucrose 

#T(c g i ycogen ) = RT(0.S mol/L) 

Cgiycogen = 0.3 mol/L = (0.3mol/L)(18,000 g/mol) = 5.4 kg/L 

Or, when significant figures are taken into account, c glycogen = 5 kg/L, an absurdly 
high concentration. 

As we'll see later (p. 246), cells of liver and muscle store carbohydrate not as low 
molecular weight sugars such as glucose or sucrose but as the high molecular weight 
polymer glycogen. This allows the cell to contain a large mass of glycogen with a 
minimal effect on the osmolarity of the cytosol. 
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SUMMARY 2.1 Weak Interactions in 
Aqueous Systems 

■ The very different electronegativities of H and 0 
make water a highly polar molecule, capable of 
forming hydrogen bonds with itself and with solutes. 
Hydrogen bonds are fleeting, primarily electrostatic, 
and weaker than covalent bonds. Water is a good 
solvent for polar (hydrophilic) solutes, with which it 
forms hydrogen bonds, and for charged solutes, 
with which it interacts electrostatically. 

■ Nonpolar (hydrophobic) compounds dissolve 
poorly in water; they cannot hydrogen-bond with 
the solvent, and their presence forces an 
energetically unfavorable ordering of water 
molecules at their hydrophobic surfaces. To 
minimize the surface exposed to water, nonpolar 
compounds such as lipids form aggregates 
(micelles) in which the hydrophobic moieties are 
sequestered in the interior, associating through 
hydrophobic interactions, and only the more polar 
moieties interact with water. 

■ Weak, noncovalent interactions, in large numbers, 
decisively influence the folding of macromolecules 
such as proteins and nucleic acids. The most stable 
macromolecular conformations are those in which 
hydrogen bonding is maximized within the molecule 
and between the molecule and the solvent, and in 
which hydrophobic moieties cluster in the interior 
of the molecule away from the aqueous solvent. 

■ The physical properties of aqueous solutions are 
strongly influenced by the concentrations of 
solutes. When two aqueous compartments are 
separated by a semipermeable membrane (such as 
the plasma membrane separating a cell from its 
surroundings), water moves across that membrane 
to equalize the osmolarity in the two compart- 
ments. This tendency for water to move across a 
semipermeable membrane is the osmotic pressure. 

2.2 Ionization of Water, Weak Acids, and 
Weak Bases 

Although many of the solvent properties of water can be 
explained in terms of the uncharged H 2 0 molecule, the 
small degree of ionization of water to hydrogen ions 
(H + ) and hydroxide ions (OH") must also be taken into 
account. Like all reversible reactions, the ionization of 
water can be described by an equilibrium constant. 
When weak acids are dissolved in water, they contribute 
H + by ionizing; weak bases consume H + by becoming 
protonated. These processes are also governed by equi- 
librium constants. The total hydrogen ion concentration 
from all sources is experimentally measurable and is ex- 
pressed as the pH of the solution. To predict the state of 
ionization of solutes in water, we must take into account 
the relevant equilibrium constants for each ionization 



reaction. We therefore turn now to a brief discussion of 
the ionization of water and of weak acids and bases dis- 
solved in water. 

Pure Water Is Slightly Ionized 

Water molecules have a slight tendency to undergo re- 
versible ionization to yield a hydrogen ion (a proton) 
and a hydroxide ion, giving the equilibrium 



H 2 0 



H + + OH" 



(2-1) 



Although we commonly show the dissociation product 
of water as H + , free protons do not exist in solution; hy- 
drogen ions formed in water are immediately hydrated 
to hydronium ions (H 3 0 + ). Hydrogen bonding be- 
tween water molecules makes the hydration of dissoci- 
ating protons virtually instantaneous: 



H-0 1 1 H— O 



\ 



H- 



H 



H 



-O— H + OH" 

I 

H 



The ionization of water can be measured by its elec- 
trical conductivity; pure water carries electrical current 
as H 3 0 + migrates toward the cathode and OH~ toward 
the anode. The movement of hydronium and hydroxide 
ions in the electric field is extremely fast compared with 
that of other ions such as Na + , K + , and Cl~. This high 
ionic mobility results from the kind of "proton hopping" 
shown in Figure 2-13. No individual proton moves very 
far through the bulk solution, but a series of proton hops 



Hydronium ion gives up a proton 
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H 



\ 



Proton hop 
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H-C> 



-H 
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Water accepts proton and 
becomes a hydronium ion 

FIGURE 2-13 Proton hopping. Short "hops" of protons between a series 
of hydrogen-bonded water molecules result in an extremely rapid net 
movement of a proton over a long distance. As a hydronium ion (upper 
left) gives up a proton, a water molecule some distance away (lower right) 
acquires one, becoming a hydronium ion. Proton hopping is much faster 
than true diffusion and explains the remarkably high ionic mobility of H + 
ions compared with other monovalent cations such as Na + and K + . 
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between hydrogen-bonded water molecules causes the 
net movement of a proton over a long distance in a re- 
markably short time. As a result of the high ionic mo- 
bility of H + (and of OH~, which also moves rapidly by 
proton hopping, but in the opposite direction), acid- 
base reactions in aqueous solutions are exceptionally 
fast. As noted above, proton hopping very likely also 
plays a role in biological proton-transfer reactions 
(Fig. 2-10; see also Fig. 19-67). 

Because reversible ionization is crucial to the role of 
water in cellular function, we must have a means of ex- 
pressing the extent of ionization of water in quantitative 
terms. A brief review of some properties of reversible 
chemical reactions shows how this can be done. 

The position of equilibrium of any chemical reaction 
is given by its equilibrium constant, Jir eq (sometimes 
expressed simply as if). For the generalized reaction 



A + B 



C + D 



(2-2) 



an equilibrium constant can be defined in terms of the 
concentrations of reactants (A and B) and products (C 
and D) at equilibrium: 



[C] eq [D] eq 
[A] eq [B] eq 



Strictly speaking, the concentration terms should be the 
activities, or effective concentrations in nonideal solu- 
tions, of each species. Except in very accurate work, 
however, the equilibrium constant may be approximated 
by measuring the concentrations at equilibrium. For rea- 
sons beyond the scope of this discussion, equilibrium con- 
stants are dimensionless. Nonetheless, we have generally 
retained the concentration units (m) in the equilibrium ex- 
pressions used in this book to remind you that molarity is 
the unit of concentration used in calculating if eq . 

The equilibrium constant is fixed and characteristic 
for any given chemical reaction at a specified tempera- 
ture. It defines the composition of the final equilibrium 
mixture, regardless of the starting amounts of reactants 
and products. Conversely, we can calculate the equilib- 
rium constant for a given reaction at a given tempera- 
ture if the equilibrium concentrations of all its reactants 
and products are known. As we showed in Chapter 1 
(p. 24), the standard free-energy change (AG 0 ) is di- 
rectly related to ln^ eq . 

The Ionization of Water Is Expressed by 
an Equilibrium Constant 

The degree of ionization of water at equilibrium (Eqn 2-1) 
is small; at 25 °C only about two of every 10 9 molecules in 
pure water are ionized at any instant. The equilibrium 
constant for the reversible ionization of water is 



K eq - 



[H + ][OH-] 
[H 2 0] 



(2-3) 



In pure water at 25 °C, the concentration of water is 
55.5 m — grams of H 2 0 in 1 L divided by its gram molecular 



weight: (1,000 g/L)/(18.015 g/mol)— and is essentially 
constant in relation to the very low concentrations of H + 
and OH", namely, 1 X 10" 7 m. Accordingly, we can sub- 
stitute 55.5 M in the equilibrium constant expression 
(Eqn 2-3) to yield 

= [H + ][OH-] 
eq [55.5 M] 

On rearranging, this becomes 

(55.5 M)CK eq ) = [H + ] [OH - ] = X w (2-4) 

where K w designates the product (55.5 M)(X eq ), the ion 
product of water at 25 °C. 

The value for K m , determined by electrical- 
conductivity measurements of pure water, is 1.8 X 
10" 16 m at 25 °C. Substituting this value for K e(i in 
Equation 2-4 gives the value of the ion product of water: 

K w = [H + ][OH _ ] = (55.5M)(1.8 x HT 16 m) 



Thus the product [H + ][OH~] in aqueous solutions at 
25 °C always equals 1 X 10~ 14 m 2 . When there are ex- 
actly equal concentrations of H + and OH~, as in pure 
water, the solution is said to be at neutral pH. At this 
pH, the concentration of H + and OH~ can be calculated 
from the ion product of water as follows: 

K w = [H + ][Oir] = [H + ] 2 = [OH"] 2 



Solving for [H + ] gives 



[H + ] = Vtf w = Vl x KT 14 m 2 
[H + ] = [OH - ] = 10" 7 M 

As the ion product of water is constant, whenever [H + ] 
is greater than 1 X 10~ 7 m, [OH~] must be less than 1 X 
10~ 7 m, and vice versa. When [H + ] is very high, as in a 
solution of hydrochloric acid, [OH~] must be very low. 
From the ion product of water we can calculate [H + ] if 
we know [OH~], and vice versa. 



Hi WORKED EXAMPLE 2-3 Calculation of [H + ] 

What is the concentration of H + in a solution of 0.1 m 
NaOH? 

Solution: We begin with the equation for the ion product 
of water: 

# w = [H + ][om 

With [OH"] = 0.1 M, solving for [H + ] gives 



[H + ] = 



lxlO^W 1(T 14 m* 



[OH~] 

10" 13 M 



0.1 M 



10 -1 M 



^56^ Water 



WORKED EXAMPLE 2-4 Calculation of [OH"] 

What is the concentration of OH~ in a solution with an 
H + concentration of 1.3 X 10~ 4 m? 

Solution: We begin with the equation for the ion product 
of water: 

K w = [H + ][OH"] 
With [H + ] = 1.3 X 10" 4 m, solving for [OH - ] gives 
K w 



[OH] = 



[H + ] 0.00013 M 
7.7 x 10" n M 



1.3 x 10" 4 m 



In all calculations be sure to round your answer to the 
correct number of significant figures, as here. 



The pH Scale Designates the H + and OH Concentrations 

The ion product of water, K w , is the basis for the pH 
scale (Table 2-6). It is a convenient means of designat- 
ing the concentration of H + (and thus of OH") in any 
aqueous solution in the range between 1.0 M H + and 
1.0 M OH~. The term pH is denned by the expression 



pH = log 



[H + ] 



-log [H + ] 



The symbol p denotes "negative logarithm of." For 
a precisely neutral solution at 25 °C, in which the 
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FIGURE 2-14 The pH of some aqueous fluids. 



concentration of hydrogen ions is 1.0 X 10 7 m, the pH 
can be calculated as follows: 



1 TABLE 2-6 


The pH Scale 






[H + ] (M) 


PH 


[OH"] ( M ) 


pOH* 


10° (1) 
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10 -14 


14 


10" 1 
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IO" 13 


13 


io- 2 
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io- 12 


12 


1(T 3 
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10" 11 


11 


icr 4 
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10" 10 


10 


10" 5 
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io- 9 
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10" 6 


6 


icr 8 
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icr 7 
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1(T 7 
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10" 8 
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10" 6 
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IO" 9 
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10" 5 
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1Q -io 


10 


10" 4 
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10" 11 


11 


10" 3 


3 


IO" 12 


12 


icr 2 


2 


IO" 13 


13 


10" 1 


1 


1Q -14 


14 


10° (1) 


0 



pH = log 



1.0 X 10" 



= 7.0 



*The expression pOH is sometimes used to describe the basicity, or OH" concentra- 
tion, of a solution; pOH is defined by the expression pOH = -log [0H"] f which is 
analogous to the expression for pH. Note that in all cases, pH + pOH = 14. 



Note that the concentration of H + must be expressed in 
molar (m) terms. 

The value of 7 for the pH of a precisely neutral solu- 
tion is not an arbitrarily chosen figure; it is derived from 
the absolute value of the ion product of water at 25 °C, 
which by convenient coincidence is a round number. So- 
lutions having a pH greater than 7 are alkaline or basic; 
the concentration of OH~ is greater than that of H + . 
Conversely, solutions having a pH less than 7 are acidic. 

Keep in mind that the pH scale is logarithmic , not 
arithmetic. To say that two solutions differ in pH by 1 pH 
unit means that one solution has ten times the H + con- 
centration of the other, but it does not tell us the absolute 
magnitude of the difference. Figure 2-14 gives the pH 
values of some common aqueous fluids. A cola drink (pH 
3.0) or red wine (pH 3.7) has an H + concentration ap- 
proximately 10,000 times that of blood (pH 7.4). 

The pH of an aqueous solution can be approxi- 
mately measured with various indicator dyes, including 
litmus, phenolphthalein, and phenol red, which undergo 
color changes as a proton dissociates from the dye mol- 
ecule. Accurate determinations of pH in the chemical or 
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clinical laboratory are made with a glass electrode that is 
selectively sensitive to H + concentration but insensitive 
to Na + , K + , and other cations. In a pH meter, the signal 
from the glass electrode placed in a test solution is am- 
plified and compared with the signal generated by a so- 
lution of accurately known pH. 

I Measurement of pH is one of the most important 
Bfl and frequently used procedures in biochemistry. 
The pH affects the structure and activity of biological 
macromolecules; for example, the catalytic activity of 
enzymes is strongly dependent on pH (see Fig. 2-21). 
Measurements of the pH of blood and urine are com- 
monly used in medical diagnoses. The pH of the blood 
plasma of people with severe, uncontrolled diabetes, for 
example, is often below the normal value of 7.4; this 
condition is called acidosis (described in more detail 
below). In certain other diseases the pH of the blood is 
higher than normal, a condition known as alkalosis. 
Extreme acidosis or alkalosis can be life-threatening. ■ 

Weak Acids and Bases Have Characteristic Acid 
Dissociation Constants 

Hydrochloric, sulfuric, and nitric acids, commonly called 
strong acids, are completely ionized in dilute aqueous 
solutions; the strong bases NaOH and KOH are also 



completely ionized. Of more interest to biochemists is 
the behavior of weak acids and bases — those not com- 
pletely ionized when dissolved in water. These are ubiq- 
uitous in biological systems and play important roles in 
metabolism and its regulation. The behavior of aqueous 
solutions of weak acids and bases is best understood if 
we first define some terms. 

Acids may be defined as proton donors and bases as 
proton acceptors. A proton donor and its corresponding 
proton acceptor make up a conjugate acid-base pair 
(Fig. 2-15). Acetic acid (CH 3 COOH), a proton donor, 
and the acetate anion (CH 3 COO"), the corresponding 
proton acceptor, constitute a conjugate acid-base pair, 
related by the reversible reaction 

CH3COOH CH3COO- + H + 

Each acid has a characteristic tendency to lose its 
proton in an aqueous solution. The stronger the acid, 
the greater its tendency to lose its proton. The tendency 
of any acid (HA) to lose a proton and form its conjugate 
base (A~) is defined by the equilibrium constant (K eq ) 
for the reversible reaction 

HA ^ H + + A" 

for which 



Monoprotic acids 

Acetic acid 

(K a = 1.74 x 10-5 M ) 



Ammonium ion 

«a = 5.62 X IO-^m) 

Diprotic acids 

Carbonic acid 

(K a = 1.70 X 10~ 4 m); 

Bicarbonate 

(1^ = 6.31 X 10- n M) 

Glycine, carboxyl 
(i£a = 4.57 X 10-3 m); 
Glycine, amino 
(K a = 2.51 X IO-^m) 



Triprotic acids 

Phosphoric acid 
(K a = 1.25 X 10" 3 m); 
Dihydrogen phosphate 
(K a = 1.38 x 10-7 m); 
Monohydrogen phosphate 
(tf a = 3.98 x 10" 13 m) 
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FIGURE 2-15 Conjugate acid-base pairs consist of a proton donor and a 

proton acceptor. Some compounds, such as acetic acid and ammonium 
ion, are monoprotic; they can give up only one proton. Others are diprotic 
(carbonic acid and glycine) or triprotic (phosphoric acid). The dissociation 



reactions for each pair are shown where they occur along a pH gradient. 
The equilibrium or dissociation constant (K a ) and its negative logarithm, the 
p/( a , are shown for each reaction. *For an explanation of apparent discrep- 
ancies in pK a values for carbonic acid (H 2 C0 3 ), see p. 63. 
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Equilibrium constants for ionization reactions are usually 
called ionization constants or acid dissociation con- 
stants, often designated K a . The dissociation constants 
of some acids are given in Figure 2-15. Stronger acids, 
such as phosphoric and carbonic acids, have larger ion- 
ization constants; weaker acids, such as monohydrogen 
phosphate (HPOf ~), have smaller ionization constants. 

Also included in Figure 2-15 are values of p^T a , 
which is analogous to pH and is defined by the equation 



pK a = log 



-logtf a 



The stronger the tendency to dissociate a proton, the 
stronger is the acid and the lower its pK a . As we shall 
now see, the pK a of any weak acid can be determined 
quite easily. 

Titration Curves Reveal the p/f a of Weak Acids 

Titration is used to determine the amount of an acid in a 
given solution. A measured volume of the acid is titrated 
with a solution of a strong base, usually sodium hydrox- 
ide (NaOH), of known concentration. The NaOH is 
added in small increments until the acid is consumed 
(neutralized), as determined with an indicator dye or a 
pH meter. The concentration of the acid in the original 
solution can be calculated from the volume and concen- 
tration of NaOH added. 

A plot of pH against the amount of NaOH added (a 
titration curve) reveals the pK a of the weak acid. Con- 
sider the titration of a 0.1 m solution of acetic acid with 
0.1 m NaOH at 25 °C (Fig, 2-16). Two reversible equi- 
libria are involved in the process (here, for simplicity, 
acetic acid is denoted HAc): 



H 2 0 ; 
HAc 



H + + OH" 
• H + + Ac" 



(2-5) 
(2-6) 



The equilibria must simultaneously conform to their char- 
acteristic equilibrium constants, which are, respectively, 



[H + ][OH"] 



[H + ][Ac~] 
[HAc] 



1.74 x 10 5 M 



(2-7) 



(2-8) 



At the beginning of the titration, before any NaOH is 
added, the acetic acid is already slightly ionized, to an 
extent that can be calculated from its ionization con- 
stant (Eqn 2-8). 

As NaOH is gradually introduced, the added OH~ 
combines with the free H + in the solution to form H 2 0, 
to an extent that satisfies the equilibrium relationship in 
Equation 2-7. As free H + is removed, HAc dissociates 
further to satisfy its own equilibrium constant (Eqn 2-8). 
The net result as the titration proceeds is that more and 
more HAc ionizes, forming Ac~, as the NaOH is 
added. At the midpoint of the titration, at which exactly 
0.5 equivalent of NaOH has been added, one-half of the 
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FIGURE 2-16 The titration curve of acetic acid. After addition of each in- 
crement of NaOH to the acetic acid solution, the pH of the mixture is 
measured This value is plotted against the amount of NaOH added, ex- 
pressed as a fraction of the total NaOH required to convert all the acetic 
acid (CH3COOH) to its deprotonated form, acetate (CH 3 COO"). The 
points so obtained yield the titration curve. Shown in the boxes are the 
predominant ionic forms at the points designated. At the midpoint of 
the titration, the concentrations of the proton donor and proton accep- 
tor are equal, and the pH is numerically equal to the p/C a . The shaded 
zone is the useful region of buffering power, generally between 10% 
and 90% titration of the weak acid. 

original acetic acid has undergone dissociation, so that 
the concentration of the proton donor, [HAc], now 
equals that of the proton acceptor, [Ac - ]. At this mid- 
point a very important relationship holds: the pH of the 
equimolar solution of acetic acid and acetate is exactly 
equal to the pK a of acetic acid (piT a = 4.76; Figs 2-15, 
2-16). The basis for this relationship, which holds for all 
weak acids, will soon become clear. 

As the titration is continued by adding further in- 
crements of NaOH, the remaining nondissociated acetic 
acid is gradually converted into acetate. The end point 
of the titration occurs at about pH 7.0: all the acetic acid 
has lost its protons to OH~, to form H 2 0 and acetate. 
Throughout the titration the two equilibria (Eqns 2-5, 
2-6) coexist, each always conforming to its equilibrium 
constant. 

Figure 2-17 compares the titration curves of three 
weak acids with very different ionization constants: 
acetic acid (pK a = 4.76); dihydrogen phosphate, H2PO4 
(pK a = 6.86); and ammonium ion, NH4 (pK a = 9.25). 
Although the titration curves of these acids have the 
same shape, they are displaced along the pH axis be- 
cause the three acids have different strengths. Acetic 
acid, with the highest K a (lowest pK a ) of the three, is the 
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FIGURE 2-17 Comparison of the titration curves of three weak acids. 

Shown here are the titration curves forCH 3 COOH, H 2 PC>4, and NH4 . 
The predominant ionic forms at designated points in the titration are 
given in boxes. The regions of buffering capacity are indicated at the 
right. Conjugate acid-base pairs are effective buffers between approxi- 
mately 10% and 90% neutralization of the proton-donor species. 



strongest of the three weak acids (loses its proton most 
readily); it is already half dissociated at pH 4.76. Dihy- 
drogen phosphate loses a proton less readily, being half 
dissociated at pH 6.86. Ammonium ion is the weakest 
acid of the three and does not become half dissociated 
until pH 9.25. 

The titration curve of a weak acid shows graphically 
that a weak acid and its anion — a conjugate acid-base 
pair — can act as a buffer, which we describe in the next 
section. 



SUMMARY 2.2 



Ionization of Water, Weak 
Acids, and Weak Bases 



Pure water ionizes slightly, forming equal numbers 
of hydrogen ions (hydronium ions, H 3 0 + ) and 
hydroxide ions. The extent of ionization is described 

[H + ][OH"] 

by an equilibrium constant, iT eq = — — , 

LH 2 OJ 

from which the ion product of water, K w , is derived. 
At 25 °C, K w = [H + ][OH~] - (55.5 M)(K eQ ) = 
10" 14 m 2 . 



The pH of an aqueous solution reflects, on a 
logarithmic scale, the concentration of hydrogen ions: 

pH = log-^-= -log[H + ]. 
LH J 

The greater the acidity of a solution, the lower its 
pH. Weak acids partially ionize to release a hydro- 
gen ion, thus lowering the pH of the aqueous solu- 
tion. Weak bases accept a hydrogen ion, increasing 
the pH. The extent of these processes is character- 
istic of each particular weak acid or base and is 
expressed as an acid dissociation constant: 

[H + ][A" 



^eq - 



[HA] 



m The pK a expresses, on a logarithmic scale, the 
relative strength of a weak acid or base: 

pKa = log-^- = -\ogK a . 

The stronger the acid, the lower its pi*T a ; the 
stronger the base, the higher its pK a . The pK a can 
be determined experimentally; it is the pH at the 
midpoint of the titration curve for the acid or base. 

2.3 Buffering against pH Changes in 
Biological Systems 

Almost every biological process is pH-dependent; a small 
change in pH produces a large change in the rate of the 
process. This is true not only for the many reactions in 
which the H + ion is a direct participant, but also for 
those in which there is no apparent role for H + ions. The 
enzymes that catalyze cellular reactions, and many of 
the molecules on which they act, contain ionizable 
groups with characteristic pK a values. The protonated 
amino and carboxyl groups of amino acids and the phos- 
phate groups of nucleotides, for example, function as 
weak acids; their ionic state is determined by the pH of 
the surrounding medium. (When an ionizable group is 
sequestered in the middle of a protein, away from the 
aqueous solvent, its pK a , or apparent piT a , can be signif- 
icantly different from its p7T a in water.) As we noted 
above, ionic interactions are among the forces that sta- 
bilize a protein molecule and allow an enzyme to recog- 
nize and bind its substrate. 

Cells and organisms maintain a specific and constant 
cytosolic pH, usually near pH 7, keeping biomolecules in 
their optimal ionic state. In multicellular organisms, the 
pH of extracellular fluids is also tightly regulated. Con- 
stancy of pH is achieved primarily by biological buffers: 
mixtures of weak acids and their conjugate bases. 

Buffers Are Mixtures of Weak Acids and Their 
Conjugate Bases 

Buffers are aqueous systems that tend to resist 
changes in pH when small amounts of acid (H + ) or base 
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[H + ][Ac-] 
[HAc] 



FIGURE 2-18 The acetic acid-acetate pair as a buffer system. The sys- 
tem is capable of absorbing either H + or OH" through the reversibil- 
ity of the dissociation of acetic acid. The proton donor, acetic acid 
(HAc), contains a reserve of bound H + , which can be released to neu- 
tralize an addition of OH~ to the system, forming H 2 0. This happens 
because the product [H + ][OhT] transiently exceeds K w (1 x 10~ 14 m 2 ). 
The equilibrium quickly adjusts to restore the product to 1 X 10~ 14 m 2 
{at 25 °C), thus transiently reducing the concentration of H + . But now 
the quotient [H + ] [Ac"]/[HAc] is less than K a , so HAc dissociates 
further to restore equilibrium. Similarly, the conjugate base, Ac", can 
react with H + ions added to the system; again, the two ionization re- 
actions simultaneously come to equilibrium. Thus a conjugate acid- 
base pair, such as acetic acid and acetate ion, tends to resist a change 
in pH when small amounts of acid or base are added. Buffering action 
is simply the consequence of two reversible reactions taking place si- 
multaneously and reaching their points of equilibrium as governed by 
their equilibrium constants, K w and K a . 



(OH~) are added. A buffer system consists of a weak 
acid (the proton donor) and its conjugate base (the pro- 
ton acceptor). As an example, a mixture of equal con- 
centrations of acetic acid and acetate ion, found at the 
midpoint of the titration curve in Figure 2-16, is a buffer 
system. Notice that the titration curve of acetic acid has 
a relatively flat zone extending about 1 pH unit on either 
side of its midpoint pH of 4.76. In this zone, a given 
amount of H + or OH" added to the system has much 
less effect on pH than the same amount added outside 
the zone. This relatively flat zone is the buffering re- 
gion of the acetic acid-acetate buffer pair. At the mid- 
point of the buffering region, where the concentration of 
the proton donor (acetic acid) exactly equals that of the 
proton acceptor (acetate), the buffering power of the 
system is maximal; that is, its pH changes least on addi- 
tion of H + or OH". The pH at this point in the titration 
curve of acetic acid is equal to its pJT a . The pH of the ac- 
etate buffer system does change slightly when a small 
amount of H + or OH" is added, but this change is very 
small compared with the pH change that would result if 
the same amount of H + or OH" were added to pure wa- 
ter or to a solution of the salt of a strong acid and strong 
base, such as NaCl, which has no buffering power. 

Buffering results from two reversible reaction equi- 
libria occurring in a solution of nearly equal concentra- 



tions of a proton donor and its conjugate proton accep- 
tor. Figure 2-18 explains how a buffer system works. 
Whenever H + or OH" is added to a buffer, the result is a 
small change in the ratio of the relative concentrations 
of the weak acid and its anion and thus a small change in 
pH. The decrease in concentration of one component of 
the system is balanced exactly by an increase in the 
other. The sum of the buffer components does not 
change, only their ratio. 

Each conjugate acid-base pair has a characteristic 
pH zone in which it is an effective buffer (Fig. 2-17). 
The H2PO4 /HPOf ~ pair has a pK a of 6.86 and thus can 
serve as an effective buffer system between approxi- 
mately pH 5.9 and pH 7.9; the NH4 /NH 3 pair, with a pK a 
of 9.25, can act as a buffer between approximately pH 
8.3 and pH 10.3. 

The Henderson-Hasselbalch Equation Relates pH,ptf a , 
and Buffer Concentration 

The titration curves of acetic acid, H2PO4 , and NH4 (Fig. 
2-17) have nearly identical shapes, suggesting that these 
curves reflect a fundamental law or relationship. This is 
indeed the case. The shape of the titration curve of any 
weak acid is described by the Henderson-Hasselbalch 
equation, which is important for understanding buffer 
action and acid-base balance in the blood and tissues of 
vertebrates. This equation is simply a useful way of re- 
stating the expression for the ionization constant of an 
acid. For the ionization of a weak acid HA, the Hender- 
son-Hasselbalch equation can be derived as follows: 



[H + ][A"] 
[HA] 



First solve for [H~* 



[H + ] - K, 



[HA] 
[A"] 



Then take the negative logarithm of both sides: 



-log [H + ] = -logtf a -log 



[HA] 
[A"] 



Substitute pH for -log [H + ] and pK a for -log K a : 



pH = pK a - log 



[HA] 
[A"] 



Now invert -log [HA]/[A ], which involves changing its 
sign, to obtain the Henderson-Hasselbalch equation: 



pH = pK a + log 



[A~] 
[HA] 



(2-9) 



This equation fits the titration curve of all weak acids 
and enables us to deduce some important quantitative 
relationships. For example, it shows why the pK a of a 
weak acid is equal to the pH of the solution at the mid- 
point of its titration. At that point, [HA] = [A~], and 

pH = pK a + log 1 = p# a + 0 = pK a 
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The Henderson-Hasselbalch equation also allows us to 
(1) calculate piT a , given pH and the molar ratio of proton 
donor and acceptor; (2) calculate pH, given p# a and the 
molar ratio of proton donor and acceptor; and (3) calcu- 
late the molar ratio of proton donor and acceptor, given 
pH and pif a . 

Weak Acids or Bases Buffer Cells and Tissues 
against pH Changes 

The intracellular and extracellular fluids of multicellular 
organisms have a characteristic and nearly constant pH. 
The organism's first line of defense against changes in 
internal pH is provided by buffer systems. The cyto- 
plasm of most cells contains high concentrations of pro- 
teins, which contain many amino acids with functional 
groups that are weak acids or weak bases. For example, 
the side chain of histidine (Fig. 2-19) has a pif a of 6.0; 
proteins containing histidine residues therefore buffer 
effectively near neutral pH, and histidine side chains ex- 
ist in either the protonated or unprotonated form near 
neutral pH. 



WORKED EXAMPLE 2-5 Ionization of Histidine 

Calculate the fraction of histidine that has its imidazole 
side chain protonated at pH 7.3. The piT a values for 
histidine are pK x = 1.82, pK 2 (imidazole) = 6.00, and 
pK 3 = 9.17 (see Fig. 3-12b). 

Solution: The three ionizable groups in histidine have 
sufficiently different pK a values that the first acid 
( — COOH) is completely ionized before the second 
(protonated imidazole) begins to dissociate a proton, and 
the second ionizes completely before the third ( — NH3 ) 
begins to dissociate its proton. (With the Henderson- 
Hasselbalch equation, we can easily show that a weak 
acid goes from 1% ionized at 2 pH units below its pK a to 
99% ionized at 2 pH units above its piT a ; see also 
Fig. 3-12b.) At pH 7.3, the carboxyl group of histidine is 
entirely deprotonated ( — COO~) and the a-amino 
group is fully protonated ( — NHj). We can therefore as- 
sume that at pH 7.3, the only group that is partially dis- 
sociated is the imidazole group, which can be 
protonated (we'll abbreviate as HisH + ) or not (His). 




FIGURE 2-19 Ionization of histidine. The amino acid histidine, a com- 
ponent of proteins, is a weak acid. The pK a of the protonated nitrogen 
of the side chain is 6.0. 



We use the Henderson-Hasselbalch equation: 

[A"] 



pH = pK a + log 



[HA] 



Substituting pK 2 = 6.0 and pH = 7.3: 

[His] 



7.3 = 6.0 + log 



[HisH + ] 



io i [His] 
1.3 = log — 

[HisH ] 

antilog 1.3 = [HlS l, = 2.0 X 10 1 
[HisH ] 

So the fraction of total histidine that is in the protonated 
form HisH + at pH 7.3 is 1/21 (1 part HisH + in a total of 
21 parts histidine in either form), or about 4.8%. 



Nucleotides such as ATP, as well as many low mo- 
lecular weight metabolites, contain ionizable groups 
that can contribute buffering power to the cytoplasm. 
Some highly specialized organelles and extracellular 
compartments have high concentrations of compounds 
that contribute buffering capacity: organic acids buffer 
the vacuoles of plant cells; ammonia buffers urine. 

Two especially important biological buffers are the 
phosphate and bicarbonate systems. The phosphate 
buffer system, which acts in the cytoplasm of all 
cells, consists of H2PO4 as proton donor and HPOf ~ as 
proton acceptor: 

H 2 P04 H + + HPOl" 

The phosphate buffer system is maximally effective at a 
pH close to its pK a of 6.86 (Figs 2-15, 2-17) and thus 
tends to resist pH changes in the range between about 
5.9 and 7.9. It is therefore an effective buffer in biologi- 
cal fluids; in mammals, for example, extracellular fluids 
and most cytoplasmic compartments have a pH in the 
range of 6.9 to 7.4 (see Worked Example 2-6). 

Blood plasma is buffered in part by the bicarbonate 
system, consisting of carbonic acid (H 2 C0 3 ) as proton 
donor and bicarbonate (HCO3) as proton acceptor (Ki 
is the first of several equilibrium constants in the bicar- 
bonate buffering system): 



H 2 C0 3 



=± H + + HCO3 

[H + ][HCQ3] 
[H 2 C0 3 1 



This buffer system is more complex than other conju- 
gate acid-base pairs because one of its components, car- 
bonic acid (H 2 C0 3 ) ; is formed from dissolved (d) carbon 
dioxide and water, in a reversible reaction: 

C0 2 (d) + H 2 0 H 2 C0 3 

[H 2 C0 3 ] 



K 2 



[C0 2 (d)][H 2 0] 



Water 



mm WORKED EXAMPLE 2-6 Phosphate Buffers 

(a) What is the pH of a mixture of 0.042 m NaH 2 P0 4 and 0.058 m Na 2 HP0 4 ? 
Solution: We use the Henderson-Hasselbalch equation, which we'll express here as 

[conjugate base] 



pH = pK a + log - 



[acid] 



In this case, the acid (the species that gives up a proton) is H2PO4 , and the conju- 
gate base (the species that loses a proton) is HPOf ~. Substituting the given concen- 
trations of acid and conjugate base and the pK a (6.86), 

pH = 6.86 + log^jjH) = 6.86 + log 1.38 = 6.86 + 0.14 = 7.0 

We can roughly check this answer. When more conjugate base than acid is 
present, the acid is more than 50% titrated and thus the pH is above the pK a (6.86), 
where the acid is exactly 50% titrated. 

(b) If 1.0 mL of 10.0 N NaOH is added to a liter of the buffer prepared in (a), how 
much will the pH change? 

Solution: A liter of the buffer contains 0.042 mol of NaH 2 P0 4 . Adding 1.0 mL of 10.0 N 
NaOH (0.010 mol) would titrate an equivalent amount (0.010 mol) of NaH 2 P0 4 to 
Na 2 HP0 4 , resulting in 0.032 mol of NaH 2 P0 4 and 0.068 mol of Na 2 HP0 4 . The new pH is 

IT JT 4-1 [HPQ '" ] 
PH = P ** + l0g [i^ 

0.068 

= 6.86 + log^-^ = 6.86 + 0.33 = 7.2 

(c) If 1.0 mL of 10.0 N NaOH is added to a liter of pure water at pH 7.0, what is the 
final pH? Compare this with the answer in (b). 

Solution: The NaOH dissociates completely into Na + and OH", giving [OH - ] = 0.010 
mol/L = 1.0 X 10" 2 m. The pOH is the negative logarithm of [OH"], so pOH = 2.0. 
Given that in all solutions, pH + pOH = 14, the pH of the solution is 12. 

So, an amount of NaOH that increases the pH of water from 7 to 12 increases the 
pH of a buffered solution, as in (b), from 7.0 to just 7.2. Such is the power of buffering! 



Carbon dioxide is a gas under normal conditions, and 
the concentration of dissolved C0 2 is the result of equi- 
libration with C0 2 of the gas (g) phase: 



C0 2 (g) 



C0 2 (d) 



[CO a (d)] 
[C0 2 (g)] 



The pH of a bicarbonate buffer system depends on the con- 
centration of H 2 C0 3 and HCO3 , the proton donor and ac- 
ceptor components. The concentration of H 2 C0 3 in turn 
depends on the concentration of dissolved C0 2j which in 
turn depends on the concentration of C0 2 in the gas phase, 
or the partial pressure of C0 2 , denoted pC0 2 . Thus the 
pH of a bicarbonate buffer exposed to a gas phase is ulti- 
mately determined by the concentration of HCO3 in the 
aqueous phase and by pC0 2 in the gas phase. 

The bicarbonate buffer system is an effective 
physiological buffer near pH 7.4, because the 
H 2 C0 3 of blood plasma is in equilibrium with a large 
reserve capacity of C0 2 (g) in the air space of the lungs. 
As noted above, this buffer system involves three re- 



versible equilibria, in this case between gaseous C0 2 in 
the lungs and bicarbonate (HCO3) in the blood plasma 

(Fig. 2-20). 



H + + HCO; 



Aqueous phase 
(blood in capillaries) 



H 2 0 - 



reaction 1 



reaction 2 

->H 2 0 



CCUd) 



Gas phase 
(lung air space) 



reaction 3 



C0 2 (g) 



FIGURE 2-20 The bicarbonate buffer system. C0 2 in the air space of 
the lungs is in equilibrium with the bicarbonate buffer in the blood 
plasma passing through the lung capillaries. Because the concentra- 
tion of dissolved C0 2 can be adjusted rapidly through changes in the 
rate of breathing, the bicarbonate buffer system of the blood is in near- 
equilibrium with a large potential reservoir of C0 2 . 
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When H + (from the lactic acid produced in muscle 
tissue during vigorous exercise, for example) is added 
to blood as it passes through the tissues, reaction 1 in 
Figure 2-20 proceeds toward a new equilibrium, in 
which [H 2 C0 3 ] is increased. This in turn increases 
[C0 2 (d)] in the blood (reaction 2) and thus increases 
the partial pressure of C0 2 (g) in the air space of the 
lungs (reaction 3); the extra C0 2 is exhaled. Con- 
versely, when the pH of blood is raised (by the NH 3 
produced during protein catabolism, for example), the 
opposite events occur: [H + ] of blood plasma is lowered, 
causing more H 2 C0 3 to dissociate into H + and HC0 3 
and thus more C0 2 (g) from the lungs to dissolve in 
blood plasma. The rate of respiration, or breathing — 
that is, the rate of inhaling and exhaling — can quickly 
adjust these equilibria to keep the blood pH nearly 
constant. The rate of respiration is controlled by the 
brain stem, where detection of an increased blood 
pC0 2 or decreased blood pH triggers deeper and more 
frequent breathing. 

At the pH of blood plasma (7.4) very little H 2 C0 3 is 
present in comparison with HC0 3 , and the addition of a 
small amount of base (NH 3 or OH~) would titrate this 
H 2 C0 3 , exhausting the buffering capacity. The impor- 
tant role for carbonic acid (pif a = 3.57 at 37 °C) in 
buffering blood plasma (~pH 7.4) seems inconsistent 
with our earlier statement that a buffer is most effective 
in the range of 1 pH unit above and below its pK a . The 
explanation for this paradox is the large reservoir of 
C0 2 (d) in blood and its rapid equilibration with H 2 C0 3 : 

C0 2 (d) + H 2 0 H 2 C0 3 

We can define a constant, K h) which is the equilibrium 
constant for the hydration of C0 2 : 



[H 2 CQ 3 ] 
[C0 2 (d)] 



Then, to take the C0 2 (d) reservoir into account, we can 
express [H 2 C0 3 ] as if h [C0 2 (d)] 5 and substitute this ex- 
pression for [H 2 C0 3 ] in the equation for the acid dissoci- 
ation of H 2 C0 3 : 



[H+HHCOg] [H + ][HC0 3 ] 



[H 2 C0 3 



# h [C0 2 (d)] 



Now, the overall equilibrium for dissociation of H 2 C0 3 
can be expressed in these terms: 



"combined 



[H + ][HCQ3] 
[C0 2 (d)] 



We can calculate the value of the new constant, ^combined, 
and the corresponding apparent pif, or piT combined , 
from the experimentally determined values of K h (3.0 X 
1(T 3 m) andiT a (2.7 X 10" 4 m) at 37 °C: 

^combined = (3-0 X lO" 3 M) (2.7 X 10" 4 M) 



In clinical medicine, it is common to refer to C0 2 (d) 
as the conjugate acid and to use the apparent, or com- 
bined, pK a of 6.1 to simplify calculation of pH from 
[C0 2 (d)]. In this convention, 



pH = 6.1 + 



[HCOi 



(0.23 x P C0 2 ) 



where pC0 2 is expressed in kilopascals (kPa; typically, 
pC0 2 is 4.6 to 6.7 kPa) and 0.23 is the corresponding 
solubility coefficient for C0 2 in water; thus the term 
0.23 X pC0 2 - 1.2 kPa. Plasma [HC0 3 ] is normally 
about 24 mM. ■ 

Untreated Diabetes Produces Life-Threatening 
Acidosis 

Human blood plasma normally has a pH between 
7.35 and 7.45, and many of the enzymes that 
function in the blood have evolved to have maximal ac- 
tivity in that pH range. Although many aspects of cell 
structure and function are influenced by pH, the 
catalytic activity of enzymes is especially sensitive. 
Enzymes typically show maximal catalytic activity at a 
characteristic pH, called the pH optimum (Fig. 2-21). 
On either side of this optimum pH, catalytic activity of- 
ten declines sharply. Thus, a small change in pH can 
make a large difference in the rate of some crucial 
enzyme-catalyzed reactions. Biological control of the pH 
of cells and body fluids is therefore of central impor- 
tance in all aspects of metabolism and cellular activities, 
and changes in blood pH have marked physiological con- 
sequences (described with gusto in Box 2-1!). 

In individuals with untreated diabetes, the lack of 
insulin, or insensitivity to insulin (depending on the type 




pK C( 



6.1 



9 10 



FIGURE 2-21 The pH optima of some enzymes. Pepsin is a digestive 
enzyme secreted into gastric juice, which has a pH of -1.5, allowing 
pepsin to act optimally. Trypsin, a digestive enzyme that acts in the 
small intestine, has a pH optimum that matches the neutral pH in the 
lumen of the small intestine. Alkaline phosphatase of bone tissue is a 
hydrolytic enzyme thought to aid in bone mineralization. 
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BOX 2-1 



MEDICINE 



On Being One's Own Rabbit (Don't Try This at Home!) 



This is an account by J.B.S. Haldane of physiological 
experiments on controlling blood pH, from his book Pos- 
sible Worlds (Harper and Brothers, 1928). 

"I wanted to find out what happened to a man when 
one made him more acid or more alkaline . . . One might, of 
course, have tried experiments on a rabbit first, and some 
work had been done along these lines; but it is difficult to 
be sure how a rabbit feels at any time. Indeed, some rab- 
bits make no serious attempt to cooperate with one. 

"... A human colleague and I therefore began ex- 
periments on one another ... My colleague Dr. H.W. 
Davies and I made ourselves alkaline by over-breathing 
and by eating anything up to three ounces of bicarbon- 
ate of soda. We made ourselves acid by sitting in an air- 
tight room with between six and seven per cent of 
carbon dioxide in the air. This makes one breathe as if 
one had just completed a boat-race, and also gives one a 
rather violent headache . . . Two hours was as long as any 
one wanted to stay in the carbon dioxide, even if the gas 
chamber at our disposal had not retained an ineradica- 
ble odour of 'yellow cross gas' from some wartime ex- 
periments, which made one weep gently every time one 
entered it. The most obvious thing to try was drinking 
hydrochloric acid. If one takes it strong it dissolves one's 
teeth and burns one's throat, whereas I wanted to let it 
diffuse gently all through my body. The strongest I ever 
cared to drink was about one part of the commercial 
strong acid in a hundred of water, but a pint of that was 
enough for me, as it irritated my throat and stomach, 
while my calculations showed that I needed a gallon and 
a half to get the effect I wanted ... I argued that if one 
ate ammonium chloride, it would partly break up in the 
body, liberating hydrochloric acid. This proved to be 
correct . . . the liver turns ammonia into a harmless sub- 
stance called urea before it reaches the heart and brain 
on absorption from the gut. The hydrochloric acid is left 



behind and combines with sodium bicarbonate, which 
exists in all the tissues, producing sodium chloride and 
carbon dioxide. I have had this gas produced in me in 
this way at the rate of six quarts an hour (though not for 
an hour on end at that rate) . . . 

"I was quite satisfied to have reproduced in myself 
the type of shortness of breath which occurs in the ter- 
minal stages of kidney disease and diabetes. This had 
long been known to be due to acid poisoning, but in each 
case the acid poisoning is complicated by other chemi- 
cal abnormalities, and it had been rather uncertain 
which of the symptoms were due to the acid as such. 

"The scene now shifts to Heidelberg, where 
Freudenberg and Gyorgy were studying tetany in 
babies ... it occurred to them that it would be well 
worth trying the effect of making the body unusually 
acid. For tetany had occasionally been observed in pa- 
tients who had been treated for other complaints by 
very large doses of sodium bicarbonate, or had lost large 
amounts of hydrochloric acid by constant vomiting; and 
if alkalinity of the tissues will produce tetany, acidity 
may be expected to cure it. Unfortunately, one could 
hardly try to cure a dying baby by shutting it up in a 
room full of carbonic acid, and still less would one give it 
hydrochloric acid to drink; so nothing had come of their 
idea, and they were using lime salts, which are not very 
easily absorbed, and which upset the digestion, but cer- 
tainly benefit many cases of tetany. 

"However, the moment they read my paper on the 
effects of ammonium chloride, they began giving it to 
babies, and were delighted to find that the tetany 
cleared up in a few hours. Since then it has been used 
with effect both in England and America, both on chil- 
dren and adults. It does not remove the cause, but it 
brings the patient into a condition from which he has a 
very fair chance of recovering." 



of diabetes), disrupts the uptake of glucose from blood 
into the tissues and forces the tissues to use stored fatty 
acids as their primary fuel. For reasons we will describe 
in detail later (p. 914), this dependence on fatty acids 
results in the accumulation of high concentrations of 
two carboxylic acids, jS-hydroxybutyric acid and ace- 
toacetic acid (blood plasma level of 90 mg/100 mL, 
compared with <3 mg/100 mL in control (healthy) indi- 
viduals; urinary excretion of 5 g/24 hr, compared with 
<125 mg/24 hr in controls). Dissociation of these acids 
lowers the pH of blood plasma to less than 7.35, causing 
acidosis. Severe acidosis leads to headache, drowsiness, 
nausea, vomiting, and diarrhea, followed by stupor, 
coma, and convulsions, presumably because at the 
lower pH, some enzyme(s) do not function optimally. 
When a patient is found to have high blood glucose, low 
plasma pH, and high levels of /3-hydroxybutyric acid and 



acetoacetic acid in blood and urine, diabetes mellitus is 
the likely diagnosis. 

Other conditions can also produce acidosis. Fast- 
ing and starvation force the use of stored fatty acids as 
fuel, with the same consequences as for diabetes. Very 
heavy exertion, such as a sprint by runners or cyclists, 
leads to temporary accumulation of lactic acid in the 
blood. Kidney failure results in a diminished capacity 
to regulate bicarbonate levels. Lung diseases (such as 
emphysema, pneumonia, and asthma) reduce the ca- 
pacity to dispose of the C0 2 produced by fuel oxidation 
in the tissues, with the resulting accumulation of 
H 2 C0 3 . Acidosis is treated by dealing with the underly- 
ing condition — insulin for people with diabetes; 
steroids or antibiotics for people with lung disease. 
Severe acidosis can be reversed by administering 
bicarbonate solution intravenously. ■ 
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WORKED EXAMPLE 2-7 Treatment of Acidosis with 
Bicarbonate 

Why does intravenous administration of a bicarbonate 
solution raise the plasma pH? 

Solution: The ratio of [HCOg ] to [C0 2 (d)] determines the 
pH of the bicarbonate buffer, according to the equation 



pH = 6.1 + log 



[HCOi 



(0.23 X pC0 2 ) 



If [HCO3] is increased with no change in pC0 2 , the pH 
will rise. 

SUMMARY 2.3 Buffering against pH Changes 
in Biological Systems 

■ A mixture of a weak acid (or base) and its salt 
resists changes in pH caused by the addition of H + 
or OH~. The mixture thus functions as a buffer. 

■ The pH of a solution of a weak acid (or base) and 
its salt is given by the Henderson-Hasselbalch 

[A~] 

equation: pH = pK a + log — — . 

[MAJ 

■ In cells and tissues, phosphate and bicarbonate 
buffer systems maintain intracellular and 
extracellular fluids at their optimum (physiological) 
pH, which is usually close to pH 7. Enzymes 
generally work optimally at this pH. 

■ Medical conditions that lower the pH of blood, 
causing acidosis, or raise it, causing alkalosis, can 
be life threatening. 

2.4 WaterasaReactant 

Water is not just the solvent in which the chemical reac- 
tions of living cells occur; it is very often a direct partic- 
ipant in those reactions. The formation of ATP from 
ADP and inorganic phosphate is an example of a con- 
densation reaction in which the elements of water are 
eliminated (Fig. 2-22). The reverse of this reaction — 
cleavage accompanied by the addition of the elements of 
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FIGURE 2-22 Participation of water in biological reactions. ATP is a 

phosphoanhydride formed by a condensation reaction (loss of the ele- 
ments of water) between ADP and phosphate. R represents adenosine 
monophosphate (AMP). This condensation reaction requires energy 
The hydrolysis of (addition of the elements of water to) ATP to form 
ADP and phosphate releases an equivalent amount of energy. These 
condensation and hydrolysis reactions of ATP are just one example of 
the role of water as a reactant in biological processes. 



water — is a hydrolysis reaction. Hydrolysis reactions 
are also responsible for the enzymatic depolymerization 
of proteins, carbohydrates, and nucleic acids. Hydrolysis 
reactions, catalyzed by enzymes called hydrolases, are 
almost invariably exergonic; by producing two mole- 
cules from one, they lead to an increase in the random- 
ness of the system. The formation of cellular polymers 
from their subunits by simple reversal of hydrolysis 
(that is, by condensation reactions) would be endergonic 
and therefore does not occur. As we shall see, cells cir- 
cumvent this thermodynamic obstacle by coupling en- 
dergonic condensation reactions to exergonic processes, 
such as breakage of the anhydride bond in ATP. 

You are (we hope!) consuming oxygen as you read. 
Water and carbon dioxide are the end products of the 
oxidation of fuels such as glucose. The overall reaction 
can be summarized as 



C 6 H 12 0 6 + 6O2 
Glucose 



6C0 2 + 6H 2 0 



The "metabolic water" formed by oxidation of foods and 
stored fats is actually enough to allow some animals in 
very dry habitats (gerbils, kangaroo rats, camels) to sur- 
vive for extended periods without drinking water. 

The C0 2 produced by glucose oxidation is con- 
verted in erythrocytes to the more soluble HCO3, in a 
reaction catalyzed by the enzyme carbonic anhydrase: 



C0 2 + H 2 0 



HCO," + H + 



In this reaction, water not only is a substrate but also 
functions in proton transfer by forming a network of 
hydrogen-bonded water molecules through which pro- 
ton hopping occurs (Fig. 2-13). 

Green plants and algae use the energy of sunlight to 
split water in the process of photosynthesis: 



2H 2 0 + 2A 



light 



0 2 + 2AH 2 



In this reaction, A is an electron-accepting species, 
which varies with the type of photosynthetic organism, 
and water serves as the electron donor in an oxidation- 
reduction sequence (see Fig. 19-57) that is fundamen- 
tal to all life. 

SUMMARY 2.4 WaterasaReactant 

■ Water is both the solvent in which metabolic 

reactions occur and a reactant in many biochemical 
processes, including hydrolysis, condensation, and 
oxidation-reduction reactions. 

2.5 The Fitness of the Aqueous Environment 
for Living Organisms 

Organisms have effectively adapted to their aqueous en- 
vironment and have evolved means of exploiting the un- 
usual properties of water. The high specific heat of water 
(the heat energy required to raise the temperature 
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of 1 g of water by 1 °C) is useful to cells and organisms 
because it allows water to act as a "heat buffer," keep- 
ing the temperature of an organism relatively constant 
as the temperature of the surroundings fluctuates and 
as heat is generated as a byproduct of metabolism. Fur- 
thermore, some vertebrates exploit the high heat of va- 
porization of water (Table 2-1) by using (thus losing) 
excess body heat to evaporate sweat. The high degree 
of internal cohesion of liquid water, due to hydrogen 
bonding, is exploited by plants as a means of trans- 
porting dissolved nutrients from the roots to the leaves 
during the process of transpiration. Even the density of 
ice, lower than that of liquid water, has important bio- 
logical consequences in the life cycles of aquatic or- 
ganisms. Ponds freeze from the top down, and the 
layer of ice at the top insulates the water below from 
frigid air, preventing the pond (and the organisms in it) 
from freezing solid. Most fundamental to all living or- 
ganisms is the fact that many physical and biological 
properties of cell macromolecules, particularly the pro- 
teins and nucleic acids, derive from their interactions 
with water molecules of the surrounding medium. The 
influence of water on the course of biological evolution 
has been profound and determinative. If life forms 
have evolved elsewhere in the universe, they are un- 
likely to resemble those of Earth unless their extrater- 
restrial origin is also a place in which plentiful liquid 
water is available. 
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Aqueous environments support countless species. Soft corals, sponges, 
bryozoans, and algae compete for space on this reef off the Philippine 
Islands. 
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Problems 

1. Solubility of Ethanol in Water Explain why ethanol 
(CH 3 CH 2 OH) is more soluble in water than is ethane 
(CH3CH3). 

2. Calculation of pH from Hydrogen Ion Concentration 

What is the pH of a solution that has an H + concentration of 

(a) 1.75 X l(T 5 mo]/L; (b) 6.50 X 10" 10 mol/L; (c) 1.0 X 10" 4 
mol/L; (d) 1.50 X 10" 5 mol/L? 

3. Calculation of Hydrogen Ion Concentration from pH 

What is the H + concentration of a solution with pH of (a) 3.82; 

(b) 6.52; (c) 11.11? 

Bg 4. Acidity of Gastric HC1 In a hospital laboratory, a 
KM 10.0 mL sample of gastric juice, obtained several hours 
after a meal, was titrated with 0.1 m NaOH to neutrality; 7.2 mL 
of NaOH was required. The patient's stomach contained no 
ingested food or drink, thus assume that no buffers were pres- 
ent. What was the pH of the gastric juice? 

5. Calculation of the pH of a Strong Acid or Base (a) Write 
out the acid dissociation reaction for hydrochloric acid. 

(b) Calculate the pH of a solution of 5.0 x 10" 4 m HC1. 

(c) Write out the acid dissociation reaction for sodium hydrox- 
ide, (d) Calculate the pH of a solution of 7.0 X 10" 5 m NaOH. 

6. Calculation of pH from Concentration of Strong Acid 

Calculate the pH of a solution prepared by diluting 3.0 mL of 
2.5 m HC1 to a final volume of 100 mL with H 2 0. 

7. Measurement of Acetylcholine Levels by pH Changes 

The concentration of acetylcholine (a neurotransmitter) in a 
sample can be determined from the pH changes that accom- 
pany its hydrolysis. When the sample is incubated with the 
enzyme acetylcholinesterase, acetylcholine is quantitatively 
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converted to choline and acetic acid, which dissociates to yield 
acetate and a hydrogen ion: 

O CH^ 

II I H 2 0 

CH 3 -C-0-CH 2 -CH 2 -N-CH, — ► 



I 



Acetylcholine 



CH, 



HO-CH 2 -CH 2 -N-CH 3 + CH 3 -C-0- 

CH 3 O 
Choline Acetate 



In a typical analysis, 15 mL of an aqueous solution containing 
an unknown amount of acetylcholine had a pH of 7.65. When 
incubated with acetylcholinesterase, the pH of the solution de- 
creased to 6.87. Assuming there was no buffer in the assay 
mixture, determine the number of moles of acetylcholine in 
the 15 mL sample. 

8. Physical Meaning of pK & Which of the following aqueous 
solutions has the lowest pH: 0.1 m HC1; 0.1 M acetic acid (pK a 
= 4.86); 0.1 M formic acid (pK a = 3.75)? 

9. Simulated Vinegar One way to make vinegar (not the 
preferred way) is to prepare a solution of acetic acid, the sole 
acid component of vinegar, at the proper pH (see Fig. 2-14) 
and add appropriate flavoring agents. Acetic acid (M r 60) is a 
liquid at 25 °C, with a density of 1.049 g/mL. Calculate the 
volume that must be added to distilled water to make 1 L of 
simulated vinegar (see Fig. 2-15). 

10. Identifying the Conjugate Base Which is the conju- 
gate base in each of the pairs below? 

(a) RCOOH, RCOO" (c) H 2 POJ, H 3 P0 4 

(b) RNH 2 , RNHj (d) H 2 C0 3 , HCO3 

11. Calculation of the pH of a Mixture of a Weak Acid 
and Its Conjugate Base Calculate the pH of a dilute solution 
that contains a molar ratio of potassium acetate to acetic acid 
(pK a = 4.76) of (a) 2:1; (b) 1:3; (c) 5:1; (d) 1:1; (e) 1:10. 

12. Effect of pH on Solubility The strongly polar, hydrogen- 
bonding properties of water make it an excellent solvent for 
ionic (charged) species. By contrast, nonionized, nonpolar or- 
ganic molecules, such as benzene, are relatively insoluble in 
water. In principle, the aqueous solubility of any organic acid 
or base can be increased by converting the molecules to 
charged species. For example, the solubility of benzoic acid in 
water is low. The addition of sodium bicarbonate to a mixture 
of water and benzoic acid raises the pH and deprotonates 
the benzoic acid to form benzoate ion, which is quite soluble 
in water. 
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r C-OH 




1 



Benzoic add 
pK n - 5 



Benzoate ion 



Are the following compounds more soluble in an aqueous solu- 
tion of 0.1 m NaOH or 0.1 m HC1? (The dissociable protons are 
shown in red.) 



Oil 




I 

H 

Pyridine ion 

(a) 
O 

& 




/3-Naphthol 
pK a ~10 

(b) 



H 



ch 3 )n-c-ch 2 ^J^om 



H 



O— CH, 



iV-Acetyltyrosine methyl ester 
(c) 

13. Treatment of Poison Ivy Rash The components 
of poison ivy and poison oak that produce the charac- 
teristic itchy rash are catechols substituted with long-chain 
alkyl groups. 



OH 




lCH 2 )„-CH3 
pK a = 8 

If you were exposed to poison ivy, which of the treatments be- 
low would you apply to the affected area? Justify your choice. 

(a) Wash the area with cold water. 

(b) Wash the area with dilute vinegar or lemon juice. 

(c) Wash the area with soap and water. 

(d) Wash the area with soap, water, and baking soda 
(sodium bicarbonate). 

14. pH and Drug Absorption Aspirin is a weak acid 
with a pK a of 3.5: 




It is absorbed into the blood through the cells lining the stom- 
ach and the small intestine. Absorption requires passage 
through the plasma membrane, the rate of which is deter- 
mined by the polarity of the molecule: charged and highly po- 
lar molecules pass slowly, whereas neutral hydrophobic ones 
pass rapidly. The pH of the stomach contents is about 1.5, and 
the pH of the contents of the small intestine is about 6. Is more 
aspirin absorbed into the bloodstream from the stomach or 
from the small intestine? Clearly justify your choice. 
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15. Calculation of pH from Molar Concentrations What 
is the pH of a solution containing 0.12 mol/L of NH 4 C1 and 0.03 
mol/L of NaOH (p# a of NH£/NH 3 is 9.25)? 

16. Calculation of pH after Titration of Weak Acid A 

compound has a pK a of 7.4. To 100 mL of a 1.0 m solution of 
this compound at pH 8.0 is added 30 mL of 1.0 m hydrochloric 
acid. What is the pH of the resulting solution? 

17. Properties of a Buffer The amino acid glycine is often 
used as the main ingredient of a buffer in biochemical experi- 
ments. The amino group of glycine, which has a piT a of 9.6, can 
exist either in the protonated form (— NHj ) or as the free 
base ( — NH 2 ), because of the reversible equilibrium 

R— NHJ v R— NH 2 + H + 

(a) In what pH range can glycine be used as an effective 
buffer due to its amino group? 

(b) In a 0.1 m solution of glycine at pH 9.0, what fraction 
of glycine has its amino group in the — NH3 form? 

(c) How much 5 M KOH must be added to 1.0 L of 0.1 M 
glycine at pH 9,0 to bring its pH to exactly 10.0? 

(d) When 99% of the glycine is in its — NHj form, what is 
the numerical relation between the pH of the solution and the 
pK a of the amino group? 

18. Preparation of a Phosphate Buffer What molar ratio 
of HPOf " to H 2 P04 in solution would produce a pH of 7.0? 
Phosphoric acid (H 3 P0 4 ), a triprotic acid, has 3 pK a values: 
2.14, 6.86, and 12.4. Hint: Only one of the p/T a values is rele- 
vant here. 

19. Preparation of Standard Buffer for Calibration of a 
pH Meter The glass electrode used in commercial pH meters 
gives an electrical response proportional to the concentration 
of hydrogen ion. To convert these responses to a pH reading, 
the electrode must be calibrated against standard solutions of 
known H + concentration. Determine the weight in grams of 
sodium dihydrogen phosphate (NaH 2 P0 4 • H 2 0; FW 138) and 
disodium hydrogen phosphate (Na 2 HP0 4 ; FW 142) needed to 
prepare 1 L of a standard buffer at pH 7.00 with a total phos- 
phate concentration of 0.100 m (see Fig. 2-15). See problem 
18 for the pK a values of phosphoric acid. 

20. Calculation of Molar Ratios of Conjugate Base to 
Weak Acid from pH For a weak acid with a p/T a of 6.0, calcu- 
late the ratio of conjugate base to acid at a pH of 5.0. 

21. Preparation of Buffer of Known pH and Strength 

Given 0.10 m solutions of acetic acid (j)K a = 4.76) and sodium 
acetate, describe how you would go about preparing 1.0 L of 
0.10 m acetate buffer of pH 4.00. 

22. Choice of Weak Acid for a Buffer Which of these com- 
pounds would be the best buffer at pH 5.0: formic acid (jpK a = 
3.8), acetic acid (pif a = 4.76), or ethylamine (pK a = 9.0)? 
Briefly justify your answer. 

23. Working with Buffers A buffer contains 0.010 mol of 
lactic acid (pK a = 3.86) and 0.050 mol of sodium lactate per 
liter, (a) Calculate the pH of the buffer, (b) Calculate the 



change in pH when 5 mL of 0.5 M HC1 is added to 1 L of the 
buffer, (c) What pH change would you expect if you added the 
same quantity of HC1 to 1 L of pure water? 

24. Use of Molar Concentrations to Calculate pH What 
is the pH of a solution that contains 0.20 m sodium acetate and 
0.60 M acetic acid (pK a = 4.76)? 

25. Preparation of an Acetate Buffer Calculate the con- 
centrations of acetic acid QpK a = 4.76) and sodium acetate 
necessary to prepare a 0.2 m buffer solution at pH 5.0. 

26. pH of Insect Defensive Secretion You have been ob- 
serving an insect that defends itself from enemies by secreting 
a caustic liquid. Analysis of the liquid shows it to have a total 
concentration of formate plus formic acid (K a = 1.8 X 10~ 4 ) of 
1.45 m; the concentration of formate ion is 0.015 m. What is the 
pH of the secretion? 

27. Calculation of pif a An unknown compound, X, is 
thought to have a carboxyl group with a pK a of 2.0 and another 
ionizable group with a pK a between 5 and 8. When 75 mL of 0.1 
M NaOH is added to 100 mL of a 0.1 m solution of X at pH 2.0, 
the pH increases to 6.72. Calculate the pK a of the second ion- 
izable group of X. 

28. Ionic Forms of Alanine Alanine is a diprotic acid that 
can undergo two dissociation reactions (see Table 3-1 for pK a 
values), (a) Given the structure of the partially protonated 
form (or zwitterion; see Fig. 3-9) below, draw the chemical 
structures of the other two forms of alanine that predominate 
in aqueous solution: the fully protonated form and the fully de- 
protonated form. 

coo- 

+ I 

H 3 N— C-H 

I 

CH 3 
Alanine 

Of the three possible forms of alanine, which would be present 
at the highest concentration in solutions of the following pH: 
(b) 1.0; (c) 6.2; (d) 8.02; (e) 11.9. Explain your answers in 
terms of pH relative to the two $K a values. 

29. Control of Blood pH by Respiratory Rate 

(a) The partial pressure of C0 2 in the lungs can be varied 
rapidly by the rate and depth of breathing. For example, a 
common remedy to alleviate hiccups is to increase the con- 
centration of C0 2 in the lungs. This can be achieved by holding 
one's breath, by very slow and shallow breathing (hypoventila- 
tion), or by breathing in and out of a paper bag. Under such 
conditions, pC0 2 in the air space of the lungs rises above nor- 
mal. Qualitatively explain the effect of these procedures on the 
blood pH. 

(b) A common practice of competitive short-distance 
runners is to breathe rapidly and deeply (hyperventilate) for 
about half a minute to remove C0 2 from their lungs just before 
the race begins. Blood pH may rise to 7.60. Explain why the 
blood pH increases. 

(c) During a short-distance run, the muscles produce a 
large amount of lactic acid (CH 3 CH(OH)COOH; K a = 
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1.38 X 10 4 m) from their glucose stores. In view of this fact, 
why might hyperventilation before a dash be useful? 

30. Calculation of Blood pH from C0 2 and Bicarbonate 
Levels Calculate the pH of a blood plasma sample with a total 
C0 2 concentration of 26.9 mM and bicarbonate concentration 
of 25.6 mM. Recall from page 63 that the relevant pK a of car- 
bonic acid is 6.1. 

31. Effect of Holding One's Breath on Blood pH The pH 

of the extracellular fluid is buffered by the bicarbonate/car- 
bonic acid system. Holding your breath can increase the con- 
centration of C0 2 (g) in the blood. What effect might this have 
on the pH of the extracellular fluid? Explain by showing the 
relevant equilibrium equation(s) for this buffer system. 

Data Analysis Problem 

32. "Switchable" Surfactants Hydrophobic molecules do 
not dissolve well in water. Given that water is a very commonly 
used solvent, this makes certain processes very difficult: wash- 
ing oily food residue off dishes, cleaning up spilled oil, keeping 
the oil and water phases of salad dressings well mixed, and 
carrying out chemical reactions that involve both hydrophobic 
and hydrophilic components. 

Surfactants are a class of amphipathic compounds that 
includes soaps, detergents, and emulsifiers. With the use of 
surfactants, hydrophobic compounds can be suspended in 
aqueous solution by forming micelles (see Fig. 2-7). A micelle 
has a hydrophobic core consisting of the hydrophobic com- 
pound and the hydrophobic "tails" of the surfactant; the hy- 
drophilic "heads" of the surfactant cover the surface of the 
micelle. A suspension of micelles is called an emulsion. The 
more hydrophilic the head group of the surfactant, the more 
powerful it is — that is, the greater its capacity to emulsify hy- 
drophobic material. 

When you use soap to remove grease from dirty dishes, the 
soap forms an emulsion with the grease that is easily removed 
by water through interaction with the hydrophilic head of the 
soap molecules. Likewise, a detergent can be used to emulsify 
spilled oil for removal by water. And emulsifiers in commercial 
salad dressings keep the oil suspended evenly throughout the 
water-based mixture. 

There are some situations in which it would be very useful 
to have a "switchable" surfactant: a molecule that could be re- 
versibly converted between a surfactant and a nonsurfactant. 

(a) Imagine such a "switchable" surfactant existed. How 
would you use it to clean up and then recover the oil from an 
oil spill? 

Liu et al. describe a prototypical switchable surfactant in 
their 2006 article "Switchable Surfactants." The switching is 
based on the following reaction: 



(b) Given that the piT a of a typical amidinium ion is 12.4, 
in which direction (left or right) would you expect the equilib- 
rium of the above reaction to lie? (See Fig. 2-16 for relevant 
pK a values.) Justify your answer. Hint: Remember the reaction 
H 2 0 + C0 2 H 2 C0 3 . 

Liu and colleagues produced a switchable surfactant for 
which R = C 16 H 33 . They do not name the molecule in their ar- 
ticle; for brevity, we'll call it s-surf. 

(c) The amidinium form of s-surf is a powerful surfactant; 
the amidine form is not. Explain this observation. 

Liu and colleagues found that they could switch between 
the two forms of s-surf by changing the gas that they bubbled 
through a solution of the surfactant. They demonstrated this 
switch by measuring the electrical conductivity of the s-surf 
solution; aqueous solutions of ionic compounds have higher 
conductivity than solutions of nonionic compounds. They 
started with a solution of the amidine form of s-surf in water. 
Their results are shown below; dotted lines indicate the 
switch from one gas to another. 

Gas bubbled in: C0 2 Ar C0 2 Ar 
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(d) In which form is the majority of s-surf at point A? At 
point B? 

(e) Why does the electrical conductivity rise from time 0 
to point A? 

(f) Why does the electrical conductivity fall from point A 
to point B? 

(g) Explain how you would use s-surf to clean up and re- 
cover the oil from an oil spill. 

Reference 

Liu, Y., Jessop, P.G., Cunningham, M., Eckert, C.A., & 
Liotta, C.L. (2006) Science 313, 958-960. 
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The word protein that I propose to you ... I would wish to derive from 
proteios, because it appears to be the primitive or principal substance 
of animal nutrition that plants prepare for the herbivores, and which 
the latter then furnish to the carnivores. 

—J. J. Berzelius, letter to G. J. Mulder, 1838 
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Proteins mediate virtually every process that takes 
place in a cell, exhibiting an almost endless diversity 
of functions. To explore the molecular mechanism of 
a biological process, a biochemist almost inevitably studies 
one or more proteins. Proteins are the most abundant bio- 
logical macromolecules, occurring in all cells and all parts 
of cells. Proteins also occur in great variety; thousands of 
different kinds may be found in a single cell. As the ar- 
biters of molecular function, proteins are the most impor- 
tant final products of the information pathways discussed 
in Part III of this book. Proteins are the molecular instru- 
ments through which genetic information is expressed. 

Relatively simple monomeric subunits provide the 
key to the structure of the thousands of different pro- 
teins. All proteins, whether from the most ancient lines 
of bacteria or from the most complex forms of life, are 
constructed from the same ubiquitous set of 20 amino 



acids, covalently linked in characteristic linear se- 
quences. Because each of these amino acids has a side 
chain with distinctive chemical properties, this group of 
20 precursor molecules may be regarded as the alphabet 
in which the language of protein structure is written. 
Proteins are found in a wide range of sizes, from rela- 
tively small peptides with just a few amino acid residues 
to huge polymers with molecular weights in the millions. 

What is most remarkable is that cells can produce 
proteins with strikingly different properties and activities 
by joining the same 20 amino acids in many different com- 
binations and sequences. From these building blocks dif- 
ferent organisms can make such widely diverse products 
as enzymes, hormones, antibodies, transporters, muscle 
fibers, the lens protein of the eye, feathers, spider webs, 
rhinoceros horn, milk proteins, antibiotics, mushroom poi- 
sons, and myriad other substances having distinct biologi- 
cal activities (Fig. 3-1). Among these protein products, 
the enzymes are the most varied and specialized. Virtually 
all cellular reactions are catalyzed by enzymes. 

Protein structure and function are the topics of this 
and the next three chapters. Here, we begin with a de- 
scription of the fundamental chemical properties of 
amino acids, peptides, and proteins. We also consider 
how a biochemist works with proteins. 




FIGURE 3-1 Some functions of proteins, (a) The light produced by fireflies 
is the result of a reaction involving the protein luciferin and ATP, catalyzed 
by the enzyme luciferase (see Box 13-1). (b) Erythrocytes contain large 
amounts of the oxygen-transporting protein hemoglobin, (c) The protein 
keratin, formed by all vertebrates, is the chief structural component of hair, 



scales, horn, wool, nails, and feathers. The black rhinoceros is nearing ex- 
tinction in the wild because of the belief prevalent in some parts of the 
world that a powder derived from its horn has aphrodisiac properties. In 
reality, the chemical properties of powdered rhinoceros horn are no dif- 
ferent from those of powdered bovine hooves or human fingernails. 



Amino Acids, Peptides, and Proteins 



3.1 Amino Acids 

% Protein Architecture-Amino Acids. Proteins are polymers 

of amino acids, with each amino acid residue joined to 
its neighbor by a specific type of covalent bond. (The 
term "residue" reflects the loss of the elements of water 
when one amino acid is joined to another.) Proteins can 
be broken down (hydrolyzed) to their constituent amino 
acids by a variety of methods, and the earliest studies of 
proteins naturally focused on the free amino acids 
derived from them. Twenty different amino acids are 
commonly found in proteins. The first to be discovered 
was asparagine, in 1806. The last of the 20 to be found, 
threonine, was not identified until 1938. All the amino 
acids have trivial or common names, in some cases 
derived from the source from which they were first 
isolated. Asparagine was first found in asparagus, and 
glutamate in wheat gluten; tyrosine was first isolated 
from cheese (its name is derived from the Greek tyros, 
"cheese"); and glycine (Greek glykos, "sweet") was so 
named because of its sweet taste. 

Amino Acids Share Common Structural Features 

All 20 of the common amino acids are a-amino acids. They 
have a carboxyl group and an amino group bonded to the 
same carbon atom (the a carbon) (Fig. 3-2). They differ 
from each other in their side chains, or R groups, which 
vary in structure, size, and electric charge, and which in- 
fluence the solubility of the amino acids in water. In addi- 
tion to these 20 amino acids there are many less common 
ones. Some are residues modified after a protein has 
been synthesized; others are amino acids present in living 
organisms but not as constituents of proteins. The com- 
mon amino acids of proteins have been assigned three- 
letter abbreviations and one-letter symbols (Table 3-1), 
which are used as shorthand to indicate the composition 
and sequence of amino acids polymerized in proteins. 

KEY CONVENTION: The three-letter code is transparent, the 
abbreviations generally consisting of the first three 
letters of the amino acid name. The one-letter code was 
devised by Margaret Oakley Dayhoff (1925-1983), 
considered by many to be the founder of the field of 
bioinformatics. The one-letter code reflects an attempt 
to reduce the size of the data files (in an era of punch- 
card computing) used to describe amino acid sequences. 
It was designed to be easily memorized, and understand- 
ing its origin can help students do just that. For six amino 
acids (CHIMSV), the first letter of the amino acid name is 
unique and thus is used as the symbol. For five others 



COO" 

I 

-C— H 
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FIGURE 3-2 General structure of an amino 
acid. This structure is common to all but one of 
the a-amino acids. (Proline, a cyclic amino acid, 
is the exception.) The R group, or side chain 
(red), attached to the a carbon (blue) is different 
in each amino acid. 




Margaret Oakley Dayhoff 
1925-1983 



(AGLPT), the first letter is not 
unique but is assigned to the 
amino acid that is most com- 
mon in proteins (for example, 
leucine is more common than 
lysine). For another four, the 
letter used is phonetically sug- 
gestive (RFYW: aRginine, Fenyl- 
alanine, tYrosine, tWiptophan). 
The rest were harder to assign. 
Four (DNEQ) were assigned 
letters found within or sug- 
gested by their names (aspar- 
Dic, asparagiNe, glutamEke, Q-tamine). That left lysine. 
Only a few letters were left in the alphabet, and K was cho- 
sen because it was the closest to L. ■ 

For all the common amino acids except glycine, the 
a carbon is bonded to four different groups: a carboxyl 
group, an amino group, an R group, and a hydrogen 
atom (Fig. 3-2; in glycine, the R group is another hydro- 
gen atom). The a-carbon atom is thus a chiral center 
(p. 16). Because of the tetrahedral arrangement of the 
bonding orbitals around the a-carbon atom, the four dif- 
ferent groups can occupy two unique spatial arrange- 
ments, and thus amino acids have two possible 
stereoisomers. Since they are nonsuperposable mirror 
images of each other (Fig. 3-3), the two forms 
represent a class of stereoisomers called enantiomers 

coo- 



CH 3 

(a) L-Alanine 
COO" 

+ = 

H 3 N^Q^H 

CH 3 
L-Alanine 
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(b) 



H,N- 



(c) 
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CH 3 
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COO" 
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COO" 
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CH 3 
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FIGURE 3-3 Stereoisomerism in a-amino acids, (a) The two stereoiso- 
mers of alanine, i_- and D-alanine, are nonsuperposable mirror images of 
each other (enantiomers). (b, c) Two different conventions for showing 
the configurations in space of stereoisomers. In perspective formulas (b) 
the solid wedge-shaped bonds project out of the plane of the paper, the 
dashed bonds behind it. In projection formulas (c) the horizontal bonds 
are assumed to project out of the plane of the paper, the vertical bonds 
behind. However, projection formulas are often used casually and are 
not always intended to portray a specific stereochemical configuration. 
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TABLE 3-1 



Properties and Conventions Associated with the Common Amino Adds Found in Proteins 





















Amino acid 


Abbreviation/ 
symbol 


M r * 


ptfi 
( — COOH) 


( — NH3) 


(R group) 


Pi 


Hydropathy 
index' 


Occurrence in 
proteins (%)* 


Nonpolar, aliphatic 
R groups 
















Glycine 


Gly G 


75 


2.34 


9.60 




5.97 


-0.4 


7.2 


Alanine 


Ala A 


89 


2.34 


9.69 




6.01 


1.8 


7.8 


Proline 


Pro P 


115 


1.99 


10.96 




6.48 


1.6 


5.2 


Valine 


Val V 


117 


2.32 


9.62 




5.97 


4.2 


6.6 


Leucine 


Leu L 


131 


2.36 


9.60 




5.98 


3.8 


9.1 


Isoleucine 


He I 


131 


2,36 


9.68 




6.02 


4.5 


5.3 


Methionine 


Met M 


149 


2.28 


9.21 




5.74 


1.9 


2.3 


Aromatic 


















R groups 


















Phenylalanine 


Phe F 


165 


1.83 


9.13 




5.48 


2.8 


3.9 


Tyrosine 


Tyr Y 


181 


2.20 


9.11 


10.07 


5.66 


-1.3 


3.2 


Tryptophan 


Trp W 


204 


2.38 


9.39 




5.89 


-0.9 


1.4 


Polar, uncharged 
R groups 
















Serine 


Ser S 


105 


2.21 


9.15 




5.68 


-0.8 


6.8 


Threonine 


Thr T 


119 


2.11 


9 62 




5.87 


-0.7 


5.9 


Cysteine § 


Cys C 


121 


1.96 


10.28 


8.18 


5.07 


2.5 


1.9 


Asparagine 


Asn N 


132 


2.02 


8.80 




5.41 


-3.5 


4.3 


Glutamine 


Gin Q 


146 


2.17 


9.13 




5.65 


-3.5 


4.2 


Positively charged 
R groups 
















Lysine 


Lys K 


146 


2.18 


8.95 


10.53 


9.74 


-3.9 


5.9 


Histidine 


His H 


155 


1.82 


9.17 


6.00 


7.59 


-3.2 


2.3 


Arginine 


Arg R 


174 


2.17 


9.04 


12.48 


10.76 


-4.5 


5.1 


Negatively charged 
R groups 
















Aspartate 


Asp D 


133 


1.88 


9 60 


3.65 


2.77 


-3.5 


5.3 


Glutamate 


Glu E 


147 


2.19 


9.67 


4.25 


3.22 


-3.5 


6.3 



*M r values reflect the structures as shown in Figure 3-5. The elements of water (M r 18) are deleted when the amino acid is 
incorporated into a polypeptide. 

fA scale combining hydrophobicity and hydrophilicity of R groups. The values reflect the free energy (AG) of transfer of the amino 
acid side chain from a hydrophobic solvent to water. This transfer is favorable (AG < 0; negative value in the index) for charged 
or polar amino acid side chains, and unfavorable (AG > 0; positive value in the index) for amino acids with nonpolar or more 
hydrophobic side chains. See Chapter 11. From Kyte, J. & Doolittle, R.R (1982) A simple method for displaying the hydropathic 
character of a protein. / Mol. Biol. 157, 105-132. 

^Average occurrence in more than 1,150 proteins. From Doolittle, R.F (1989) Redundancies in protein sequences. In Prediction 
of Protein Structure and the Principles of Protein Conformation (Fasman, G.D., ed ), pp 599-623, Plenum Press, New York. 

§Cysteine is generally classified as polar despite having a positive hydropathy index. This reflects the ability of the sulfhydryl 
group to act as a weak acid and to form a weak hydrogen bond with oxygen or nitrogen 



(see Fig. 1-19). All molecules with a chiral center are 
also optically active — that is, they rotate plane- 
polarized light (see Box 1-2). 

KEY CONVENTION: Two conventions are used to identify 
the carbons in an amino acid — a practice that can be 



confusing. The additional carbons in an R group are 
commonly designated /3, y, S, s, and so forth, proceeding 
out from the a carbon. For most other organic mole- 
cules, carbon atoms are simply numbered from one end, 
giving highest priority (C-l) to the carbon with the sub- 
stituent containing the atom of highest atomic number. 
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Within this latter convention, the carboxyl carbon of an 
amino acid would be C-l and the a carbon would be C-2. 

e <5 7 (3 a 

5 4 3 2 1 

CH 2 — CH 2 — CH 2 — CH 2 -CH— COO- 

+ NH 3 +NH 3 
Lysine 

In some cases, such as amino acids with heterocyclic 
R groups (such as histidine), the Greek lettering system 
is ambiguous and the numbering convention is therefore 
used. For branched amino acid side chains, equivalent 
carbons are given numbers after the Greek letters. 
Leucine thus has 81 and 82 carbons (see the structure in 
Fig. 3-5). ■ 



one-step oxidation. Historically, the similar l and d des- 
ignations were used for levorotatory (rotating plane- 
polarized light to the left) and dextrorotatory (rotating 
light to the right). However, not all L-amino acids are 
levorotatory, and the convention shown in Figure 3-4 
was needed to avoid potential ambiguities about ab- 
solute configuration. By Fischer's convention, l and d 
refer only to the absolute configuration of the four sub- 
stituents around the chiral carbon, not to optical prop- 
erties of the molecule. 

Another system of specifying configuration around 
a chiral center is the RS system, which is used in the 
systematic nomenclature of organic chemistry and de- 
scribes more precisely the configuration of molecules 
with more than one chiral center (see p. 17). 



Special nomenclature has been developed to spec- 
ify the absolute configuration of the four sub- 
stituents of asymmetric carbon atoms. The absolute 
configurations of simple sugars and amino acids are 
specified by the d, l system (Fig. 3-4), based on the 
absolute configuration of the three-carbon sugar glycer- 
aldehyde, a convention proposed by Emil Fischer in 
1891. (Fischer knew what groups surrounded the 
asymmetric carbon of glyceraldehyde but had to guess 
at their absolute configuration; his guess was later con- 
firmed by x-ray diffraction analysis.) For all chiral com- 
pounds, stereoisomers having a configuration related 
to that of L-glyceraldehyde are designated L, and 
stereoisomers related to D-glyceraldehyde are desig- 
nated d. The functional groups of L-alanine are matched 
with those of L-glyceraldehyde by aligning those that 
can be interconverted by simple, one-step chemical re- 
actions. Thus the carboxyl group of L-alanine occupies 
the same position about the chiral carbon as does the 
aldehyde group of L-glyceraldehyde, because an alde- 
hyde is readily converted to a carboxyl group via a 



3 CH 2 OH 
L-Glyceraldehyde 

COO 

H 3 N^C^H 

CH 3 
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H^C^OH 
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FIGURE 3-4 Steric relationship of the stereoisomers of alanine to the 
absolute configuration of l- and D-glyceraldehyde. In these perspec- 
tive formulas, the carbons are lined up vertically, with the chiral atom 
in the center. The carbons in these molecules are numbered beginning 
with the terminal aldehyde or carboxyl carbon (red), 1 to 3 from top to 
bottom as shown. When presented in this way, the R group of the 
amino acid (in this case the methyl group of alanine) is always below 
the a carbon. L-Amino acids are those with the a-amino group on the 
left, and D-amino acids have the a-amino group on the right. 



The Amino Acid Residues in Proteins Are l Stereoisomers 

Nearly all biological compounds with a chiral center oc- 
cur naturally in only one stereoisomeric form, either D or 
l. The amino acid residues in protein molecules are ex- 
clusively l stereoisomers. D-Amino acid residues have 
been found in only a few, generally small peptides, in- 
cluding some peptides of bacterial cell walls and certain 
peptide antibiotics. 

It is remarkable that virtually all amino acid 
residues in proteins are l stereoisomers. When chiral 
compounds are formed by ordinary chemical reac- 
tions, the result is a racemic mixture of d and L iso- 
mers, which are difficult for a chemist to distinguish 
and separate. But to a living system, d and l isomers 
are as different as the right hand and the left. The for- 
mation of stable, repeating substructures in proteins 
(Chapter 4) generally requires that their constituent 
amino acids be of one stereochemical series. Cells are 
able to specifically synthesize the L isomers of amino 
acids because the active sites of enzymes are asym- 
metric, causing the reactions they catalyze to be stereo- 
specific. 

Amino Acids Can Be Classified by R Group 

Knowledge of the chemical properties of the common 
amino acids is central to an understanding of biochem- 
istry. The topic can be simplified by grouping the amino 
acids into five main classes based on the properties of 
their R groups (Table 3-1), in particular, their polarity, 
or tendency to interact with water at biological pH (near 
pH 7.0). The polarity of the R groups varies widely, from 
nonpolar and hydrophobic (water-insoluble) to highly 
polar and hydrophilic (water-soluble). 

The structures of the 20 common amino acids are 
shown in Figure 3-5, and some of their properties are 
listed in Table 3-1. Within each class there are grada- 
tions of polarity, size, and shape of the R groups. 

Nonpolar, Aliphatic R Groups The R groups in this 
class of amino acids are nonpolar and hydrophobic. 
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Nonpolar, aliphatic R groups 




COO" 


COO" 


COO" 


POO " 


+ I 


+ 1 


1 /H 


+ 


HoN — C — H 


H3N-C-H 


+ /< 




H 


1 

CH 3 


H oN CH 0 






CH 






^^ 2 (3 cjh 2 


X \ 
CH 3 CH 3 


Glycine 


Alanine 


Proline 


Valine 




COO" 

+ 1 


COO" 


COO" 




H 3 N— C-H 


+ 1 

H 3 N-C-H 


H 3 N— C— H 




yti 2 


TT /"I pTT 

ri U — Url 3 


1 

CH 9 






CH 2 


CH 2 




CH 3 CH 3 


1 

CH 3 


! 

s 

1 








CH 3 




Leucine 


Isoleucine 


Methionine 



Polar, uncharged R groups 
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Positively charged R groups 
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FIGURE 3-5 The 20 common amino acids of proteins. The structural 
formulas show the state of ionization that would predominate at pH 
7.0. The unshaded portions are those common to all the amino acids; 
the portions shaded in pink are the R groups. Although the R group of 



histidine is shown uncharged, its p/C a (see Table 3-1) is such that a 
small but significant fraction of these groups are positively charged at 
pH 7.0. The protonated form of histidine is shown above the graph in 
Fig. 3-1 2b. 



The side chains of alanine, valine, leucine, and 

isoleucine tend to cluster together within proteins, 
stabilizing protein structure by means of hydrophobic 
interactions. Glycine has the simplest structure. Al- 
though it is most easily grouped with the nonpolar 
amino acids, its very small side chain makes no real 
contribution to hydrophobic interactions. Methio- 
nine, one of the two sulfur-containing amino acids, has 
a nonpolar thioether group in its side chain. Proline 
has an aliphatic side chain with a distinctive cyclic 
structure. The secondary amino (imino) group of pro- 



line residues is held in a rigid conformation that re- 
duces the structural flexibility of polypeptide regions 
containing proline. 

Aromatic R Groups Phenylalanine, tyrosine, and 
tryptophan, with their aromatic side chains, are rela- 
tively nonpolar (hydrophobic). All can participate in hy- 
drophobic interactions. The hydroxyl group of tyrosine 
can form hydrogen bonds, and it is an important func- 
tional group in some enzymes. Tyrosine and tryptophan 
are significantly more polar than phenylalanine, because 
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FIGURE 3-6 Absorption of ultraviolet light by aromatic amino acids. 

Comparison of the light absorption spectra of the aromatic amino acids 
tryptophan and tyrosine at pH 6.0. The amino acids are present in 
equimolar amounts (1 0~ 3 m) under identical conditions. The measured 
absorbance of tryptophan is as much as four times that of tyrosine 
Note that the maximum light absorption for both tryptophan and tyro- 
sine occurs near a wavelength of 280 nm. Light absorption by the third 
aromatic amino acid, phenylalanine (not shown), generally con- 
tributes little to the spectroscopic properties of proteins. 



of the tyrosine hydroxyl group and the nitrogen of the 
tryptophan indole ring. 

Tryptophan and tyrosine, and to a much lesser ex- 
tent phenylalanine, absorb ultraviolet light (Fig. 3-6; 



see also Box 3-1). This accounts for the characteristic 
strong absorbance of light by most proteins at a wave- 
length of 280 nm, a property exploited by researchers in 
the characterization of proteins. 



BOX 3-1 



METHODS 



Absorption of Light by Molecules: The Lambert-Beer Law 



A wide range of biomolecules absorb light at character- 
istic wavelengths, just as tryptophan absorbs light at 280 
nm (see Fig. 3-6). Measurement of light absorption by a 
spectrophotometer is used to detect and identify mole- 
cules and to measure their concentration in solution. 
The fraction of the incident light absorbed by a solution 
at a given wavelength is related to the thickness of the 
absorbing layer (path length) and the concentration of 
the absorbing species (Fig. 1). These two relationships 
are combined into the Lambert-Beer law, 



log 



eel 



where I Q is the intensity of the incident light, / is the in- 
tensity of the transmitted light, the ratio I/I 0 (the inverse 
of the ratio in the equation) is the transmittance, e is the 
molar extinction coefficient (in units of liters per mole- 
centimeter), c is the concentration of the absorbing 



species (in moles per liter), and I is the path length of the 
light-absorbing sample (in centimeters). The Lambert- 
Beer law assumes that the incident light is parallel and 
monochromatic (of a single wavelength) and that the sol- 
vent and solute molecules are randomly oriented. The ex- 
pression log (V-D is called the absorbance, designated A 

It is important to note that each successive millime- 
ter of path length of absorbing solution in a 1.0 cm cell 
absorbs not a constant amount but a constant fraction of 
the light that is incident upon it. However, with an ab- 
sorbing layer of fixed path length, the absorbance, A, is 
directly proportional to the concentration of the ab- 
sorbing solute. 

The molar extinction coefficient varies with the na- 
ture of the absorbing compound, the solvent, and the 
wavelength, and also with pH if the light-absorbing 
species is in equilibrium with an ionization state that has 
different absorbance properties. 
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FIGURE 1 The principal components of a spec- 
trophotometer. A light source emits light along a 
broad spectrum, then the monochromator se- 
lects and transmits light of a particular wave- 
length. The monochromatic light passes through 
the sample in a cuvette of path length / and is 
absorbed by the sample in proportion to the con- 
centration of the absorbing species. The transmit- 
ted light is measured by a detector. 
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FIGURE 3-7 Reversible formation of a disulfide bond by the oxidation of two molecules of cysteine. Disulfide bonds 
between Cys residues stabilize the structures of many proteins. 



Polar, Uncharged R Groups The R groups of these 
amino acids are more soluble in water, or more hy- 
drophilic, than those of the nonpolar amino acids, be- 
cause they contain functional groups that form 
hydrogen bonds with water. This class of amino acids 
includes serine, threonine, cysteine, asparagine, 
and glutamine. The polarity of serine and threonine 
is contributed by their hydroxyl groups; that of cys- 
teine by its sulfhydryl group, which is a weak acid and 
can make weak hydrogen bonds with oxygen or nitro- 
gen; and that of asparagine and glutamine by their 
amide groups. 

Asparagine and glutamine are the amides of two 
other amino acids also found in proteins, aspartate 
and glutamate, respectively, to which asparagine and 
glutamine are easily hydrolyzed by acid or base. Cys- 
teine is readily oxidized to form a covalently linked 
dimeric amino acid called cystine, in which two cys- 
teine molecules or residues are joined by a disulfide 
bond (Fig. 3-7). The disulfide-linked residues are 
strongly hydrophobic (nonpolar). Disulfide bonds 
play a special role in the structures of many proteins 
by forming covalent links between parts of a polypep- 
tide molecule or between two different polypeptide 
chains. 

Positively Charged (Basic) R Groups The most 
hydrophilic R groups are those that are either positively 
or negatively charged. The amino acids in which the R 
groups have significant positive charge at pH 7.0 are ly- 
sine, which has a second primary amino group at the e 
position on its aliphatic chain; arginine, which has a 
positively charged guanidinium group; and histidine, 
which has an aromatic imidazole group. As the only 
common amino acid having an ionizable side chain with 
pK a near neutrality, histidine may be positively charged 
(protonated form) or uncharged at pH 7.0. His residues 
facilitate many enzyme-catalyzed reactions by serving 
as proton donors/acceptors. 

Negatively Charged (Acidic) R Groups The two 

amino acids having R groups with a net negative charge 



at pH 7.0 are aspartate and glutamate, each of which 
has a second carboxyl group. 

Uncommon Amino Acids Also Have Important 
Functions 

In addition to the 20 common amino acids, proteins 
may contain residues created by modification of com- 
mon residues already incorporated into a polypeptide 
(Fig. 3-8a). Among these uncommon amino acids 
are 4-hydroxyproline, a derivative of proline, and 

5- hydroxylysine, derived from lysine. The former is 
found in plant cell wall proteins, and both are found 
in collagen, a fibrous protein of connective tissues. 

6- iV-Methyllysine is a constituent of myosin, a con- 
tractile protein of muscle. Another important uncommon 
amino acid is 7-carboxyglutamate, found in the blood- 
clotting protein prothrombin and in certain other pro- 
teins that bind Ca 2+ as part of their biological function. 
More complex is desmosine, a derivative of four Lys 
residues, which is found in the fibrous protein elastin. 

Selenocysteine is a special case. This rare amino 
acid residue is introduced during protein synthesis 
rather than created through a postsynthetic modifica- 
tion. It contains selenium rather than the sulfur of cys- 
teine. Actually derived from serine, selenocysteine is a 
constituent of just a few known proteins. 

Some amino acid residues in a protein may be 
modified transiently to alter the protein's function. The 
addition of phosphoryl, methyl, acetyl, adenylyl, ADP- 
ribosyl, or other groups to particular amino acid 
residues can increase or decrease a protein's activity 
(Fig. 3-8b). Phosphorylation is a particularly common 
regulatory modification. Covalent modification as a pro- 
tein regulatory strategy is discussed in more detail in 
Chapter 6. 

Some 300 additional amino acids have been found 
in cells. They have a variety of functions but are not all 
constituents of proteins. Ornithine and citrulline 

(Fig. 3-8c) deserve special note because they are key 
intermediates (metabolites) in the biosynthesis of argi- 
nine (Chapter 22) and in the urea cycle (Chapter 18). 
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FIGURE 3-8 Uncommon amino acids, (a) Some uncommon amino 
acids found in proteins. All are derived from common amino acids. Ex- 
tra functional groups added by modification reactions are shown in 
red. Desmosine is formed from four Lys residues (the four carbon back- 
bones are shaded in yellow). Note the use of either numbers or Greek 
letters to identify the carbon atoms in these structures, (b) Reversible 
amino acid modifications involved in regulation of protein activity. 
Phosphorylation is the most common type of regulatory modification, 
(c) Ornithine and citrulline, which are not found in proteins, are inter- 
mediates in the biosynthesis of arginine and in the urea cycle. 
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Amino Acids Can Act as Acids and Bases 

The amino and carboxyl groups of amino acids, along 
with the ionizable R groups of some amino acids, func- 
tion as weak acids and bases. When an amino acid lack- 
ing an ionizable R group is dissolved in water at neutral 



pH, it exists in solution as the dipolar ion, or zwitterion 
(German for "hybrid ion"), which can act as either an 
acid or a base (Fig. 3-9). Substances having this dual 
(acid-base) nature are amphoteric and are often called 
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FIGURE 3-9 Nonionic and zwitterionic forms of amino acids. The 

nonionic form does not occur in significant amounts in aqueous solu- 
tions. The zwitterion predominates at neutral pH. A zwitterion can act 
as either an acid (proton donor) or a base (proton acceptor). 

ampholytes (from "amphoteric electrolytes"). A simple 
monoamino monocarboxylic a-amino acid, such as 
alanine, is a diprotic acid when fully protonated; it has 
two groups, the — COOH group and the — NH 3 group, 
that can yield protons: 
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FIGURE 3-10 Titration of an amino acid. Shown here is the titration 
curve of 0.1 m glycine at 25 °C. The ionic species predominating at key 
points in the titration are shown above the graph. The shaded boxes, 
centered at about p/0, = 2.34 and pK 2 = 9.60, indicate the regions of 
greatest buffering power. Note that 1 equivalent of OH~ = 0.1 m NaOH 
added. 
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Amino Acids Have Characteristic Titration Curves 

Acid-base titration involves the gradual addition or re- 
moval of protons (Chapter 2). Figure 3-10 shows the 
titration curve of the diprotic form of glycine. The two 
ionizable groups of glycine, the carboxyl group and the 
amino group, are titrated with a strong base such as 
NaOH. The plot has two distinct stages, corresponding 
to deprotonation of two different groups on glycine. 
Each of the two stages resembles in shape the titration 
curve of a monoprotic acid, such as acetic acid (see 
Fig. 2-16), and can be analyzed in the same way. At 
very low pH, the predominant ionic species of glycine is 
the fully protonated form, + H 3 N — CH 2 — COOH. At the 
midpoint in the first stage of the titration, in which the 
— COOH group of glycine loses its proton, equimolar 
concentrations of the proton-donor ( + H 3 N — CH 2 — 
COOH) and proton-acceptor ( + H 3 N— CH 2 — COO") 
species are present. At the midpoint of any titration, a 
point of inflection is reached where the pH is equal to 
the pK a of the protonated group being titrated (see 
Fig. 2-17). For glycine, the pH at the midpoint is 2.34, 



thus its — COOH group has a pK a (labeled pK ± in 
Fig. 3-10) of 2.34. (Recall from Chapter 2 that pH and 
pK a are simply convenient notations for proton concen- 
tration and the equilibrium constant for ionization, re- 
spectively. The pZ" a is a measure of the tendency of a 
group to give up a proton, with that tendency decreas- 
ing tenfold as the pK a increases by one unit.) As the 
titration proceeds, another important point is reached 
at pH 5.97. Here there is another point of inflection, at 
which removal of the first proton is essentially complete 
and removal of the second has just begun. At this pH 
glycine is present largely as the dipolar ion (zwitterion) 
+ H 3 N — CH 2 — COO~. We shall return to the significance 
of this inflection point in the titration curve (labeled pi 
in Fig. 3-10) shortly. 

The second stage of the titration corresponds to the 
removal of a proton from the — NHj group of glycine. 
The pH at the midpoint of this stage is 9.60, equal to the 
pK a (labeled pK 2 in Fig. 3-10) for the — NH 3 group. The 
titration is essentially complete at a pH of about 12, at 
which point the predominant form of glycine is H 2 N — 
CH 2 — COO". 

From the titration curve of glycine we can derive 
several important pieces of information. First, it gives a 
quantitative measure of the pK a of each of the two ioniz- 
ing groups: 2.34 for the — COOH group and 9.60 for the 
— NH 3 group. Note that the carboxyl group of glycine is 
over 100 times more acidic (more easily ionized) than 
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FIGURE 3-11 Effect of the chemical environment on pK a . The p/C a 
values for the ionizable groups in glycine are lower than those for sim- 
ple, methyl-substituted amino and carboxyl groups. These downward 
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perturbations of pK a are due to intramolecular interactions. Similar 
effects can be caused by chemical groups that happen to be positioned 
nearby — for example, in the active site of an enzyme. 



the carboxyl group of acetic acid, which, as we saw in 
Chapter 2, has a pK a of 4.76 — about average for a car- 
boxyl group attached to an otherwise unsubstituted 
aliphatic hydrocarbon. The perturbed pK a of glycine is 
caused by repulsion between the departing proton and the 
nearby positively charged amino group on the a-carbon 
atom, as described in Figure 3-11. The opposite 
charges on the resulting zwitterion are stabilizing. Simi- 
larly, the pK a of the amino group in glycine is perturbed 
downward relative to the average pK a of an amino 
group. This effect is due partly to the electronegative 
oxygen atoms in the carboxyl groups, which tend to pull 
electrons toward them, increasing the tendency of the 
amino group to give up a proton. Hence, the a-amino 
group has a pK a that is lower than that of an aliphatic 
amine such as methylamine (Fig. 3-11). In short, the 
pi*T a of any functional group is greatly affected by its 
chemical environment, a phenomenon sometimes ex- 
ploited in the active sites of enzymes to promote exquis- 
itely adapted reaction mechanisms that depend on the 
perturbed pK a values of proton donor/acceptor groups 
of specific residues. 

The second piece of information provided by the 
titration curve of glycine is that this amino acid has two 
regions of buffering power. One of these is the relatively 
flat portion of the curve, extending for approximately 
1 pH unit on either side of the first pJT a of 2.34, indicating 
that glycine is a good buffer near this pH. The other 
buffering zone is centered around pH 9.60. (Note that 
glycine is not a good buffer at the pH of intracellular 
fluid or blood, about 7.4.) Within the buffering ranges of 



glycine, the Henderson-Hasselbalch equation (p. 60) 
can be used to calculate the proportions of proton- 
donor and proton-acceptor species of glycine required 
to make a buffer at a given pH. 

Titration Curves Predict the Electric Charge of Amino Acids 

Another important piece of information derived from the 
titration curve of an amino acid is the relationship be- 
tween its net charge and the pH of the solution. At pH 
5.97, the point of inflection between the two stages in its 
titration curve, glycine is present predominantly as its 
dipolar form, fully ionized but with no net electric charge 
(Fig. 3-10). The characteristic pH at which the net elec- 
tric charge is zero is called the isoelectric point or iso- 
electric pH, designated pi. For glycine, which has no 
ionizable group in its side chain, the isoelectric point is 
simply the arithmetic mean of the two piT a values: 

pi = |(p^ + pK 2 ) = | (2.34 + 9.60) = 5.97 

As is evident in Figure 3-10, glycine has a net negative 
charge at any pH above its pi and will thus move toward 
the positive electrode (the anode) when placed in an 
electric field. At any pH below its pi, glycine has a net 
positive charge and will move toward the negative elec- 
trode (the cathode). The farther the pH of a glycine so- 
lution is from its isoelectric point, the greater the net 
electric charge of the population of glycine molecules. 
At pH 1.0, for example, glycine exists almost entirely as 
the form + H 3 N — CH 2 — COOH with a net positive charge 
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FIGURE 3-12 Titration curves for (a) glutamate and (b) histidine. The pK a of the R group is designated here as pK R 





of 1.0. At pH 2.34, where there is an equal mixture of 
+ H 3 N — CH 2 — COOH and + H 3 N— CH 2 — COO", the av- 
erage or net positive charge is 0.5. The sign and the 
magnitude of the net charge of any amino acid at any pH 
can be predicted in the same way. 

Amino Acids Differ in Their Acid-Base Properties 

The shared properties of many amino acids permit some 
simplifying generalizations about their acid-base behav- 
iors. First, all amino acids with a single a-amino group, a 
single a-carboxyl group, and an R group that does not 
ionize have titration curves resembling that of glycine 
(Fig. 3-10). These amino acids have very similar, al- 
though not identical, pK a values: pK a of the — COOH 
group in the range of 1.8 to 2.4, and pK a of the — NH^ 
group in the range of 8.8 to 11.0 (Table 3-1). The differ- 
ences in these pK a values reflect the effects of the R 
groups. Second, amino acids with an ionizable R group 
have more complex titration curves, with three stages 
corresponding to the three possible ionization steps; 
thus they have three pK a values. The additional stage for 
the titration of the ionizable R group merges to some 
extent with the other two. The titration curves for two 
amino acids of this type, glutamate and histidine, are 
shown in Figure 3-12. The isoelectric points reflect the 
nature of the ionizing R groups present. For example, 
glutamate has a pi of 3.22, considerably lower than that 
of glycine. This is due to the presence of two carboxyl 
groups, which, at the average of their pK a values (3.22), 
contribute a net charge of - 1 that balances the + 1 con- 
tributed by the amino group. Similarly, the pi of histidine, 
with two groups that are positively charged when proto- 
nated, is 7.59 (the average of the pK a values of the amino 
and imidazole groups), much higher than that of glycine. 



Finally, as pointed out earlier, under the general 
condition of free and open exposure to the aqueous en- 
vironment, only histidine has an R group (piT a = 6.0) 
providing significant buffering power near the neutral 
pH usually found in the intracellular and extracellular 
fluids of most animals and bacteria (Table 3-1). 

SUMMARY 3.1 Amino Acids 

■ The 20 amino acids commonly found as residues in 
proteins contain an a-carboxyl group, an a-amino 
group, and a distinctive R group substituted on the 
a -carbon atom. The a-carbon atom of all amino 
acids except glycine is asymmetric, and thus amino 
acids can exist in at least two stereoisomeric forms. 
Only the l stereoisomers, with a configuration 
related to the absolute configuration of the reference 
molecule L-glyceraldehyde, are found in proteins. 

a Other, less common amino acids also occur, either 
as constituents of proteins (through modification 
of common amino acid residues after protein 
synthesis) or as free metabolites. 

■ Amino acids are classified into five types on the 
basis of the polarity and charge (at pH 7) of their 
R groups. 

■ Amino acids vary in their acid-base properties and 
have characteristic titration curves. Monoamino 
monocarboxylic amino acids (with nonionizable 

R groups) are diprotic acids ( + H 3 NCH(R)COOH) at 
low pH and exist in several different ionic forms as 
the pH is increased. Amino acids with ionizable 
R groups have additional ionic species, depending 
on the pH of the medium and the pK a of the R 
group. 
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3.2 Peptides and Proteins 

We now turn to polymers of amino acids, the peptides 
and proteins. Biologically occurring polypeptides range 
in size from small to very large, consisting of two or 
three to thousands of linked amino acid residues. Our 
focus is on the fundamental chemical properties of these 
polymers. 

Peptides Are Chains of Amino Acids 

Two amino acid molecules can be covalently joined 
through a substituted amide linkage, termed a peptide 
bond, to yield a dipeptide. Such a linkage is formed by 
removal of the elements of water (dehydration) from 
the a-carboxyl group of one amino acid and the a-amino 
group of another (Fig. 3-13). Peptide bond formation 
is an example of a condensation reaction, a common 
class of reactions in living cells. Under standard bio- 
chemical conditions, the equilibrium for the reaction 
shown in Figure 3-13 favors the amino acids over the 
dipeptide. To make the reaction thermodynamically 
more favorable, the carboxyl group must be chemically 
modified or activated so that the hydroxyl group can be 
more readily eliminated. A chemical approach to this 
problem is outlined later in this chapter. The biological 
approach to peptide bond formation is a major topic of 
Chapter 27. 

Three amino acids can be joined by two peptide 
bonds to form a tripeptide; similarly, four amino acids 
can be linked to form tetrapeptides, five to form pen- 
tapeptides, and so forth. When a few amino acids are 
joined in this fashion, the structure is called an 
oligopeptide. When many amino acids are joined, the 
product is called a polypeptide. Proteins may have 
thousands of amino acid residues. Although the terms 
"protein" and "polypeptide" are sometimes used inter- 
changeably, molecules referred to as polypeptides gen- 
erally have molecular weights below 10,000, and those 
called proteins have higher molecular weights. 
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FIGURE 3-13 Formation of a peptide bond by condensation. The 

a-amino group of one amino acid (with R 2 group) acts as a nucleophile 
to displace the hydroxyl group of another amino acid (with R 1 group), 
forming a peptide bond (shaded in yellow), Amino groups are good nu- 
cleophiles, but the hydroxyl group is a poor leaving group and is not 
readily displaced. At physiological pH, the reaction shown here does 
not occur to any appreciable extent. 



CH 3 CH 3 
CH 



CH 2 OH H H H CM A H CH, H CH Z 

+ 

H 3 N-C C-N-C-C-N-C-C— N-C-C— N— C—i 

1 U iJ I J 1 I J 



COO" 



H 



0 



H O 



H O 



H 



Ammo- 
terminal end 



Carboxyl- 
terminal end 



FIGURE 3-14 The pentapeptide serylglycyltyrosy lalanyl leucine, 
Ser-Gly-Tyr-Ala-Leu, or SGYAL. Peptides are named beginning with 
the amino-terminal residue, which by convention is placed at the left. 
The peptide bonds are shaded in yellow; the R groups are in red. 

Figure 3-14 shows the structure of a pentapep- 
tide. As already noted, an amino acid unit in a peptide is 
often called a residue (the part left over after losing a 
hydrogen atom from its amino group and the hydroxyl 
moiety from its carboxyl group). In a peptide, the amino 
acid residue at the end with a free a-amino group is the 
amino-terminal (or N- terminal) residue; the residue at 
the other end, which has a free carboxyl group, is the 
carboxyl-terminal (C-terminal) residue. 

KEY CONVENTION: When an amino acid sequence of a 
peptide, polypeptide, or protein is displayed, the amino- 
terminal end is placed on the left, the carboxyl-terminal 
end on the right. The sequence is read left to right, be- 
ginning with the amino-terminal end. ■ 

Although hydrolysis of a peptide bond is an exergonic 
reaction, it occurs only slowly because it has a high acti- 
vation energy (see p. 25). As a result, the peptide bonds 
in proteins are quite stable, with an average half-life (f 1/2 ) 
of about 7 years under most intracellular conditions. 

Peptides Can Be Distinguished by Their 
Ionization Behavior 

Peptides contain only one free a-amino group and one 
free a-carboxyl group, at opposite ends of the chain 
( Fig, 3- 15). These groups ionize as they do in free amino 
acids, although the ionization constants are different be- 
cause an oppositely charged group is no longer linked to 
the a carbon. The a-amino and a-carboxyl groups of all 
nonterminal amino acids are covalently joined in the pep- 
tide bonds, which do not ionize and thus do not con- 
tribute to the total acid-base behavior of peptides. 
However, the R groups of some amino acids can ionize 
(Table 3-1), and in a peptide these contribute to the 
overall acid-base properties of the molecule (Fig. 3-15). 
Thus the acid-base behavior of a peptide can be predicted 
from its free a-amino and a-carboxyl groups as well as the 
nature and number of its ionizable R groups. 

Like free amino acids, peptides have characteristic 
titration curves and a characteristic isoelectric pH (pi) 
at which they do not move in an electric field. These 
properties are exploited in some of the techniques used 
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FIGURE 3-15 Alanylglutamylglycyllysine. This tetrapeptide has one 
free a-amino group, one free a-carboxyl group, and two ionizable R 
groups. The groups ionized at pH 7.0 are in red. 

to separate peptides and proteins, as we shall see later 
in the chapter. It should be emphasized that the pK a 
value for an ionizable R group can change somewhat 
when an amino acid becomes a residue in a peptide. The 
loss of charge in the a-carboxyl and a-amino groups, the 
interactions with other peptide R groups, and other en- 
vironmental factors can affect the pK a . The pK a values 
for R groups listed in Table 3-1 can be a useful guide to 
the pH range in which a given group will ionize, but they 
cannot be strictly applied to peptides. 

Biologically Active Peptides and Polypeptides Occur in a 
Vast Range of Sizes and Compositions 

No generalizations can be made about the molecular 
weights of biologically active peptides and proteins in rela- 




tion to their functions. Naturally occurring peptides range 
in length from two to many thousands of amino acid 
residues. Even the smallest peptides can have biologically 
important effects. Consider the commercially synthesized 
dipeptide L-aspartyl-L-phenylalanine methyl ester, the arti- 
ficial sweetener better known as aspartame or NutraSweet. 
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L-Aspartyl-L-phenylalanine methyl ester 
(aspartame) 



Many small peptides exert their effects at very low 
concentrations. For example, a number of vertebrate 
hormones (Chapter 23) are small peptides. These in- 
clude oxytocin (nine amino acid residues), which is se- 
creted by the posterior pituitary and stimulates uterine 
contractions, and thyrotropin-releasing factor (three 
residues), which is formed in the hypothalamus and 
stimulates the release of another hormone, thyrotropin, 
from the anterior pituitary gland. Some extremely toxic 
mushroom poisons, such as amanitin, are also small pep- 
tides, as are many antibiotics. 

How long are the polypeptide chains in proteins? As 
Table 3-2 shows, lengths vary considerably. Human 
cytochrome c has 104 amino acid residues linked in a 
single chain; bovine chymotrypsinogen has 245 residues. 
At the extreme is titin, a constituent of vertebrate mus- 
cle, which has nearly 27,000 amino acid residues and a 
molecular weight of about 3,000,000. The vast majority 
of naturally occurring proteins are much smaller than 
this, containing fewer than 2,000 amino acid residues. 



TABLE 3-2 




Molecular 


Number of 


Number of 




weight 


residues 


polypeptide chains 


Cytochrome c (human) 


12,400 


104 


1 


Ribonuclease A (bovine pancreas) 


13,700 


124 


1 


Lysozyme (chicken egg white) 


14,300 


129 


1 


Myoglobin (equine heart) 


16,700 


153 


1 


Chymotrypsin (bovine pancreas) 


25,200 


241 


3 


Chymotrypsinogen (bovine) 


25,700 


245 


1 


Hemoglobin (human) 


64,500 


574 


4 


Serum albumin (human) 


66,000 


609 


1 


Hexokinase (yeast) 


107,900 


972 


2 


RNA polymerase (E. colt) 


450,000 


4,158 


5 


Apolipoprotein B (human) 


513,000 


4,536 


1 


Glutamine synthetase (E. colt) 


619,000 


5,628 


12 


Titin (human) 


2,993,000 


26,926 


1 
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Amino Acid Composition of 
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2 
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4 


6 
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4 


9 


Ser 


1 


28 


Thr 


8 


23 


Trp 


1 


8 


Tyr 


4 


4 


Val 


3 


23 


Total 


104 


245 



*ln some common analyses, such as acid hydrolysis, Asp and Asn are not readily dis- 
tinguished from each other and are together designated Asx (or B). Similarly, when Glu 
and Gin cannot be distinguished, they are together designated Glx (or Z). In addition, 
Trp is destroyed by acid hydrolysis. Additional procedures must be employed to obtain 
an accurate assessment of complete amino acid content. 



Some proteins consist of a single polypeptide chain, 
but others, called multisubunit proteins, have two or 
more polypeptides associated noncovalently (Table 3-2). 
The individual polypeptide chains in a multisubunit pro- 
tein may be identical or different. If at least two are 
identical the protein is said to be oligomeric, and the 
identical units (consisting of one or more polypeptide 
chains) are referred to as protomers. Hemoglobin, for 
example, has four polypeptide subunits: two identical a 
chains and two identical jS chains, all four held together 
by noncovalent interactions. Each a subunit is paired in 
an identical way with a /3 subunit within the structure of 
this multisubunit protein, so that hemoglobin can be 
considered either a tetramer of four polypeptide sub- 
units or a dimer of a/3 protomers. 

A few proteins contain two or more polypeptide 
chains linked covalently. For example, the two polypeptide 



chains of insulin are linked by disulfide bonds. In such 
cases, the individual polypeptides are not considered sub- 
units but are commonly referred to simply as chains. 

The amino acid composition of proteins is also 
highly variable. The 20 common amino acids almost 
never occur in equal amounts in a protein. Some amino 
acids may occur only once or not at all in a given type 
of protein; others may occur in large numbers. Table 
3-3 shows the amino acid composition of bovine cy- 
tochrome c and chymotrypsinogen, the inactive pre- 
cursor of the digestive enzyme chymotrypsin. These 
two proteins, with very different functions, also differ 
significantly in the relative numbers of each kind of 
amino acid residue. 

We can calculate the approximate number of amino 
acid residues in a simple protein containing no other 
chemical constituents by dividing its molecular weight 
by 110. Although the average molecular weight of the 
20 common amino acids is about 138, the smaller amino 
acids predominate in most proteins. If we take into ac- 
count the proportions in which the various amino acids 
occur in an average protein (Table 3-1; the averages 
are determined by surveying the amino acid composi- 
tions of thousands of different proteins), the average 
molecular weight of protein amino acids is nearer to 
128. Because a molecule of water (M T 18) is removed to 
create each peptide bond, the average molecular 
weight of an amino acid residue in a protein is about 
128 - 18 = 110. 

Some Proteins Contain Chemical Groups Other Than 
Amino Adds 

Many proteins, for example the enzymes ribonuclease A 
and chymotrypsin, contain only amino acid residues and 
no other chemical constituents; these are considered 
simple proteins. However, some proteins contain perma- 
nently associated chemical components in addition to 
amino acids; these are called conjugated proteins. 
The non-amino acid part of a conjugated protein is usu- 
ally called its prosthetic group. Conjugated proteins 
are classified on the basis of the chemical nature of their 
prosthetic groups (Table 3-4); for example, lipopro- 
teins contain lipids, glycoproteins contain sugar 
groups, and metalloproteins contain a specific metal. 
Some proteins contain more than one prosthetic group. 
Usually the prosthetic group plays an important role in 
the protein's biological function. 

SUMMARY 3.2 Peptides and Proteins 

a Amino acids can be joined covalently through 
peptide bonds to form peptides and proteins. Cells 
generally contain thousands of different proteins, 
each with a different biological activity. 

■ Proteins can be very long polypeptide chains of 100 
to several thousand amino acid residues. However, 
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Conjugated Proteins 










Liass 


group 


Example 


Lipoproteins 


Lipids 


/^-Lipoprotein of blood 


Glycoproteins 


C arb o hy drat e s 


Immunoglobulin G 


Phosphoproteins 


Phosphate groups 


Casein of milk 


Hemoproteins 


Heme (iron porphyrin) 


Hemoglobin 


Flavoproteins 


Flavin nucleotides 


Succinate dehydrogenase 


Metalloproteins 


Iron 


Ferritin 




Zinc 


Alcohol dehydrogenase 




Calcium 


Calmodulin 




Molybdenum 


Dinitrogenase 




Copper 


Plastocyanin 



some naturally occurring peptides have only a few 
amino acid residues. Some proteins are composed 
of several noncovalently associated polypeptide 
chains, called subunits. 

Simple proteins yield only amino acids on 
hydrolysis; conjugated proteins contain in addition 
some other component, such as a metal or organic 
prosthetic group. 

33 Working with Proteins 

Biochemists' understanding of protein structure and 
function has been derived from the study of many indi- 
vidual proteins. To study a protein in detail, the re- 
searcher must be able to separate it from other proteins 
in pure form and must have the techniques to determine 
its properties. The necessary methods come from pro- 
tein chemistry, a discipline as old as biochemistry itself 
and one that retains a central position in biochemical 
research. 

Proteins Can Be Separated and Purified 

A pure preparation is essential before a protein's prop- 
erties and activities can be determined. Given that cells 
contain thousands of different kinds of proteins, how 
can one protein be purified? Classical methods for sepa- 
rating proteins take advantage of properties that vary 
from one protein to the next, including size, charge, and 
binding properties. Some additional modern methods, 
involving DNA cloning and genome sequencing, can sim- 
plify the process of protein purification and are pre- 
sented in Chapter 9. 

The source of a protein is generally tissue or mi- 
crobial cells. The first step in any protein purification 
procedure is to break open these cells, releasing their 
proteins into a solution called a crude extract. If 



necessary, differential centrifugation can be used to 
prepare subcellular fractions or to isolate specific 
organelles (see Fig. 1-8). 

Once the extract or organelle preparation is ready, 
various methods are available for purifying one or more 
of the proteins it contains. Commonly, the extract is 
subjected to treatments that separate the proteins into 
different fractions based on a property such as size or 
charge, a process referred to as fractionation. Early 
fractionation steps in a purification utilize differences in 
protein solubility, which is a complex function of pH, 
temperature, salt concentration, and other factors. The 
solubility of proteins is generally lowered at high salt 
concentrations, an effect called "salting out." The addi- 
tion of certain salts in the right amount can selectively 
precipitate some proteins, while others remain in solu- 
tion. Ammonium sulfate ((NH 4 ) 2 S0 4 ) is particularly ef- 
fective and is often used to salt out proteins. The 
proteins thus precipitated are removed from those re- 
maining in solution by low-speed centrifugation. 

A solution containing the protein of interest usually 
must be further altered before subsequent purification 
steps are possible. For example, dialysis is a procedure 
that separates proteins from small solutes by taking ad- 
vantage of the proteins 1 larger size. The partially purified 
extract is placed in a bag or tube made of a semiperme- 
able membrane. When this is suspended in a much 
larger volume of buffered solution of appropriate ionic 
strength, the membrane allows the exchange of salt and 
buffer but not proteins. Thus dialysis retains large pro- 
teins within the membranous bag or tube while allowing 
the concentration of other solutes in the protein prepa- 
ration to change until they come into equilibrium with 
the solution outside the membrane. Dialysis might be 
used, for example, to remove ammonium sulfate from 
the protein preparation. 

The most powerful methods for fractionating pro- 
teins make use of column chromatography, which 
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FIGURE 3-16 Column chromatography. The standard elements of a 
chromatographic column include a solid, porous material (matrix) sup- 
ported inside a column, generally made of plastic or glass. A solution, 
the mobile phase, flows through the matrix, the stationary phase. The 
solution that passes out of the column at the bottom (the effluent) is 
constantly replaced by solution supplied from a reservoir at the top. 
The protein solution to be separated is layered on top of the column 
and allowed to percolate into the solid matrix. Additional solution is 
added on top. The protein solution forms a band within the mobile 
phase that is initially the depth of the protein solution applied to the 
column. As proteins migrate through the column, they are retarded to 
different degrees by their different interactions with the matrix mate- 
rial. The overall protein band thus widens as it moves through the col- 
umn. Individual types of proteins (such as A, B, and C, shown in blue, 
red, and green) gradually separate from each other, forming bands 
within the broader protein band. Separation improves (i.e., resolution 
increases) as the length of the column increases. However, each indi- 
vidual protein band also broadens with time due to diffusional spread- 
ing, a process that decreases resolution. In this example, protein A is 
well separated from B and C, but diffusional spreading prevents com- 
plete separation of B and C under these conditions. 

takes advantage of differences in protein charge, size, 
binding affinity, and other properties (Fig. 3-16). A 
porous solid material with appropriate chemical proper- 
ties (the stationary phase) is held in a column, and a 



buffered solution (the mobile phase) percolates through 
it. The protein-containing solution, layered on the top of 
the column, percolates through the solid matrix as an 
ever-expanding band within the larger mobile phase. In- 
dividual proteins migrate faster or more slowly through 
the column depending on their properties. 

Ion-exchange chromatography exploits differ- 
ences in the sign and magnitude of the net electric 
charge of proteins at a given pH. The column matrix 
is a synthetic polymer (resin) containing bound 
charged groups; those with bound anionic groups are 
called cation exchangers, and those with bound 
cationic groups are called anion exchangers. The affin- 
ity of each protein for the charged groups on the 
column is affected by the pH (which determines the 
ionization state of the molecule) and the concentra- 
tion of competing free salt ions in the surrounding 
solution. Separation can be optimized by gradually 
changing the pH and/or salt concentration of the mo- 
bile phase so as to create a pH or salt gradient. In 
cation-exchange chromatography (Fig. 3-1 7a), 
the solid matrix has negatively charged groups. In 
the mobile phase, proteins with a net positive charge 
migrate through the matrix more slowly than those 
with a net negative charge, because the migration of 
the former is retarded more by interaction with the 
stationary phase. 

In ion-exchange columns, the expansion of the pro- 
tein band in the mobile phase (the protein solution) is 
caused both by separation of proteins with different 
properties and by diffusional spreading. As the length of 
the column increases, the resolution of two types of 
protein with different net charges generally improves. 
However, the rate at which the protein solution can flow 
through the column usually decreases with column 
length. And as the length of time spent on the column 
increases, the resolution can decline as a result of diffu- 
sional spreading within each protein band. As the pro- 
tein-containing solution exits a column, successive 
portions (fractions) of this effluent are collected in test 
tubes. Each fraction can be tested for the presence of 
the protein of interest as well as other properties, such 
as ionic strength or total protein concentration. All frac- 
tions positive for the protein of interest can be com- 
bined as the product of this chromatographic step of the 
protein purification. 

Hi WORKED EXAMPLE 3-1 Ion Exchange of Peptides 

A biochemist wants to separate two peptides by ion- 
exchange chromatography. At the pH of the mobile 
phase to be used on the column, one peptide (A) has a 
net charge of -3, due to the presence of more Glu and 
Asp residues than Arg, Lys, and His residues. Peptide B 
has a net charge of +1. Which peptide would elute first 
from a cation-exchange resin? Which would elute first 
from an anion-exchange resin? 
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Polymer beads with negatively- 
charged functional groups 

Protein mixture is added 
to column containing 
cation exchangers 



Proteins move through the column at rates 
determined by their net charge at the pH 
being used. With cation exchangers, 
proteins with a more negative net charge 
move faster and elute earlier. 

(a) Ion-exchange chromatography 

FIGURE 3-17 Three chromatographic methods used in protein 
purification, (a) Ion-exchange chromatography exploits differences in 
the sign and magnitude of the net electric charges of proteins at a given 
pH. (b) Size-exclusion chromatography, also called gel filtration, sepa- 
rates proteins according to size, (c) Affinity chromatography separates 
proteins by their binding specificities. Further details of these methods 
are given in the text. 



Solution: A cation-exchange resin has negative charges 
and binds positively charged molecules, retarding their 
progress through the column. Peptide B, with its net 
positive charge, will interact more strongly with the 
cation-exchange resin than peptide A, and thus peptide 
A will elute first. On the anion-exchange resin, peptide B 
will elute first. Peptide A, being negatively charged, will 
be retarded by its interaction with the positively 
charged resin. 



Figure 3-17 shows two other variations of column 
chromatography in addition to ion exchange. Size- 
exclusion chromatography, also called gel filtration 
(Fig. 3-1 7b), separates proteins according to size. In this 
method, large proteins emerge from the column sooner 
than small ones — a somewhat counterintuitive result. 
The solid phase consists of cross-linked polymer beads 
with engineered pores or cavities of a particular size. 
Large proteins cannot enter the cavities and so take a 
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by size; larger molecules 
pass more freely, appearing 
in the earlier fractions. 
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A Purification Table for a Hypothetical Enzyme 




Fraction volume 


Total protein 


Activity 


Specific activity 


Procedure or step 


(ml) 


(mg) 


(units) 


(units/mg) 


1 . Crude cellular extract 


1,400 


10,000 


100,000 


10 


2 Precipitation with ammonium sulfate 


280 


3,000 


96,000 


32 


3. Ion-exchange chromatography 


90 


400 


80,000 


200 


4. Size-exclusion chromatography 


80 


100 


60,000 


600 


5. Affinity chromatography 


6 


3 


45,000 


15,000 



Note: All data represent the status of the sample after the designated procedure has been carried out. Activity and specific activity are defined on page 91. 



short (and rapid) path through the column, around the 
beads. Small proteins enter the cavities and are slowed 
by their more labyrinthine path through the column. 

Affinity chromatography is based on binding 
affinity (Fig. 3-1 7c). The beads in the column have a co- 
valently attached chemical group called a ligand — a 
group or molecule that binds to a macromolecule such 
as a protein. When a protein mixture is added to the col- 
umn, any protein with affinity for this ligand binds to the 
beads, and its migration through the matrix is retarded 
For example, if the biological function of a protein in- 
volves binding to ATP, then attaching ATP to the beads 
in the column creates an affinity matrix that can help 
purify the protein. As the protein solution moves through 
the column, ATP-binding proteins (including the protein 
of interest) bind to the matrix. After proteins that do not 
bind are washed through the column, the bound protein 
is eluted by a solution containing either a high concen- 
tration of salt or free ligand— in this case, ATP. Salt 
weakens the binding of the protein to the immobilized 
ligand, interfering with ionic interactions. Free ligand 
competes with the ligand attached to the beads, releas- 
ing the protein from the matrix; the protein product that 
elutes from the column is often bound to the ligand used 
to elute it. 

A modem refinement in chromatographic methods 
is HPLC, or high-performance liquid chromatogra- 
phy. HPLC makes use of high-pressure pumps that 
speed the movement of the protein molecules down the 
column, as well as higher-quality chromatographic ma- 
terials that can withstand the crushing force of the pres- 
surized flow. By reducing the transit time on the column, 
HPLC can limit diffusional spreading of protein bands 
and thus greatly improve resolution. 

The approach to purification of a protein that has 
not previously been isolated is guided both by estab- 
lished precedents and by common sense. In most cases, 
several different methods must be used sequentially to 
purify a protein completely, each method separating 
proteins on the basis of different properties. For exam- 
ple, if one step separates ATP-binding proteins from 
those that do not bind ATP, then the next step must sep- 
arate the various ATP-binding proteins on the basis of 



size or charge to isolate the particular protein that is 
wanted. The choice of methods is somewhat empirical, 
and many strategies may be tried before the most effec- 
tive one is found. Trial and error can often be minimized 
by basing the procedure on purification techniques de- 
veloped for similar proteins. Published purification pro- 
tocols are available for many thousands of proteins. 
Common sense dictates that inexpensive procedures 
such as salting out be used first, when the total volume 
and the number of contaminants are greatest. Chro- 
matographic methods are often impractical at early 
stages, because the amount of chromatographic 
medium needed increases with sample size. As each pu- 
rification step is completed, the sample size generally 
becomes smaller (Table 3-5), making it feasible to use 
more sophisticated (and expensive) chromatographic 
procedures at later stages. 

Proteins Can Be Separated and Characterized by 
Electrophoresis 

Another important technique for the separation of 
proteins is based on the migration of charged proteins 
in an electric field, a process called electrophoresis. 
These procedures are not generally used to purify 
proteins in large amounts, because simpler alterna- 
tives are usually available and electrophoretic meth- 
ods often adversely affect the structure and thus the 
function of proteins. Electrophoresis is, however, es- 
pecially useful as an analytical method. Its advantage 
is that proteins can be visualized as well as separated, 
permitting a researcher to estimate quickly the num- 
ber of different proteins in a mixture or the degree of 
purity of a particular protein preparation. Also, elec- 
trophoresis allows determination of crucial properties 
of a protein such as its isoelectric point and approxi- 
mate molecular weight. 

Electrophoresis of proteins is generally carried out 
in gels made up of the cross-linked polymer polyacryl- 
amide (Fig. 3-18). The polyacrylamide gel acts as a 
molecular sieve, slowing the migration of proteins ap- 
proximately in proportion to their charge-to-mass ra- 
tio. Migration may also be affected by protein shape. In 
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FIGURE 3-18 Electrophoresis, (a) Different samples are loaded in 
wells or depressions at the top of the polyacrylamide gel. The proteins 
move into the gel when an electric field is applied. The gel minimizes 
convection currents caused by small temperature gradients, as well as 
protein movements other than those induced by the electric field, 
(b) Proteins can be visualized after electrophoresis by treating the gel 
with a stain such as Coomassie blue, which binds to the proteins but not 
to the gel itself. Each band on the gel represents a different protein (or 
protein subunit); smaller proteins move through the gel more rapidly 
than larger proteins and therefore are found nearer the bottom of the 
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gel. This gel illustrates purification of the RecA protein of Escherichia 
coli (described in Chapter 25). The gene for the RecA protein was 
cloned (Chapter 9) so that its expression (synthesis of the protein) 
could be controlled. The first lane shows a set of standard proteins (of 
known M r ), serving as molecular weight markers. The next two lanes 
show proteins from E, coli cells before and after synthesis of RecA pro- 
tein was induced. The fourth lane shows the proteins in a crude cellu- 
lar extract. Subsequent lanes (left to right) show the proteins present 
after successive purification steps. The purified protein is a single 
polypeptide chain [M r -38,000), as seen in the rightmost lane. 



electrophoresis, the force moving the macromolecule 
is the electrical potential, E The electrophoretic mo- 
bility, /i, of a molecule is the ratio of its velocity, V, to 
the electrical potential. Electrophoretic mobility is also 
equal to the net charge, Z, of the molecule divided by 
the frictional coefficient,/, which reflects in part a pro- 
tein's shape. Thus: 

V Z 

The migration of a protein in a gel during electrophoresis 
is therefore a function of its size and its shape. 

An electrophoretic method commonly employed 
for estimation of purity and molecular weight makes 
use of the detergent sodium dodecyl sulfate (SDS) 
("dodecyl" denoting a 12-carbon chain). 

O 

Na + "0-S-0-(CH 2 ) n CH 3 
O 

Sodium dodecyl sulfate 
(SDS) 

SDS binds to most proteins in amounts roughly pro- 
portional to the molecular weight of the protein, about 



one molecule of SDS for every two amino acid 
residues. The bound SDS contributes a large net neg- 
ative charge, rendering the intrinsic charge of the pro- 
tein insignificant and conferring on each protein a 
similar charge-to-mass ratio. In addition, SDS binding 
partially unfolds proteins, such that most SDS-bound 
proteins assume a similar shape. Electrophoresis in 
the presence of SDS therefore separates proteins al- 
most exclusively on the basis of mass (molecular 
weight), with smaller polypeptides migrating more 
rapidly. After electrophoresis, the proteins are visual- 
ized by adding a dye such as Coomassie blue, which 
binds to proteins but not to the gel itself (Fig. 3-18b). 
Thus, a researcher can monitor the progress of a pro- 
tein purification procedure as the number of protein 
bands visible on the gel decreases after each new frac- 
tionation step. When compared with the positions to 
which proteins of known molecular weight migrate in 
the gel, the position of an unidentified protein can 
provide a good approximation of its molecular weight 
(Fig. 3-19). If the protein has two or more different 
sub units, the subunits are generally separated by the 
SDS treatment, and a separate band appears for each. 
(| SDS Gel Electrophoresis 
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FIGURE 3-19 Estimating the molecular weight of a protein. The elec- 
trophoretic mobility of a protein on an SDS polyacrylamide gel is 
related to its molecular weight, M r . (a) Standard proteins of known mo- 
lecular weight are subjected to electrophoresis (lane 1). These marker 
proteins can be used to estimate the molecular weight of an unknown 
protein (lane 2). (b) A plot of log M T of the marker proteins versus rela- 
tive migration during electrophoresis is linear, which allows the molec- 
ular weight of the unknown protein to be read from the graph. 
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Isoelectric focusing is a procedure used to deter- 
mine the isoelectric point (pi) of a protein (Fig. 3-20). A 
pH gradient is established by allowing a mixture of low 
molecular weight organic acids and bases (ampholytes; 
p. 79) to distribute themselves in an electric field gener- 
ated across the gel. When a protein mixture is applied, 
each protein migrates until it reaches the pH that matches 
its pi (Table 3-6). Proteins with different isoelectric 
points are thus distributed differently throughout the gel. 



Combining isoelectric focusing and SDS elec- 
trophoresis sequentially in a process called two- 
dimensional electrophoresis permits the resolution 
of complex mixtures of proteins (Fig. 3-21). This is a 
more sensitive analytical method than either elec- 
trophoretic method alone. Two-dimensional elec- 
trophoresis separates proteins of identical molecular 
weight that differ in pi, or proteins with similar pi values 
but different molecular weights. 
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FIGURE 3-20 Isoelectric focusing. This technique separates proteins according to their 
isoelectric points. A stable pH gradient is established in the gel by the addition of ap- 
propriate ampholytes. A protein mixture is placed in a well on the gel. With an applied 
electric field, proteins enter the gel and migrate until each reaches a pH equivalent to 
its pi. Remember that when pH = pi, the net charge of a protein is zero. 
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FIGURE 3-21 Two-dimensional electrophoresis, (a) Proteins are first 
separated by isoelectric focusing in a cylindrical gel. The gel is then 
laid horizontally on a second, slab-shaped gel, and the proteins are 
separated by SDS polyacrylamide gel electrophoresis. Horizontal sep- 
aration reflects differences in pi; vertical separation reflects differences 
in molecular weight, (b) More than 1 ,000 different proteins from E . colt 
can be resolved using this technique. 



Unseparated Proteins Can Be Quantified 

To purify a protein, it is essential to have a way of de- 
tecting and quantifying that protein in the presence of 
many other proteins at each stage of the procedure. Of- 
ten, purification must proceed in the absence of any in- 
formation about the size and physical properties of the 
protein or about the fraction of the total protein mass it 
represents in the extract. For proteins that are enzymes, 
the amount in a given solution or tissue extract can be 
measured, or assayed, in terms of the catalytic effect the 
enzyme produces — that is, the increase in the rate at 
which its substrate is converted to reaction products 
when the enzyme is present. For this purpose one must 
know (1) the overall equation of the reaction catalyzed, 
(2) an analytical procedure for determining the disap- 
pearance of the substrate or the appearance of a reaction 
product, (3) whether the enzyme requires cofactors 
such as metal ions or coenzymes, (4) the dependence of 
the enzyme activity on substrate concentration, (5) the 
optimum pH, and (6) a temperature zone in which the 
enzyme is stable and has high activity Enzymes are usu- 
ally assayed at their optimum pH and at some convenient 
temperature within the range 25 to 38°C. Also, very high 
substrate concentrations are generally used so that the 
initial reaction rate, measured experimentally, is propor- 
tional to enzyme concentration (Chapter 6). 

By international agreement, 1.0 unit of enzyme ac- 
tivity for most enzymes is defined as the amount of en- 
zyme causing transformation of 1.0 /zmol of substrate 
per minute at 25°C under optimal conditions of mea- 
surement (for some enzymes, this definition is inconven- 
ient, and a unit may be defined differently). The term 
activity refers to the total units of enzyme in a solution. 
The specific activity is the number of enzyme units 
per milligram of total protein (Fig. 3-22). The specific 
activity is a measure of enzyme purity: it increases dur- 
ing purification of an enzyme and becomes maximal and 
constant when the enzyme is pure (Table 3-5, p. 88). 
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FIGURE 3-22 Activity versus specific activity. The difference between 
these terms can be illustrated by considering two beakers of mar- 
bles. The beakers contain the same number of red marbles, but 
different numbers of marbles of other colors. If the marbles represent 
proteins, both beakers contain the same activity of the protein repre- 
sented by the red marbles. The second beaker, however, has the higher 
specific activity because red marbles represent a higher fraction of 
the total. 
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After each purification step, the activity of the 
preparation (in units of enzyme activity) is assayed, the 
total amount of protein is determined independently, 
and the ratio of the two gives the specific activity. Ac- 
tivity and total protein generally decrease with each 
step. Activity decreases because there is always some 
loss due to inactivation or nonideal interactions with 
chromatographic materials or other molecules in the 
solution. Total protein decreases because the objective 
is to remove as much unwanted or nonspecific protein 
as possible. In a successful step, the loss of nonspecific 
protein is much greater than the loss of activity; there- 
fore, specific activity increases even as total activity 
falls. The data are assembled in a purification table sim- 
ilar to Table 3-5. A protein is generally considered pure 
when further purification steps fail to increase specific 
activity and when only a single protein species can be 
detected (for example, by electrophoresis). 

For proteins that are not enzymes, other quantifica- 
tion methods are required. Transport proteins can be as- 
sayed by their binding to the molecule they transport, 
and hormones and toxins by the biological effect they 
produce; for example, growth hormones will stimulate 
the growth of certain cultured cells. Some structural 
proteins represent such a large fraction of a tissue mass 
that they can be readily extracted and purified without a 
functional assay. The approaches are as varied as the 
proteins themselves. 

SUMMARY 3.3 Working with Proteins 

■ Proteins are separated and purified on the basis of 
differences in their properties. Proteins can be 
selectively precipitated by the addition of certain 
salts. A wide range of chromatographic procedures 
makes use of differences in size, binding affinities, 
charge, and other properties. These include 
ion-exchange, size-exclusion, affinity, and 
high-performance liquid chromatography. 

■ Electrophoresis separates proteins on the basis 
of mass or charge. SDS gel electrophoresis and 
isoelectric focusing can be used separately or in 
combination for higher resolution. 

Primary 
structure 

FIGURE 3-23 Levels of structure in 
proteins. The primary structure consists 
of a sequence of amino acids linked to- 
gether by peptide bonds and includes 
any disulfide bonds. The resulting 
polypeptide can be arranged into units 
of secondary structure, such as an a 
helix. The helix is a part of the tertiary 
structure of the folded polypeptide, 
which is itself one of the subunits that 
make up the quaternary structure of 
the multisubunit protein, in this case Amino acid 
hemoglobin. residues 



■ All purification procedures require a method for 
quantifying or assaying the protein of interest in 
the presence of other proteins. Purification can be 
monitored by assaying specific activity. 

3.4 The Structure of Proteins: 
Primary Structure 

Purification of a protein is usually only a prelude to a de- 
tailed biochemical dissection of its structure and func- 
tion. What is it that makes one protein an enzyme, 
another a hormone, another a structural protein, and 
still another an antibody? How do they differ chemi- 
cally? The most obvious distinctions are structural, and 
to protein structure we now turn. 

For large macromolecules such as proteins, the tasks 
of describing and understanding structure are ap- 
proached at several levels of complexity, arranged in a 
kind of conceptual hierarchy. Four levels of protein struc- 
ture are commonly defined (Fig. 3-23). A description of 
all covalent bonds (mainly peptide bonds and disulfide 
bonds) linking amino acid residues in a polypeptide chain 
is its primary structure. The most important element of 
primary structure is the sequence of amino acid residues. 
Secondary structure refers to particularly stable 
arrangements of amino acid residues giving rise to recur- 
ring structural patterns. Tertiary structure describes all 
aspects of the three-dimensional folding of a polypeptide. 
When a protein has two or more polypeptide subunits, 
their arrangement in space is referred to as quaternary 
structure. Our exploration of proteins will eventually in- 
clude complex protein machines consisting of dozens to 
thousands of subunits. Primary structure is the focus of 
the remainder of this chapter; the higher levels of struc- 
ture are discussed in Chapter 4. 

The differences in primary structure can be especially 
informative. Each protein has a distinctive number and se- 
quence of amino acid residues. As we shall see in Chapter 
4, the primary structure of a protein determines how it 
folds up into its unique three-dimensional structure, and 
this in turn determines the function of the protein. We first 
consider empirical clues that amino acid sequence and 
protein function are closely linked, then describe how 
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amino acid sequence is determined; finally, we outline the 
many uses to which this information can be put. 

The Function of a Protein Depends on Its 
Amino Acid Sequence 

The bacterium Escherichia coli produces more than 
3,000 different proteins; a human has -25,000 genes en- 
coding a much larger number of proteins (through ge- 
netic processes discussed in Part III of this text). In both 
cases, each type of protein has a unique three-dimensional 
structure and this structure confers a unique function. 
Each type of protein also has a unique amino acid sequence. 
Intuition suggests that the amino acid sequence must play 
a fundamental role in determining the three-dimensional 
structure of the protein, and ultimately its function, but is 
this supposition correct? A quick survey of proteins and 
how they vary in amino acid sequence provides some em- 
pirical clues that help substantiate the important relation- 
ship between amino acid sequence and biological function. 

First, as we have already noted, proteins with differ- 
ent functions always have different amino acid se- 
quences. Second, thousands of human genetic diseases 
have been traced to the production of defective pro- 
teins. The defect can range from a single change in the 
amino acid sequence (as in sickle cell anemia, described 
in Chapter 5) to deletion of a larger portion of the 
polypeptide chain (as in most cases of Duchenne mus- 
cular dystrophy: a large deletion in the gene encoding 
the protein dystrophin leads to production of a short- 
ened, inactive protein). Thus we know that if the pri- 
mary structure is altered, the function of the protein 
may also be changed. Finally, on comparing functionally 
similar proteins from different species, we find that 
these proteins often have similar amino acid sequences. 
An extreme case is ubiquitin, a 76-residue protein in- 
volved in regulating the degradation of other proteins. 
The amino acid sequence of ubiquitin is identical in 
species as disparate as fruit flies and humans. 

Is the amino acid sequence absolutely fixed, or in- 
variant, for a particular protein? No; some flexibility is 
possible. An estimated 20% to 30% of the proteins in hu- 
mans are polymorphic, having amino acid sequence 
variants in the human population. Many of these varia- 
tions in sequence have little or no effect on the function 
of the protein. Furthermore, proteins that carry out a 
broadly similar function in distantly related species can 
differ greatly in overall size and amino acid sequence. 

Although the amino acid sequence in some regions 
of the primary structure might vary considerably with- 
out affecting biological function, most proteins contain 
crucial regions that are essential to their function and 
whose sequence is therefore conserved. The fraction of 
the overall sequence that is critical varies from protein 
to protein, complicating the task of relating sequence to 
three-dimensional structure, and structure to function. 
Before we can consider this problem further, however, 
we must examine how sequence information is obtained. 



The Amino Acid Sequences of Millions of Proteins 
Have Been Determined 

Two major discoveries in 1953 were of crucial impor- 
tance in the history of biochemistry. In that year, James 
D. Watson and Francis Crick deduced the double-helical 
structure of DNA and proposed a structural basis for its 
precise replication (Chapter 8). Their proposal illumi- 
nated the molecular reality behind the idea of a gene. In 
the same year, Frederick Sanger worked out the 
sequence of amino acid residues in the polypeptide 
chains of the hormone insulin (Fig. 3-24), surprising 
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FIGURE 3-24 Amino acid sequence of bovine insulin. The two 

polypeptide chains are joined by disulfide cross-linkages. The A chain 
is identical in human, pig, dog, rabbit, and sperm whale insulins. The 
B chains of the cow, pig, dog, goat, and horse are identical. 
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many researchers who had long thought that determin- 
ing the amino acid sequence of a polypeptide would be 
a hopelessly difficult task. It quickly became evident 
that the nucleotide sequence in DNA and the amino 
acid sequence in proteins were somehow related. Barely 
a decade after these discoveries, the genetic code relat- 
ing the nucleotide sequence of DNA to the amino acid 
sequence of protein molecules was elucidated (Chapter 
27). An enormous number of protein sequences can 
now be derived indirectly from the DNA sequences in 
the rapidly growing genome databases. However, mod- 
ern protein chemistry still makes frequent use of tradi- 
tional methods of polypeptide sequencing, which can 
reveal details not evident in the gene sequence, such as 
modifications that occur after proteins are synthesized. 
Chemical protein sequencing now complements the 
newer methods, providing multiple avenues to obtain 
amino acid sequence data. Such data are now critical to 
every area of biochemical investigation. 



Short Polypeptides Are Sequenced with 
Automated Procedures 

Various procedures are used to analyze protein primary 
structure. To start, biochemists have several strategies 
for labeling and identifying the amino-terminal amino 
acid residue (Fig. 3-2-oa). Sanger developed the 
reagent l-fluoro-2,4-dinitrobenzene (FDNB) for this 
purpose; other available reagents are dansyl chloride 
and dabsyl chloride, which yield derivatives that are 
more easily detectable than the dinitrophenyl deriva- 
tives. After the amino-terminal residue is labeled with 
one of these reagents, the polypeptide is hydrolyzed (in 
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FIGURE 3-25 Steps in sequencing a polypeptide, (a) Identification of 
the amino-terminal residue can be the first step in sequencing a 
polypeptide. Sanger's method for identifying the amino-terminal 
residue is shown here, (b) The Edman degradation procedure reveals 



the entire sequence of a peptide. For shorter peptides, this method 
alone readily yields the entire sequence, and step (a) is often omitted. 
Step (a) is useful in the case of larger polypeptides, which are often 
fragmented into smaller peptides for sequencing (see Fig. 3-27). 



6 m HC1) to its constituent amino acids and the labeled 
amino acid is identified. Because the hydrolysis stage 
destroys the polypeptide, this procedure cannot be used 
to sequence a polypeptide beyond its amino-terminal 
residue. However, it can help determine the number of 
chemically distinct polypeptides in a protein, provided 
each has a different amino-terminal residue. For example, 
two residues — Phe and Gly — would be labeled if insulin 
(Fig. 3-24) were subjected to this procedure. 

CH 3 CH 3 
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S0 2 C1 

Dansyl chloride Dabsyl chloride 

To sequence an entire polypeptide, a chemical 
method devised by Pehr Edman is usually employed. 
The Edman degradation procedure labels and re- 
moves only the amino-terminal residue from a peptide, 
leaving all other peptide bonds intact (Fig. 3-25b). The 
peptide is reacted with phenylisothiocyanate under 
mildly alkaline conditions, which converts the amino- 
terminal amino acid to a phenylthiocarbamoyl (PTC) 
adduct. The peptide bond next to the PTC adduct is 
then cleaved in a step carried out in anhydrous trifluo- 
roacetic acid, with removal of the amino-terminal 
amino acid as an anilinothiazolinone derivative. The 
derivatized amino acid is extracted with organic sol- 
vents, converted to the more stable phenylthiohy- 
dantoin derivative by treatment with aqueous acid, 
and then identified. The use of sequential reactions 
carried out under first basic and then acidic condi- 
tions provides a means of controlling the entire 
process. Each reaction with the amino-terminal 
amino acid can go essentially to completion without 
affecting any of the other peptide bonds in the pep- 
tide. After removal and identification of the amino- 
terminal residue, the new amino-terminal residue so 
exposed can be labeled, removed, and identified 
through the same series of reactions. This procedure 
is repeated until the entire sequence is determined. 
The Edman degradation is carried out in a machine, 
called a sequenator, that mixes reagents in the 
proper proportions, separates the products, identi- 
fies them, and records the results. These methods 
are extremely sensitive. Often, the complete amino 
acid sequence can be determined starting with only a 
few micrograms of protein. 

The length of polypeptide that can be accurately 
sequenced by the Edman degradation depends on the 
efficiency of the individual chemical steps. Consider a 
peptide beginning with the sequence Gly-Pro-Lys- at 
its amino terminus. If glycine were removed with 97% 
efficiency, 3% of the polypeptide molecules in the so- 
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lution would retain a Gly residue at their amino termi- 
nus. In the second Edman cycle, 94% of the liberated 
amino acids would be proline (97% of the Pro residues 
would be removed from the 97% of molecules ending 
in Pro), and 2.9% glycine (97% of Gly residues would 
be removed from the 3% of molecules still ending in 
Gly), while 3% of the polypeptide molecules would re- 
tain Gly (0.1%) or Pro (2.9%) residues at their amino 
terminus. At each cycle, peptides that did not react 
in earlier cycles would contribute amino acids to an 
ever-increasing background, eventually making it im- 
possible to determine which amino acid is next in 
the original peptide sequence. Modern sequenators 
achieve efficiencies of better than 99% per cycle, per- 
mitting the sequencing of more than 50 contiguous 
amino acid residues in a polypeptide. The primary 
structure of insulin, worked out by Sanger and col- 
leagues over a period of 10 years, could now be com- 
pletely determined in a day or two by direct sequencing 
in a protein sequenator. (As we'll discuss in Chapter 8, 
DNA sequencing is even more efficient.) 

Large Proteins Must Be Sequenced in 
Smaller Segments 

The overall accuracy of amino acid sequencing generally 
declines as the length of the polypeptide increases. 
The very large polypeptides found in proteins must 
be broken down into smaller pieces to be sequenced 
efficiently. There are several steps in this process. First, 
the protein is cleaved into a set of specific fragments 
by chemical or enzymatic methods. If any disulfide 
bonds are present, they must be broken. Each fragment 
is purified, then sequenced by the Edman procedure. 
Finally, the order in which the fragments appear in the 
original protein is determined and disulfide bonds (if 
any) are located. 

Breaking Disulfide Bonds Disulfide bonds interfere 
with the sequencing procedure. If a cystine residue 
(Fig. 3-7) has one of its peptide bonds cleaved by the 
Edman procedure, it may remain attached to another 
polypeptide strand via its disulfide bond. Disulfide 
bonds also interfere with the enzymatic or chemical 
cleavage of the polypeptide. Two approaches to irre- 
versible breakage of disulfide bonds are outlined in 
Figure 3-26. 

Cleaving the Polypeptide Chain Several methods 
can be used for fragmenting the polypeptide chain. 
Enzymes called proteases catalyze the hydrolytic 
cleavage of peptide bonds. Some proteases cleave only 
the peptide bond adjacent to particular amino acid 
residues (Table 3-7) and thus fragment a polypeptide 
chain in a predictable and reproducible way. A number 
of chemical reagents also cleave the peptide bond adja- 
cent to specific residues. 
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FIGURE 3-26 Breaking disulfide bonds in proteins. Two com- 
mon methods are illustrated. Oxidation of a cystine residue 
with performic acid produces two cysteic acid residues. Re- 
duction by dithiothreitol or /3-mercaptoethanol to form Cys 
residues must be followed by further modification of the reac- 
tive — SH groups to prevent re-formation of the disulfide bond. 
Acetylation by iodoacetate serves this purpose. 
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TABLE 3-7 



The Specificity of Some Common Methods for fragmenting Polypeptide Chains 
Reagent (biological source)* Cleavage points 1 



Trypsin (bovine pancreas) 

Submaxillarus protease (mouse submaxillary gland) 
Chymotrypsin (bovine pancreas) 

Staphylococcus aureus V8 protease (bacterium S. aureus) 
Asp-Af-protease (bacterium Pseudomonas fragi) 
Pepsin (porcine stomach) 

Endoproteinase Lys C (bacterium Lysobacter enzymogenes) 
Cyanogen bromide 



Lys, Arg (C) 
Arg(C) 

Phe, Trp, Tyr (C) 

Asp, Glu (C) 

Asp, Glu (N) 

Leu, Phe, Trp, Tyr (N) 

Lys (C) 

Met (C) 



* All reagents except cyanogen bromide are proteases. All are available from commercial sources. 

Residues furnishing the primary recognition point for the protease or reagent; peptide bond cleavage occurs on either the car- 
bonyl (C) or the amino (N) side of the indicated amino acid residues. 



Among proteases, the digestive enzyme trypsin cat- 
alyzes the hydrolysis of only those peptide bonds in 
which the carbonyl group is contributed by either a Lys 
or an Arg residue, regardless of the length or amino acid 
sequence of the chain. The number of smaller peptides 
produced by trypsin cleavage can thus be predicted 
from the total number of Lys or Arg residues in the 
original polypeptide, as determined by hydrolysis of an 
intact sample (Fig. 3-27). A polypeptide with three 
Lys and/or Arg residues (as in Fig. 3-27) will usually 
yield four smaller peptides on cleavage with trypsin. 
Moreover, all except one of these will have a carboxyl- 



terminal Lys or Arg. The fragments produced by trypsin 
(or other enzyme or chemical) action are then sepa- 
rated by chromatographic or electrophoretic methods. 

Sequencing the Peptides Each peptide fragment re- 
sulting from the action of trypsin is sequenced separately 
by the Edman procedure. 

Ordering the Peptide Fragments The order of the 
"trypsin fragments" in the original polypeptide chain 
must now be determined. Another sample of the intact 
polypeptide is cleaved into fragments using a different 
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enzyme or reagent, one that cleaves peptide bonds at 
points other than those cleaved by trypsin. For example, 
cyanogen bromide cleaves only those peptide bonds in 
which the carbonyl group is contributed by Met. The 
fragments resulting from this second procedure are 
then separated and sequenced as before. 

The amino acid sequences of each fragment obtained 
by the two cleavage procedures are examined, with the 
objective of finding peptides from the second procedure 
whose sequences establish continuity, because of overlaps, 
between the fragments obtained by the first cleavage pro- 
cedure (Fig. 3-27). Overlapping peptides obtained from the 
second fragmentation yield the correct order of the peptide 
fragments produced in the first. If the ammo-terminal 



amino acid has been identified before the original cleavage 
of the protein, this information can be used to establish 
which fragment is derived from the amino terminus. The 
two sets of fragments can be compared for possible errors 
in determining the amino acid sequence of each fragment. 
If the second cleavage procedure fails to establish continu- 
ity between all peptides from the first cleavage, a third or 
even a fourth cleavage method must be used to obtain a set 
of peptides that can provide the necessary overlap(s). 

Locating Disulfide Bonds If the primary structure 
includes disulfide bonds, their locations are determined 
in an additional step after sequencing is completed. A 
sample of the protein is again cleaved with a reagent 
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Conclusion 

Polypeptide has 38 
amino acid residues. Tryp- 
sin will cleave three times 
(at one R (Arg) and two 
K (Lys)) to give four frag- 
ments. Cyanogen bromide 
will cleave at two 
M (Met) to give three 
fragments. 



E (Glu) is amino- 
terminal residue. 
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CM ft With 
sequence 



GAS MALIK 
EGAAYHDFEPIDPR 
DCVHSD 
YLIACGPMTK 



(replaced at amino 
terminus because it 
begins with E (Glu). 

(T^3) placed at carboxyl 
terminus because it 
does not end with 
R (Arg) or K (Lys). 



C-i) EGAAYHDFEPIDPRGASM (C-3j overlaps with 



(C^2) TKDCVHSD 
(03) ALIKYLIACGPM 



(T^and(T-4), allowing 
them to be ordered. 



tnZu, EGAAYIiDFEPIDPRGASMAUKYLIA CGPMTKDC VHSD £S 



FIGURE 3-27 Cleaving proteins and sequencing and ordering the 

peptide fragments. First, the amino acid composition and amino- 
terminal residue of an intact sample are determined, Then any disulfide 
bonds are broken before fragmenting so that sequencing can proceed 
efficiently. In this example, there are only two Cys (C) residues and 



thus only one possibility for location of the disulfide bond. In polypep- 
tides with three or more Cys residues, the position of disulfide bonds 
can be determined as described in the text. (The one-letter symbols for 
amino acids are given in Table 3-1 .) 
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such as trypsin, this time without first breaking the 
disulfide bonds. The resulting peptides are separated by 
electrophoresis and compared with the original set of 
peptides generated by trypsin. For each disulfide bond, 
two of the original peptides will be missing and a new, 
larger peptide will appear. The two missing peptides 
represent the regions of the intact polypeptide that are 
linked by the disulfide bond. 

Amino Acid Sequences Can Also Be Deduced by 
Other Methods 

The approach outlined above is not the only way to 
determine amino acid sequences. New methods based on 
mass spectrometry permit the sequencing of short 
polypeptides (20 to 30 amino acid residues) in just a few 
minutes (Box 3-2). In addition, with the development of 
rapid DNA sequencing methods (Chapter 8), the elucida- 
tion of the genetic code (Chapter 27), and the develop- 
ment of techniques for isolating genes (Chapter 9), 
researchers can deduce the sequence of a polypeptide by 
detennining the sequence of nucleotides in the gene that 
codes for it (Fig. 3-28). The techniques used to deter- 
mine protein and DNA sequences are complementary. 



Amino acid 

sequence (protein) Gln-Tyr-Pro-Thr-Ile-Trp 
DNA sequence (gene) C AGTATC CTAC G ATTTGG 

FIGURE 3-28 Correspondence of DNA and amino acid sequences. 

Each amino acid is encoded by a specific sequence of three nucleotides 
in DNA. The genetic code is described in detail in Chapter 27. 

When the gene is available, sequencing the DNA can be 
faster and more accurate than sequencing the protein. 
Most proteins are now sequenced in this indirect way. If 
the gene has not been isolated, direct sequencing of pep- 
tides is necessary, and this can provide information (the 
location of disulfide bonds, for example) not available in a 
DNA sequence. In addition, a knowledge of the amino acid 
sequence of even a part of a polypeptide can greatly facil- 
itate the isolation of the corresponding gene (Chapter 9). 

The array of methods now available to analyze both 
proteins and nucleic acids is ushering in a new discipline 
of "whole cell biochemistry." The complete sequence of 
an organism's DNA, its genome, is now available for or- 
ganisms ranging from viruses to bacteria to multicellular 
eukaryotes (see Table 1-2). New genes are being discov- 
ered by the thousands, including many that encode 

{continued on page 100) 



BOX 3-2 



METHODS 



Investigating Proteins with Mass Spectn 



The mass spectrometer has long been an indispensable 
tool in chemistry. Molecules to be analyzed, referred to as 
analytes, are first ionized in a vacuum. When the newly 
charged molecules are introduced into an electric and/or 
magnetic field, their paths through the field are a function 
of their mass-to-charge ratio, mlz. This measured prop- 
erty of the ionized species can be used to deduce the 
mass (M) of the analyte with very high precision. 

Although mass spectrometry has been in use for 
many years, it could not be applied to macromolecules 
such as proteins and nucleic acids. The mlz measure- 
ments are made on molecules in the gas phase, and the 
heating or other treatment needed to transfer a macro- 
molecule to the gas phase usually caused its rapid de- 
composition. In 1988, two different techniques were 
developed to overcome this problem. In one, proteins 
are placed in a light-absorbing matrix. With a short pulse 
of laser light, the proteins are ionized and then desorbed 
from the matrix into the vacuum system. This process, 
known as matrix-assisted laser desorption/ioniza- 
tion mass spectrometry, or MALDI MS, has been 
successfully used to measure the mass of a wide range of 
macromolecules. In a second and equally successful 
method, macromolecules in solution are forced directly 
from the liquid to gas phase. A solution of analytes is 
passed through a charged needle that is kept at a high 
electrical potential, dispersing the solution into a fine 
mist of charged microdroplets. The solvent surrounding 
the macromolecules rapidly evaporates, and the result- 



ing multiply charged macromolecular ions are thus 
introduced nondestructively into the gas phase. This 
technique is called electrospray ionization mass 
spectrometry, or ESI MS. Protons added during pas- 
sage through the needle give additional charge to the 
macromolecule. The mlz of the molecule can be ana- 
lyzed in the vacuum chamber. 

Mass spectrometry provides a wealth of information 
for proteomics research, enzymology, and protein chem- 
istry in general. The techniques require only miniscule 
amounts of sample, so they can be readily applied to the 
small amounts of protein that can be extracted from a 
two-dimensional electrophoretic gel. The accurately 
measured molecular mass of a protein is one of the crit- 
ical parameters in its identification. Once the mass of a 
protein is accurately known, mass spectrometry is a 
convenient and accurate method for detecting changes 
in mass due to the presence of bound cof actors, bound 
metal ions, covalent modifications, and so on. 

The process for determining the molecular mass of 
a protein with ESI MS is illustrated in Figure 1. As it is 
injected into the gas phase, a protein acquires a variable 
number of protons, and thus positive charges, from the 
solvent. This creates a spectrum of species with differ- 
ent mass-to-charge ratios. Each successive peak corre- 
sponds to a species that differs from that of its 
neighboring peak by a charge difference of 1 and a mass 
difference of 1 (1 proton). The mass of the protein can 
be determined from any two neighboring peaks. 
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The measured m/z of one peak is 



(m/z) 2 



M + n 2 X 
n 2 



where M is the mass of the protein, n 2 is the number of 
charges, and X is the mass of the added groups (protons 
in this case). Similarly for the neighboring peak, 



(777/2)1 



M + (n 2 + 1)X 
n 2 + 1 



We now have two unknowns (M and rc 2 ) and two equa- 
tions. We can solve first for n 2 and then for M: 



n 2 



(m/z) 2 ~ X 
{mlz) 2 — (m/z)i 



M = n 2 [(m/z) 2 - X] 

This calculation using the m/z values for any two peaks 
in a spectrum such as that shown in Figure lb usually 
provides the mass of the protein (in this case, aerolysin 
k; 47,342 Da) with an error of only ±0.01%. Generating 
several sets of peaks, repeating the calculation, and av- 
eraging the results generally provides an even more ac- 
curate value for M, Computer algorithms can transform 
the m/z spectrum into a single peak that also provides a 
very accurate mass measurement (Fig. lb, inset). 

Mass spectrometry can also be used to sequence 
short stretches of polypeptide, an application that has 
emerged as an invaluable tool for quickly identifying un- 
known proteins. Sequence information is extracted using 
a technique called tandem MS, or MS/MS. A solution 
containing the protein under investigation is first treated 
with a protease or chemical reagent to hydrolyze it to a 
mixture of shorter peptides. The mixture is then injected 
into a device that is essentially two mass spectrometers 
in tandem (Fig. 2a, top). In the first, the peptide mixture 
is sorted and the ionized fragments are manipulated so 
that only one of the several types of peptides produced 
by cleavage emerges at the other end. The sample of the 
selected peptide, each molecule of which has a charge 
somewhere along its length, then travels through a vac- 
uum chamber between the two mass spectrometers. In 
this collision cell, the peptide is further fragmented by 
high-energy impact with a "collision gas," a small amount 
of a noble gas such as helium or argon that is bled into 
the vacuum chamber. This procedure is designed to frag- 
ment many of the peptide molecules in the sample, with 
each individual peptide broken in only one place, on av- 
erage. Most breaks occur at peptide bonds. This frag- 
mentation does not involve the addition of water (it is 
done in a near- vacuum), so the products may include 
molecular ion radicals such as carbonyl radicals (Fig. 2a, 
bottom). The charge on the original peptide is retained 
on one of the fragments generated from it. 
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FIGURE 1 Electrospray mass spectrometry of a protein, (a) A protein 
solution is dispersed into highly charged droplets by passage through a 
needle under the influence of a high-voltage electric field. The droplets 
evaporate, and the ions (with added protons in this case) enter the 
mass spectrometer for m/z measurement. The spectrum generated (b) 
is a family of peaks, with each successive peak (from right to left) cor- 
responding to a charged species increased by 1 in both mass and 
charge. Inset: a computer-generated transformation of this spectrum. 



The second mass spectrometer then measures the 
m/z ratios of all the charged fragments (uncharged 
fragments are not detected). This generates one or 
more sets of peaks. A given set of peaks (Fig. 2b) con- 
sists of all the charged fragments that were generated 
by breaking the same type of bond (but at different 
points in the peptide) and are derived from the same 
side of the bond breakage, either the carboxyl- or 
amino-terminal side. Each successive peak in a given set 
has one less amino acid than the peak before. The dif- 
ference in mass from peak to peak identifies the amino 
acid that was lost in each case, thus revealing the se- 
quence of the peptide. The only ambiguities involve 
leucine and isoleucine, which have the same mass. 

{continued on next page) 
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METHODS 



with Mass Spectrometry [continual) 
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The charge on the peptide can be retained on either 
the carboxyl- or amino4erminal fragment, and bonds other 
than the peptide bond can be broken in the fragmentation 
process, with the result that multiple sets of peaks are usu- 
ally generated. The two most prominent sets generally 
consist of charged fragments derived from breakage of the 
peptide bonds. The set consisting of the carboxyl-tenrrinal 



FIGURE 2 Obtaining protein sequence information with tandem MS. 

(a) After proteolytic hydrolysis, a protein solution is injected into a mass 
spectrometer (MS-1) The different peptides are sorted so that only one 
type is selected for further analysis. The selected peptide is further frag- 
mented in a chamber between the two mass spectrometers, and m/z for 
each fragment is measured in the second mass spectrometer (MS-2). 
Many of the ions generated during this second fragmentation result 
from breakage of the peptide bond, as shown. These are called b-type 
or y-type ions, depending on whether the charge is retained on the 
amino- or carboxyl-terminal side, respectively, (b) A typical spectrum 
with peaks representing the peptide fragments generated from a sample 
of one small peptide (10 residues). The labeled peaks are y-type ions. 
The large peak next to y 5 " is a doubly charged ion and is not part of the 
y set. The successive peaks differ by the mass of a particular amino acid 
in the original peptide. In this case, the deduced sequence was 
Phe-Pro-Gly-Gln-{lle/Leu)-Asn-Ala-Asp-(lle/Leu)-Arg. Note the am- 
biguity about He and Leu residues, because they have the same molec- 
ular mass. In this example, the set of peaks derived from y-type ions 
predominates, and the spectrum is greatly simplified as a result. This is 
because an Arg residue occurs at the carboxyl terminus of the peptide, 
and most of the positive charges are retained on this residue. 

fragments can be unambiguously distinguished from that 
consisting of the airmo-tenrunal fragments. Because the 
bond breaks generated between the spectrometers (in the 
collision cell) do not yield full carboxyl and amino groups 
at the sites of the breaks, the only intact a-amino and 
a-carboxyl groups on the peptide fragments are those at 
the very ends (Fig. 2a). The two sets of fragments can 
thereby be identified by the resulting slight differences in 
mass. The amino acid sequence derived from one set can 
be confirmed by the other, improving the confidence in the 
sequence information obtained. 

Even a short sequence is often enough to permit un- 
ambiguous association of a protein with its gene, if the 
gene sequence is known. Sequencing by mass spectrom- 
etry cannot replace the Edman degradation procedure 
for the sequencing of long polypeptides, but it is ideal 
for proteomics research aimed at cataloging the hun- 
dreds of cellular proteins that might be separated on a 
two-dimensional gel. 
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proteins with no known function. To describe the entire 
protein complement encoded by an organism's DNA, 
researchers have coined the term proteome. As de- 
scribed in Chapter 9, the new disciplines of genomics and 
proteomics are complementing work carried out on cellu- 
lar intermediary metabolism and nucleic acid metabolism 
to provide a new and increasingly complete picture of 
biochemistry at the level of cells and even organisms. 

Small Peptides and Proteins Can Be Chemically Synthesized 

Many peptides are potentially useful as pharmacologic 
agents, and their production is of considerable commercial 



importance. There are three ways to obtain a peptide: (1) 
purification from tissue, a task often made difficult by the 
vanishingly low concentrations of some peptides; (2) ge- 
netic engineering (Chapter 9); or (3) direct chemical syn- 
thesis. Powerful techniques now make direct chemical 
synthesis an attractive option in many cases. In addition to 
commercial applications, the synthesis of specific peptide 
portions of larger proteins is an increasingly important tool 
for the study of protein structure and function. 

The complexity of proteins makes the traditional 
synthetic approaches of organic chemistry impractical 
for peptides with more than four or five amino acid 
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residues. One problem is the difficulty of purifying the 
product after each step. RoshanKetab 021-66950639 

The major breakthrough in this technology was pro- 
vided by R. Bruce Merrifield in 1962. His innovation in- 
volved synthesizing a peptide while keeping it attached 
at one end to a solid support. The support is an insoluble 
polymer (resin) contained within a column, similar to 



that used for chromatographic procedures. The peptide 
is built up on this support one amino acid at a time, 
through a standard set of reactions in a repeating cycle 
(Fig. 3-29). At each successive step in the cycle, pro- 
tective chemical groups block unwanted reactions. 

The technology for chemical peptide synthesis is now 
automated. As in the sequencing reactions considered 
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FIGURE 3-29 Chemical synthesis of a peptide on an insoluble polymer support. 

Reactions ©through ©are necessary for the formation of each peptide bond. 
The 9-fluorenyImethoxycarbonyl (Fmoc) group (shaded blue) prevents un- 
wanted reactions at the a-amino group of the residue (shaded red). Chemical 
synthesis proceeds from the carboxyl terminus to the amino terminus, the re- 
verse of the direction of protein synthesis in vivo (Chapter 27). 
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TABLE 3-8 



ffect of Stepwise Yield on Overall 
Yield in Peptide Synthesis 



■ 



Number of residues in 
the final polypeptide 



Overall yield of final 
peptide (%) when the 
yield of each step is: 



96.0% 



99.8% 



11 


66 


98 


21 


44 


96 


31 


29 


94 


51 


13 


90 


100 


1.8 


82 



above, the most important limitation of the process is the 
efficiency of each chemical cycle, as can be seen by cal- 
culating the overall yields of peptides of various lengths 
when the yield for addition of each new amino acid is 
96.0% versus 99.8% (Table 3-8). Incomplete reaction at 
one stage can lead to formation of an impurity (in the 
form of a shorter peptide) in the next. The chemistry has 
been optimized to permit the synthesis of proteins of 100 
amino acid residues in a few days in reasonable yield. A 
very similar approach is used to synthesize nucleic acids 
(see Fig. 8-35). It is worth noting that this technology, 
impressive as it is, still pales when compared with biolog- 
ical processes. The same 100-residue protein would be 
synthesized with exquisite fidelity in about 5 seconds in a 
bacterial cell. 

A variety of new methods for the efficient ligation 
Qoining together) of peptides has made possible the as- 
sembly of synthetic peptides into larger polypeptides and 
proteins. With these methods, novel forms of proteins can 
be created with precisely positioned chemical groups, in- 
cluding those that might not normally be found in a cellu- 
lar protein. These novel forms provide new ways to test 
theories of enzyme catalysis, to create proteins with new 
chemical properties, and to design protein sequences 
that will fold into particular structures. This last applica- 
tion provides the ultimate test of our increasing ability 
to relate the primary structure of a peptide to the three- 
dimensional structure that it takes up in solution. 

Amino Acid Sequences Provide Important 
Biochemical Information 

Knowledge of the sequence of amino acids in a protein 
can offer insights into its three-dimensional structure 
and its function, cellular location, and evolution. Most of 
these insights are derived by searching for similarities 
between a protein of interest and previously studied 
proteins. Thousands of sequences are known and avail- 
able in databases accessible through the Internet. A 
comparison of a newly obtained sequence with this large 
bank of stored sequences often reveals relationships 
both surprising and enlightening. 



Exactly how the amino acid sequence determines 
three-dimensional structure is not understood in detail, 
nor can we always predict function from sequence. How- 
ever, protein families that have some shared structural or 
functional features can be readily identified on the basis of 
amino acid sequence similarities. Individual proteins are 
assigned to families based on the degree of similarity in 
amino acid sequence. Members of a family are usually 
identical across 25% or more of their sequences, and pro- 
teins in these families generally share at least some struc- 
tural and functional characteristics. Some families are 
denned, however, by identities involving only a few amino 
acid residues that are critical to a certain function. A num- 
ber of similar substructures, or "domains" (to be defined 
more fully in Chapter 4) , occur in many functionally unre- 
lated proteins. These domains often fold into structural 
configurations that have an unusual degree of stability or 
that are specialized for a certain environment. Evolution- 
ary relationships can also be inferred from the structural 
and functional similarities within protein families. 

Certain amino acid sequences serve as signals that de- 
termine the cellular location, chemical modification, and 
half-life of a protein. Special signal sequences, usually at 
the amino terminus, are used to target certain proteins for 
export from the cell; other proteins are targeted for distri- 
bution to the nucleus, the cell surface, the cytosol, or 
other cellular locations. Other sequences act as attach- 
ment sites for prosthetic groups, such as sugar groups in 
glycoproteins and lipids in lipoproteins. Some of these sig- 
nals are well characterized and are easily recognized in the 
sequence of a newly characterized protein (Chapter 27). 

KEY CONVENTION: Much of the functional information 
encapsulated in protein sequences comes in the form of 
consensus sequences. This term is applied to such 
sequences in DNA, RNA, or protein. When a series of 
related nucleic acid or protein sequences are compared, 
a consensus sequence is the one that reflects the most 
common base or amino acid at each position. Parts of 
the sequence that have particularly good agreement often 
represent evolutionarily conserved functional domains. A 
range of mathematical tools available on the Internet can 
be used to generate consensus sequences, or identify 
them in sequence databases. Box 3-3 illustrates common 
conventions for displaying consensus sequences. ■ 

Protein Sequences Can Elucidate the History 
of Life on Earth 

The simple string of letters denoting the amino acid se- 
quence of a protein belies the wealth of information this 
sequence holds. As more protein sequences have be- 
come available, the development of more powerful 
methods for extracting information from them has be- 
come a major biochemical enterprise. Analysis of the 
information available in the many, ever-expanding bio- 
logical databases, including gene and protein sequences 
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BOX 3-3 Consensus Sequences and Sequence Logos 



Consensus sequences can be represented in several 
ways. To illustrate two types of conventions, we use two 
examples of consensus sequences, shown in Figure 1: 
(a) an ATP-binding structure called a P loop (see Box 
12-2) and (b) a Ca 2+ -binding structure called an EF 
hand (see Fig. 12-11). The rules described here are 
adapted from those used by the sequence comparison 
website PROSITE (expasy.org/prosite); they use the 
standard one-letter codes for the amino acids. 



[AG]-x(4)-G-K-[ST]. 



GK; 



1 2345678 
(a) N C 

D-{W}-[DNS]-{ILVFYW}-[DENSTG]-[DNQGHRK]-{GP}- 
[LIVMC] - [DENQSTAGC] -x(2)- [DE] - [LIVMFYW] . 




8 9 10 11 12 13 



(b) N 



FIGURE 1 Representations of two consensus sequences, (a) P loop, an 
ATP-binding structure; (b) EF hand, a Ca 2+ -binding structure. 

In one type of consensus sequence designation 
(shown at the top of (a) and (b)), each position is sepa- 
rated from its neighbor by a hyphen. A position where 
any amino acid is allowed is designated x. Ambiguities 
are indicated by listing the acceptable amino acids for a 
given position between square brackets. For example, in 
(a) [AG] means Ala or Gly. If all but a few amino acids 
are allowed at one position, the amino acids that are not 
allowed are listed between curly brackets. For example, 
in (b) {W} means any amino acid except Trp. Repetition 



of an element of the pattern is indicated by following 
that element with a number or range of numbers be- 
tween parentheses. In (a), for example, x(4) means x-x- 
x-x, and x(2,4) means x-x, or x-x-x, or x-x-x-x. When a 
pattern is restricted to either the amino or carboxyl ter- 
minus of a sequence, that pattern starts with < or ends 
with >, respectively (not so for either example here). A 
period ends the pattern. Applying these rules to the 
consensus sequence in (a), either A or G can be found at 
the first position. Any amino acid can occupy the next 
four positions, followed by an invariant G and an invari- 
ant K. The last position is either S or T. 

Sequence logos provide a more informative and 
graphic representation of an amino acid (or nucleic acid) 
multiple sequence alignment. Each logo consists of a 
stack of symbols for each position in the sequence. The 
overall height of the stack (in bits) indicates the degree 
of sequence conservation at that position, while the 
height of each symbol in the stack indicates the relative 
frequency of that amino acid (or nucleotide). For amino 
acid sequences, the colors denote the characteristics of 
the amino acid: polar (G, S, T, Y, C, Q, N) green; basic (K, 
R, H) blue; acidic (D, E) red; and hydrophobic (A, V, L, I, 
P, W, F, M) black. The classification of amino acids in this 
scheme is somewhat different from that in Table 3-1 and 
Figure 3-5. The amino acids with aromatic side chains 
are subsumed into the nonpolar (F, W) and polar (Y) 
classifications. Glycine, always hard to group, is assigned 
to the polar group. Note that when multiple amino acids 
are acceptable at a particular position, they rarely occur 
with equal probability. One or a few usually predominate. 
The logo representation makes the predominance clear, 
and a conserved sequence in a protein is made obvious. 
However, the logo obscures some amino acid residues 
that may be allowed at a position, such as the Cys that 
occasionally occurs at position 8 of the EF hand in (b). 



and macromolecular structures, has given rise to the 
new field of bioinformatics. One outcome of this disci- 
pline is a growing suite of computer programs, many 
readily available on the Internet, that can be used by any 
scientist, student, or knowledgeable layperson. Each 
protein's function relies on its three-dimensional struc- 
ture, which in turn is determined largely by its primary 
structure. Thus, the biochemical information conveyed 
by a protein sequence is limited only by our own under- 
standing of structural and functional principles. The 
constantly evolving tools of bioinformatics make it 
possible to identify functional segments in new proteins 
and help establish both their sequence and their struc- 
tural relationships to proteins already in the databases. 
On a different level of inquiry, protein sequences are 
beginning to tell us how the proteins evolved and, ulti- 
mately, how life evolved on this planet. 



The field of molecular evolution is often traced to 
Emile Zuckerkandl and Linus Pauling, whose work in 
the mid-1960s advanced the use of nucleotide and pro- 
tein sequences to explore evolution. The premise is de- 
ceptively straightforward. If two organisms are closely 
related, the sequences of their genes and proteins 
should be similar. The sequences increasingly diverge as 
the evolutionary distance between two organisms in- 
creases. The promise of this approach began to be real- 
ized in the 1970s, when Carl Woese used ribosomal RNA 
sequences to define the Archaea as a group of living 
organisms distinct from the Bacteria and Eukarya (see 
Fig. 1-4). Protein sequences offer an opportunity to 
greatly refine the available information. With the advent 
of genome projects investigating organisms from bacte- 
ria to humans, the number of available sequences is 
growing at an enormous rate. This information can be 
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used to trace biological history. The challenge is in 
learning to read the genetic hieroglyphics. 

Evolution has not taken a simple linear path. Com- 
plexities abound in any attempt to mine the evolution- 
ary information stored in protein sequences. For a given 
protein, the amino acid residues essential for the activ- 
ity of the protein are conserved over evolutionary time. 
The residues that are less important to function may 
vary over time — that is, one amino acid may substitute 
for another — and these variable residues can provide 
the information to trace evolution. Amino acid substitu- 
tions are not always random, however. At some posi- 
tions in the primary structure, the need to maintain 
protein function may mean that only particular amino 
acid substitutions can be tolerated. Some proteins have 
more variable amino acid residues than others. For 
these and other reasons, different proteins can evolve at 
different rates. 

Another complicating factor in tracing evolutionary 
history is the rare transfer of a gene or group of genes 
from one organism to another, a process called lateral 
gene transfer. The transferred genes may be quite 
similar to the genes they were derived from in the origi- 
nal organism, whereas most other genes in the same two 
organisms may be quite distantly related. An example of 
lateral gene transfer is the recent rapid spread of 
antibiotic-resistance genes in bacterial populations. The 
proteins derived from these transferred genes would not 
be good candidates for the study of bacterial evolution, 
because they share only a very limited evolutionary his- 
tory with their "host" organisms. 

The study of molecular evolution generally focuses 
on families of closely related proteins. In most cases, 
the families chosen for analysis have essential func- 
tions in cellular metabolism that must have been pres- 
ent in the earliest viable cells, thus greatly reducing 
the chance that they were introduced relatively re- 
cently by lateral gene transfer. For example, a protein 
called EF-la (elongation factor la) is involved in the 
synthesis of proteins in all eukaryotes. A similar pro- 
tein, EF-Tu, with the same function, is found in bacte- 
ria. Similarities in sequence and function indicate that 
EF-la and EF-Tu are members of a family of proteins 
that share a common ancestor. The members of protein 
families are called homologous proteins, or ho- 
mologs. The concept of a homolog can be further re- 
fined. If two proteins in a family (that is, two 
homologs) are present in the same species, they are re- 
ferred to as paralogs. Homologs from different species 



are called orthologs. The process of tracing evolution 
involves first identifying suitable families of homolo- 
gous proteins and then using them to reconstruct evo- 
lutionary paths. 

Homologs are identified through the use of increas- 
ingly powerful computer programs that can directly 
compare two or more chosen protein sequences, or can 
search vast databases to find the evolutionary relatives 
of one selected protein sequence. The electronic search 
process can be thought of as sliding one sequence past 
the other until a section with a good match is found. 
Within this sequence alignment, a positive score is as- 
signed for each position where the amino acid residues 
in the two sequences are identical — the value of the 
score varying from one program to the next — to provide 
a measure of the quality of the alignment. The process 
has some complications. Sometimes the proteins being 
compared match well at, say, two sequence segments, 
and these segments are connected by less related se- 
quences of different lengths. Thus the two matching 
segments cannot be aligned at the same time. To handle 
this, the computer program introduces "gaps" in one of 
the sequences to bring the matching segments into 
register (Fig. 3-30). Of course, if a sufficient number of 
gaps are introduced, almost any two sequences could be 
brought into some sort of alignment. To avoid uninfor- 
mative alignments, the programs include penalties for 
each gap introduced, thus lowering the overall align- 
ment score. With electronic trial and error, the program 
selects the alignment with the optimal score that maxi- 
mizes identical amino acid residues while minimizing 
the introduction of gaps. 

Identical amino acids are often inadequate to identify 
related proteins or, more importantly, to detenrime how 
closely related the proteins are on an evolutionary time 
scale. A more useful analysis includes a consideration of 
the chemical properties of substituted amino acids. When 
amino acid substitutions are found within a protein fam- 
ily, many of the differences may be conservative — that 
is, an amino acid residue is replaced by a residue having 
similar chemical properties. For example, a Glu residue 
may substitute in one family member for the Asp residue 
found in another; both amino acids are negatively 
charged. Such a conservative substitution should logi- 
cally garner a higher score in a sequence alignment than 
does a nonconservative substitution, such as the replace- 
ment of the Asp residue with a hydrophobic Phe residue. 

For most efforts to find homologies and explore evo- 
lutionary relationships, protein sequences (derived either 



E. coli TGNRTIAVYDLGGGTFDISI IEID 
B. subtilis DEDQTILLYDLGGGTFDVSILELG 



FIGURE 3-30 Aligning protein sequences with the use of gaps. Shown 
here is the sequence alignment of a short section of the Hsp70 proteins 
(a widespread class of protein-folding chaperones) from two well-studied 



VDGEKTFEVLATNGDTHLGGEDFDSRLIHYL 
G TFEVRSTAGDNRLGGDDFDQVI I DHL- 



bacterial species, f. coli and Bacillus subtilis. Introduction of a gap in the 
B. subtilis sequence allows a better alignment of amino acid residues on 
either side of the gap. Identical amino acid residues are shaded. 
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FIGURE 3-31 A signature sequence in the EF-1a/EF-Tu protein family. 

The signature sequence (boxed) is a 12-residue insertion near the 
amino terminus of the sequence. Residues that align in all species are 
shaded yellow. Both archaea and eukaryotes have the signature, 
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LLKDRGFIDEKTVKEA 
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IGHVDHGKSTMVGR 
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IGHVDSGKSTTTG H [L I Y K r - G ■ !*■ r \ 

IGHVDHGKSTMVGR 
IGHVDHGKTTLT AA 



I EQH 



I EKF 
I EKF 
ITTV 
ITTV 



although the sequences of the insertions are quite distinct for the two 
groups. The variation in the signature sequence reflects the significant 
evolutionary divergence that has occurred at this site since it first ap- 
peared in a common ancestor of both groups. 



directly from protein sequencing or from the sequencing 
of the DNA encoding the protein) are superior to non- 
genic nucleic acid sequences (those that do not encode a 
protein or functional RNA). For a nucleic acid, with its 
four different types of residues, random alignment of non- 
homologous sequences will generally yield matches for at 
least 25% of the positions. Introduction of a few gaps can 
often increase the fraction of matched residues to 40% or 
more, and the probability of chance alignment of unre- 
lated sequences becomes quite high. The 20 different 
amino acid residues in proteins greatly lower the probabil- 
ity of uninformative chance alignments of this type. 

The programs used to generate a sequence align- 
ment are complemented by methods that test the relia- 
bility of the alignments. A common computerized test is 
to shuffle the amino acid sequence of one of the pro- 
teins being compared to produce a random sequence, 
then to instruct the program to align the shuffled se- 
quence with the other, unshuffled one. Scores are as- 
signed to the new alignment, and the shuffling and 
alignment process is repeated many times. The original 
alignment, before shuffling, should have a score signifi- 
cantly higher than any of those within the distribution 
of scores generated by the random alignments; this in- 
creases the confidence that the sequence alignment has 
identified a pair of homologs. Note that the absence of 
a significant alignment score does not necessarily mean 
that no evolutionary relationship exists between two 
proteins. As we shall see in Chapter 4, three-dimensional 
structural similarities sometimes reveal evolutionary 
relationships where sequence homology has been 
wiped away by time. 

Use of a protein family to explore evolution requires 
the identification of family members with similar molec- 
ular functions in the widest possible range of organisms. 
Information from the family can then be used to trace 
the evolution of those organisms. By analyzing the se- 
quence divergence in selected protein families, investi- 
gators can segregate organisms into classes based on 
their evolutionary relationships. This information must 
be reconciled with more classical examinations of the 
physiology and biochemistry of the organisms. 

Certain segments of a protein sequence may be 
found in the organisms of one taxonomic group but not 
in other groups; these segments can be used as signa- 



ture sequences for the group in which they are found. 
An example of a signature sequence is an insertion of 
12 amino acids near the amino terminus of the EF- 
la/EF-Tu proteins in all archaea and eukaryotes but not 
in bacteria (Fig. 3-31). This particular signature is one 
of many biochemical clues that can help establish the 
evolutionary relatedness of eukaryotes and archaea. 
Other signature sequences allow the establishment of 
evolutionary relationships among groups of organisms 
at many different taxonomic levels. 

By considering the entire sequence of a protein, 
researchers can now construct more elaborate evolu- 
tionary trees with many species in each taxonomic 
group. Figure 3-32 presents one such tree for bacte- 
ria, based on sequence divergence in the protein 
GroEL (a protein present in all bacteria that assists in 
the proper folding of proteins). The tree can be re- 
fined by basing it on the sequences of multiple pro- 
teins and by supplementing the sequence information 
with data on the unique biochemical and physiologi- 
cal properties of each species. There are many meth- 
ods for generating trees, each method with its own 
advantages and shortcomings, and many ways to rep- 
resent the resulting evolutionary relationships. In 
Figure 3-32, the free end points of lines are called 
"external nodes"; each represents an extant species, 
and each is so labeled. The points where two lines 
come together, the "internal nodes," represent ex- 
tinct ancestor species. In most representations (in- 
cluding Fig. 3-32), the lengths of the lines connecting 
the nodes are proportional to the number of amino 
acid substitutions separating one species from an- 
other. If we trace two extant species to a common in- 
ternal node (representing the common ancestor of 
the two species), the length of the branch connecting 
each external node to the internal node represents 
the number of amino acid substitutions separating 
one extant species from this ancestor. The sum of the 
lengths of all the line segments that connect an extant 
species to another extant species through a common 
ancestor reflects the number of substitutions separat- 
ing the two extant species. To determine how much 
time was needed for the various species to diverge, 
the tree must be calibrated by comparing it with in- 
formation from the fossil record and other sources. 
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FIGURE 3-32 Evolutionary tree derived from amino acid sequence 
comparisons. A bacterial evolutionary tree, based on the sequence di- 



vergence observed in the GroEL family of proteins. Also included in this 
tree (lower right) are the chloroplasts (chl.) of some nonbacterial species. 



As more sequence information is made available in 
databases, we can generate evolutionary trees based 
on multiple proteins. And we can refine these trees as 
additional genomic information emerges from increas- 
ingly sophisticated methods of analysis. All of this work 
moves us toward the goal of creating a detailed tree of 
life that describes the evolution and relationship of every 



organism on Earth. The story is a work in progress, of 
course (Fig. 3-33). The questions being asked and an- 
swered are fundamental to how humans view themselves 
and the world around them. The field of molecular evolu- 
tion promises to be among the most vibrant of the scien- 
tific frontiers in the twenty-first century. 
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FIGURE 3-33 A consensus tree of life. The tree shown here is based on 
analyses of many different protein sequences and additional genomic 
features. Branches shown as dashed lines remain under investigation. 



The tree presents only a fraction of the available information, as well as 
only a fraction of the issues remaining to be resolved. Each extant 
group shown is a complex evolutionary story unto itself. 
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SUMMARY 3.4 



The Structure of Proteins: 
Primary Structure 



Differences in protein function result from 
differences in amino acid composition and sequence. 
Some variations in sequence are possible for a 
particular protein, with little or no effect on function. 

Amino acid sequences are deduced by fragmenting 
polypeptides into smaller peptides with reagents 
known to cleave specific peptide bonds; detemrining 
the amino acid sequence of each fragment by the 
automated Edman degradation procedure; then 
ordering the peptide fragments by finding sequence 
overlaps between fragments generated by different 
reagents. A protein sequence can also be deduced 
from the nucleotide sequence of its corresponding 
gene in DNA. 

Short proteins and peptides (up to about 100 
residues) can be chemically synthesized. The 
peptide is built up, one amino acid residue at a time, 
while tethered to a solid support. 

Protein sequences are a rich source of information 
about protein structure and function, as well as the 
evolution of life on Earth. Sophisticated methods 
are being developed to trace evolution by analyzing 
the resultant slow changes in amino acid sequences 
of homologous proteins. 
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Problems 



1. Absolute Configuration of Citrulline The citrulline 
isolated from watermelons has the structure shown below. Is it 
a d- or L-amino acid? Explain. 

CH 2 (CH 2 ) 2 NH-C-NH 2 
NH 3 O 

COO" 

2. Relationship between the Titration Curve and the 
Acid-Base Properties of Glycine A 100 mL solution of 
0.1 m glycine at pH 1.72 was titrated with 2 m NaOH solution. 
The pH was monitored and the results were plotted as shown 
in the following graph. The key points in the titration are des- 
ignated I to V. For each of the statements (a) to (o), identify 
the appropriate key point in the titration and justify your 
choice. 

(a) Glycine is present predominantly as the species 
+ H 3 N— CH 2 — COOH 

(b) The average net charge of glycine is +\. 

(c) Half of the amino groups are ionized. 

(d) The pH is equal to the pK a of the carboxyl group. 

(e) The pH is equal to the pif a of the protonated amino 
group. 

(f) Glycine has its maximum buffering capacity. 

(g) The average net charge of glycine is zero. 

(h) The carboxyl group has been completely titrated (first 
equivalence point). 



(i) Glycine is completely titrated (second equivalence 
point). 

Q) The predominant species is + H 3 N — CH 2 — COO~ . 
(k) The average net charge of glycine is - 1 . 
(1) Glycine is present predominantly as a 50:50 mixture of 
+ H 3 N— CH 2 — COOH and + H 3 N — CH 2 — COO ~ . 
(m) This is the isoelectric point, 
(n) This is the end of the titration. 

(o) These are the worst pH regions for buffering power. 



pH 




0.5 



1.0 1.5 2.0 
OH - (equivalents) 



3. How Much Alanine Is Present as the Completely 
Uncharged Species? At a pH equal to the isoelectric point of 
alanine, the net charge on alanine is zero. Two structures can 
be drawn that have a net charge of zero, but the predominant 
form of alanine at its pi is zwitterionic. 



CH 3 o 

+ I # 
H 3 N— C— C^ 

H 



CH 3 
HoN— C— C" 



O 



O" 



I 



OH 



Zwitterionic 



Uncharged 



(a) Why is alanine predominantly zwitterionic rather than 
completely uncharged at its pi? 

(b) What fraction of alanine is in the completely un- 
charged form at its pi? Justify your assumptions. 

4. Ionization State of Histidine Each ionizable group of an 
amino acid can exist in one of two states, charged or neutral. 
The electric charge on the functional group is determined by 
the relationship between its pK a and the pH of the solution. 
This relationship is described by the Henderson-Hasselbalch 
equation. 

(a) Histidine has three ionizable functional groups. Write 
the equilibrium equations for its three ionizations and assign 
the proper pK a for each ionization. Draw the structure of histi- 
dine in each ionization state. What is the net charge on the his- 
tidine molecule in each ionization state? 

(b) Draw the structures of the predominant ionization 
state of histidine at pH 1, 4, 8, and 12. Note that the ionization 
state can be approximated by treating each ionizable group 
independently. 



Problems [l09] 



(c) What is the net charge of histidine at pH 1, 4, 8, and 
12? For each pH, will histidine migrate toward the anode (+) 
or cathode (-) when placed in an electric field? 

5. Separation of Amino Acids by Ion-Exchange Chro- 
matography Mixtures of amino acids can be analyzed by first 
separating the mixture into its components through ion- 
exchange chromatography Amino acids placed on a cation- 
exchange resin (see Fig. 3-1 7a) containing sulfonate 
( — SO3 ) groups flow down the column at different rates be- 
cause of two factors that influence their movement: (1) ionic 
attraction between the sulfonate residues on the column and 
positively charged functional groups on the amino acids, and 
(2) hydrophobic interactions between amino acid side chains 
and the strongly hydrophobic backbone of the polystyrene resin. 
For each pair of amino acids listed, determine which will be 
eluted first from the cation-exchange column by a pH 7.0 buffer. 

(a) Asp and Lys 

(b) Arg and Met 

(c) Glu and Val 

(d) Gly and Leu 

(e) Ser and Ala 

6. Naming the Stereoisomers of Isoleucine The struc- 
ture of the amino acid isoleucine is 

COO" 

+ I 

H 3 N— C— H 
I 

H-C-CH3 

I 

CH 2 

I 

CH 3 

(a) How many chiral centers does it have? 

(b) How many optical isomers? 

(c) Draw perspective formulas for all the optical isomers 
of isoleucine. 

7. Comparing the pK A Values of Alanine and Polyala- 
nine The titration curve of alanine shows the ionization of 
two functional groups with pif a values of 2.34 and 9.69, corre- 
sponding to the ionization of the carboxyl and the protonated 
amino groups, respectively. The titration of di-, tri-, and larger 
oligopeptides of alanine also shows the ionization of only two 
functional groups, although the experimental pK a values are 
different. The trend in pif a values is summarized in the table. 



Amino acid or peptide 




P#2 


Ala 


2.34 


9.69 


Ala-Ala 


3.12 


8.30 


Ala-Ala-Ala 


3.39 


8.03 


Ala-(Ala) n -Ala, n > 4 


3,42 


7.94 



(a) Draw the structure of Ala-Ala-Ala. Identify the func- 
tional groups associated with pK x and piT 2 . 

(b) Why does the value of pK 1 increase with each addi- 
tional Ala residue in the oligopeptide? 

(c) Why does the value of pK 2 decrease with each addi- 
tional Ala residue in the oligopeptide? 



8. The Size of Proteins What is the approximate molecular 
weight of a protein with 682 amino acid residues in a single 
polypeptide chain? 

9. The Number of Tryptophan Residues in Bovine 
Serum Albumin A quantitative amino acid analysis reveals 
that bovine serum albumin (BSA) contains 0.58% tryptophan 
(M T 204) by weight. 

(a) Calculate the minimum molecular weight of BSA (i.e., 
assume there is only one Trp residue per protein molecule). 

(b) Gel filtration of BSA gives a molecular weight estimate 
of 70,000. How many Trp residues are present in a molecule of 
serum albumin? 

10. Subunit Composition of a Protein A protein has a mo- 
lecular mass of 400 kDa when measured by gel nitration. When 
subjected to gel electrophoresis in the presence of sodium dode- 
cyl sulfate (SDS), the protein gives three bands with molecular 
masses of 180, 160, and 60 kDa. When electrophoresis is carried 
out in the presence of SDS and dithiothreitol, three bands are 
again formed, this time with molecular masses of 160, 90, and 60 
kDa. Determine the subunit composition of the protein. 

11. Net Electric Charge of Peptides A peptide has the 
sequence 

Glu-His-Trp-Ser-Gly-Leu-Arg-Pro-Gly 

(a) What is the net charge of the molecule at pH 3, 8, and 
11? (Use pK a values for side chains and terminal amino and 
carboxyl groups as given in Table 3-1.) 

(b) Estimate the pi for this peptide. 

12. Isoelectric Point of Pepsin Pepsin is the name given to 
a mix of several digestive enzymes secreted (as larger precur- 
sor proteins) by glands that line the stomach. These glands 
also secrete hydrochloric acid, which dissolves the particulate 
matter in food, allowing pepsin to enzymatically cleave indi- 
vidual protein molecules. The resulting mixture of food, HC1, 
and digestive enzymes is known as chyme and has a pH near 1 .5. 
What pi would you predict for the pepsin proteins? What func- 
tional groups must be present to confer this pi on pepsin? Which 
amino acids in the proteins would contribute such groups? 

13. The Isoelectric Point of Histones Histones are pro- 
teins found in eukaryotic cell nuclei, tightly bound to DNA, 
which has many phosphate groups. The pi of histones is very 
high, about 10.8. What amino acid residues must be present in 
relatively large numbers in histones? In what way do these 
residues contribute to the strong binding of histones to DNA? 

14. Solubility of Polypeptides One method for separating 
polypeptides makes use of their different solubilities. The sol- 
ubility of large polypeptides in water depends on the relative 
polarity of their R groups, particularly on the number of ion- 
ized groups: the more ionized groups there are, the more solu- 
ble the polypeptide. Which of each pair of the polypeptides 
that follow is more soluble at the indicated pH? 

(a) (Gly) 20 or (Glu) 20 atpH 7.0 

(b) (Lys-Ala) 3 or (Phe-Met) 3 at pH 7.0 

(c) (Ala-Ser-Gly) 5 or (Asn-Ser-His) 5 at pH 6.0 

(d) (Ala-Asp-Gly) 5 or (Asn-Ser-His) 5 at pH 3.0 
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15. Purification of an Enzyme A biochemist discovers and 
purifies a new enzyme, generating the purification table below. 





Total 






protein 


Artivitv 


Procedure 


( ma) 


( units ^ 


1 . Crude extract 


20,000 


4,000,000 


jriecipiiaiioit ^saixj 


0,UUU 


o Ann AAA 
o,UUU,UUU 


3. Precipitation (pH) 


4,000 


1,000,000 


4. Ion-exchange 


200 


800,000 


chromatography 




5. Affinity 


50 


750,000 


chromatography 






6. Size-exclusion 


45 


675,000 


chromatography 





(a) From the information given in the table, calculate the 
specific activity of the enzyme after each purification procedure. 

(b) Which of the purification procedures used for this en- 
zyme is most effective (i.e., gives the greatest relative increase 
in purity)? 

(c) Which of the purification procedures is least effective? 

(d) Is there any indication based on the results shown in 
the table that the enzyme after step 6 is now pure? What else 
could be done to estimate the purity of the enzyme preparation? 

16. Dialysis. A purified protein is in a Hepes (AT-(2-hydroxy- 
ethyl)piperazine-iV'-(2-ethanesulfonic acid)) buffer at pH 7 with 
500 mM NaCl. A sample (1 mL) of the protein solution is placed 
in a tube made of dialysis membrane and dialyzed against 1 L of 
the same Hepes buffer with 0 ium NaCl. Small molecules and ions 
(such as Na + , CF, and Hepes) can diffuse across the dialysis 
membrane, but the protein cannot. 

(a) Once the dialysis has come to equilibrium, what is the 
concentration of NaCl in the protein sample? Assume no vol- 
ume changes occur in the sample during the dialysis. 

(b) If the original 1 mL sample were dialyzed twice, succes- 
sively, against 100 mL of the same Hepes buffer with 0 mM NaCl, 
what would be the final NaCl concentration in the sample? 

17. Peptide Purification At pH 7.0, in what order would the 
following three peptides be eluted from a column filled with a 
cation-exchange polymer? Their amino acid compositions are: 

Protein A: Ala 10%, Glu 5%, Ser 5%, Leu 10%, Argl0%, 
His 5%, lie 10%, Phe 5%, Tyr 5%, Lys 10%, Gly 10%, Pro 5%, 
and Trp 10%. 

Protein B: Ala 5%, Val 5%, Gly 10%, Asp 5%, Leu 5%, Arg 
5%, He 5%, Phe 5%, Tyr 5%, Lys 5%, Trp 5%, Ser 5%, Thr 5%, 
Glu 5%, Asn 5%, Pro 10%, Met 5%, and Cys 5% 

Protein C: Ala 10%, Glu 10%, Gly 5%, Leu 5% Asp 10%, 
Arg 5% , Met 5% , Cys 5% , Tyr 5% , Phe 5% , His 5% , Val 5% , Pro 
5%, Thr 5%, Ser 5%, Asn 5%, and Gin 5%. 

18. Sequence Determination of the Brain Peptide 
Leucine Enkephalin A group of peptides that influence 
nerve transmission in certain parts of the brain has been iso- 
lated from normal brain tissue. These peptides are known as 
opioids, because they bind to specific receptors that also bind 
opiate drugs, such as morphine and naloxone. Opioids thus 



mimic some of the properties of opiates. Some researchers 
consider these peptides to be the brain's own painkillers. Using 
the information below, determine the amino acid sequence of 
the opioid leucine enkephalin. Explain how your structure is 
consistent with each piece of information. 

(a) Complete hydrolysis by 6 M HC1 at 110 °C followed by 
amino acid analysis indicated the presence of Gly, Leu, Phe, 
and Tyr, in a 2:1:1:1 molar ratio. 

(b) Treatment of the peptide with l-fluoro-2,4- 
dinitrobenzene followed by complete hydrolysis and chro- 
matography indicated the presence of the 2,4-dinitrophenyl 
derivative of tyrosine. No free tyrosine could be found. 

(c) Complete digestion of the peptide with chymotrypsin 
followed by chromatography yielded free tyrosine and leucine, 
plus a tripeptide containing Phe and Gly in a 1:2 ratio. 

19. Structure of a Peptide Antibiotic from Bacillus 
brevis Extracts from the bacterium Bacillus brevis contain 
a peptide with antibiotic properties. This peptide forms com- 
plexes with metal ions and seems to disrupt ion transport 
across the cell membranes of other bacterial species, killin g 
them. The structure of the peptide has been determined from 
the following observations. 

(a) Complete acid hydrolysis of the peptide followed by 
amino acid analysis yielded equimolar amounts of Leu, Orn, 
Phe, Pro, and Val. Orn is ornithine, an amino acid not pres- 
ent in proteins but present in some peptides. It has the 
structure 

H 

I 

H 3 N-CH 2 ~CH 2 -CH 2 -C-COO- 
+ NH 3 

(b) The molecular weight of the peptide was estimated as 
about 1,200. 

(c) The peptide failed to undergo hydrolysis when treated 
with the enzyme carboxypeptidase. This enzyme catalyzes the 
hydrolysis of the carboxyl-terminal residue of a polypeptide 
unless the residue is Pro or, for some reason, does not contain 
a free carboxyl group. 

(d) Treatment of the intact peptide with l-fluoro-2,4- 
dinitrobenzene, followed by complete hydrolysis and chro- 
matography, yielded only free amino acids and the following 
derivative: 

N0 2 

Q 2 N-^ ^MNTH— CH 2 -CH 2 -CH 2 -C-COQ- 
~~ + NH 3 

(Hint: The 2,4-dinitrophenyl derivative involves the amino 
group of a side chain rather than the a-amino group.) 

(e) Partial hydrolysis of the peptide followed by chro- 
matographic separation and sequence analysis yielded the fol- 
lowing di- and tripeptides (the ammo-terminal amino acid is 
always at the left) : 

Leu-Phe Phe-Pro Orn-Leu Val-Orn 

Val-Orn-Leu Phe-Pro-Val Pro-Val-Orn 
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Given the above information, deduce the amino acid sequence 
of the peptide antibiotic. Show your reasoning. When you have 
arrived at a structure, demonstrate that it is consistent with 
each experimental observation. 

20. Efficiency in Peptide Sequencing A peptide with the 
primary structure Lys-Arg-Pro-Leu-Ile-Asp-Gly-Ala is se- 
quenced by the Edman procedure. If each Edman cycle is 96% 
efficient, what percentage of the amino acids liberated in the 
fourth cycle will be leucine? Do the calculation a second time, 
but assume a 99% efficiency for each cycle. 

21. Sequence Comparisons Proteins called molecular 
chaperones (described in Chapter 4) assist in the process of 
protein folding. One class of chaperone found in organisms 
from bacteria to mammals is heat shock protein 90 (Hsp90) . 
All Hsp90 chaperones contain a 10 amino acid "signature se- 
quence," which allows for ready identification of these pro- 
teins in sequence databases. Two representations of this 
signature sequence are shown below. 

Y-x-[NQHD]-[KHR]-[DE]-[IVA]-F-[LM]-R-[ED]. 



8 9 10 



(a) In this sequence, which amino acid residues are invari- 
ant (conserved across all species)? 

(b) At which position(s) are amino acids limited to those 
with positively charged side chains? For each position, which 
amino acid is more commonly found? 

(c) At which positions are substitutions restricted to 
amino acids with negatively charged side chains? For each po- 
sition, which amino acid predominates? 

(d) There is one position that can be any amino acid, al- 
though one amino acid appears much more often than any other. 
What position is this, and which amino acid appears most often? 

22. Biochemistry Protocols: Your First Protein Purifi- 
cation As the newest and least experienced student in a bio- 
chemistry research lab, your first few weeks are spent washing 
glassware and labeling test tubes. You then graduate to making 
buffers and stock solutions for use in various laboratory proce- 
dures. Finally, you are given responsibility for purifying a pro- 
tein. It is citrate synthase (an enzyme of the citric acid cycle, 
to be discussed in Chapter 16), which is located in the mito- 
chondrial matrix. Following a protocol for the purification, you 
proceed through the steps below. As you work, a more experi- 
enced student questions you about the rationale for each 
procedure. Supply the answers. (Hint: See Chapter 2 for infor- 
mation about osmolarity; see p. 7 for information on separation 
of organelles from cells.) 

(a) You pick up 20 kg of beef hearts from a nearby slaugh- 
terhouse (muscle cells are rich in mitochondria, which supply 
energy for muscle contraction) . You transport the hearts on 
ice, and perform each step of the purification on ice or in a 
walk-in cold room. You homogenize the beef heart tissue in a 
high-speed blender in a medium containing 0.2 M sucrose, 



buffered to a pH of 7.2. Why do you use beef heart tissue, 
and in such large quantity? What is the purpose of keeping 
the tissue cold and suspending it in 0.2 m sucrose, at pH 
7.2? What happens to the tissue when it is homogenized? 

(b) You subject the resulting heart homogenate, which is 
dense and opaque, to a series of differential centrifugation 
steps. What does this accomplish? 

(c) You proceed with the purification using the super- 
natant fraction that contains mostly intact mitochondria. Next 
you osmotically lyse the mitochondria. The lysate, which is 
less dense than the homogenate, but still opaque, consists pri- 
marily of mitochondrial membranes and internal mitochondr- 
ial contents. To this lysate you add ammonium sulfate, a highly 
soluble salt, to a specific concentration. You centrifuge the so- 
lution, decant the supernatant, and discard the pellet. To the 
supernatant, which is clearer than the lysate, you add more 
ammonium sulfate. Once again, you centrifuge the sample, but 
this time you save the pellet because it contains the citrate 
synthase. What is the rationale for the two-step addition of 
the salt? 

(d) You solubilize the ammonium sulfate pellet containing 
the mitochondrial proteins and dialyze it overnight against 
large volumes of buffered (pH 7.2) solution. Why isn't ammo- 
nium sulfate included in the dialysis buffer? Why do you 
use the buffer solution instead of water? 

(e) You run the dialyzed solution over a size-exclusion 
chromatographic column. Following the protocol, you collect 
the first protein fraction that exits the column and discard the 
fractions that elute from the column later. You detect the pro- 
tein by measuring UV absorbance (at 280 run) by the fractions. 
What does the instruction to collect the first fraction tell 
you about the protein? Why is UV absorbance at 280 nm a 
good way to monitor for the presence of protein in the 
eluted fractions? 

(f) You place the fraction collected in (e) on a cation- 
exchange chromatographic column. After discarding the initial 
solution that exits the column (the flowthrough), you add a 
washing solution of higher pH to the column and collect the 
protein fraction that immediately elutes. Explain what you 
are doing. 

(g) You run a small sample of your fraction, now very re- 
duced in volume and quite clear (though tinged pink), on an 
isoelectric focusing gel. When stained, the gel shows three 
sharp bands. According to the protocol, the citrate synthase is 
the protein with a pi of 5.6, but you decide to do one more as- 
say of the protein's purity. You cut out the pi 5.6 band and sub- 
ject it to SDS polyacrylamide gel electrophoresis. The protein 
resolves as a single band. Why were you unconvinced of the 
purity of the "single" protein band on your isoelectric fo- 
cusing gel? What did the results of the SDS gel tell you? 
Why is it important to do the SDS gel electrophoresis after 
the isoelectric focusing? 



Data Analysis Problem 



23. Determining the Amino Acid Sequence of Insulin 

Figure 3-24 shows the amino acid sequence of the hormone in- 
sulin. This structure was determined by Frederick Sanger and 
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his coworkers. Most of this work is described in a series of arti- 
cles published in the Biochemical Journal from 1945 to 1955. 

When Sanger and colleagues began their work in 1945, it 
was known that insulin was a small protein consisting of two or 
four polypeptide chains linked by disulfide bonds. Sanger and 
his coworkers had developed a few simple methods for study- 
ing protein sequences 

Treatment with FDNB. FDNB (l-fluoro-2,4-dinitroben- 
zene) reacted with free amino (but not amido or guanidino) 
groups in proteins to produce dinitrophenyl (DNP) derivatives 
of amino acids: 




H 



Amine FDNB DNP-amine 

Acid Hydrolysis. Boiling a protein with 10% HC1 for sev- 
eral hours hydrolyzed all of its peptide and amide bonds. Short 
treatments produced short polypeptides; the longer the treat- 
ment, the more complete the breakdown of the protein into its 
amino acids. 

Oxidation of Cysteines. Treatment of a protein with per- 
formic acid cleaved all the disulfide bonds and converted all 
Cys residues to cysteic acid residues (Fig. 3-26). 

Paper Chromatography. This more primitive version of 
thin-layer chromatography (see Fig. 10-24) separated com- 
pounds based on their chemical properties, allowing identifi- 
cation of single amino acids and, in some cases, dipeptides. 
Thin-layer chromatography also separates larger peptides. 

As reported in his first paper (1945), Sanger reacted in- 
sulin with FDNB and hydrolyzed the resulting protein. He 
found many free amino acids, but only three DNP-amino 
acids: a-DNP-glycine (DNP group attached to the a-arnino 
group); a-DNP-phenylalanine; and e-DNP-lysine (DNP at- 
tached to the e-amino group), Sanger interpreted these results 
as showing that insulin had two protein chains: one with Gly at 
its amino terminus and one with Phe at its amino terminus. 
One of the two chains also contained a Lys residue, not at the 
amino terminus. He named the chain beginning with a Gly 
residue "A" and the chain beginning with Phe "B." 

(a) Explain how Sanger's results support his conclusions. 

(b) Are the results consistent with the known structure 
of insulin (Fig. 3-24)? 

In a later paper (1949), Sanger described how he used 
these techniques to determine the first few amino acids 
(amino-terminal end) of each insulin chain. To analyze the B 
chain, for example, he carried out the following steps: 

1. Oxidized insulin to separate the A and B chains. 

2. Prepared a sample of pure B chain with paper 
chromatography. 

3. Reacted the B chain with FDNB. 

4. Gently acid-hydrolyzed the protein so that some small 
peptides would be produced. 

E. Separated the DNP-peptides from the peptides that did 
not contain DNP groups. 



6. Isolated four of the DNP-peptides, which were named Bl 
through B4. 

7. Strongly hydrolyzed each DNP-peptide to give free amino 
acids. 

8. Identified the amino acids in each peptide with paper 
chromatography. 

The results were as follows: 

Bl: a-DNP-phenylalanine only 
B2: a-DNP-phenylalanine; valine 
B3: aspartic acid; a-DNP-phenylalanine; valine 
B4: aspartic acid; glutamic acid; a-DNP-phenylalanine; 
valine 

(c) Based on these data, what are the first four (amino- 
terminal) amino acids of the B chain? Explain your reasoning. 

(d) Does this result match the known sequence of insulin 
(Fig. 3-24)? Explain any discrepancies. 

Sanger and colleagues used these and related methods to 
determine the entire sequence of the A and B chains. Their se- 
quence for the A chain was as follows (amino terminus on left): 

15 10 

Gly-Ile-Val-Glx-Glx-Cys-Cys-Ala-Ser-Val- 

15 20 

Cys-Ser-Leu-Tyr-Glx-Leu-Glx-Asx-Tyr-Cys-Asx 

Because acid hydrolysis had converted all Asn to Asp and all 
Gin to Glu, these residues had to be designated Asx and Glx, 
respectively (exact identity in the peptide unknown). Sanger 
solved this problem by using protease enzymes that cleave 
peptide bonds, but not the amide bonds in Asn and Gin 
residues, to prepare short peptides. He then determined the 
number of amide groups present in each peptide by measuring 
the NH4 released when the peptide was acid-hydrolyzed. 
Some of the results for the A chain are shown below. The pep- 
tides may not have been completely pure, so the numbers 
were approximate — but good enough for Sanger's purposes. 



Peptide 


Peptide 


Number of amide 


name 


sequence 


groups in peptide 


Acl 


Cys-Asx 


0.7 


Apl5 


Tyr-Glx-Leu 


0.98 


Apl4 


TVr-Glx-Leu-Glx 


1.06 


Ap3 


Asx-Tyr-Cys-Asx 


2.10 


Apl 


Glx-Asx-TVr-Cys-Asx 


1.94 


Ap5pal 


Gly-Ile-Val-Glx 


0.15 


Ap5 


Gly-Ile-Val-Glx-Glx-Cys-Cys- 






Ala-Ser-Val-Cys-Ser-Leu 


1.16 



(e) Based on these data, determine the amino acid se- 
quence of the A chain. Explain how you reached your answer. 
Compare it with Figure 3-24. 
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Perhaps the more remarkable features of [myoglobin] are its complex- 
ity and its lack of symmetry. The arrangement seems to be almost to- 
tally lacking in the kind of regularities which one instinctively 
anticipates, and it is more complicated than has been predicted by 
any theory of protein structure. 

— John Kendrew, article in Nature, 1958 
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The covalent backbone of a typical protein contains 
hundreds of individual bonds. Because free rotation 
is possible around many of these bonds, the protein 
can assume a very large number of conformations. How- 
ever, each protein has a specific chemical or structural 
function, strongly suggesting that each has a unique 
three-dimensional structure (Fig. 4-1). By the late 
1920s, several proteins had been crystallized, includ- 
ing hemoglobin (M T 64,500) and the enzyme urease 
(M r 483,000). Given that, generally, the ordered array of 
molecules in a crystal can form only if the molecular 




FIGURE 4-1 Structure of the enzyme chymotrypsin, a globular protein. 

A molecule of glycine (blue) is shown for size comparison. The known 
three-dimensional structures of proteins are archived in the Protein 
Data Bank, or PDB (see Box 4-4). The image shown here was made 
using data from the PDB entry 6GCH. 



units are identical, the finding that many proteins could 
be crystallized was evidence that even very large pro- 
teins are discrete chemical entities with unique struc- 
tures. This conclusion revolutionized thinking about 
proteins and their functions. 

In this chapter, we examine how a sequence of 
amino acids in a polypeptide chain is translated into a 
discrete, three-dimensional protein structure. We em- 
phasize five themes. First, the three-dimensional struc- 
ture of a protein is determined by its amino acid 
sequence. Second, the function of a protein depends on 
its structure. Third, an isolated protein usually exists in 
one or a small number of stable structural forms. 
Fourth, the most important forces stabilizing the spe- 
cific structures maintained by a given protein are non- 
covalent interactions. Finally, amid the huge number of 
unique protein structures, we can recognize some com- 
mon structural patterns that help to organize our under- 
standing of protein architecture. 

These themes should not be taken to imply that 
proteins have static, unchanging, three-dimensional 
structures. Protein function often entails an intercon- 
version between two or more structural forms. The dy- 
namic aspects of protein structure will be explored in 
Chapters 5 and 6. An understanding of all levels of pro- 
tein structure is essential to the discussion of function 
in later chapters. 

4.1 Overview of Protein Structure 

The spatial arrangement of atoms in a protein is called 
its conformation. The possible conformations of a pro- 
tein include any structural state it can achieve without 
breaking covalent bonds. A change in conformation 
could occur, for example, by rotation about single 
bonds. Of the many conformations that are theoreti- 
cally possible in a protein containing hundreds of single 

[,,,] 
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bonds, one or (more commonly) a few generally pre- 
dominate under biological conditions. The need for 
multiple stable conformations reflects the changes that 
must take place in most proteins as they bind to other 
molecules or catalyze reactions. The conformations ex- 
isting under a given set of conditions are usually the 
ones that are thermodynamically the most stable — that 
is, having the lowest Gibbs free energy (G). Proteins in 
any of their functional, folded conformations are called 
native proteins. 

What principles determine the most stable confor- 
mations of a protein? An understanding of protein con- 
formation can be built stepwise from the discussion of 
primary structure in Chapter 3 through a consideration 
of secondary, tertiary, and quaternary structures. To this 
traditional approach we must add the newer emphasis 
on common and classifiable folding patterns, called su- 
persecondary structures or motifs, which provide an im- 
portant organizational context to this complex endeavor. 
We begin by introducing some guiding principles. 

A Protein's Conformation Is Stabilized Largely by 
Weak Interactions 

In the context of protein structure, the term stability 
can be defined as the tendency to maintain a native con- 
formation. Native proteins are only marginally stable; 
the AG separating the folded and unfolded states in typ- 
ical proteins under physiological conditions is in the 
range of only 20 to 65 kJ/mol. A given polypeptide chain 
can theoretically assume countless conformations, and 
as a result the unfolded state of a protein is character- 
ized by a high degree of conformational entropy. This 
entropy, and the hydrogen-bonding interactions of 
many groups in the polypeptide chain with the solvent 
(water), tend to maintain the unfolded state. The chem- 
ical interactions that counteract these effects and stabi- 
lize the native conformation include disulfide (covalent) 
bonds and the weak (noncovalent) interactions de- 
scribed in Chapter 2: hydrogen bonds and hydrophobic 
and ionic interactions. 

Many proteins do not have disulfide bonds. The en- 
vironment within most cells is highly reducing and thus 
precludes the formation of — S — S — bonds. In eukary- 
otes, disulfide bonds are found primarily in secreted, ex- 
tracellular proteins (for example, the hormone insulin). 
Disulfide bonds are also uncommon in bacterial pro- 
teins. However, thermophilic bacteria, as well as the ar- 
chaea, typically have many proteins with disulfide 
bonds, which stabilize proteins; this is presumably an 
adaptation to life at high temperatures. 

For the intracellular proteins of most organisms, 
weak interactions are especially important in the folding 
of polypeptide chains into their secondary and tertiary 
structures. The association of multiple polypeptides to 
form quaternary structures also relies on these weak 
interactions. 



About 200 to 460 kJ/mol are required to break a sin- 
gle covalent bond, whereas weak interactions can be 
disrupted by a mere 4 to 30 kJ/mol. Individual covalent 
bonds, such as disulfide bonds linking separate parts of 
a single polypeptide chain, are clearly much stronger 
than individual weak interactions. Yet, because they are 
so numerous, it is weak interactions that predominate as 
a stabilizing force in protein structure. In general, the 
protein conformation with the lowest free energy (that 
is, the most stable conformation) is the one with the 
maximum number of weak interactions. 

The stability of a protein is not simply the sum of 
the free energies of formation of the many weak interac- 
tions within it. For every hydrogen bond formed in a 
protein during folding, a hydrogen bond (of similar 
strength) between the same group and water was bro- 
ken. The net stability contributed by a given hydrogen 
bond, or the difference in free energies of the folded 
and unfolded states, may be close to zero. Ionic interac- 
tions may be either stabilizing or destabilizing. We must 
therefore look elsewhere to understand why a particular 
native conformation is favored. 

On carefully examining the contribution of weak in- 
teractions to protein stability, we find that hydrophobic 
interactions generally predominate. Pure water con- 
tains a network of hydrogen-bonded H 2 0 molecules. No 
other molecule has the hydrogen-bonding potential of 
water, and the presence of other molecules in an aque- 
ous solution disrupts the hydrogen bonding of water. 
When water surrounds a hydrophobic molecule, the op- 
timal arrangement of hydrogen bonds results in a highly 
structured shell, or solvation layer, of water around 
the molecule (see Fig. 2-7). The increased order of the 
water molecules in the solvation layer correlates with an 
unfavorable decrease in the entropy of the water. How- 
ever, when nonpolar groups cluster together, the extent 
of the solvation layer decreases because each group no 
longer presents its entire surface to the solution. The re- 
sult is a favorable increase in entropy. As described in 
Chapter 2, this increase in entropy is the major thermo- 
dynamic driving force for the association of hydrophobic 
groups in aqueous solution. Hydrophobic amino acid 
side chains therefore tend to cluster in a protein's 
interior, away from water. 

Under physiological conditions, the formation of 
hydrogen bonds in a protein is driven largely by this 
same entropic effect. Polar groups can generally form 
hydrogen bonds with water and hence are soluble in 
water. However, the number of hydrogen bonds per 
unit mass is generally greater for pure water than for 
any other liquid or solution, and there are limits to the 
solubility of even the most polar molecules as their 
presence causes a net decrease in hydrogen bonding 
per unit mass. Therefore, a solvation layer also forms to 
some extent around polar molecules. Even though the 
energy of formation of an intramolecular hydrogen 
bond between two polar groups in a macromolecule is 
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largely canceled by the elimination of such interactions 
between these polar groups and water, the release of 
structured water as intramolecular interactions form 
provides an entropic driving force for folding. Most of 
the net change in free energy as weak interactions form 
within a protein is therefore derived from the increased 
entropy in the surrounding aqueous solution resulting 
from the burial of hydrophobic surfaces. This more than 
counterbalances the large loss of conformational en- 
tropy as a polypeptide is constrained into its folded 
conformation. 

Hydrophobic interactions are clearly important in 
stabilizing conformation; the interior of a protein is gen- 
erally a densely packed core of hydrophobic amino acid 
side chains. It is also important that any polar or 
charged groups in the protein interior have suitable 
partners for hydrogen bonding or ionic interactions. 
One hydrogen bond seems to contribute little to the sta- 
bility of a native structure, but the presence of hydrogen- 
bonding groups without partners in the hydrophobic 
core of a protein can be so destabilizing that confor- 
mations containing these groups are often thermody- 
namically untenable. The favorable free-energy change 
resulting from the combination of several such groups 
with partners in the surrounding solution can be 
greater than the free-energy difference between the 
folded and unfolded states. In addition, hydrogen 
bonds between groups in a protein form cooperatively 
(formation of one makes the next one more likely) in 
repeating secondary structures that optimize hydrogen 
bonding, as described below. In this way, hydrogen 
bonds often have an important role in guiding the 
protein-folding process. 

The interaction of oppositely charged groups that 
form an ion pair, or salt bridge, can have either a stabiliz- 
ing or destabilizing effect on protein structure. As in 
the case of hydrogen bonds, charged amino acid side 
chains interact with water and salts when the protein is 
unfolded, and the loss of those interactions must be 
considered when evaluating the effect of a salt bridge on 
the overall stability of a folded protein. However, the 
strength of a salt bridge increases as it moves to an envi- 
ronment of lower dielectric constant, e (see p. 46): from 
the polar aqueous solvent (e near 80) to the nonpolar 
protein interior (e near 4). Salt bridges, especially those 
that are partly or entirely buried, can thus provide signif- 
icant stabilization to a protein structure. This trend ex- 
plains the increased occurrence of buried salt bridges in 
the proteins of thermophilic organisms. Ionic interac- 
tions also limit structural flexibility and confer a unique- 
ness to protein structure that nonspecific hydrophobic 
interactions cannot provide. 

Most of the structural patterns outlined in this 
chapter reflect two simple rules: (1) hydrophobic 
residues are largely buried in the protein interior, 
away from water; and (2) the number of hydrogen 
bonds and ionic interactions within the protein is max- 



imized, thus reducing the number of hydrogen- 
bonding and ionic groups that are not paired with a 
suitable partner. Insoluble proteins and proteins within 
membranes (which we examine in Chapter 11) follow 
somewhat different rules, because of their particular 
function or environment, but weak interactions are 
still critical structural elements. 

The Peptide Bond Is Rigid and Planar 

% Protein Architecture-Primary Structure Covalent bonds, 
too, place important constraints on the conformation of 
a polypeptide. In the late 1930s, Linus Pauling and 
Robert Corey embarked on a series of studies that laid 
the foundation for our current understanding of protein 
structure. They began with a careful analysis of the pep- 
tide bond. 
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Linus Pauling, 1901-1994 Robert Corey, 1897-1971 

The a carbons of adjacent amino acid residues are 
separated by three covalent bonds, arranged as 
C a — C — N — C a . X-ray diffraction studies of crystals 
of amino acids and of simple dipeptides and tripep- 
tides showed that the peptide C — N bond is somewhat 
shorter than the C — N bond in a simple amine and that 
the atoms associated with the peptide bond are co- 
planar. This indicated a resonance or partial sharing of 
two pairs of electrons between the carbonyl oxygen 
and the amide nitrogen (Fig. 4-2a). The oxygen has a 
partial negative charge and the nitrogen a partial pos- 
itive charge, setting up a small electric dipole. The six 
atoms of the peptide group lie in a single plane, with 
the oxygen atom of the carbonyl group trans to the hy- 
drogen atom of the amide nitrogen. From these find- 
ings Pauling and Corey concluded that the peptide 
C — N bonds, because of their partial double-bond 
character, cannot rotate freely. Rotation is permitted 
about the N — C a and the C a — C bonds. The backbone 
of a polypeptide chain can thus be pictured as a series 
of rigid planes, with consecutive planes sharing a com- 
mon point of rotation at G a (Fig. 4-2b). The rigid pep- 
tide bonds limit the range of conformations possible 
for a polypeptide chain. 
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The carbonyl oxygen has a partial negative 
charge and the amide nitrogen a partial positive 
charge, setting up a small electric dipole. 
Virtually all peptide bonds in proteins occur in 
this trans configuration; an exception is noted in 
Figure 4-7b. 
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FIGURE 4-2 The planar peptide group, (a) Each peptide bond has some 
double-bond character due to resonance and cannot rotate, (b) Three 
bonds separate sequential a carbons in a polypeptide chain. The N — C Q 
and C a — C bonds can rotate, described by dihedral angles designated (f> 
and t/f, respectively. The peptide C — N bond is not free to rotate. Other 
single bonds in the backbone may also be rotationally hindered, de- 
pending on the size and charge of the R groups, (c) The atoms and 
planes defining (d) By convention, <f> and t/r are 180° (or -180°) when 
the first and fourth atoms are farthest apart and the peptide is fully ex- 
tended. As the viewer looks out along the bond undergoing rotation 
(from either direction) the <j> and </r angles increase as the fourth atom 
rotates clockwise relative to the first. In a protein, some of the confor- 
mations shown here (e.g., 0°) are prohibited by steric overlap of atoms. 
In (b) through (d), the balls representing atoms are smaller than the van 
der Waal's radii for this scale. 



Peptide conformation is defined by three dihedral 
angles (also known as torsion angles) called <p (phi), if/ 
(psi), and co (omega), reflecting rotation about each of 
the three repeating bonds in the peptide backbone. A di- 
hedral angle is the angle at the intersection of two 
planes. In the case of peptides, the planes are defined by 
bond vectors in the peptide backbone. Two successive 
bond vectors describe a plane. Three successive bond 
vectors describe two planes (the central bond vector is 
common to both; Fig. 4-2 c), and the angle between 
these two planes is what we measure to describe protein 
conformation. 

KEY CONVENTION: The important dihedral angles in a 
peptide are defined by the three bond vectors con- 
necting four consecutive main-chain (peptide back- 
bone) atoms (Fig. 4-2c): <f> involves the C — N — C a — C 
bonds (with the rotation occurring about the N — C a 
bond), and if/ involves the N — C a — C — N bonds. Both 
<f> and if/ are defined as ±180° when the polypeptide is 
fully extended and all peptide groups are in the same 
plane (Fig. 4-2 d). As one looks down the central bond 



vector in the direction of the vector arrow (as depicted 
in Fig. 4-2c for ifi), the dihedral angles increase as the 
distal (fourth) atom is rotated clockwise (Fig. 4-2d). 
From the ±180° position, the dihedral angle increases 
from -180° to 0°, at which point the first and fourth 
atoms are eclipsed. The rotation can be continued 
from 0° to +180° (same position as -180°) to bring 
the structure back to the starting point. The third di- 
hedral angle, o>, is not often considered. It involves the 
C a — C — N — C a bonds. The central bond in this case 
is the peptide bond, where rotation is constrained. 
The peptide bond is normally (99.6% of the time) in 
the trans configuration, constraining cu to a value of 
±180°. For a rare cis peptide bond, a> = 0°. ■ 

In principle, cf> and if/ can have any value between 
-180° and +180°, but many values are prohibited by 
steric interference between atoms in the polypeptide 
backbone and amino acid side chains. The conformation 
in which both <f> and iff are 0° (Fig. 4-2d) is prohibited for 
this reason; this conformation is merely a reference 
point for describing the dihedral angles. Allowed values 
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FIGURE 4-3 Ramachandran plot for L-Ala residues. Peptide conforma- 
tions are defined by the values of <f> and ip. Conformations deemed pos- 
sible are those that involve little or no steric interference, based on 
calculations using known van der Waals radii and dihedral angles. The 
areas shaded dark blue represent conformations that involve no steric 
overlap and thus are fully allowed; medium blue indicates conforma- 
tions allowed at the extreme limits for unfavorable atomic contacts; the 
lightest blue indicates conformations that are permissible if a little flex- 
ibility is allowed in the dihedral angles. The yellow regions are confor- 
mations that are not allowed. The asymmetry of the plot results from 
the l stereochemistry of the amino acid residues. The plots for other l 
residues with unbranched side chains are nearly identical. Allowed 
ranges for branched residues such as Val, lie, and Thr are somewhat 
smaller than for Ala. The Gly residue, which is less sterically hindered, 
has a much broader range of allowed conformations. The range for Pro 
residues is greatly restricted because <f> is limited by the cyclic side 
chain to the range of -35° to -85°. 



for (f> and ip become evident when if/ is plotted versus (f> 
in a Ramachandran plot (Fig. 4-3), introduced by 
G. N. Ramachandran. 



SUMMARY 4.1 



Overview of Protein 
Structure 



Every protein has a three-dimensional structure 
that reflects its function. 

Protein structure is stabilized by multiple weak 
interactions. Hydrophobic interactions are the 
major contributors to stabilizing the globular form 
of most soluble proteins; hydrogen bonds and 
ionic interactions are optimized in the 
thermodynamically most stable structures. 

The nature of the covalent bonds in the polypeptide 
backbone places constraints on structure. The 
peptide bond has a partial double-bond character 
that keeps the entire six-atom peptide group in a 
rigid planar configuration. The N — C a and C a — C 
bonds can rotate to define the dihedral angles <f> 
and if/, respectively. 



4.2 Protein Secondary Structure 

The term secondary structure refers to any chosen 
segment of a polypeptide chain and describes the local 
spatial arrangement of its main-chain atoms, without re- 
gard to the conformation of its side chains or its rela- 
tionship to other segments. A regular secondary 
structure occurs when each dihedral angle, $ and fa re- 
mains the same or nearly the same throughout the seg- 
ment. There are a few types of secondary structure that 
are particularly stable and occur widely in proteins. The 
most prominent are the a helix and jS conformations; an- 
other common type is the /3 turn. Where a regular pat- 
tern is not found, the secondary structure is sometimes 
referred to as undefined or as a random coil. This last 
designation, however, does not properly describe the 
structure of these segments. The path of the polypep- 
tide backbone in almost any protein is not random; 
rather, it is typically unchanging and highly specific to 
the structure and function of that particular protein. 
Our discussion here focuses on the regular, common 
structures. 



The a Helix Is a Common Protein Secondary Structure 

9 Protein Architecture-a Helix Pauling and Corey were 
aware of the importance of hydrogen bonds in orienting 
polar chemical groups such as the C=0 and N — H groups 
of the peptide bond. They also had the experimental re- 
sults of William Astbury, who in the 1930s had conducted 
pioneering x-ray studies of proteins. Astbury demon- 
strated that the protein that makes up hair and porcupine 
quills (the fibrous protein a-keratin) has a regular 
structure that repeats every 5. 15 to 5.2 A. (The angstrom, 
A, named after the physicist Anders J. Angstrom, is equal 
to 0. 1 nm. Although not an SI unit, it is used universally by 
structural biologists to describe atomic distances — it is 
approximately the length of a typical C — H bond.) With 
this information and their data on the peptide bond, and 
with the help of precisely constructed models, Pauling 
and Corey set out to determine the likely conformations 
of protein molecules. 

The simplest arrangement the polypeptide chain can 
assume, given its rigid peptide bonds (but free rotation 
around its other, single bonds), is a helical structure, 
which Pauling and Corey called the a helix (Fig. 4-4). 
In this structure the polypeptide backbone is tightly 
wound around an imaginary axis drawn longitudinally 
through the middle of the helix, and the R groups of the 
amino acid residues protrude outward from the helical 
backbone. The repeating unit is a single turn of the helix, 
which extends about 5.4 A along the long axis, slightly 
greater than the periodicity Astbury observed on x-ray 
analysis of hair keratin. The amino acid residues in the 
prototypical a helix have conformations with <f> = -57° 
and ifj = -47°, and each helical turn includes 3.6 amino 
acid residues. The a-helical segments in proteins often 
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FIGURE 4-4 Models of the a helix, showing different aspects of its 
structure, (a) Ball-and-stick model showing the intrachain hydrogen 
bonds. The repeat unit is a single turn of the helix, 3.6 residues. (b)The 
a helix viewed from one end, looking down the longitudinal axis 
(derived from PDB ID 4TNC). Note the positions of the R groups, rep- 
resented by purple spheres. This ball-and-stick model, which empha- 
sizes the helical arrangement, gives the false impression that the helix 
is hollow, because the balls do not represent the van der Waals radii of 




(c) 



(d) 



the individual atoms, (c) As this space-filling model shows, the atoms 
in the center of the a helix are in very close contact, (d) Helical wheel 
projection of an a helix. This representation can be colored to identify 
surfaces with particular properties. The yellow residues, for example, 
could be hydrophobic and conform to an interface between the helix 
shown here and another part of the same or another polypeptide. The red 
and blue residues illustrate the potential for interaction of negatively and 
positively charged side chains separated by two residues in the helix. 



deviate slightly from these dihedral angles, and even 
vary somewhat within a single contiguous segment to 
produce subtle bends or kinks in the helical axis. In all 
proteins, the helical twist of the a helix is right-handed 
(Box 4-1). The a helix proved to be the predominant 
structure in a -keratins. More generally, about one- 



fourth of all amino acid residues in proteins are found in 
a helices, the exact fraction varying greatly from one 
protein to another. 

Why does the a helix form more readily than many 
other possible conformations? The answer is, in part, 
that an a helix makes optimal use of internal hydrogen 



BOX 4-1 



METHODS 



Knowing the Right Hand from the Lef 



There is a simple method for determining whether a 
helical structure is right-handed or left-handed. Make 
fists of your two hands with thumbs outstretched and 
pointing away from you. Looking at your right hand, 
think of a helix spiraling up your right thumb in the di- 
rection in which the other four fingers are curled as 
shown (clockwise). The resulting helix is right-handed. 
Your left hand will demonstrate a left-handed helix, 
which rotates in the counterclockwise direction as it 
spirals up your thumb. 



Left-handed 
helix 




Right-handed 
helix 
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bonds. The structure is stabilized by a hydrogen bond 
between the hydrogen atom attached to the electroneg- 
ative nitrogen atom of a peptide linkage and the elec- 
tronegative carbonyl oxygen atom of the fourth amino 
acid on the amino-terminal side of that peptide bond 
(Fig. 4-4a). Within the a helix, every peptide bond 
(except those close to each end of the helix) partici- 
pates in such hydrogen bonding. Each successive turn 
of the a helix is held to adjacent turns by three to four 
hydrogen bonds, conferring significant stability on the 
overall structure. 

Further model-building experiments have shown 
that an a helix can form in polypeptides consisting of ei- 
ther l- or D-amino acids. However, all residues must be 
of one stereoisomeric series; a D-amino acid will disrupt 
a regular structure consisting of L-amino acids, and vice 
versa. In principle, naturally occurring L-amino acids 
can form either right- or left-handed a helices, but ex- 
tended left-handed a helices are theoretically less stable 
and have not been observed in proteins. 



TABLE 4-1 



Propensity of Amino Acids to Take Up 



m a-Helical Conformation 



WORKED EXAMPLE 4-1 



Secondary Structure and 
Protein Dimensions 



What is the length of a polypeptide with 80 amino acid 
residues in a single contiguous a helix? 

Solution: An idealized a helix has 3.6 residues per turn 
and the rise along the helical axis is 5.4 A. Thus, the rise 
along the axis for each amino acid residue is 1.5 A. The 
length of the polypeptide is therefore 80 residues X 1.5 hi 
residue = 120 A. 



Amino Acid Sequence Affects Stability of the a Helix 

Not all polypeptides can form a stable a helix. Each 
amino acid residue in a polypeptide has an intrinsic 
propensity to form an a helix (Table 4-1), reflecting the 
properties of the R group and how they affect the ca- 
pacity of the adjoining main-chain atoms to take up the 
characteristic </> and ip angles. Alanine shows the great- 
est tendency to form a helices in most experimental 
model systems. 

The position of an amino acid residue relative to its 
neighbors is also important. Interactions between amino 
acid side chains can stabilize or destabilize the a-helical 
structure. For example, if a polypeptide chain has a long 
block of Glu residues, this segment of the chain will not 
form an a helix at pH 7.0. The negatively charged car- 
boxyl groups of adjacent Glu residues repel each other 
so strongly that they prevent formation of the a helix. 
For the same reason, if there are many adjacent Lys 
and/or Arg residues, with positively charged R groups at 
pH 7.0, they also repel each other and prevent formation 
of the a helix. The bulk and shape of Asn, Ser, Thr, and 
Cys residues can also destabilize an a helix if they are 
close together in the chain. 
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greater difficulty taking up the a-helical structure. Data are a composite derived from 
multiple experiments and experimental systems. 



The twist of an a helix ensures that critical interac- 
tions occur between an amino acid side chain and the 
side chain three (and sometimes four) residues away 
on either side of it. This is clear when the a helix is de- 
picted as a helical wheel (Fig. 4-4d). Positively charged 
amino acids are often found three residues away from 
negatively charged amino acids, permitting the forma- 
tion of an ion pair. Two aromatic amino acid residues 
are often similarly spaced, resulting in a hydrophobic 
interaction. 

A constraint on the formation of the a helix is the 
presence of Pro or Gly residues, which have the least 
proclivity to form a helices. In proline, the nitrogen 
atom is part of a rigid ring (see Fig. 4-7b), and rota- 
tion about the N — C a bond is not possible. Thus, a Pro 
residue introduces a destabilizing kink in an a helix. 
In addition, the nitrogen atom of a Pro residue in a 
peptide linkage has no substituent hydrogen to par- 
ticipate in hydrogen bonds with other residues. For 
these reasons, proline is only rarely found in an a he- 
lix. Glycine occurs infrequently in a helices for a dif- 
ferent reason: it has more conformational flexibility 
than the other amino acid residues. Polymers of 
glycine tend to take up coiled structures quite differ- 
ent from an a helix. 

A final factor affecting the stability of an a helix is 
the identity of the amino acid residues near the ends of 
the a-helical segment of the polypeptide. A small elec- 
tric dipole exists in each peptide bond (Fig. 4-2a). 
These dipoles are aligned through the hydrogen bonds 
of the helix, resulting in a net dipole along the helical 
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FIGURE 4-5 Helix dipole. The electric di- 
pole of a peptide bond (see Fig. 4-2a) is 
transmitted along an a-helical segment 
through the intrachain hydrogen bonds, re- 
sulting in an overall helix dipole. In this 
illustration, the amino and carbonyl con- 
stituents of each peptide bond are indi- 
cated by + and - symbols, respectively. 
Non-hydrogen-bonded amino and car- 
bonyl constituents of the peptide bonds 
near each end of the a-helical region are 



Carboxyl terminus shown in red. 

axis that increases with helix length (Fig. 4-5). The 
four amino acid residues at each end of the helix do not 
participate fully in the helix hydrogen bonds. The partial 
positive and negative charges of the helix dipole reside 
on the peptide amino and carbonyl groups near the 
amino-terminal and carboxyl-terminal ends, respec- 
tively. For this reason, negatively charged amino acids 
are often found near the amino terminus of the helical 
segment, where they have a stabilizing interaction with 
the positive charge of the helix dipole; a positively 
charged amino acid at the amino-terminal end is desta- 
bilizing. The opposite is true at the carboxyl-terminal 
end of the helical segment. 

In summary, five types of constraints affect the sta- 
bility of an a helix: (1) the intrinsic propensity of an 
amino acid residue to form an a helix; (2) the interac- 
tions between R groups, particularly those spaced three 
(or four) residues apart; (3) the bulkiness of adjacent R 
groups; (4) the occurrence of Pro and Gly residues; and 
(5) interactions between amino acid residues at the 
ends of the helical segment and the electric dipole in- 
herent to the a helix. The tendency of a given segment 
of a polypeptide chain to form an a helix therefore de- 
pends on the identity and sequence of amino acid 
residues within the segment. 

The f$ Conformation Organizes Polypeptide 
Chains into Sheets 

% Protein Architecture-/* Sheet In 1951, Pauling and Corey 
predicted a second type of repetitive structure, the 
f3 conformation. This is a more extended conforma- 
tion of polypeptide chains, and its structure has been 
confirmed by x-ray analysis. In the /3 conformation, 



the backbone of the polypeptide chain is extended 
into a zigzag rather than helical structure (Fig. 4-6). 
The zigzag polypeptide chains can be arranged side by 
side to form a structure resembling a series of pleats. 
In this arrangement, called a ft sheet, hydrogen bonds 
form between adjacent segments of polypeptide chain. 
The individual segments that form a /3 sheet are usu- 
ally nearby on the polypeptide chain, but can also be 

(a) Antiparallel 




6.5 A 



FIGURE 4-6 The p conformation of polypeptide chains. These top and 
side views reveal the R groups extending out from the sheet and em- 
phasize the pleated shape described by the planes of the peptide 
bonds, (An alternative name for this structure is /3-pleated sheet.) 
Hydrogen-bond cross-links between adjacent chains are also shown. 
The amino-terminal to carboxyl-terminal orientations of adjacent 
chains (arrows) can be the same or opposite, forming (a) an antiparal- 
(el jS sheet or (b) a parallel /3 sheet. 
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quite distant from each other in the linear sequence 
of the polypeptide; they may even be in different 
polypeptide chains. The R groups of adjacent amino 
acids protrude from the zigzag structure in opposite 
directions, creating the alternating pattern seen in 
the side views in Figure 4-6. 

The adjacent polypeptide chains in a /3 sheet can 
be either parallel or antiparallel (having the same or 
opposite amino-to-carboxyl orientations, respectively). 
The structures are somewhat similar, although the re- 
peat period is shorter for the parallel conformation (6.5 
A, vs. 7 A for antiparallel) and the hydrogen-bonding 
patterns are different. The idealized structures corre- 
spond to 4> = -119°, ift = +113° (parallel) and 4> = 
-139°, ifj = +135° (antiparallel); these values vary 
somewhat in real proteins, resulting in structural varia- 
tion, as seen above for a helices. 

Some protein structures limit the kinds of amino 
acids that can occur in the /3 sheet. When two or more 
j3 sheets are layered close together within a protein, 
the R groups of the amino acid residues on the touch- 
ing surfaces must be relatively small. /3-Keratins such 
as silk fibroin and the fibroin of spider webs have a 
very high content of Gly and Ala residues, the two 
amino acids with the smallest R groups. Indeed, in silk 
fibroin Gly and Ala alternate over large parts of the 
sequence. 

fi Turns Are Common in Proteins 

Q Protein Architecture-/* Turn In globular proteins, which 

have a compact folded structure, nearly one-third of the 
amino acid residues are in turns or loops where the 
polypeptide chain reverses direction (Fig. 4-7). These 



are the connecting elements that link successive runs 
of a helix or /3 conformation. Particularly common are 
P turns that connect the ends of two adjacent seg- 
ments of an antiparallel /3 sheet. The structure is a 180° 
turn involving four amino acid residues, with the car- 
bonyl oxygen of the first residue forming a hydrogen 
bond with the amino-group hydrogen of the fourth. The 
peptide groups of the central two residues do not par- 
ticipate in any inter-residue hydrogen bonding. Gly and 
Pro residues often occur in /3 turns, the former because 
it is small and flexible, the latter because peptide bonds 
involving the imino nitrogen of proline readily assume 
the cis configuration (Fig. 4-7b), a form that is particu- 
larly amenable to a tight turn. Of the several types of /3 
turns, the two shown in Figure 4-7a are the most com- 
mon. Beta turns are often found near the surface of a 
protein, where the peptide groups of the central two 
amino acid residues in the turn can hydrogen-bond with 
water. Considerably less common is the y turn, a three- 
residue turn with a hydrogen bond between the first 
and third residues. 

Common Secondary Structures Have Characteristic 
Dihedral Angles 

The a helix and the /3 conformation are the major 
repetitive secondary structures in a wide variety of 
proteins, although other repetitive structures exist in 
some specialized proteins (an example is collagen; see 
Fig. 4-12). Every type of secondary structure can be 
completely described by the dihedral angles <f> and 
ijj associated with each residue. As shown by a Ra- 
machandran plot, the a helix and /3 conformation fall 
within a relatively restricted range of sterically allowed 




(a) /3 Turns Type I Type II (b) Proline isomers 



FIGURE 4-7 Structures of p turns, (a) Type I and type Upturns are most (b) Trans and cis isomers of a peptide bond involving the imino nitro- 

common; type I turns occur more than twice as frequently as type II. gen of proline. Of the peptide bonds between amino acid residues 

Type II £ turns usually have Gfy as the third residue. Note the hydrogen other than Pro, more than 99.95% are in the trans configuration. For 

bond between the peptide groups of the first and fourth residues of the peptide bonds involving the imino nitrogen of proline, however, about 

bends. (Individual amino acid residues are framed by large blue circles.) 6% are in the cis configuration; many of these occur at ft turns. 
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FIGURE 4-8 Ramachandran plots showing a variety of structures. 

(a) The values of <f> and i// for various allowed secondary structures are 
overlaid on the plot from Figure 4-3. Although left-handed a helices 
extending over several amino acid residues are theoretically possi- 
ble, they have not been observed in proteins, (b) The values of 4> and 
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if/ for all the amino acid residues except Gly in the enzyme pyruvate 
kinase (isolated from rabbit) are overlaid on the plot of theoretically 
allowed conformations (Fig. 4-3). The small, flexible Gly residues 
were excluded because they frequently fall outside the expected 
(blue) ranges. 



structures (Fig. 4-8a). Most values of </> and iff taken 
from known protein structures fall into the expected 
regions, with high concentrations near the a helix and 
/3 conformation values as predicted (Fig. 4-8b). The 
only amino acid residue often found in a conformation 
outside these regions is glycine. Because its side chain 
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FIGURE 4-9 Circular dichroism spectroscopy. These spectra show 
polylysine entirely as a helix, as p conformation, or as a denatured, 
random coil. The y axis unit is a simplified version of the units most 
commonly used in CD experiments. Since the curves are different for 
a helix, (3 conformation, and random coil, the CD spectrum for a given 
protein can provide a rough estimate for the fraction of the protein 
made up of the two most common secondary structures. The CD spec- 
trum of the native protein can serve as a benchmark for the folded 
state, useful for monitoring denaturation or conformational changes 
brought about by changes in solution conditions. 



is small, a Gly residue can take part in many confor- 
mations that are sterically forbidden for other amino 
acids, 

Common Secondary Structures Can Be Assessed by 
Circular Dichroism 

Structural asymmetry in a molecule gives rise to differ- 
ences in absorption of left-handed versus right-handed 
plane-polarized light. Measurement of this difference is 
called circular dichroism (CD) spectroscopy. An or- 
dered structure, such as a folded protein, gives rise to an 
absorption spectrum that can have peaks or regions 
with both positive and negative values. For proteins, 
spectra are obtained in the far UV region (190 to 250 nm) . 
The light-absorbing entity, or chromophore, in this 
region is the peptide bond; a signal is obtained when the 
peptide bond is in a folded environment. The difference 
in molar extinction coefficients (see Box 3-1) for left- 
and right-handed plane-polarized light (Ae) is plotted as 
a function of wavelength. The a helix and j3 conforma- 
tions have characteristic CD spectra (Fig. 4-9). Using 
CD spectra, biochemists can determine whether pro- 
teins are properly folded, estimate the fraction of the 
protein that is folded in either of the common secondary 
structures, and monitor transitions between the folded 
and unfolded states. 

SUMMARY 4.2 Protein Secondary Structure 

■ Secondary structure is the local spatial arrangement 
of the main-chain atoms in a selected segment of 
a polypeptide chain. 
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■ The most common regular secondary structures are 
the a helix, the /3 conformation, and /3 turns. 

■ The secondary structure of a polypeptide segment 
can be completely defined if the 4> and if/ angles 
are known for all amino acid residues in that 
segment. 

■ Circular dichroism spectroscopy is a method for 
assessing common secondary structure and 
monitoring folding in proteins. 

4.3 Protein Tertiary and Quaternary 
Structures 

0 Protein Architecture — Introduction to Tertiary Structure The 

overall three-dimensional arrangement of all atoms in a 
protein is referred to as the protein's tertiary struc- 
ture. Whereas the term "secondary structure" refers 
to the spatial arrangement of amino acid residues that 
are adjacent in a segment of a polypeptide, tertiary 
structure includes longer-range aspects of amino acid 
sequence. Amino acids that are far apart in the polypep- 
tide sequence and are in different types of secondary 
structure may interact within the completely folded 
structure of a protein. The location of bends (including 
j8 turns) in the polypeptide chain and the direction and 
angle of these bends are determined by the number and 
location of specific bend-producing residues, such as 
Pro, Thr, Ser, and Gly. Interacting segments of polypep- 
tide chains are held in their characteristic tertiary 
positions by several kinds of weak interactions (and 
sometimes by covalent bonds such as disulfide cross- 
links) between the segments. 

Some proteins contain two or more separate 
polypeptide chains, or subunits, which may be identi- 
cal or different. The arrangement of these protein 
subunits in three-dimensional complexes constitutes 
quaternary structure. 

In considering these higher levels of structure, it is 
useful to classify proteins into two major groups: fibrous 
proteins, with polypeptide chains arranged in long 
strands or sheets, and globular proteins, with polypep- 
tide chains folded into a spherical or globular shape. The 
two groups are structurally distinct. Fibrous proteins 
usually consist largely of a single type of secondary struc- 
ture, and their tertiary structure is relatively simple. 



Globular proteins often contain several types of second- 
ary structure. The two groups also differ functionally: the 
structures that provide support, shape, and external pro- 
tection to vertebrates are made of fibrous proteins, 
whereas most enzymes and regulatory proteins are glob- 
ular proteins. 

Fibrous Proteins Are Adapted for a 
Structural Function 

H Protein Architecture— Tertiary Structure of Fibrous Proteins 

a-Keratin, collagen, and silk fibroin nicely illustrate the 
relationship between protein structure and biological 
function (Table 4-2). Fibrous proteins share properties 
that give strength and/or flexibility to the structures in 
which they occur. In each case, the fundamental struc- 
tural unit is a simple repeating element of secondary 
structure. All fibrous proteins are insoluble in water, a 
property conferred by a high concentration of hy- 
drophobic amino acid residues both in the interior of the 
protein and on its surface. These hydrophobic surfaces 
are largely buried as many similar polypeptide chains 
are packed together to form elaborate supramolecular 
complexes. The underlying structural simplicity of fi- 
brous proteins makes them particularly useful for illus- 
trating some of the fundamental principles of protein 
structure discussed above. 

a-Keratin The a-keratins have evolved for strength. 
Found only in mammals, these proteins constitute 
almost the entire dry weight of hair, wool, nails, claws, 
quills, horns, hooves, and much of the outer layer of 
skin. The a-keratins are part of a broader family of pro- 
teins called intermediate filament (IF) proteins. Other 
IF proteins are found in the cytoskeletons of animal 
cells. All IF proteins have a structural function and share 
the structural features exemplified by the a-keratins. 

The a-keratin helix is a right-handed a helix, the 
same helix found in many other proteins. Francis Crick 
and Linus Pauling in the early 1950s independently sug- 
gested that the a helices of keratin were arranged as a 
coiled coil. Two strands of a-keratin, oriented in parallel 
(with their amino termini at the same end) , are wrapped 
about each other to form a supertwisted coiled coil. 
The supertwisting amplifies the strength of the overall 
structure, just as strands are twisted to make a strong 



Secondary Structures and Properties of Some Fibrous Proteins 


Structure 


Characteristics 


Examples of occurrence 


a Helix, cross-linked by 
disulfide bonds 

p Conformation 

Collagen triple helix 


Tough, insoluble protective structures of 
varying hardness and flexibility 

Soft, flexible filaments 

High tensile strength, without stretch 


a-Keratin of hair, feathers, and nails 
Silk fibroin 

Collagen of tendons, bone matrix 
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rope (Fig. 4-10). The twisting of the axis of an a helix 
to form a coiled coil explains the discrepancy between 
the 5.4 A per turn predicted for an a helix by Pauling 
and Corey and the 5.15 to 5.2 A repeating structure ob- 
served in the x-ray diffraction of hair (p. 117). The heli- 
cal path of the supertwists is left-handed, opposite in 
sense to the a helix. The surfaces where the two a he- 
lices touch are made up of hydrophobic amino acid 
residues, their R groups meshed together in a regular in- 
terlocking pattern. This permits a close packing of the 
polypeptide chains within the left-handed supertwist. 
Not surprisingly, a-keratin is rich in the hydrophobic 
residues Ala, Val, Leu, He, Met, and Phe. 
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(b) Cross section of a hair 

FIGURE 4-10 Structure of hair, (a) Hair a-keratin is an elongated a he- 
lix with somewhat thicker elements near the amino and carboxyl ter- 
mini. Pairs of these helices are interwound in a left-handed sense to 
form two-chain coiled coils. These then combine in higher-order struc- 
tures called protofilaments and protofibrils. About four protofibrils — 
32 strands of a-keratin in all — combine to form an intermediate 
filament. The individual two-chain coiled coils in the various substruc- 
tures also seem to be interwound, but the handedness of the interwind- 
ing and other structural details are unknown, (b) A hair is an array of 
many a-keratin filaments, made up of the substructures shown in (a). 



An individual polypeptide in the a-keratin coiled 
coil has a relatively simple tertiary structure, domi- 
nated by an a-helical secondary structure with its heli- 
cal axis twisted in a left-handed superhelix. The 
intertwining of the two a-helical polypeptides is an ex- 
ample of quaternary structure. Coiled coils of this type 
are common structural elements in filamentous pro- 
teins and in the muscle protein myosin (see Fig. 5-27). 
The quaternary structure of a-keratin can be quite 
complex. Many coiled coils can be assembled into large 
supramolecular complexes, such as the arrangement of 
a-keratin to form the intermediate filament of hair 
(Fig. 4-10b). 

The strength of fibrous proteins is enhanced by co- 
valent cross-links between polypeptide chains in the 
multihelical "ropes" and between adjacent chains in a 
supramolecular assembly. In a-keratins, the cross-links 
stabilizing quaternary structure are disulfide bonds 
(Box 4-2). In the hardest and toughest a-keratins, such 
as those of rhinoceros horn, up to 18% of the residues 
are cysteines involved in disulfide bonds. 

Collagen Like the a-keratins, collagen has evolved to 
provide strength. It is found in connective tissue such as 
tendons, cartilage, the organic matrix of bone, and the 
cornea of the eye. The collagen helix is a unique sec- 
ondary structure Q<f> = -51°, \p = +153°) quite distinct 
from the a helix. It is left-handed and has three amino 
acid residues per turn (Fig. 4-11). Collagen is also a 




FIGURE 4-11 Structure of collagen. (Derived from PDB ID 1CGD) (a) The 
a chain of collagen has a repeating secondary structure unique to this pro- 
tein. The repeating tripeptide sequence Gly-X-Pro or Gly-X^-Hyp adopts 
a left-handed helical structure with three residues per turn. The repeating 
sequence used to generate this model is Gly-Pro-4-Hyp. (b) Space-filling 
model of the same a chain, (c) Three of these helices (shown here in gray 
blue, and purple) wrap around one another with a right-handed twist, 
(d) The three-stranded collagen superhelix shown from one end, in a ball- 
and-stick representation. Gly residues are shown in red. Glycine, because 
of its small size, is required at the tight junction where the three chains are 
in contact. The balls in this illustration do not represent the van der Waals 
radii of the individual atoms. The center of the three-stranded superhelix is 
not hollow, as it appears here, but very tightly packed. 
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Permanent Waving Is Biochemical Engineering 



When hair is exposed to moist heat, it can be stretched. 
At the molecular level, the a helices in the a-keratin of 
hair are stretched out until they arrive at the fully ex- 
tended jS conformation. On cooling they spontaneously 
revert to the a-helical conformation. The characteristic 
"stretchability" of a-keratins, and their numerous disul- 
fide cross-linkages, are the basis of permanent waving. 
The hair to be waved or curled is first bent around a form 
of appropriate shape. A solution of a reducing agent, usu- 
ally a compound containing a thiol or sulfhydryl group 
( — SH), is then applied with heat. The reducing agent 
cleaves the cross-linkages by reducing each disulfide 
bond to form two Cys residues. The moist heat breaks 
hydrogen bonds and causes the a-helical structure of the 
polypeptide chains to uncoil. After a time the reducing 
solution is removed, and an oxidizing agent is added to 
establish new disulfide bonds between pairs of Cys 
residues of adjacent polypeptide chains, but not the 
same pairs as before the treatment. After the hair is 
washed and cooled, the polypeptide chains revert to 



their a-helical conformation. The hair fibers now curl in 
the desired fashion because the new disulfide cross- 
linkages exert some torsion or twist on the bundles of a- 
helical coils in the hair fibers. The same process can be 
used to straighten hair that is naturally curly. A perma- 
nent wave (or hair straightening) is not truly permanent, 
because the hair grows; in the new hair replacing the old, 
the a-keratin has the natural pattern of disulfide bonds. 
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coiled coil, but one with distinct tertiary and quaternary 
structures: three separate polypeptides, called a chains 
(not to be confused with a helices), are supertwisted 
about each other (Fig. 4-1 lc). The superhelical twisting 
is right-handed in collagen, opposite in sense to the left- 
handed helix of the a chains. 

There are many types of vertebrate collagen. Typ- 
ically they contain about 35% Gly, 11% Ala, and 21% 
Pro and 4-Hyp (4-hydroxyproline, an uncommon 
amino acid; see Fig. 3-8a). The food product gelatin is 
derived from collagen; it has little nutritional value as 
a protein, because collagen is extremely low in many 
amino acids that are essential in the human diet. The 
unusual amino acid content of collagen is related to 
structural constraints unique to the collagen helix. 
The amino acid sequence in collagen is generally a re- 
peating tripeptide unit, Gly-X-Y, where X is often Pro, 
and Y is often 4-Hyp. Only Gly residues can be accom- 
modated at the very tight junctions between the indi- 
vidual a chains (Fig. 4-1 Id). The Pro and 4-Hyp 
residues permit the sharp twisting of the collagen he- 
lix. The amino acid sequence and the supertwisted 
quaternary structure of collagen allow a very close 
packing of its three polypeptides. 4-Hydroxyproline 
has a special role in the structure of collagen — and in 
human history (Box 4-3). 

The tight wrapping of the a chains in the collagen 
triple helix provides tensile strength greater than that 
of a steel wire of equal cross section. Collagen fibrils 
(Fig. 4-12) are supramolecular assemblies consisting 
of triple -helical collagen molecules (sometimes referred 
to as tropocollagen molecules) associated in a variety of 
ways to provide different degrees of tensile strength. 




Heads of collagen Cross-striations 
molecules 640 A (64 nm) 




Section of collagen 



FIGURE 4-12 Structure of collagen fibrils. Collagen (M r 300,000) is a 
rod-shaped molecule, about 3,000 A long and only 1 5 A thick. Its three 
helically intertwined a chains may have different sequences; each chain 
has about 1,000 amino acid residues. Collagen fibrils are made up of 
collagen molecules aligned in a staggered fashion and cross-linked for 
strength. The specific alignment and degree of cross-linking vary with 
the tissue and produce characteristic cross-striations in an electron mi- 
crograph. In the example shown here, alignment of the head groups of 
every fourth molecule produces striations 640 A (64 nm) apart. 
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MEDICINE 



Why Sailors, Explorers, and College Students Should Eat 
Their Fresh Fruits and Vegetables 



. . . from this misfortune, together with the unhealthi- 
ness of the country, where there never falls a drop of 
rain, we were stricken with the "camp-sickness," which 
was such that the flesh of our limbs all shrivelled up, and 
the skin of our legs became all blotched with black, 
mouldy patches, like an old jack-boot, and proud flesh 
came upon the gums of those of us who had the sick- 
ness, and none escaped from this sickness save through 
the jaws of death. The signal was this: when the nose be- 
gan to bleed, then death was at hand . . . 

— The Memoirs of the Lord of Joinville, ca. 1300 

This excerpt describes the plight of Louis IX's army 
toward the end of the Seventh Crusade (1248-1254), 
when the scurvy-weakened Crusader army was de- 
stroyed by the Egyptians. What was the nature of the 
malady afflicting these thirteenth-century soldiers? 

Scurvy is caused by lack of vitamin C, or ascorbic 
acid (ascorbate). Vitamin C is required for, among other 
things, the hydroxylation of proline and lysine in colla- 
gen; scurvy is a deficiency disease characterized by 
general degeneration of connective tissue. Manifesta- 
tions of advanced scurvy include numerous small hem- 
orrhages caused by fragile blood vessels, tooth loss, 
poor wound healing and the reopening of old wounds, 
bone pain and degeneration, and eventually heart fail- 
ure. Milder cases of vitamin C deficiency are accompa- 
nied by fatigue, irritability, and an increased severity of 
respiratory tract infections. Most animals make large 
amounts of vitamin C, converting glucose to ascorbate 
in four enzymatic steps. But in the course of evolution, 
humans and some other animals — gorillas, guinea pigs, 
and fruit bats — have lost the last enzyme in this path- 
way and must obtain ascorbate in their diet. Vitamin C 
is available in a wide range of fruits and vegetables. Un- 
til 1800, however, it was often absent in the dried foods 
and other food supplies stored for winter or for ex- 
tended travel. 

Scurvy was recorded by the Egyptians in 1500 bce, 
and it is described in the fifth century bce writings of 
Hippocrates. Yet it did not come to wide public notice 
until the European voyages of discovery from 1500 to 
1800. The first circumnavigation of the globe, led by 
Ferdinand Magellan (1520), was accomplished only 
with the loss of more than 80% of his crew to scurvy. 
During Jacques Cartier's second voyage to explore 
the St. Lawrence River (1535-1536), his band was 
threatened with complete disaster until the native 
Americans taught the men to make a cedar tea that 
cured and prevented scurvy (it contained vitamin C). 
Winter outbreaks of scurvy in Europe were gradually 




James Lind, 1716-1794; 
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1739-1 748 



eliminated in the nineteenth century as the cultivation 
of the potato, introduced from South America, became 
widespread. 

In 1747, James Lind, a Scottish surgeon in the Royal 
Navy, carried out the first controlled clinical study in 
recorded history. During an extended voyage on the 50- 
gun warship HMS Salisbury, Lind selected 12 sailors suf- 
fering from scurvy and separated them into groups of 
two. All 12 received the same diet, except that each group 
was given a different remedy for scurvy from among 
those recommended at the time. 
The sailors given lemons and or- 
anges recovered and returned to 
duty. The sailors given boiled ap- 
ple juice improved slightly The re- 
mainder continued to deteriorate. 
Lind's Treatise on the Scurvy was 
published in 1753, but inaction 
persisted in the Royal Navy for an- 
other 40 years. In 1795 the British 
admiralty finally mandated a ra- 
tion of concentrated lime or lemon 
juice for all British sailors (hence 
the name "limeys"). Scurvy continued to be a problem in 
some other parts of the world until 1932, when Hungarian 
scientist Albert Szent-Gyorgyi, and W. A. Waugh and C. G. 
King at the University of Pittsburgh, isolated and synthe- 
sized ascorbic acid. 

L-Ascorbic acid (vitamin C) is a white, odorless, 
crystalline powder. It is freely soluble in water and rela- 
tively insoluble in organic solvents. In a dry state, away 
from light, it is stable for a considerable length of time. 
The appropriate daily intake of this vitamin is still in 
dispute. The recommended daily allowance in the 
United States is 60 mg (Australia and the United 
Kingdom recommend 30 to 40 mg; Russia recommends 
100 mg). Along with citrus fruits and almost all other 
fresh fruits, other good sources of vitamin C include 
peppers, tomatoes, potatoes, and broccoli. The vitamin 
C of fruits and vegetables is destroyed by overcooking or 
prolonged storage. 

So why is ascorbate so necessary to good health? Of 
particular interest to us here is its role in the formation 
of collagen. As noted in the text, collagen is constructed 
of the repeating tripeptide unit Gly-X-Y, where X and Y 
are generally Pro or 4-Hyp — the proline derivative (4R)- 
L-hydroxyproline, which plays an essential role in the 
folding of collagen and in maintaining its structure. The 
proline ring is normally found as a mixture of two puck- 
ered conformations, called C 7 -endo and C r -exo (Fig. 1). 
The collagen helix structure requires the Pro residue in 
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FIGURE 1 The C y -endo conformation of proline and the C T -exo 
conformation of 4-hydroxyproline. 

the Y positions to be in the C r -exo conformation, and it 
is this conformation that is enforced by the hydroxyl 
substitution at C-4 in 4-Hyp. The collagen structure also 
requires that the Pro residue in the X positions have the 
C 7 -endo conformation, and introduction of 4-Hyp here 
can destabilize the helix. In the absence of vitamin C, 
cells cannot hydroxylate the Pro at the Y positions. This 
leads to collagen instability and the connective tissue 
problems seen in scurvy. 

The hydroxylation of specific Pro residues in procol- 
lagen, the precursor of collagen, requires the action of 
the enzyme prolyl 4-hydroxylase. This enzyme (Af r 
240,000) is an a 2 ^2 tetramer in all vertebrate sources. 
The proline-hydroxylating activity is found in the a sub- 
units. Each a subunit contains one atom of nonheme 
iron (Fe 2+ ), and the enzyme is one of a class of hydrox- 
ylases that require a-ketoglutarate in their reactions. 



In the normal prolyl 4-hydroxylase reaction (Fig. 2a) , 
one molecule of a-ketoglutarate and one of 0 2 bind to 
the enzyme. The a-ketoglutarate is oxidatively decar- 
boxylated to form C0 2 and succinate. The remaining 
oxygen atom is then used to hydroxylate an appropriate 
Pro residue in procollagen. No ascorbate is needed in 
this reaction. However, prolyl 4-hydroxylase also cat- 
alyzes an oxidative decarboxylation of a-ketoglutarate 
that is not coupled to proline hydroxylation (Fig. 2b). 
During this reaction the heme Fe 2+ becomes oxidized, 
inactivating the enzyme and preventing the proline hy- 
droxylation. The ascorbate consumed in the reaction is 
needed to restore enzyme activity — by reducing the 
heme iron. 

Scurvy remains a problem today, not only in remote 
regions where nutritious food is scarce but, surpris- 
ingly, on U.S. college campuses. The only vegetables 
consumed by some students are those in tossed salads, 
and days go by without these young adults consuming 
fruit. A 1998 study of 230 students at Arizona State Uni- 
versity revealed that 10% had serious vitamin C defi- 
ciencies, and 2 students had vitamin C levels so low that 
they probably had scurvy. Only half the students in the 
study consumed the recommended daily allowance of 
vitamin C. 

Eat your fresh fruit and vegetables. 
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FIGURE 2 Reactions catalyzed by prolyl 4-hydroxylase. (a) The normal reaction, coupled to proline hy- 
droxylation, which does not require ascorbate. The fate of the two oxygen atoms from 0 2 is shown in red. 
(b) The uncoupled reaction, in which a-ketoglutarate is oxidatively decarboxylated without hydroxylation 
of proline. Ascorbate is consumed stoichiometrically in this process as it is converted to dehydroascorbate. 
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The a chains of collagen molecules and the collagen 
molecules of fibrils are cross-linked by unusual types of 
covalent bonds involving Lys, HyLys (5-hydroxylysine; 
see Fig. 3-8a), or His residues that are present at a few 
of the X and Y positions. These links create uncommon 
amino acid residues such as dehydrohydroxylysinonor- 
leucine. The increasingly rigid and brittle character of 
aging connective tissue results from accumulated covalent 
cross-links in collagen fibrils. 



H— N 



/ 



!CH— CH 2 — CH 2 — CH 2 — CH=N— CH.v 



o=c 



Polypeptide Lys residue 
chain minus £-amino 

group (norleucine) 



OH 

HyLys 
residue 



N— H 



C=0 



Polypeptide 
chain 



Dehydrohydroxylysinonorleucine 

A typical mammal has more than 30 structural 
variants of collagen, particular to certain tissues 
and each somewhat different in sequence and func- 
tion. Some human genetic defects in collagen structure 
illustrate the close relationship between amino acid se- 
quence and three-dimensional structure in this pro- 
tein. Osteogenesis imperfecta is characterized by 
abnormal bone formation in babies; Ehlers-Danlos syn- 



drome is characterized by loose joints. Both conditions 
can be lethal, and both result from the substitution of 
an amino acid residue with a larger R group (such as 
Cys or Ser) for a single Gly residue in each a chain (a 
different Gly residue in each disorder). These single- 
residue substitutions have a catastrophic effect on col- 
lagen function because they disrupt the Gly-X-Y 
repeat that gives collagen its unique helical structure. 
Given its role in the collagen triple helix (Fig. 4-1 Id), 
Gly cannot be replaced by another amino acid residue 
without substantial deleterious effects on collagen 
structure. ■ 

Silk Fibroin Fibroin, the protein of silk, is produced by 
insects and spiders. Its polypeptide chains are predomi- 
nantly in the /3 conformation. Fibroin is rich in Ala and Gly 
residues, permitting a close packing of f3 sheets and an in- 
terlocking arrangement of Rgroups (Fig. 4-13). The over- 
all structure is stabilized by extensive hydrogen bonding 
between all peptide linkages in the polypeptides of each j3 
sheet and by the optimization of van der Waals interactions 
between sheets. Silk does not stretch, because the /3 con- 
formation is already highly extended (Fig. 4-6). However, 
the structure is flexible because the sheets are held to- 
gether by numerous weak interactions rather than by co- 
valent bonds such as the disulfide bonds in a-keratins. 
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FIGURE 4-13 Structure of silk. The fibers in silk cloth and in a spider 
web are made up of the protein fibroin, (a) Fibroin consists of layers of 
antiparallel sheets rich in Ala and Gly residues. The small side chains 
interdigitate and allow close packing of the sheets, as shown in the ball 
and stick view, (b) Strands of fibroin (blue) emerge from the spinnerets 
of a spider in this colorized electron micrograph. 
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I0X4-4 The Protein Data Bank 



The number of known three-dimensional protein struc- 
tures is now in the tens of thousands and more than 
doubles every couple of years. This wealth of informa- 
tion is revolutionizing our understanding of protein 
structure, the relation of structure to function, and the 
evolutionary paths by which proteins arrived at their 
present state, which can be seen in the family resem- 
blances that come to light as protein databases are 
sifted and sorted. One of the most important resources 
available to biochemists is the Protein Data Bank 
(PDB; www.rcsb.org). 

The PDB is an archive of experimentally deter- 
mined three-dimensional structures of biological 
macromolecules, containing virtually all of the macro- 
molecular structures (proteins, RNAs, DNAs, etc.) 
elucidated to date. Each structure is assigned an iden- 
tifying label (a four-character identifier called the PDB 



ID). Such labels are provided in the figure legends for 
every PDB-derived structure illustrated in this text so 
that students and instructors can explore the same 
structures on their own. The data files in the PDB 
describe the spatial coordinates of each atom whose 
position has been determined (many of the cataloged 
structures are not complete). Additional data files 
provide information on how the structure was deter- 
mined and its accuracy. The atomic coordinates can be 
converted into an image of the macromolecule using 
structure visualization software. Students are encour- 
aged to access the PDB and explore structures using 
visualization software linked to the database. Macro- 
molecular structure files can also be downloaded and 
explored on the desktop using free software such as 
RasMol, Protein Explorer, or FirstGlance in Jmol, avail- 
able at www.umass.edu/microbio/rasmol. 



Structural Diversity Reflects Functional Diversity in 
Globular Proteins 

In a globular protein, different segments of the polypep- 
tide chain (or multiple polypeptide chains) fold back on 
each other, generating a more compact shape than is 
seen in the fibrous proteins (Fig. 4-14). The folding 
also provides the structural diversity necessary for pro- 
teins to carry out a wide array of biological functions. 
Globular proteins include enzymes, transport proteins, 
motor proteins, regulatory proteins, immunoglobulins, 
and proteins with many other functions. 

Our discussion of globular proteins begins with the 
principles gleaned from the first protein structures to be 
elucidated. This is followed by a detailed description of 
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FIGURE 4-14 Globular protein structures are compact and varied. 
Human serum albumin (M r 64,500) has 585 residues in a single chain. 
Given here are the approximate dimensions its single polypeptide 
chain would have if it occurred entirely in extended /3 conformation or 
as an a helix. Also shown is the size of the protein in its native globu- 
lar form, as determined by x-ray crystallography; the polypeptide chain 
must be very compactly folded to fit into these dimensions. 



protein substructure and comparative categorization. 
Such discussions are possible only because of the vast 
amount of information available over the Internet from 
publicly accessible databases, particularly the Protein 
Data Bank (Box 4-4). 

Myoglobin Provided Early Clues about the Complexity of 
Globular Protein Structure 

£ Protein Architecture-Tertiary Structure of Small Globular 

Proteins, ii. Myoglobin The first breakthrough in under- 
standing the three-dimensional structure of a globular 
protein came from x-ray diffraction studies of myoglobin 
carried out by John Kendrew and his colleagues in the 
1950s. Myoglobin is a relatively small (M T 16,700), 
oxygen-binding protein of muscle cells. It functions both 
to store oxygen and to facilitate oxygen diffusion in rap- 
idly contracting muscle tissue. Myoglobin contains a sin- 
gle polypeptide chain of 153 amino acid residues of 
known sequence and a single iron protoporphyrin, or 
heme, group. The same heme group that is found in 
myoglobin is found in hemoglobin, the oxygen-binding 
protein of erythrocytes, and is responsible for the deep 
red-brown color of both myoglobin and hemoglobin. 
Myoglobin is particularly abundant in the muscles of div- 
ing mammals such as the whale, seal, and porpoise — so 
abundant that the muscles of these animals are brown. 
Storage and distribution of oxygen by muscle myoglobin 
permits diving mammals to remain submerged for long 
periods. The activities of myoglobin and other globin 
molecules are investigated in greater detail in Chapter 5. 

Figure 4-15 shows several structural representa- 
tions of myoglobin, illustrating how the polypeptide 
chain is folded in three dimensions— its tertiary struc- 
ture. The red group surrounded by protein is heme. The 
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FIGURE 4-15 Tertiary structure of sperm whale myoglobin. (PDB ID 
1MBO) Orientation of the protein is similar in (a) through (d); the 
heme group is shown in red. In addition to illustrating the myoglobin 
structure, this figure provides examples of several different ways to dis- 
play protein structure, (a) The polypeptide backbone in a ribbon 
representation of a type introduced by Jane Richardson, which high- 
lights regions of secondary structure. The a-helical regions are evident. 




(b) Surface contour image; this is useful for visualizing pockets in the 
protein where other molecules might bind, (c) Ribbon representation 
including side chains (blue) for the hydrophobic residues Leu, lie, Vat 
and Phe. (d) Space-filling model with all amino acid side chains. Each 
atom is represented by a sphere encompassing its van der Waals 
radius. The hydrophobic residues are again shown in blue; most are 
buried in the interior of the protein and thus not visible. 



backbone of the myoglobin molecule consists of eight 
relatively straight segments of a helix interrupted by 
bends, some of which are /3 turns. The longest a helix 
has 23 amino acid residues and the shortest only 7; all 
helices are right-handed. More than 70% of the residues 
in myoglobin are in these a-helical regions. X-ray analy- 
sis has revealed the precise position of each of the 
R groups, which occupy nearly all the space within the 
folded chain. 

Many important conclusions were drawn from the 
structure of myoglobin. The positioning of amino acid 
side chains reflects a structure that derives much of its 
stability from hydrophobic interactions. Most of the hy- 
drophobic R groups are in the interior of the molecule, 
hidden from exposure to water. All but two of the polar 
R groups are located on the outer surface of the mole- 
cule, and all are hydrated. The myoglobin molecule is so 
compact that its interior has room for only four mole- 
cules of water. This dense hydrophobic core is typical of 
globular proteins. The fraction of space occupied by 
atoms in an organic liquid is 0.4 to 0.6. In a globular 
protein the fraction is about 0.75, comparable to that in 
a crystal (in a typical crystal the fraction is 0.70 to 0.78, 
near the theoretical maximum). In this packed environ- 
ment, weak interactions strengthen and reinforce each 
other. For example, the nonpolar side chains in the core 
are so close together that short-range van der Waals in- 
teractions make a significant contribution to stabilizing 
hydrophobic interactions. 

Deduction of the structure of myoglobin confirmed 
some expectations and introduced some new elements 
of secondary structure. As predicted by Pauling and 
Corey, all the peptide bonds are in the planar trans con- 
figuration. The a helices in myoglobin provided the first 
direct experimental evidence for the existence of this 
type of secondary structure. Three of the four Pro 
residues are found at bends. The fourth Pro residue oc- 
curs within an a helix, where it creates a kink necessary 
for tight helix packing. 



The flat heme group rests in a crevice, or pocket, 
in the myoglobin molecule. The iron atom in the center 
of the heme group has two bonding (coordination) 
positions perpendicular to the plane of the heme 
(Fig. 4-16). One of these is bound to the R group of 
the His residue at position 93; the other is the site at 
which an 0 2 molecule binds. Within this pocket, the 
accessibility of the heme group to solvent is highly 
restricted. This is important for function, because free 
heme groups in an oxygenated solution are rapidly 
oxidized from the ferrous (Fe 2+ ) form, which is active 
in the reversible binding of 0 2 , to the ferric (Fe 3+ ) 
form, which does not bind 0 2 . 
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FIGURE 4-16 The heme group. This group is present in myoglobin, he- 
moglobin, cytochromes, and many other proteins (the heme proteins), 
(a) Heme consists of a complex organic ring structure, protoporphyrin, 
which binds an iron atom in its ferrous (Fe 2+ ) state. The iron atom has 
six coordination bonds, four in the plane of, and bonded to, the flat 
porphyrin molecule and two perpendicular to it. (b) In myoglobin and 
hemoglobin, one of the perpendicular coordination bonds is bound to 
a nitrogen atom of a His residue. The other is "open" and serves as the 
binding site for an 0 2 molecule. 
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As many different myoglobin structures were re- 
solved, investigators were able to observe the structural 
changes that accompany the binding of oxygen or other 
molecules and thus, for the first time, to understand the 
correlation between protein structure and function. 
Hundreds of proteins have now been subjected to simi- 
lar analysis. Today, nuclear magnetic resonance (NMR) 
spectroscopy and other techniques supplement x-ray 
diffraction data, providing more information on a pro- 
tein's structure (Box 4-5, p. 132). In addition, the se- 
quencing of the genomic DNA of many organisms 
(Chapter 9) has identified thousands of genes that en- 
code proteins of known sequence but, as yet, unknown 
function; this work continues apace. 

Globular Proteins Have a Variety of 
Tertiary Structures 

From what we now know about the tertiary structures of 
hundreds of globular proteins, it is clear that myoglobin 
illustrates just one of many ways in which a polypeptide 
chain can be folded. Table 4-3 shows the proportions of 
a helix and j3 conformations (expressed as percentage 
of residues in each type) in several small, single-chain, 
globular proteins. Each of these proteins has a distinct 
structure, adapted for its particular biological function, 
but together they share several important properties 
with myoglobin. Each is folded compactly, and in each 
case the hydrophobic amino acid side chains are ori- 
ented toward the interior (away from water) and the hy- 
drophilic side chains are on the surface. The structures 
are also stabilized by a multitude of hydrogen bonds and 
some ionic interactions. 

For the beginning student, the very complex terti- 
ary structures of globular proteins — some much larger 
than myoglobin — are best approached by focusing on 
common structural patterns, recurring in different and 



often unrelated proteins. The three-dimensional struc- 
ture of a typical globular protein can be considered an 
assemblage of polypeptide segments in the a-helical and 
/3-sheet conformations, linked by connecting segments. 
The structure can then be defined by how these seg- 
ments stack on one another and how the segments that 
connect them are arranged. 

To understand a complete three-dimensional struc- 
ture, we need to analyze its folding patterns. We begin 
by defining two important terms that describe protein 
structural patterns or elements in a polypeptide chain, 
and then turn to the folding rules. 

The first term is motif, also called a supersec- 
ondary structure or fold. A motif is simply a recog- 
nizable folding pattern involving two or more elements 
of secondary structure and the connection(s) between 
them. Although there is some confusing application of 
these three terms in the literature, they are generally 
used interchangeably. A motif can be very simple, such 
as two elements of secondary structure folded against 
each other, and represent only a small part of a protein. 
An example is a p-a-p loop (Fig. 4-17a). A motif can 
also be a very elaborate structure involving scores of 
protein segments folded together, such as the j8 barrel 
(Fig. 4-1 7b). In some cases, a single large motif may 
comprise the entire protein. The term encompasses any 
advantageous folding pattern and is useful for describ- 
ing such patterns. The segment defined as a motif may 
or may not be independently stable. We have already 
encountered one well-studied motif, the coiled coil of 
a-keratin, which is also found in some other proteins. 
Note that a motif is not a hierarchical structural ele- 
ment falling between secondary and tertiary structure. 
It is a folding pattern that can describe a small part of a 
protein or an entire polypeptide chain. The synony- 
mous term "supersecondary structure" is thus some- 
what misleading because it suggests hierarchy. 



TABLE 4-3 



Approximate Proportion of a Helix and 
f$ Conformation in Some Single-Chain 
Proteins 

Residues (%)* 



Protein (total residues) a Helix /3 Conformation 



Chymotrypsin (247) 


14 


45 


Ribonuclease (124) 


26 


35 


Carboxypeptidase (307) 


38 


17 


Cytochrome c (104) 


39 


0 


Lysozyme (129) 


40 


12 


Myoglobin (153) 


78 


0 



Source: Data from Cantor, C.R. & Schimmel, RR. (1980) Biophysical Chemistry, Part I: 
The Conformation of Biological Macromolecules, p. 100, W. H. Freeman and Company, 
New York 

♦Portions of the polypeptide chains not accounted for by a helix or p conformation 
consist of bends and irregularly coiled or extended stretches. Segments of a helix and 
p conformation sometimes deviate slightly from their normal dimensions and geometry. 





(a) p-a-/3 Loop (b) P Barrel 

FIGURE 4-17 Motifs, (a) A simple motif, the p-a-p loop, (b) A more 
elaborate motif, the p barrel. This p barrel is a single domain of 
a-hemolysin (a toxin that kills a cell by creating a hole in its 
membrane) from the bacterium Staphylococcus aureus (derived 
from PDB ID 7AHL). 
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BOX 4-5 




Methods for Determining the Three-Dimensional Structure 
of a Protein 




(a) 



X-Ray Diffraction 

The spacing of atoms in a crystal lattice can be deter- 
mined by measuring the locations and intensities of spots 
produced on photographic film by a beam of x rays of 
given wavelength, after the beam has been diffracted by 
the electrons of the atoms. For example, x-ray analysis of 
sodium chloride crystals shows that Na + and GT ions are 
arranged in a simple cubic lattice. The spacing of the dif- 
ferent kinds of atoms in complex organic molecules, even 
very large ones such as proteins, can also be analyzed by 
x-ray diffraction methods. However, the technique for an- 
alyzing crystals of complex molecules is far more labori- 
ous than for simple salt crystals. When the repeating 
pattern of the crystal is a molecule as large as, say, a pro- 
tein, the numerous atoms in the molecule yield thousands 
of diffraction spots that must be analyzed by computer. 

Consider how images are generated in a light mi- 
croscope. Light from a point source is focused on an ob- 
ject. The object scatters the light waves, and these 
scattered waves are recombined by a series of lenses to 
generate an enlarged image of the object. The smallest 
object whose structure can be determined by such a 
system— that is, the resolving power of the micro- 
scope—is determined by the wavelength of the light, in 
this case visible light, with wavelengths in the range of 
400 to 700 nm. Objects smaller than half the wave- 
length of the incident light cannot be resolved. To re- 
solve objects as small as proteins we must use x rays, 
with wavelengths in the range of 0.7 to 1.5 A (0.07 to 
0.15 nm). However, there are no lenses that can recom- 
bine x rays to form an image; instead, the pattern of 
diffracted x rays is collected directly and an image is re- 
constructed by mathematical techniques. 

The amount of information obtained from x-ray 
crystallography depends on the degree of structural or- 
der in the sample. Some important structural parame- 



ters were obtained from early studies of the diffraction 
patterns of the fibrous proteins arranged in regular ar- 
rays in hair and wool. However, the orderly bundles 
formed by fibrous proteins are not crystals— the mole- 
cules are aligned side by side, but not all are oriented in 
the same direction. More detailed three-dimensional 
structural information about proteins requires a highly 
ordered protein crystal. The structures of many proteins 
are not yet known, simply because they have proved dif- 
ficult to crystallize. Practitioners have compared making 
protein crystals to holding together a stack of bowling 
balls with cellophane tape. 

Operationally, there are several steps in x-ray struc- 
tural analysis (Fig. 1). A crystal is placed in an x-ray 
beam between the x-ray source and a detector, and a 
regular array of spots called reflections is generated. 
The spots are created by the diffracted x-ray beam, and 
each atom in a molecule makes a contribution to each 
spot. An electron-density map of the protein is recon- 
structed from the overall diffraction pattern of spots by 
a mathematical technique called a Fourier transform. In 
effect, the computer acts as a "computational lens." A 
model for the structure is then built that is consistent 
with the electron-density map. 

John Kendrew found that the x-ray diffraction 
pattern of crystalline myoglobin (isolated from muscles 
of the sperm whale) is very complex, with nearly 25,000 
reflections. Computer analysis of these reflections took 
place in stages. The resolution improved at each 
stage, until in 1959 the positions of virtually all the non- 
hydrogen atoms in the protein had been determined. 
The amino acid sequence of the protein, obtained by 
chemical analysis, was consistent with the molecular 
model. The structures of thousands of proteins, many of 
them much more complex than myoglobin, have since 
been determined to a similar level of resolution. 
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The physical environment in a crystal, of course, is 
not identical to that in solution or in a living cell. A crys- 
tal imposes a space and time average on the structure 
deduced from its analysis, and x-ray diffraction studies 
provide little information about molecular motion within 
the protein. The conformation of proteins in a crystal 
could in principle also be affected by nonphysiological 
factors such as incidental protein-protein contacts 
within the crystal. However, when structures derived 
from the analysis of crystals are compared with struc- 
tural information obtained by other means (such as 
NMR, as described below), the crystal-derived structure 
almost always represents a functional conformation of 
the protein. X-ray crystallography can be applied suc- 
cessfully to proteins too large to be structurally ana- 
lyzed by NMR. 

Nuclear Magnetic Resonance 

An advantage of nuclear magnetic resonance (NMR) 
studies is that they are carried out on macromolecules 
in solution, whereas x-ray crystallography is limited to 
molecules that can be crystallized. NMR can also illumi- 
nate the dynamic side of protein structure, including 
conformational changes, protein folding, and interac- 
tions with other molecules. 

NMR is a manifestation of nuclear spin angular 
momentum, a quantum mechanical property of atomic 
nuclei. Only certain atoms, including 1 H, 13 C, 15 N, 19 F, 
and 31 P, have the kind of nuclear spin that gives rise to 
an NMR signal. Nuclear spin generates a magnetic 
dipole. When a strong, static magnetic field is applied to a 
solution containing a single type of macromolecule, the 
magnetic dipoles are aligned in the field in one of two ori- 
entations, parallel (low energy) or antiparallel (high en- 
ergy). A short (—10 jjls) pulse of electromagnetic energy 
of suitable frequency (the resonant frequency, which is in 



FIGURE 1 Steps in determining the structure of sperm whale myoglo- 
bin by x-ray crystallography, (a) X-ray diffraction patterns are generated 
from a crystal of the protein, (b) Data extracted from the diffraction 
patterns are used to calculate a three-dimensional electron-density 
map. The electron density of only part of the structure, the heme, is 
shown, (c) Regions of greatest electron density reveal the location of 
atomic nuclei, and this information is used to piece together the final 
structure. Here, the heme structure is modeled into its electron-density 
map. (d) The completed structure of sperm whale myoglobin, includ- 
ing the heme (PDB ID 2MBW). 



the radio frequency range) is applied at right angles to the 
nuclei aligned in the magnetic field. Some energy is ab- 
sorbed as nuclei switch to the high-energy state, and the 
absorption spectrum that results contains information 
about the identity of the nuclei and their immediate 
chemical environment. The data from many such experi- 
ments on a sample are averaged, increasing the signal-to- 
noise ratio, and an NMR spectrum such as that in Figure 2 
is generated. 

X H is particularly important in NMR experiments be- 
cause of its high sensitivity and natural abundance. For 
macromolecules, *H NMR spectra can become quite com- 
plicated. Even a small protein has hundreds of X H atoms, 
typically resulting in a one-dimensional NMR spectrum 
too complex for analysis. Structural analysis of proteins 
became possible with the advent of two-dimensional 
NMR techniques (Fig. 3). These methods allow measure- 
ment of distance-dependent coupling of nuclear spins in 
nearby atoms through space (the nuclear Overhauser ef- 
fect (NOE), in a method dubbed NOESY) or the coupling 
of nuclear spins in atoms connected by covalent bonds 
(total correlation spectroscopy, or TOCSY). 

(continued on next page) 
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FIGURE 2 One-dimensional NMR spectrum of a globin from a marine 
biood worm. This protein and sperm whale myoglobin are very close 
structural analogs, belonging to the same protein structural family and 
sharing an oxygen-transport function. 
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BOX 4-5 



METHODS 



ethods for Determining the Three-Dimensional Structure of a 
Protein [continued from previous page) 



Translating a two-dimensional NMR spectrum into a 
complete three-dimensional structure can be a labori- 
ous process. The NOE signals provide some information 
about the distances between individual atoms, but for 
these distance constraints to be useful, the atoms giving 
rise to each signal must be identified. Complementary 
TOCSY experiments can help identify which NOE sig- 
nals reflect atoms that are linked by covalent bonds. 
Certain patterns of NOE signals have been associated 
with secondary structures such as a helices. Modern ge- 
netic engineering (Chapter 9) can be used to prepare 
proteins that contain the rare isotopes 13 C or 15 N. The 
new NMR signals produced by these atoms, and the 
coupling with X H signals resulting from these substitu- 
tions, help in the assignment of individual *H NOE sig- 
nals. The process is also aided by a knowledge of the 
amino acid sequence of the polypeptide. 

To generate a three-dimensional structure, re- 
searchers feed the distance constraints into a com- 



puter along with known geometric constraints such 
as chirality, van der Waals radii, and bond lengths 
and angles. The computer generates a family of 
closely related structures that represent the range 
of conformations consistent with the NOE distance 
constraints (Fig. 3c). The uncertainty in structures 
generated by NMR is in part a reflection of the mo- 
lecular vibrations (known as breathing) within a pro- 
tein structure in solution, discussed in more detail in 
Chapter 5. Normal experimental uncertainty can 
also play a role. 

Protein structures determined by both x-ray crystal- 
lography and NMR generally agree well. In some cases, 
the precise locations of particular amino acid side chains 
on the protein exterior are different, often because of ef- 
fects related to the packing of adjacent protein mole- 
cules in a crystal. The two techniques together are at the 
heart of the rapid increase in the availability of structural 
information about the macromolecules of living cells. 




FIGURE 3 Use of two-dimensional NMR to generate a three-dimensional 
structure of a globin, the same protein used to generate the data in Fig- 
ure 2. The diagonal in a two-dimensional NMR spectrum is equivalent to 
a one-dimensional spectrum. The off-diagonal peaks are NOE signals 
generated by close-range interactions of 1 H atoms that may generate sig- 
nals quite distant in the one-dimensional spectrum. Two such interac- 
tions are identified in (a), and their identities are shown with blue lines 
in (b) (PDB ID 1 VRF). Three lines are drawn for interaction 2 between a 
methyl group in the protein and a hydrogen on the heme. The methyl 
group rotates rapidly such that each of its three hydrogens contributes 
equally to the interaction and the NMR signal. Such information is used 
to determine the complete three-dimensional structure (PDB ID 1VRE), 
as in (c). The multiple lines shown for the protein backbone in (c) repre- 
sent the family of structures consistent with the distance constraints in 
the NMR data. The structural similarity with myoglobin (Fig. 1) is evi- 
dent. The proteins are oriented in the same way in both figures. 
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FIGURE 4-18 Structural domains in the polypeptide troponin C. (PDB 
ID 4TNC) This calcium-binding protein associated with muscle has 
two separate calcium-binding domains, indicated in blue and purple. 

The second term for describing structural patterns is 
domain. A domain, as defined by Jane Richardson in 
1981, is a part of a polypeptide chain that is independ- 
ently stable or could undergo movements as a single en- 
tity with respect to the entire protein. Polypeptides with 
more than a few hundred amino acid residues often fold 
into two or more domains, sometimes with different func- 
tions. In many cases, a domain from a large protein will 
retain its native three-dimensional structure even when 
separated (for example, by proteolytic cleavage) from the 
remainder of the polypeptide chain. In a protein with 
multiple domains, each domain may appear as a distinct 
globular lobe (Fig. 4-18); more commonly, extensive 
contacts between domains make individual domains hard 
to discern. Different domains often have distinct func- 
tions, such as the binding of small molecules or interac- 
tion with other proteins. Small proteins usually have only 
one domain (the domain is the protein). 

Folding of polypeptides is subject to an array of phys- 
ical and chemical constraints, and several rules have 
emerged from studies of common protein folding patterns. 

1 . Hydrophobic interactions make a large contribu- 
tion to the stability of protein structures. Burial of 
hydrophobic amino acid R groups so as to exclude 
water requires at least two layers of secondary 
structure. Simple motifs, such as the f3-a-(3 loop 
(Fig. 4-17a), create two such layers. 

2. Where they occur together in a protein, a helices 
and /3 sheets generally are found in different 
structural layers. This is because the backbone of 
a polypeptide segment in the /3 conformation 
(Fig. 4-6) cannot readily hydrogen-bond to an 

a helix aligned with it. 

3. Segments adjacent to each other in the amino acid 
sequence are usually stacked adjacent to each 
other in the folded structure. Distant segments of 
a polypeptide may come together in the tertiary 
structure, but this is not the norm. 

4. Connections between common elements of 
secondary structure cannot cross or form knots 

(Fig. 4-19a). 

5. The /3 conformation is most stable when the 
individual segments are twisted slightly in a right- 



handed sense. This influences both the arrange- 
ment of /3 sheets relative to one another and the 
path of the polypeptide connections between them. 
Two parallel /3 strands, for example, must be 
connected by a crossover strand (Fig. 4-1 9b). 
In principle, this crossover could have a right- or 
left-handed conformation, but in proteins it is 
almost always right-handed. Right-handed 
connections tend to be shorter than left-handed 
connections and tend to bend through smaller 
angles, making them easier to form. The twisting 
of p sheets also leads to a characteristic twisting 
of the structure formed by many such segments 
together, as seen in the jS barrel (4-1 7b) and 
twisted p sheet (Fig. 4-19c), which form the core 
of many larger structures. 

Following these rules, complex motifs can be built up 
from simple ones. For example, a series of /3-a-£ loops 
arranged so that the /3 strands form a barrel creates a 
particularly stable and common motif, the a/fi barrel 





(a) Typical connections 
in an all-/3 motif 



Crossover connection 
(not observed) 





(b) Right-handed connection 
between p strands 



Left-handed connection 
between p strands 
(very rare) 




(c) 



Twisted p sheet 



FIGURE 4-19 Stable folding patterns in proteins, (a) Connections 
between p strands in layered p sheets. The strands here are viewed 
from one end, with no twisting. Thick lines represent connections at 
the ends nearest the viewer; thin lines are connections at the far ends 
of the p strands. The connections at a given end (e.g., near the viewer) 
do not cross one other, (b) Because of the right-handed twist in 
P strands, connections between strands are generally right-handed. Left- 
handed connections must traverse sharper angles and are harder to 
form, (c) This twisted p sheet is from a domain of photolyase (a protein 
that repairs certain types of DNA damage) from E. coli (derived from 
PDB ID 1 DNP). Connecting loops have been removed so as to focus 
on the folding of the p sheet. 
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P-a-fi Loop alp Barrel 



FIGURE 4-20 Constructing large motifs from smaller ones. The alp 

barrel is a commonly occurring motif constructed from repetitions of 
the P-a-p loop motif. This a/p barrel is a domain of pyruvate kinase (a 
glycolytic enzyme) from rabbit (derived from PDB ID 1 PKN). 

(Fig. 4-20). In this structure, each parallel /3 segment 
is attached to its neighbor by an a-helical segment. All 
connections are right-handed. The a/p barrel is found in 
many enzymes, often with a binding site (for a cof actor 
or substrate) in the form of a pocket near one end of the 
barrel. Note that domains with similar folding patterns 
are said to have the same motif even though their con- 
stituent a helices and /3 sheets may differ in length. 

Protein Motifs Are the Basis for Protein 
Structural Classification 

|) Protein Architecture-Tertiary Structure of Large Globular Pro- 
teins, IV. Structural Classification of Proteins As we have seen, 
understanding the complexities of tertiary structure is 
made easier by considering substructures. Taking this 
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FIGURE 4-21 Organization of proteins based on motifs. Shown 
here are just a few of the hundreds of known stable motifs. They are 
divided into four classes: all a, all p, a/p, and a + p. Structural 
classification data from the SCOP (Structural Classification of Pro- 
teins) database (http://scop.mrc-lmb.cam.ac.uk/scop) are also pro- 



idea further, researchers have organized the complete 
contents of protein databases according to hierarchical 
levels of structure. All of these databases rely on data 
and information deposited in the Protein Data Bank. 
The Structural Classification of Proteins (SCOP) data- 
base is a good example of this important trend in bio- 
chemistry. At the highest level of classification, the 
SCOP database (http://scop.mrc-lmb.cam.ac.uk/scop) 
borrows a scheme already in common use, with four 
classes of protein structure: all a, all p, a/p (with a 
and (3 segments interspersed or alternating), and a + 
P (with a and /3 regions somewhat segregated). Each 
class includes tens to hundreds of different folding 
arrangements (motifs), built up from increasingly 
identifiable substructures. Some of the substructure 
arrangements are very common, others have been 
found in just one protein. Figure 4-21 shows a variety 
of motifs arrayed among the four classes of protein 
structure; this is just a minute sample of the hundreds 
of known motifs. The number of folding patterns is not 
infinite, however. As the rate at which new protein 
structures are elucidated has increased, the fraction of 
those structures containing a new motif has steadily 
declined. Fewer than 1,000 different folds or motifs 
may exist in all. Figure 4-21 also shows how proteins 
can be organized based on the presence of the various 
motifs. The top two levels of organization, class and 
fold, are purely structural. Below the fold level (see 
color key in Fig. 4-21), categorization is based on evo- 
lutionary relationships. 

Many examples of recurring domain or motif struc- 
tures are available, and these reveal that protein terti- 
ary structure is more reliably conserved than amino 
acid sequence. The comparison of protein structures 
can thus provide much information about evolution. 
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vided (see the color key). The PDB identifier (listed first for each 
structure) is the unique accession code given to each structure 
archived in the Protein Data Bank (www.rcsb.org). The a/p barrel 
(see Fig. 4-20) is another particularly common a/p motif. 

(Figure continues on facing page.) 
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Proteins with significant similarity in primary structure 
and/or with similar tertiary structure and function are 
said to be in the same protein family. A strong evolu- 
tionary relationship is usually evident within a protein 
family. For example, the globin family has many differ- 
ent proteins with both structural and sequence similar- 
ity to myoglobin (as seen in the proteins used as 
examples in Box 4-5 and in Chapter 5). Two or more 
families with little similarity in amino acid sequence 
sometimes make use of the same major structural mo- 
tif and have functional similarities; these families are 
grouped as superfamilies. An evolutionary relationship 
among families in a superfamily is considered probable, 
even though time and functional distinctions— that is, 
different adaptive pressures — may have erased many 
of the telltale sequence relationships. A protein family 
may be widespread in all three domains of cellular life, 
the Bacteria, Archaea, and Eukarya, suggesting a very 
ancient origin. Other families may be present in only a 
small group of organisms, indicating that the structure 
arose more recently. Tracing the natural history of 
structural motifs, using structural classifications in 
databases such as SCOP, provides a powerful comple- 
ment to sequence analyses in tracing evolutionary re- 
lationships. The SCOP database is curated manually, 
with the objective of placing proteins in the correct 
evolutionary framework based on conserved struc- 
tural features. 

Structural motifs become especially important in 
defining protein families and superfamilies. Improved 
classification and comparison systems for proteins lead 
inevitably to the elucidation of new functional relation- 
ships. Given the central role of proteins in living sys- 
tems, these structural comparisons can help illuminate 
every aspect of biochemistry, from the evolution of indi- 
vidual proteins to the evolutionary history of complete 
metabolic pathways. 

Several online databases and resources comple- 
ment the SCOP database for analysis of protein 
structure. The CATH (Class, Architecture, Topology, 
and Homologous Superfamily) database arranges the 
proteins in the PDB in a four-level hierarchy Other pro- 
grams allow the user to input the structure of a protein 
of interest and then find all the proteins in the PDB that 
are structurally similar to this protein or some part of it. 
These programs include VAST (Vector Alignment 
Search Tool), CE (Combinatorial Extension of the Opti- 
mal Paths), and FSSP (Fold Classification Based on 
Structure-Structure Alignment of Proteins). 

Protein Quaternary Structures Range from Simple Dimers 
to Large Complexes 

0 Protein Architecture -Quaternary Structure Many proteins 

have multiple polypeptide subunits (from two to hun- 
dreds) . The association of polypeptide chains can serve a 
variety of functions. Many multisubunit proteins have 
regulatory roles; the binding of small molecules may 



affect the interaction between subunits, causing large 
changes in the protein's activity in response to small 
changes in the concentration of substrate or regulatory 
molecules (Chapter 6). In other cases, separate subunits 
take on separate but related functions, such as catalysis 
and regulation. Some associations, such as the fibrous 
proteins considered earlier in this chapter and the coat 
proteins of viruses, serve primarily structural roles. 
Some very large protein assemblies are the site of com- 
plex, multistep reactions. For example, each ribosome, 
the site of protein synthesis, incorporates dozens of 
protein subunits along with a number of RNA molecules. 

A multisubunit protein is also referred to as a multi- 
mer. A multimer with just a few subunits is often called 
an oligomer. If a multimer has nonidentical subunits, 
the overall structure of the protein can be asymmetric 
and quite complicated. However, most multimers have 
identical subunits or repeating groups of nonidentical 
subunits, usually in symmetric arrangements. As noted 
in Chapter 3, the repeating structural unit in such a mul- 
timeric protein, whether a single subunit or a group of 
subunits, is called a protomer. Greek letters are some- 
times used to distinguish the individual subunits that 
make up a protomer. 

The first oligomeric protein to have its three- 
dimensional structure determined was hemoglobin 
(M T 64,500), which contains four polypeptide chains 
and four heme prosthetic groups, in which the iron 
atoms are in the ferrous (Fe 2+ ) state (Fig. 4-16). The 
protein portion, the globin, consists of two a chains 
(141 residues each) and two (3 chains (146 residues 
each). Note that in this case, a and (3 do not refer to 
secondary structures. Because hemoglobin is four 
times as large as myoglobin, much more time and ef- 
fort were required to solve its three-dimensional struc- 
ture by x-ray analysis, finally achieved by Max Perutz, 
John Kendrew, and their colleagues in 1959. The sub- 
units of hemoglobin are arranged in symmetric pairs 
(Fig. 4-22), each pair having one a and one ft subunit. 
Hemoglobin can therefore be described either as a 
tetramer or as a dimer of a/3 protomers. 




Max Perutz, 1914-2002 (left) John Kendrew, 1917-1997 (right) 
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FIGURE 4-22 Quaternary structure of de- 
oxyhemoglobin. (PDB ID 2HHB) X-ray dif- 
fraction analysis of deoxyhemoglobin 
(hemoglobin without oxygen molecules 
bound to the heme groups) shows how the 
four polypeptide subunits are packed to- 
gether, (a) A ribbon representation, (b) A sur- 
face contour model. The a subunits are 
shown in shades of gray; the j8 subunits in 
shades of blue. Note that the heme groups 
(red) are relatively far apart. 




Identical subunits of multimeric proteins are gener- 
ally arranged in one or a limited set of symmetric pat- 
terns. A description of the structure of these proteins 
requires an introduction to conventions used to define 
symmetries. Oligomers can have either rotational 
symmetry or helical symmetry; that is, individual 
subunits can be superimposed on others (brought to co- 
incidence) by rotation about one or more rotational axes 
or by a helical rotation. In proteins with rotational sym- 
metry, the subunits pack about the rotational axes to 
form closed structures. Proteins with helical symmetry 
tend to form more open-ended structures, with subunits 
added in a spiraling array. 

There are several forms of rotational symmetry. The 
simplest is cyclic symmetry, involving rotation about a 
single axis (Fig. 4-23a). If subunits can be superimposed 
by rotation about a single axis, the protein has a symmetry 



defined as G n (C for cyclic, n for the number of subunits 
related by the axis). The axis itself is described as an n- 
fold rotational axis. The a/5 protomers of hemoglobin (Fig. 
4-22) are related by C 2 symmetry. A somewhat more com- 
plicated rotational symmetry is dihedral symmetry, in 
which a twofold rotational axis intersects an n-fold axis at 
right angles; this symmetry is defined as D n (Fig. 4-23b). 
A protein with dihedral symmetry has 2n protomers. 

Proteins with cyclic or dihedral symmetry are par- 
ticularly common. More complex rotational symmetries 
are possible, but only a few are regularly encountered in 
proteins. One example is icosahedral symmetry. An 
icosahedron is a regular 12-cornered polyhedron with 
20 equilateral triangular faces (Fig. 4-23c). Each face 
can be brought to coincidence with another face by ro- 
tation about one or more of three axes. This is a com- 
mon structure in virus coats, or capsids. The human 
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FIGURE 4-23 Rotational symmetry in proteins, (a) In cyclic symmetry, 
subunits are related by rotation about a single n-fold axis, where n is 
the number of subunits so related. The axes are shown as black lines; 
the numbers are values of n. Only two of many possible C n arrange- 
ments are shown, (b) In dihedral symmetry, all subunits can be related 
by rotation about one or both of two axes, one of which is twofold. D 2 
symmetry is most common, (c) Icosahedral symmetry. Relating all 20 
triangular faces of an icosahedron requires rotation about one or more 
of three separate rotational axes: twofold, threefold, and fivefold. An 
end-on view of each of these axes is shown at the right. 
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FIGURE 4-24 Viral capsids. (a) Poliovirus (derived from PDB ID 2PLV) 
as rendered in the VIPER relational database for structural virology. The 
coat proteins of poliovirus assemble into an icosahedron 300 A in di- 
ameter. Icosahedral symmetry is a type of rotational symmetry (see Fig. 
4-2 3c). On the left is a surface contour image of the poliovirus capsid. 
The image at right was rendered at lower resolution, and the coat pro- 
tein subunits were colored to show the icosahedral symmetry, (b) To- 
bacco mosaic virus (derived from PDB ID 1VTM). This rod-shaped 
virus (as shown in the electron micrograph) is 3,000 A long and 1 80 A 
in diameter; it has helical symmetry. 

poliovirus has an icosahedral capsid (Fig. 4-24a). Each 
triangular face is made up of three protomers, each con- 
taining single copies of four different polypeptide 
chains, three of which are accessible at the outer sur- 
face. Sixty protomers form the 20 faces of the icosahe- 
dral shell, which encloses the genetic material (RNA). 

The other major type of symmetry found in oligomers, 
helical symmetry, also occurs in capsids. Tobacco mosaic 
virus is a right-handed helical filament made up of 2,130 
identical subunits (Fig. 4-24b). This cylindrical structure 
encloses the viral RNA. Proteins with subunits arranged in 
helical filaments can also form long, fibrous structures 
such as the actin filaments of muscle (see Fig. 5-28). 

SUMMARY 4.3 Protein Tertiary and 
Quaternary Structures 

■ Tertiary structure is the complete three-dimensional 
structure of a polypeptide chain. There are two 
general classes of proteins based on tertiary 
structure: fibrous and globular. 

■ Fibrous proteins, which serve mainly structural roles, 
have simple repeating elements of secondary structure. 



■ Globular proteins have more complicated tertiary 
structures, often containing several types of 
secondary structure in the same polypeptide 
chain. The first globular protein structure to be 
determined, by x-ray diffraction methods, was that 
of myoglobin. 

■ The complex structures of globular proteins can 
be analyzed by examining folding patterns called 
motifs, supersecondary structures, or folds. The 
thousands of known protein structures are 
generally assembled from a repertoire of only a 
few hundred motifs. Domains are regions of a 
polypeptide chain that can fold stably and 
independently. 

Quaternary structure results from interactions 
between the subunits of multisubunit (multimeric) 
proteins or large protein assemblies. Some 
multimeric proteins have a repeated unit consisting 
of a single subunit or a group of subunits, or 
protomer. Protomers are usually related by 
rotational or helical symmetry. 

4.4 Protein Denaturation and Folding 

All proteins begin their existence on a ribosome as a lin- 
ear sequence of amino acid residues (Chapter 27). This 
polypeptide must fold during and following synthesis to 
take up its native conformation. As we have seen, a na- 
tive protein conformation is only marginally stable. Mod- 
est changes in the protein's environment can bring 
about structural changes that can affect function. We 
now explore the transition that occurs between the 
folded and unfolded states. 

Loss of Protein Structure Results in Loss of Function 

Protein structures have evolved to function in particular 
cellular environments. Conditions different from those in 
the cell can result in protein structural changes, large and 
small. A loss of three-dimensional structure sufficient to 
cause loss of function is called denaturation. The dena- 
tured state does not necessarily equate with complete un- 
folding of the protein and randomization of conformation. 
Under most conditions, denatured proteins exist in a set of 
partially folded states, which as yet are poorly understood. 

Most proteins can be denatured by heat, which has 
complex effects on the weak interactions in a protein 
(primarily hydrogen bonds). If the temperature is in- 
creased slowly, a protein's conformation generally remains 
intact until an abrupt loss of structure (and function) 
occurs over a narrow temperature range (Fig. 4-25). The 
abruptness of the change suggests that unfolding is a co- 
operative process: loss of structure in one part of the pro- 
tein destabilizes other parts. The effects of heat on 
proteins are not readily predictable. The very heat-stable 
proteins of thermophilic bacteria and archaea have 
evolved to function at the temperature of hot springs 
(—100 °C). Yet the structures of these proteins often 
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FIGURE 4-25 Protein denaturation. Results are shown for proteins de- 
natured by two different environmental changes. In each case, the 
transition from the folded to the unfolded state is abrupt suggesting co- 
operativity in the unfolding process, (a) Thermal denaturation of horse 
apomyoglobin (myoglobin without the heme prosthetic group) and ri- 
bonuclease A (with its disulfide bonds intact; see Fig. 4-26). The mid- 
point of the temperature range over which denaturation occurs is 
called the melting temperature, or T m . Denaturation of apomyoglobin 
was monitored by circular dichroism {see Fig. 4-9), which measures 
the amount of helical structure in the protein. Denaturation of ribonu- 
clease A was tracked by monitoring changes in the intrinsic fluores- 
cence of the protein, which is affected by changes in the environment 
of Trp residues, (b) Denaturation of disulfide-intact ribonuclease A by 
guanidine hydrochloride {GdnHCl), monitored by circular dichroism. 

differ only slightly from those of homologous proteins de- 
rived from bacteria such as Escherichia coli. How these 
small differences promote structural stability at high tem- 
peratures is not yet understood. 

Proteins can also be denatured by extremes of pH, 
by certain miscible organic solvents such as alcohol or 
acetone, by certain solutes such as urea and guanidine 
hydrochloride, or by detergents. Each of these denatur- 
ing agents represents a relatively mild treatment in the 
sense that no covalent bonds in the polypeptide chain 
are broken. Organic solvents, urea, and detergents act 
primarily by disrupting the hydrophobic interactions 
that make up the stable core of globular proteins; ex- 
tremes of pH alter the net charge on the protein, causing 
electrostatic repulsion and the disruption of some hydro- 
gen bonding. The denatured structures obtained with 
these various treatments are not necessarily the same. 



Amino Add Sequence Determines Tertiary Structure 

The tertiary structure of a globular protein is determined 
by its amino acid sequence. The most important proof of 
this came from experiments showing that denaturation of 
some proteins is reversible. Certain globular proteins de- 
natured by heat, extremes of pH, or denaturing reagents 
will regain their native structure and their biological ac- 
tivity if returned to conditions in which the native confor- 
mation is stable. This process is called renaturation. 

A classic example is the denaturation and renatu- 
ration of ribonuclease A, demonstrated by Christian 
Anfinsen in the 1950s. Purified ribonuclease A denatures 
completely in a concentrated urea solution in the pres- 
ence of a reducing agent. The reducing agent cleaves the 
four disulfide bonds to yield eight Cys residues, and the 
urea disrupts the stabilizing hydrophobic interactions, 
thus freeing the entire polypeptide from its folded con- 
formation. Denaturation of ribonuclease is accompanied 
by a complete loss of catalytic activity When the urea and 
the reducing agent are removed, the randomly coiled, de- 
natured ribonuclease spontaneously refolds into its cor- 
rect tertiary structure, with full restoration of its catalytic 
activity (Fig. 4-26). The refolding of ribonuclease is so 
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FIGURE 4-26 Renaturation of unfolded, denatured ribonuclease. 

Urea denatures the ribonuclease, and mercaptoethanol (HOCH 2 CH 2 SH) 
reduces and thus cleaves the disulfide bonds to yield eight Cys residues. 
Renaturation involves reestablishing the correct disulfide cross-links. 
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accurate that the four intrachain disulfide bonds are re- 
formed in the same positions in the renatured molecule 
as in the native ribonuclease. Calculated mathematically, 
the eight Cys residues could recombine at random to 
form up to four disulfide bonds in 105 different ways. In 
fact, an essentially random distribution of disulfide bonds 
is obtained when the disulfides are allowed to re-form in 
the presence of denaturant (without reducing agent), in- 
dicating that weak bonding interactions are required for 
correct positioning of disulfide bonds and restoration of 
the native conformation. Later, similar results were ob- 
tained using chemically synthesized, catalytically active 
ribonuclease A. This eliminated the possibility that some 
minor contaminant in Anfinsen's purified ribonuclease 
preparation might have contributed to the renaturation of 
the enzyme, thus dispelling any remaining doubt that this 
enzyme folds spontaneously. 

The Anfinsen experiment provided the first evi- 
dence that the amino acid sequence of a polypeptide 
chain contains all the information required to fold the 
chain into its native, three-dimensional structure. Sub- 
sequent work has shown that this is true only for a mi- 
nority of proteins, many of them small and inherently 
stable. Even though all proteins have the potential to 
fold into their native structure, many require some 
assistance, as we shall see. 

Polypeptides Fold Rapidly by a Stepwise Process 

In living cells, proteins are assembled from amino acids 
at a very high rate. For example, E. coli cells can make a 
complete, biologically active protein molecule containing 
100 amino acid residues in about 5 seconds at 37 °C. How 
does the polypeptide chain arrive at its native conforma- 
tion? Let's assume conservatively that each of the amino 
acid residues could take up 10 different conformations 

FIGURE 4-27 A simulated folding pathway. The folding pathway of a 
36-residue segment of the protein villin (an actin-binding protein 
found principally in the microvilli lining the intestine) was simulated 
by computer. The process started with the randomly coiled peptide and 
3,000 surrounding water molecules in a virtual "water box/' The mo- 



on average, giving 10 different conformations for the 
polypeptide. Let's also assume that the protein folds 
spontaneously by a random process in which it tries out 
all possible conformations around every single bond in its 
backbone until it finds its native, biologically active form. 
If each conformation were sampled in the shortest possi- 
ble time (~10~ 13 second, or the time required for a sin- 
gle molecular vibration), it would take about 10 77 years 
to sample all possible conformations. Clearly, protein 
folding is not a completely random, trial-and-error 
process. There must be shortcuts. This problem was first 
pointed out by Cyrus Levinthal in 1968 and is sometimes 
called LevinthaTs paradox. 

The folding pathway of a large polypeptide chain is 
unquestionably complicated, and not all the principles 
that guide the process have been worked out. However, 
there are several plausible models. In one, the folding 
process is hierarchical. Local secondary structures form 
first. Certain amino acid sequences fold readily into a he- 
lices or j8 sheets, guided by constraints such as those re- 
viewed in our discussion of secondary structure. Ionic 
interactions, involving charged groups that are often near 
one another in the linear sequence of the polypeptide 
chain, can play an important role in guiding these early 
folding steps. Assembly of local structures is followed by 
longer-range interactions between, say, two a helices that 
come together to form stable supersecondary structures. 
The process continues until complete domains form and 
the entire polypeptide is folded (Fig. 4-27). Notably, 
proteins dominated by close-range interactions (between 
pairs of residues generally located near each other in the 
polypeptide sequence) tend to fold faster than proteins 
with more complex folding patterns and many long-range 
interactions between different segments. 

In an alternative model, folding is initiated by a 
spontaneous collapse of the polypeptide into a compact 



lecular motions of the peptide and the effects of the water molecules 
were taken into account in mapping the most likely paths to the final 
structure among the countless alternatives. The simulated folding took 
place in a theoretical time span of 1 ms. 
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state, mediated by hydrophobic interactions among 
nonpolar residues. The state resulting from this "hy- 
drophobic collapse" may have a high content of second- 
ary structure, but many amino acid side chains are not 
entirely fixed. The collapsed state is often referred to as 
a molten globule. Most proteins probably fold by a 
process that incorporates features of both models. In- 
stead of following a single pathway, a population of pep- 
tide molecules may take a variety of routes to the same 
end point, with the number of different partly folded 
conformational species decreasing as folding nears 
completion. 

Thermodynamically, the folding process can be 
viewed as a kind of free-energy funnel (Fig- 4-28). The 
unfolded states are characterized by a high degree of 
conformational entropy and relatively high free energy. 
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FIGURE 4-28 The thermodynamics of protein folding depicted as a 
free-energy funnel. At the top, the number of conformations, and 
hence the conformational entropy, is large. Only a small fraction of the 
intramolecular interactions that will exist in the native conformation 
are present. As folding progresses, the thermodynamic path down the 
funnel reduces the number of states present (decreases entropy), in- 
creases the amount of protein in the native conformation, and de- 
creases the free energy. Depressions on the sides of the funnel 
represent semistable folding intermediates, which in some cases may 
slow the folding process. 



As folding proceeds, the narrowing of the funnel repre- 
sents a decrease in the number of conformational 
species present. Small depressions along the sides of the 
free-energy funnel represent semistable intermediates 
that can briefly slow the folding process. At the bottom 
of the funnel, an ensemble of folding intermediates has 
been reduced to a single native conformation (or one of 
a small set of native conformations). 

Thermodynamic stability is not evenly distributed 
over the structure of a protein — the molecule has regions 
of high and low stability. For example, a protein may have 
two stable domains joined by a segment with lower struc- 
tural stability, or one small part of a domain may have a 
lower stability than the remainder. The regions of low sta- 
bility allow a protein to alter its conformation between 
two or more states. As we shall see in the next two chap- 
ters, variations in the stability of regions within a protein 
are often essential to protein function. 

As our understanding of protein folding and protein 
structure improves, increasingly sophisticated com- 
puter programs for predicting the structure of proteins 
from their amino acid sequence are being developed. 
Prediction of protein structure is a specialty field of 
bioinformatics, and progress in this area is monitored 
with a biennial test called the CASP (Critical Assessment 
of Structural Prediction) competition. Entrants from 
around the world vie to predict the structure of an as- 
signed protein (whose structure has been detennined but 
not yet published). The most successful teams are invited 
to present their results at a CASP conference. Completely 
reliable solutions to the complex problem of predicting 
protein structure are not yet available, but the success 
and rigor of new approaches is being enriched by CASP. 

Some Proteins Undergo Assisted Folding 

Not all proteins fold spontaneously as they are synthe- 
sized in the cell. Folding for many proteins requires 
molecular chaperones, proteins that interact with 
partially folded or improperly folded polypeptides, 
facilitating correct folding pathways or providing 
microenvironments in which folding can occur. Two 
classes of molecular chaperones have been well studied. 
Both are found in organisms ranging from bacteria to hu- 
mans. The first class is a family of proteins called Hsp70 
(see Fig. 3-30) , which generally have a molecular weight 
near 70,000 and are more abundant in cells stressed by 
elevated temperatures (hence, heat shock proteins of M T 
70,000, or Hsp70). Hsp70 proteins bind to regions of un- 
folded polypeptides that are rich in hydrophobic 
residues, preventing inappropriate aggregation. These 
chaperones thus "protect" both proteins subject to de- 
naturation by heat and new peptide molecules being syn- 
thesized (and not yet folded). Hsp70 proteins also block 
the folding of certain proteins that must remain unfolded 
until they have been translocated across a membrane 
(as described in Chapter 27). Some chaperones also 
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(T) DnaJ binds to the 
unfolded or partially 
folded protein and 
then to DnaK. 



Unfolded 
protein 




) DnaJ stimulates ATP 
hydrolysis by DnaK. 
DnaK-ADP binds tightly 
to the unfolded protein. 



Folded 
protein 
(native 

conformation) 



(4) ATP binds to 
DnaK and the 
protein dissociates. 



FIGURE 4-29 Chaperones in protein folding. The cyclic pathway by 
which chaperones bind and release polypeptides is illustrated for the E. 
coli chaperone proteins DnaK and DnaJ, homologs of the eukaryotic 
chaperones Hsp70 and Hsp40. The chaperones do not actively pro- 
mote the folding of the substrate protein, but instead prevent aggrega- 
tion of unfolded peptides For a population of polypeptide molecules, 



GrpE 



ADP + GrpE (+ DnaJ ?) 

CD 

(3) In bacteria, the 
nucleotide-exchange 
factor GrpE stimulates 
release of ADP. 

some fraction of the molecules released at the end of the cycle are in 
the native conformation. The remainder are rebound by DnaK or di- 
verted to the chaperonin system (GroEL; see Fig. 4-30). In bacteria, a 
protein called GrpE interacts transiently with DnaK late in the cycle 
(step©), promoting dissociation of ADP and possibly DnaJ. No eu- 
karyotic analog of GrpE is known. 



facilitate the quaternary assembly of oligomeric proteins. 
The Hsp70 proteins bind to and release polypeptides in a 
cycle that uses energy from ATP hydrolysis and involves 
several other proteins (including a class called Hsp40) . 
Figure 4-29 illustrates chaperone-assisted folding as 
elucidated for the chaperones DnaK and DnaJ mE. coli, 
homologs of the eukaryotic Hsp70 and Hsp40 chaper- 
ones. DnaK and DnaJ were first identified as proteins 
required for in vitro replication of certain viral DNA 
molecules (hence the "Dna" designation). 

The second class of chaperones is the chaperonins. 
These are elaborate protein complexes required for the 
folding of some cellular proteins that do not fold 
spontaneously. InE. coli an estimated 10% to 15% of cel- 
lular proteins require the resident chaperonin system, 
called GroEL/GroES, for folding under normal conditions 
(up to 30% require this assistance when the cells are heat 
stressed). The chaperonins first became known when 
they were found to be necessary for the growth of certain 
bacterial viruses (hence the designation "Gro"). Unfolded 
proteins are bound within pockets in the GroEL complex, 



and the pockets are capped transiently by the GroES "lid" 
(Fig. 4-30). GroEL undergoes substantial conforma- 
tional changes, coupled to ATP hydrolysis and the bind- 
ing and release of GroES, which promote folding of the 
bound polypeptide. The mechanism by which the 
GroEL/GroES chaperonin facilitates folding is not known 
in detail, but it depends on the size and interior surface 
properties of the cavity where folding occurs. 

Finally, the folding pathways of some proteins re- 
quire two enzymes that catalyze isomerization reac- 
tions. Protein disulfide isomerase (PDI) is a widely 
distributed enzyme that catalyzes the interchange, or 
shuffling, of disulfide bonds until the bonds of the native 
conformation are formed. Among its functions, PDI 
catalyzes the elimination of folding intermediates with 
inappropriate disulfide cross-links. Peptide prolyl cis- 
trans isomerase (PPI) catalyzes the interconversion 
of the cis and trans isomers of Pro residue peptide 
bonds (Fig. 4-7b), which can be a slow step in the fold- 
ing of proteins that contain some Pro peptide bonds in 
the cis conformation. 
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conformation. 
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FIGURE 4-30 Chaperonins in protein folding, (a) A proposed pathway 
for the action of the E. coli chaperonins GroEL (a member of the Hsp60 
protein family) and GroES. Each GroEL complex consists of two large 
pockets formed by two heptameric rings (each subunit M r 57,000). 
GroES, also a heptamer (subunit M r 10,000), blocks one of the GroEL 
pockets, (b) Surface and cut-away images of the GroEIVGroES complex 
(PDB ID 1 AON). The cut-away (right) illustrates the large interior space 
within which other proteins are bound. 




Protein folding is likely to be a more complex pro- 
cess in the densely packed cellular environment than in 
the test tube. More classes of proteins that facilitate pro- 
tein folding may well be discovered. 

Defects in Protein Folding May Be the Molecular Basis for a 
Wide Range of Human Genetic Disorders 

rl Despite the many processes that assist in protein 
I folding, misfolding does occur. In fact, protein 
misfolding is a substantial problem in all cells, and a 
quarter or more of all polypeptides synthesized may be 



destroyed because they do not fold correctly. In some 
cases, the misfolding causes or contributes to the devel- 
opment of serious disease. 

Many conditions, including type 2 diabetes, 
Alzheimer's disease, Huntington's disease, and Parkin- 
son's disease, arise from a common misfolding mecha- 
nism. In most cases, a soluble protein that is normally 
secreted from the cell is secreted in a misfolded state 
and converted into an insoluble extracellular amyloid 
fiber. The diseases are collectively referred to as amy- 
loidoses. The fibers are highly ordered and un- 
branched, with a diameter of 7 to 10 nm and a high 
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degree of /3-sheet structure. The )3 strands are ori- 
ented perpendicular to the axis of the fiber. In some 
amyloid fibers the overall structure forms a long, two- 
layered j3 sheet such as that shown for amyloid-/? pep- 
tide in Figure 4-31. 

Many proteins can take on the amyloid fibril struc- 
ture as an alternative to their normal folded conforma- 
tions, and most of these proteins have a concentration of 
aromatic amino acid residues in a core region of jS sheet. 
The proteins are secreted in an incompletely folded con- 
formation. The core jS sheet (or some part of it) forms 
before the rest of the protein folds correctly, and the /3 
sheets from two or more incompletely folded protein 
molecules associate to begin forming an amyloid fibril. 
The fibril grows in the extracellular space. Other parts of 
the protein then fold differently, remaining on the out- 
side of the /3-sheet core in the growing fibril (Fig. 4-31a; 
the effect of aromatic residues in stabilizing the struc- 
ture is shown in Fig. 4-3 lc). Because most of the 
protein molecules fold normally, the onset of symptoms 
in the amyloidoses is often very slow. If a person inherits 
a mutation such as substitution with an aromatic residue 
at a position that favors formation of amyloid fibrils, dis- 
ease symptoms may begin at an earlier age. A decreased 
capacity for removing misfolded proteins may also con- 
tribute to these diseases. 
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Some amyloidoses are systemic, involving many tis- 
sues. Primary systemic amyloidosis is caused by deposi- 
tion of fibrils consisting of misfolded immunoglobulin 
light chains (described in Chapter 5), or fragments of 
light chains derived from proteolytic degradation. The 
mean age of onset is about 65 years. Patients have symp- 
toms including fatigue, hoarseness, swelling, and weight 
loss, and many die within the first year after diagnosis. 
The kidneys or heart are often most affected. Some 
amyloidoses are associated with other types of disease. 
Patients with certain chronic infectious or inflammatory 
diseases such as rheumatoid arthritis, tuberculosis, cys- 
tic fibrosis, and some cancers can experience a sharp in- 
crease in secretion of an amyloid-prone polypeptide 
called serum amyloid A (SAA) protein. This protein, or 
fragments of it, deposits in the connective tissue of the 

FIGURE 4-31 Formation of disease-causing amyloid fibrils, (a) Protein 
molecules whose normal structure includes regions of /3 sheet undergo 
partial folding. In a small number of the molecules, before folding is 
complete, the /3-sheet regions of one polypeptide associate with the 
same region in another polypeptide, forming the nucleus of an amy- 
loid. Additional protein molecules slowly associate with the amyloid 
and extend it to form a fibril, (b) The amyloid-/? peptide, which plays a 
major role in Alzheimer's disease, is derived from a larger transmem- 
brane protein called amyloid-/? precursor protein or APR This protein 
is found in most human tissues. When it is part of the larger protein, the 
peptide is composed of two a-helical segments spanning the mem- 
brane. When the external and internal domains (each of which have 
independent functions) are cleaved off by dedicated proteases, the 
remaining and relatively unstable amyloid-/? peptide leaves the mem- 
brane and loses its a-helical structure. It can then assemble slowly into 
amyloid fibrils (c), which contribute to the characteristic plaques on the 
exterior of nervous tissue in people with Alzheimer's. Amyloid is rich in 
0-sheet structure, with the p strands arranged perpendicular to the axis 
of the amyloid fibril. In amyloid-/? peptide, the structure takes the form 
of an extended two-layer parallel 0 sheet. Others may take the form of 
left-handed /?-helices (see Fig. 4-21 ). 
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spleen, kidney, and liver, and around the heart. People 
with this condition, known as secondary systemic amy- 
loidosis, have a wide range of symptoms, depending on 
the organs initially affected. The disease is generally fa- 
tal within a few years. More than 80 amyloidoses are as- 
sociated with mutations in transthyretin (a protein that 
binds to and transports thyroid hormones, distributing 
them throughout the body and brain). A variety of mu- 
tations in this protein lead to amyloid deposition con- 
centrated around different tissues, thus producing 
different symptoms. Amyloidoses are also associated 
with inherited mutations in the proteins lysozyme, fib- 
rinogen A a-chain, and apolipoproteins A-I and A-II; all 
of these proteins are described in later chapters. 

Some amyloid diseases are associated with particu- 
lar organs. The amyloid-prone protein is generally se- 
creted only by the affected tissue, and its locally high 
concentration leads to amyloid deposition around that 
tissue (although some of the protein may be distributed 
systemically). One common site of amyloid deposition 
is near the pancreatic islet /3 cells, responsible for in- 
sulin secretion and regulation of glucose metabolism 
(p. 924). Secretion by /3 cells of a small (37 amino 
acids) peptide called islet amyloid polypeptide (IAPP), 
or amylin, can lead to amyloid deposits around the 
islets, gradually destroying the cells. A healthy human 
adult has 1 to 1.5 million pancreatic f3 cells. With pro- 
gressive loss of these cells, glucose homeostasis is af- 
fected and eventually, when 50% or more of the cells 
are lost, the condition matures into type 2 (adult onset) 
diabetes mellitus. 

The amyloid deposition diseases that trigger neu- 
rodegeneration, particularly in older adults, are a special 
class of localized amyloidoses. Alzheimer's disease is as- 
sociated with extracellular amyloid deposition by neu- 



rons, involving a protein called amyloid /3-peptide. These 
amyloid deposits seem to be the primary cause of 
Alzheimer's, but a second type of amyloid-like aggrega- 
tion, involving a protein called tau, also occurs intracellu- 
larly (in neurons) in patients with Alzheimer's. Inherited 
mutations in the tau protein do not result in Alzheimer's, 
but they cause a frontotemporal dementia and parkinson- 
ism (a condition with symptoms resembling Parkinson's 
disease) that can be equally devastating. 

Several other neurodegenerative conditions involve 
intracellular aggregation of misfolded proteins. In Parkin- 
son's disease, the misfolded form of the protein a-synu- 
clein aggregates into spherical filamentous masses called 
Lewy bodies. Huntington's disease involves the protein 
huntingtin, which has a long polyglutamine repeat. In 
some individuals, the polyglutamine repeat is longer 
than normal and a more subtle type of intracellular ag- 
gregation occurs. The relationship of some of these neu- 
rodegenerative conditions to amyloidoses has been 
debated, but the aggregates are known to have high de- 
grees of (3 structure and insolubility that suggest some 
common structures and mechanisms of formation. 

Protein misfolding need not lead to amyloid forma- 
tion to cause serious disease. For example, cystic fibro- 
sis is caused by defects in a membrane-bound protein 
called cysticjibrosis transmembrane conductance regu- 
lator (CFTR), which acts as a channel for chloride ions. 
The most common cystic fibrosis-causing mutation is 
the deletion of a Phe residue at position 508 in CFTR, 
which causes improper protein folding. Most of this pro- 
tein is then degraded and its normal function is lost (see 
Box 11-3). Many of the disease-related mutations in col- 
lagen (p. 128) also cause defective folding. A particu- 
larly remarkable type of protein misfolding is seen in the 
prion diseases (Box 4-6). ■ 
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Death by Misfolding: The Prion Diseases 



A misfolded brain protein seems to be the causative agent 
of several rare degenerative brain diseases in mammals. 
Perhaps the best known of these is bovine spongiform en- 
cephalopathy (BSE; also known as mad cow disease). Re- 
lated diseases include kuru and Creutzfeldt-Jakob 
disease in humans, scrapie in sheep, and chronic wasting 
disease in deer and elk. These diseases are also referred 
to as spongiform encephalopathies, because the diseased 
brain frequently becomes riddled with holes (Fig. 1) . Pro- 
gressive deterioration of the brain leads to a spectrum of 
neurological symptoms, including weight loss, erratic be- 
havior, problems with posture, balance, and coordination, 
and loss of cognitive function. The diseases are fatal. 

In the 1960s, investigators found that prepara- 
tions of the disease-causing agents seemed to lack nu- 
cleic acids. At this time, Tikvah Alper suggested that 
the agent was a protein. Initially, the idea seemed 

(continued on next page) 




FIGURE 1 Stained section of cerebral cortex from autopsy of a patient 
with Creutzfeldt-Jakob disease shows spongiform (vacuolar) degenera- 
tion, the most characteristic neurohistological feature. The yellowish 
vacuoles are intracellular and occur mostly in pre- and postsynaptic 
processes of neurons. The vacuoles in this section vary in diameter 
from 20 to 100 fim. 
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Death by Misfolding: The Prion Diseases 
(continued from previous page) 




heretical. All disease-causing agents known up to that 
time — viruses, bacteria, fungi, and so on — contained 
nucleic acids, and their virulence was related to ge- 
netic reproduction and propagation. However, four 
decades of investigations, pursued most notably by 
Stanley Prusiner, have provided evidence that spongi- 
form encephalopathies are different. 

The infectious agent has been traced to a single 
protein (M r 28,000), which Prusiner dubbed prion 
(proteinaceous infectious only) protein (PrP). Prion 
protein is a normal constituent of brain tissue in all 
mammals. Its role is not known in detail, but it may 
have a molecular signaling function. Strains of mice 
lacking the gene for PrP (and thus the protein itself) 
suffer no obvious ill effects. Illness occurs only when 
the normal cellular PrP, or PrP c , occurs in an altered 
conformation called PrP Sc (Sc denotes scrapie). The 
interaction of PrP Sc with PrP c converts the latter to 
PrP Sc , initiating a domino effect in which more and 
more of the brain protein converts to the disease- 
causing form. The mechanism by which the presence 
of PrP Sc leads to spongiform encephalopathy is not 
understood. 

In inherited forms of prion diseases, a mutation in 
the gene encoding PrP produces a change in one amino 
acid residue that is believed to make the conversion of 



PrP c to PrP Sc more likely. A complete understanding of 
prion diseases awaits new information on how prion pro- 
tein affects brain function. Structural information about 
PrP is beginning to provide insights into the molecular 
process that allows the prion proteins to interact so as to 
alter their conformation (Fig. 2). 





FIGURE 2 Structure of the globular domain of human PrP in 
monomeric (left) and dimeric (right) forms. The second subunit is gray 
to highlight the dramatic conformational change in the green a helix 
(now flipped downward) when the intertwined dimer is formed. 



SUMMARY 4.4 Protein Denaturation and 
Folding 

■ The three-dimensional structure and the function 
of proteins can be destroyed by denaturation, 
demonstrating a relationship between structure 
and function. Some denatured proteins can 
renature spontaneously to form biologically active 
protein, showing that tertiary structure is 
determined by amino acid sequence. 

Protein folding in cells probably involves 
multiple pathways. Initially, regions of secondary 
structure may form, followed by folding into 
supersecondary structures. Large ensembles of 
folding intermediates are rapidly brought to a 
single native conformation. 

■ For many proteins, folding is facilitated by Hsp70 
chaperones and by chaperonins. Disulfide bond 
formation and the cis-trans isomerization of Pro 
peptide bonds are catalyzed by specific enzymes. 

Protein misfolding is the molecular basis of a 
wide range of human diseases, including the 
amyloidoses. 
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Problems 

1. Properties of the Peptide Bond In x-ray studies of crys- 
talline peptides, Linus Pauling and Robert Corey found that 
the C — N bond in the peptide link is intermediate in length 
(1.32 A) between a typical C — N single bond (1.49 A) and a 
C=N double bond (1.27 A). They also found that the peptide 
bond is planar (all four atoms attached to the C — N group are 
located in the same plane) and that the two a-carbon atoms 
attached to the C — N are always trans to each other (on oppo- 
site sides of the peptide bond) : 

(a) What does the length of the C — N bond in the peptide 
linkage indicate about its strength and its bond order (i.e., 
whether it is single, double, or triple)? 

(b) What do the observations of Pauling and Corey tell us 
about the ease of rotation about the C — N peptide bond? 

2. Structural and Functional Relationships in Fibrous 
Proteins William Astbury discovered that the x-ray diffrac- 
tion pattern of wool shows a repeating structural unit spaced 
about 5.2 A along the length of the wool fiber. When he 
steamed and stretched the wool, the x-ray pattern showed a 
new repeating structural unit at a spacing of 7,0 A. Steaming 
and stretching the wool and then letting it shrink gave an x-ray 
pattern consistent with the original spacing of about 5.2 A. Al- 
though these observations provided important clues to the 
molecular structure of wool, Astbury was unable to interpret 
them at the time. 
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(a) Given our current understanding of the structure of 
wool, interpret Astbury's observations 

(t>) When wool sweaters or socks are washed in hot water 
or heated in a dryer, they shrink. Silk, on the other hand, does 
not shrink under the same conditions. Explain. 

3. Rate of Synthesis of Hair a-Keratin Hair grows at a 
rate of 15 to 20 cm/yr All this growth is concentrated at the 
base of the hair fiber, where a-keratin filaments are synthe- 
sized inside living epidermal cells and assembled into ropelike 
structures (see Fig. 4-10). The fundamental structural ele- 
ment of a-keratin is the a helix, which has 3,6 amino acid 
residues per turn and a rise of 5.4 A per turn (see Fig 4-4a). 
Assuming that the biosynthesis of a-helical keratin chains is 
the rate-limiting factor in the growth of hair, calculate the rate 
at which peptide bonds of a-keratin chains must be synthe- 
sized (peptide bonds per second) to account for the observed 
yearly growth of hair. 

4. Effect of pH on the Conformation of a-Helical Sec- 
ondary Structures The unfolding of the a helix of a 
polypeptide to a randomly coiled conformation is accompa- 
nied by a large decrease in a property called specific rota- 
tion, a measure of a solution's capacity to rotate plane- 
polarized light. Polyglutamate, a polypeptide made up of 
only l-G1u residues, has the a-helical conformation at pH 3 
When the pH is raised to 7, there is a large decrease in the 
specific rotation of the solution Similarly, polylysine (L-Lys 
residues) is an a helix at pH 10, but when the pH is lowered 
to 7 the specific rotation also decreases, as shown by the fol- 
lowing graph. 
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What is the explanation for the effect of the pH changes on the 
conformations of poly(Glu) and poly(Lys)? Why does the 
transition occur over such a narrow range of pH? 

5. Disulfide Bonds Determine the Properties of Many 
Proteins Some natural proteins are rich in disulfide bonds, and 
their mechanical properties (tensile strength, viscosity, hard- 
ness, etc.) are correlated with the degree of disulfide bonding. 

(a) Glutenin, a wheat protein rich in disulfide bonds, is 
responsible for the cohesive and elastic character of dough 
made from wheat flour. Similarly, the hard, tough nature of 
tortoise shell is due to the extensive disulfide bonding in its 



a-keratin. What is the molecular basis for the correlation 
between disulfide-bond content and mechanical properties of 
the protein? 

(b) Most globular proteins are denatured and lose their 
activity when briefly heated to 65 °C. However, globular pro- 
teins that contain multiple disulfide bonds often must be 
heated longer at higher temperatures to denature them. One 
such protein is bovine pancreatic trypsin inhibitor (BPTI), 
which has 58 amino acid residues in a single chain and con- 
tains three disulfide bonds. On cooling a solution of denatured 
BPTI, the activity of the protein is restored. What is the mo- 
lecular basis for this property? 

6. Amino Acid Sequence and Protein Structure Our 

growing understanding of how proteins fold allows re- 
searchers to make predictions about protein structure based 
on primary amino acid sequence data. Consider the following 
amino acid sequence. 

I 23456789 10 

He -Ala -His -Thr -Tyr -Gly -Pro -Phe -Glu -Ala - 

II 12 13 14 15 16 17 18 19 20 

Ala - Met -Cys - Lys - Tip - Glu - Ala - Gin - Pro - Asp - 

21 22 23 24 25 26 27 28 

Gly -Met - Glu -Cys-Ala-Phe -His -Arg 

(a) Where might bends or f3 turns occur? 

(b) Where might intrachain disulfide cross-linkages be 
formed? 

(c) Assuming that this sequence is part of a larger globular 
protein, indicate the probable location (the external surface or 
interior of the protein) of the following amino acid residues: 
Asp, He, Thr, Ala, Gin, Lys. Explain your reasoning. (Hint: See 
the hydropathy index in Table 3-1.) 

7. Bacteriorhodopsin in Purple Membrane Proteins 

Under the proper environmental conditions, the salt-loving 
archaean Halobacterium halobium synthesizes a membrane 
protein (M T 26,000) known as bacteriorhodopsin, which is 
purple because it contains retinal (see Fig. 10-21). Molecules 
of this protein aggregate into "purple patches" in the cell 
membrane. Bacteriorhodopsin acts as a light-activated proton 
pump that provides energy for cell functions. X-ray analysis of 
this protein reveals that it consists of seven parallel a-helical 
segments, each of which traverses the bacterial cell mem- 
brane (thickness 45 A). Calculate the minimum number of 
amino acid residues necessary for one segment of a helix to 
traverse the membrane completely Estimate the fraction of 
the bacteriorhodopsin protein that is involved in membrane- 
spanning helices. (Use an average amino acid residue weight 
of 110) 

8. Protein Structure Terminology Is myoglobin a motif, a 
domain, or a complete three-dimensional structure? 

9. Pathogenic Action of Bacteria That Cause Gas 
Gangrene The highly pathogenic anaerobic bacterium 
Clostridium perfringens is responsible for gas gangrene, a 
condition in which animal tissue structure is destroyed. This 
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bacterium secretes an enzyme that efficiently catalyzes the 
hydrolysis of the peptide bond indicated in red: 



— X— Gly— Pro— Y— 



H 2 0 



— X— COCT + H 3 N— Gly— Pro— Y— 

where X and Y are any of the 20 common amino acids. How 
does the secretion of this enzyme contribute to the invasive- 
ness of this bacterium in human tissues? Why does this en- 
zyme not affect the bacterium itself? 

10. Number of Polypeptide Chains in a Multisubunit Pro- 
tein A sample (660 mg) of an oligomeric protein of M T 132,000 
was treated with an excess of l-fluoro-2,4-dinitrobenzene 
(Sanger's reagent) under slightly alkaline conditions until the 
chemical reaction was complete. The peptide bonds of the 
protein were then completely hydrolyzed by heating it 
with concentrated HC1. The hydrolysate was found to contain 
5.5 mg of the following compound: 



NO. C N H V 

i • CH 

0 2 N-( ^NH-C-COOH 

2,4-Dinitrophenyl derivatives of the a-amino groups of other 
amino acids could not be found. 

(a) Explain how this information can be used to determine 
the number of polypeptide chains in an oligomeric protein. 

(b) Calculate the number of polypeptide chains in this 
protein. 

(c) What other protein analysis technique could you 
employ to determine whether the polypeptide chains in this 
protein are similar or different? 

11. Predicting Secondary Structure Which of the follow- 
ing peptides is more likely to take up an a-helical structure, 
and why? 

(a) LKAENDEAARAMSEA 

(b) CRAGGFPWDQPGTSN 

12. Amyloid Fibers in Disease Several small aro- 
matic molecules, such as phenol red (used as a non- 
toxic drug model) , have been shown to inhibit the formation of 
amyloid in laboratory model systems. A goal of the research on 
these small aromatic compounds is to find a drug that would 
efficiently inhibit the formation of amyloid in the brain in peo- 
ple with incipient Alzheimer's disease. 

(a) Suggest why molecules with aromatic substituents 
would disrupt the formation of amyloid. 

(b) Some researchers have suggested that a drug used to 
treat Alzheimer's disease may also be effective in treating type 
2 (adult onset) diabetes mellitus. Why might a single drug be 
effective in treating these two different conditions? 
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13. Protein Modeling on the Internet A group of patients 
with Crohn's disease (an inflammatory bowel disease) under- 



went biopsies of their intestinal mucosa in an attempt to identify 
the causative agent. Researchers identified a protein that was 
present at higher levels in patients with Crohn's disease than in 
patients with an unrelated inflammatory bowel disease or in un- 
affected controls. The protein was isolated, and the following 
partial amino acid sequence was obtained (reads left to right): 



EAELCPDRCI 
SQRIQTNNNP 
FQVTVRDPSG 
TAELKICRVN 
KEDIEVYFTG 
VFRTPPYADP 
SEPMEFQYLP 
SIMKKSPFSG 
VPKPAPQPYP 



HSFQNLGIQC 

FQVPIEEQRG 

RPLRLPPVLP 

RNSGSCLGGD 

PGWEARGSFS 

SLQAPVRVSM 

DTDDRHRIEE 

PTDPRPPPRR 



VKKRDLEQAI 

DYDLNAVRLC 

HPIFDNRAPN 

EIFLLCDKVQ 

QADVHRQVAI 

QLRRPSDREL 

KRKRTYETFK 

IAVPSRSSAS 



(a) You can identify this protein using a protein database 
on the Internet. Some good places to start include Protein 
Information Resource (PIR; pir.georgetown.edu), Structural 
Classification of Proteins (SCOP; http://scop.mrc-lmb.cam. 
ac.uk/scop), and Prosite (http://expasy.org/prosite). 

At your selected database site, follow links to the sequence 
comparison engine. Enter about 30 residues from the protein se- 
quence in the appropriate search field and submit it for analysis. 
What does this analysis tell you about the identity of the protein? 

(b) Try using different portions of the amino acid se- 
quence. Do you always get the same result? 

(c) A variety of websites provide information about the 
three-dimensional structure of proteins, Find information 
about the protein's secondary, tertiary, and quaternary struc- 
ture using database sites such as the Protein Data Bank (PDB; 
www.rcsb.org) or SCOP. 

(d) In the course of your Web searches, what did you learn 
about the cellular function of the protein? 
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14. Mirror-Image Proteins As noted in Chapter 3, "The 
amino acid residues in protein molecules are exclusively l 
stereoisomers." It is not clear whether this selectivity is neces- 
sary for proper protein function or is an accident of evolution. 
To explore this question, Milton and colleagues (1992) pub- 
lished a study of an enzyme made entirely of D stereoisomers. 
The enzyme they chose was HIV protease, a proteolytic en- 
zyme made by HIV that converts inactive viral pre-proteins to 
their active forms. 

Previously, Wlodawer and coworkers (1989) had reported 
the complete chemical synthesis of HIV protease from L-amino 
acids (the L-enzyme), using the process shown in Figure 3-29. 
Normal HIV protease contains two Cys residues at positions 67 
and 95. Because chemical synthesis of proteins containing Cys 
is technically difficult, Wlodawer and colleagues substituted 
the synthetic amino acid L-a-amino-n-butyric acid (Aba) for 
the two Cys residues in the protein. In the authors' words, this 
was done so as to "reduce synthetic difficulties associated with 
Cys deprotection and ease product handling." 
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The Three-Dimensional Structure of Proteins 



(a) The structure of Aba is shown below. Why was this a 
suitable substitution for a Cys residue? Under what circum- 
stances would it not be suitable? 

O o 

V 
i 

H C — CH 9 — CHo 

i 

+ NH 3 

L-a-Amino-/i-butyric acid 

Wlodawer and coworkers denatured the newly synthe- 
sized protein by dissolving it in 6 m guanidine HC1, and then 
allowed it to fold slowly by dialyzing away the guanidine 
against a neutral buffer (10% glycerol, 25 mM NaP0 4 , pH 7). 

(b) There are many reasons to predict that a protein 
synthesized, denatured, and folded in this manner would not 
be active. Give three such reasons. 

(c) Interestingly, the resulting L-protease was active. 
What does this finding tell you about the role of disulfide bonds 
in the native HIV protease molecule? 

In their new study, Milton and coworkers synthesized HIV 
protease from D-amino acids, using the same protocol as the 
earlier study (Wlodawer et al.). Formally, there are three 
possibilities for the folding of the D-protease: it would give 
(1) the same shape as the L-protease; (2) the mirror image of 
the L-protease, or (3) something else, possibly inactive. 

(d) For each possibility, decide whether or not it is a likely 
outcome and defend your position. 

In fact, the D-protease was active: it cleaved a particular 
synthetic substrate and was inhibited by specific inhibitors. To 
examine the structure of the d- and L-enzymes, Milton and 



coworkers tested both forms for activity with d and L forms of 
a chiral peptide substrate and for inhibition by d and l forms of 
a chiral peptide-analog inhibitor. Both forms were also tested 
for inhibition by the achiral inhibitor Evans blue. The findings 
are given in the table. 

Inhibition 

Substrate Peptide 

hydrolysis inhibitor Evans 
HIV ~ blue 

protease D-substrate D-protease D-inhibitor L-inhibitor (achiral) 

L-protease + + + 

D-protease + + + 



(e) Which of the three models proposed above is 
supported by these data? Explain your reasoning. 

(f) Why does Evans blue inhibit both forms of the protease? 

(g) Would you expect chymotrypsin to digest the D- 
protease? Explain your reasoning. 

(h) Would you expect total synthesis from D-amino acids 
followed by renaturation to yield active enzyme for any 
enzyme? Explain your reasoning. 
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Since the proteins participate in one way or another in all chemical 
processes in the living organism, one may expect highly significant 
information for biological chemistry from the elucidation of their 
structure and their transformations. 

— Emil Fischer, article in Berichte der deutschen chemischen 

Gesellschaft zu Berlin, 7906 
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Knowing the three-dimensional structure of a pro- 
tein is an important part of understanding how the 
protein functions. However, the structure shown in 
two dimensions on a page is deceptively static. Proteins 
are dynamic molecules whose functions almost invari- 
ably depend on interactions with other molecules, and 
these interactions are affected in physiologically impor- 
tant ways by sometimes subtle, sometimes striking 
changes in protein conformation. In this chapter, we ex- 
plore how proteins interact with other molecules and 
how their interactions are related to dynamic protein 
structure. The importance of molecular interactions to a 
protein's function can hardly be overemphasized. In 
Chapter 4, we saw that the function of fibrous proteins 
as structural elements of cells and tissues depends on 
stable, long-term quaternary interactions between iden- 
tical polypeptide chains. As we shall see in this chapter, 
the functions of many other proteins involve interac- 
tions with a variety of different molecules. Most of these 
interactions are fleeting, though they may be the basis of 
complex physiological processes such as oxygen trans- 
port, immune function, and muscle contraction — the 
topics we examine in this chapter. The proteins that 
carry out these processes illustrate the following key 
principles of protein function, some of which will be fa- 
miliar from the previous chapter: 

The functions of many proteins involve the re- 
versible binding of other molecules. A molecule 
bound reversibly by a protein is called a ligand. A 
ligand may be any kind of molecule, including an- 
other protein. The transient nature of protein- 



ligand interactions is critical to life, allowing an or- 
ganism to respond rapidly and reversibly to chang- 
ing environmental and metabolic circumstances. 

A ligand binds at a site on the protein called the bind- 
ing site, which is complementary to the ligand in size, 
shape, charge, and hydrophobic or hydrophilic char- 
acter. Furthermore, the interaction is specific: the 
protein can discriminate among the thousands of dif- 
ferent molecules in its environment and selectively 
bind only one or a few. A given protein may have sep- 
arate binding sites for several different ligands. These 
specific molecular interactions are crucial in main- 
taining the high degree of order in a living system. 
(This discussion excludes the binding of water, which 
may interact weakly and nonspecifically with many 
parts of a protein. In Chapter 6, we consider water as 
a specific ligand for many enzymes.) 

Proteins are flexible. Changes in conformation may 
be subtle, reflecting molecular vibrations and small 
movements of amino acid residues throughout the 
protein. A protein flexing in this way is sometimes 
said to "breathe." Changes in conformation may also 
be quite dramatic, with major segments of the pro- 
tein structure moving as much as several nanome- 
ters. Specific conformational changes are frequently 
essential to a protein's function. 

The binding of a protein and ligand is often coupled 
to a conformational change in the protein that 
makes the binding site more complementary to the 
ligand, permitting tighter binding. The structural 
adaptation that occurs between protein and ligand 
is called induced fit. 

In a multisubunit protein, a conformational change 
in one subunit often affects the conformation of 
other subunits. 

Interactions between ligands and proteins may be 
regulated, usually through specific interactions with 
one or more additional ligands. These other ligands 
may cause conformational changes in the protein 
that affect the binding of the first ligand. 

[l53j 
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Enzymes represent a special case of protein 
function. Enzymes bind and chemically transform other 
molecules — they catalyze reactions. The molecules acted 
upon by enzymes are called reaction substrates rather 
than ligands, and the ligand-binding site is called the cat- 
alytic site or active site. In this chapter we emphasize 
the noncatalytic functions of proteins. In Chapter 6 we 
consider catalysis by enzymes, a central topic in bio- 
chemistry. You will see that the themes of this chapter — 
binding, specificity, and conformational change — are 
continued in the next chapter, with the added element of 
proteins participating in chemical transformations. 

5.1 Reversible Binding of a Protein to a 
Ligand: Oxygen-Binding Proteins 

Myoglobin and hemoglobin may be the most-studied and 
best-understood proteins. They were the first proteins 
for which three-dimensional structures were deter- 
mined, and these two molecules illustrate almost every 
aspect of that most central of biochemical processes: 
the reversible binding of a ligand to a protein. This clas- 
sic model of protein function tells us a great deal about 
how proteins work. g Oxygen-Binding Proteins-Myoglobin: 
Oxygen Storage 

Oxygen Can Bind to a Heme Prosthetic Group 

Oxygen is poorly soluble in aqueous solutions (see Table 
2-3) and cannot be carried to tissues in sufficient quan- 
tity if it is simply dissolved in blood serum. Diffusion of 
oxygen through tissues is also ineffective over distances 
greater than a few millimeters. The evolution of larger, 
multicellular animals depended on the evolution of pro- 
teins that could transport and store oxygen. However, 
none of the amino acid side chains in proteins are suited 



X X 



NH HN 



(a) X X 

FIGURE 5-1 Heme. The heme group is present in myoglobin, hemo- 
globin, and many other proteins, designated heme proteins. Heme 
consists of a complex organic ring structure, protoporphyrin IX, with a 
bound iron atom in its ferrous (Fe 2+ ) state, (a) Porphyrins, of which 
protoporphyrin IX is only one example, consist of four pyrrole rings 



for the reversible binding of oxygen molecules. This role 
is filled by certain transition metals, among them iron 
and copper, that have a strong tendency to bind oxygen. 
Multicellular organisms exploit the properties of metals, 
most commonly iron, for oxygen transport. However, 
free iron promotes the formation of highly reactive oxy- 
gen species such as hydroxyl radicals that can damage 
DNA and other macromolecules. Iron used in cells is 
therefore bound in forms that sequester it and/or make it 
less reactive. In multicellular organisms — especially 
those in which iron, in its oxygen-carrying capacity, 
must be transported over large distances — iron is often 
incorporated into a protein-bound prosthetic group 
called heme (or haem). (Recall from Chapter 3 that a 
prosthetic group is a compound permanently associated 
with a protein that contributes to the protein's function.) 

Heme consists of a complex organic ring structure, 
protoporphyrin, to which is bound a single iron atom 
in its ferrous (Fe 2+ ) state (Fig. 5-1). The iron atom has 
six coordination bonds, four to nitrogen atoms that are 
part of the flat porphyrin ring system and two perpen- 
dicular to the porphyrin. The coordinated nitrogen 
atoms (which have an electron-donating character) help 
prevent conversion of the heme iron to the ferric (Fe 3+ ) 
state. Iron in the Fe 2+ state binds oxygen reversibly; in 
the Fe 3+ state it does not bind oxygen. Heme is found 
in many oxygen-transporting proteins, as well as in 
some proteins, such as the cytochromes, that partici- 
pate in oxidation-reduction (electron-transfer) reac- 
tions (Chapter 19). 

Free heme molecules (heme not bound to protein) 
leave Fe 2+ with two "open" coordination bonds. Simul- 
taneous reaction of one 0 2 molecule with two free heme 
molecules (or two free Fe 2+ ) can result in irreversible 
conversion of Fe 2+ to Fe 3+ . In heme-containing pro- 
teins, this reaction is prevented by sequestering each 
heme deep within the protein structure. Thus, access to 




linked by methene bridges, with substitutions at one or more of the 
positions denoted X. (b, c) Two representations of heme (derived from 
PDB ID 1CCR). The iron atom of heme has six coordination bonds: 
four in the plane of, and bonded to, the flat porphyrin ring system, and 
(d) two perpendicular to it. 
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Edge view 



N-Fe Oo 

/ i 

Histidine Plane of 
residue porphyrin 
ring system 

FIGURE 5-2 The heme group viewed from the side. This view shows 
the two coordination bonds to Fe 2+ that are perpendicular to the por- 
phyrin ring system. One is occupied by a His residue, sometimes 
called the proximal His; the other is the binding site for oxygen. The 
remaining four coordination bonds are in the plane of, and bonded to, 
the flat porphyrin ring system. 

the two open coordination bonds is restricted. One of 
these two coordination bonds is occupied by a side- 
chain nitrogen of a His residue. The other is the binding 
site for molecular oxygen (0 2 ) (Fig. 5-2). When oxy- 
gen binds, the electronic properties of heme iron 
change; this accounts for the change in color from the 
dark purple of oxygen-depleted venous blood to the 
bright red of oxygen-rich arterial blood. Some small mol- 
ecules, such as carbon monoxide (CO) and nitric oxide 
(NO), coordinate to heme iron with greater affinity than 
does 0 2 . When a molecule of CO is bound to heme, 0 2 
is excluded, which is why CO is highly toxic to aerobic 
organisms (a topic explored later, in Box 5-1). By 
surrounding and sequestering heme, oxygen-binding 
proteins regulate the access of CO and other small mol- 
ecules to the heme iron. 

Myoglobin Has a Single Binding Site for Oxygen 

Myoglobin (M T 16,700; abbreviated Mb) is a relatively 
simple oxygen-binding protein found in almost all mam- 
mals, primarily in muscle tissue. As a transport protein, 
it facilitates oxygen diffusion in muscle. Myoglobin is 
particularly abundant in the muscles of diving mammals 
such as seals and whales, where it also has an oxygen- 
storage function for prolonged excursions undersea. 
Proteins very similar to myoglobin are widely distrib- 
uted, occurring even in some single-celled organisms. 

Myoglobin is a single polypeptide of 153 amino acid 
residues with one molecule of heme. It is typical of the 
family of proteins called globins, all of which have sim- 
ilar primary and tertiary structures. The polypeptide is 
made up of eight a -helical segments connected by 
bends (Fig. 5-3). About 78% of the amino acid residues 
in the protein are found in these a helices. 

Any detailed discussion of protein function in- 
evitably involves protein structure. In the case of myo- 
globin, we first introduce some structural conventions 
peculiar to globins. As seen in Figure 5-3, the helical 




FIGURE 5-3 Myoglobin. (PDB ID 1 MBO) The eight a-helical segments 
(shown here as cylinders) are labeled A through H. Nonhelical 
residues in the bends that connect them are labeled AB, CD, EF, and so 
forth, indicating the segments they interconnect. A few bends, includ- 
ing BC and DE, are abrupt and do not contain any residues; these are 
not normally labeled. (The short segment visible between D and E is an 
artifact of the computer representation.) The heme is bound in a pocket 
made up largely of the E and F helices, although amino acid residues 
from other segments of the protein also participate. 

segments are named A through H. An individual amino 
acid residue is designated either by its position in the 
amino acid sequence or by its location in the sequence of 
a particular a-helical segment. For example, the His 
residue coordinated to the heme in myoglobin, His 93 (the 
93rd residue from the amino-terminal end of the myoglo- 
bin polypeptide sequence), is also called His F8 (the 8th 
residue in a helix F). The bends in the structure are 
designated AB, CD, EF, FG, and so forth, reflecting the 
a-helical segments they connect. 

Protein-Ligand Interactions Can Be 
Described Quantitatively 

The function of myoglobin depends on the protein's ability 
not only to bind oxygen but also to release it when and 
where it is needed. Function in biochemistry often revolves 
around a reversible protein-ligand interaction of this type. 
A quantitative description of this interaction is therefore a 
central part of many biochemical investigations. 

In general, the reversible binding of a protein (P) to 
a ligand (L) can be described, by a simple equilibrium 
expression: 

P + L PL (5-1) 
The reaction is characterized by an equilibrium con- 
stant, K a , such that 

K »-[P][L]-k d (52) 
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where k a and k d are rate constants (more on these be- 
low). The term K a is an association constant (not to 
be confused with the K a that denotes an acid dissocia- 
tion constant; p. 58) that describes the equilibrium 
between the complex and the unbound components of 
the complex. The association constant provides a meas- 
ure of the affinity of the ligand L for the protein. K a has 
units of m" 1 ; a higher value of K a corresponds to a higher 
affinity of the ligand for the protein. 

The equilibrium term K a is also equivalent to the ra- 
tio of the rates of the forward (association) and reverse 
(dissociation) reactions that form the PL complex. The 
association rate is described by a rate constant fc a , and 
dissociation by the rate constant k d . As discussed fur- 
ther in the next chapter, rate constants are proportion- 
ality constants, describing the fraction of a pool of 
reactant that reacts in a given amount of time. When the 
reaction involves one molecule, such as the dissociation 
reaction PL -> P + L, the reaction is first order and the 
rate constant (/c d ) has units of reciprocal time (s _1 ). 
When the reaction involves two molecules, such as the 
association reaction P + L -> PL, it is called second or- 
der, and its rate constant (/c a ) has units of m" 1 s -1 . 

KEY CONVENTION; Equilibrium constants are denoted with 
a capital K and rate constants with a lower case k. ■ 

A rearrangement of the first part of Equation 5-2 shows 
that the ratio of bound to free protein is directly propor- 
tional to the concentration of free ligand: 



K a [L] 



[PL] 
[PI 



(5-3) 



When the concentration of the ligand is much greater 
than the concentration of ligand-binding sites, the 
binding of the ligand by the protein does not apprecia- 
bly change the concentration of free (unbound) lig- 
and — that is, [L] remains constant. This condition is 
broadly applicable to most ligands that bind to proteins 
in cells and simplifies our description of the binding 
equilibrium. 

We can now consider the binding equilibrium from 
the standpoint of the fraction, 6 (theta), of ligand- 
binding sites on the protein that are occupied by ligand: 



binding sites occupied [PL] 



[P] 



(5-4) 



total binding sites [PL] 

Substituting if a [L][P] for [PL] (see Eqn 5-3) and rear 
ranging terms gives 



K a [h][P] 



K a [L] 



[L] 



K a [L][P] + [P] KJL] + 1 



[L] 



_1 



(5-5) 



The value of K a can be determined from a plot of 0 ver- 
sus the concentration of free ligand, [L] (Fig, 5-4a). 
Any equation of the form x = y/(y+ z) describes a hy- 
perbola, and 6 is thus found to be a hyperbolic function 
of [L]. The fraction of ligand-binding sites occupied ap- 
proaches saturation asymptotically as [L] increases. The 
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FIGURE 5-4 Graphical representations of ligand binding. The fraction 
of ligand-binding sites occupied, 0, is plotted against the concentration 
of free ligand. Both curves are rectangular hyperbolas, (a) A hypothet- 
ical binding curve for a ligand L. The [L] at which half of the available 
ligand-binding sites are occupied is equivalent to MK a; or K d . The 
curve has a horizontal asymptote at 6 - 1 and a vertical asymptote (not 
shown) at [L] = -MK a . (b) A curve describing the binding of oxygen to 
myoglobin. The partial pressure of 0 2 in the air above the solution is 
expressed in kilopascals (kPa). Oxygen binds tightly to myoglobin, 
with a P 50 of only 0.26 kPa. 

[L] at which half of the available ligand-binding sites are 
occupied (that is, 0 = 0.5) corresponds to \/K a . 

It is more common (and intuitively simpler) , however, 
to consider the dissociation constant, K d , which is the 
reciprocal of K a (K d = and is given in units of molar 
concentration (m). K d is the equilibrium constant for the 
release of ligand. The relevant expressions change to 

[P][L] _ k d 
[PL] k a 



(5-6) 



[PL] 



[P][L] 

K d 

[L] 

fL] +gj 



(5-7) 



g (5-8) 



When [L] equals K d , half of the ligand-binding sites are 
occupied. As [L] falls below K d , progressively less of the 
protein has ligand bound to it. In order for 90% of 
the available ligand-binding sites to be occupied, [L] 
must be nine times greater than K d . 

In practice, K d is used much more often than K a to 
express the affinity of a protein for a ligand. Note that 
a lower value of K d corresponds to a higher affinity of 
ligand for the protein. The mathematics can be reduced 
to simple statements: K d is equivalent to the molar 
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Some Protein Dissociation Constants 


Protein 


Ligand 


K d (m)* 


Avidin (egg white) 


Biotin 


1 X 10" 15 


Insulin receptor (human) 


Insulin 


1 x lO" 10 


Anti-HIV lnuriunoglobulin (human) ^ 


gp4i ^riiv-i sunace pruLemj 


A V If) -10 


Nickel-binding protein (E. coll) 


Ni 2+ 


1 x 10" 7 


Calmodulin (rat)* 


Ca 2+ 


3 x 10" 6 






2 x 10" 5 




Typical receptor-ligand interactions 





Sequence-specific protein-DNA 



Biotin-avidin 

i 
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Color bars indicate the range of dissociation constants typical of various classes of interactions in bio- 
logical systems. A few interactions, such as that between the protein avidin and the enzyme cofactor 
biotin, fall outside the normal ranges. The avidin-biotin interaction is so tight it may be considered 
irreversible. Sequence-specific protein-DNA interactions reflect proteins that bind to a particular 
sequence of nucleotides in DNA, as opposed to general binding to any DNA site. 

*A reported dissociation constant is valid only for the particular solution conditions under which it was measured. K d values for a 
protein-ligand interaction can be altered, sometimes by several orders of magnitude, by changes in the solution's salt concentration, pH, 
or other variables. 

tThis immunoglobulin was isolated as part of an effort to develop a vaccine against HIV. Immunoglobulins {described later in the chapter) 
are highly variable, and the K d reported here should not be considered characteristic of all immunoglobulins. 

Calmodulin has four binding sites for calcium. The values shown reflect the highest- and lowest-affinity binding sites observed in one set 
of measurements. 



concentration of ligand at which half of the available lig- 
and-binding sites are occupied. At this point, the protein 
is said to have reached half-saturation with respect to li- 
gand binding. The more tightly a protein binds a ligand, 
the lower the concentration of ligand required for half 
the binding sites to be occupied, and thus the lower the 
value of K d , Some representative dissociation constants 
are given in Table 5-1 ; the scale shows typical ranges for 
dissociation constants found in biological systems. 



WORKED EXAMPLE 5-1 



Receptor-Ligand 
Dissociation Constants 



Two proteins, X and Y, bind to the same ligand, A, with 
the binding curves shown below. 




What is the dissociation constant, for each protein? 
Which protein (X or Y) has a greater affinity for ligand A? 

Solution: We can determine the dissociation constants 
by inspecting the graph. Since 6 represents the fraction 
of binding sites occupied by ligand, the concentration 
of ligand at which half the binding sites are occupied — 
that is, the point where the binding curve crosses the 
line where 6 = 0.5 — is the dissociation constant. For X, 
K d = 2 /am; for Y, K d = 6 jjlm. Because X is half-saturated 
at a lower [A], it has a higher affinity for the ligand. 



The binding of oxygen to myoglobin follows the pat- 
terns discussed above. However, because oxygen is a 
gas, we must make some minor adjustments to the equa- 
tions so that laboratory experiments can be carried out 
more conveniently. We first substitute the concentration 
of dissolved oxygen for [L] in Equation 5-8 to give 



[0 2 ] 



[0 2 ] + K d 



(5-9) 



As for any ligand, K d equals the [0 2 ] at which half of the 
available ligand-binding sites are occupied, or [O 2 ]o,5- 
Equation 5-9 thus becomes 



0 = 



[0 2 ] 



[0 2 1 + [OJo.5 



(5-10) 
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In experiments using oxygen as a ligand, it is the partial 
pressure of oxygen (p0 2 ) in the gas phase above the so- 
lution that is varied, because this is easier to measure 
than the concentration of oxygen dissolved in the solu- 
tion. The concentration of a volatile substance in solution 
is always proportional to the local partial pressure of the 
gas. So, if we define the partial pressure of oxygen at 
[C>2]o 5 as ^50) substitution in Equation 5-10 gives 



p0 2 



p0 2 + P 5l 



§ (5-11) 



A binding curve for myoglobin that relates 6 to p0 2 is 
shown in Figure 5-4b. 

Protein Structure Affects How Ligands Bind 

The binding of a ligand to a protein is rarely as simple as 
the above equations would suggest. The interaction is 
greatly affected by protein structure and is often ac- 
companied by conformational changes. For example, 
the specificity with which heme binds its various ligands 
is altered when the heme is a component of myoglobin. 
Carbon monoxide binds to free heme molecules more 
than 20,000 times better than does 0 2 (that is, the K a or 
P 50 for CO binding to free heme is more than 20,000 
times lower than that for 0 2 ) , but it binds only about 200 
times better than 0 2 when the heme is bound in myo- 
globin. The difference may be partly explained by steric 
hindrance. When 0 2 binds to free heme, the axis of the 
oxygen molecule is positioned at an angle to the Fe — O 
bond (Fig. 5-5a). In contrast, when CO binds to free 
heme, the Fe, C, and O atoms lie in a straight line 
(Fig. 5-5b). In both cases, the binding reflects the 
geometry of hybrid orbitals in each ligand. In myoglobin, 
His 64 (His E7), on the 0 2 -binding side of the heme, is too 
far away to coordinate with the heme iron, but it does in- 
teract with a ligand bound to heme. This residue, called 
the distal His (as distinct from the proximal His, His 
F8), forms a hydrogen bond with 0 2 (Fig. 5-5c) but may 
preclude the linear binding of CO, providing one expla- 
nation for the selectively diminished binding of CO to 
heme in myoglobin (and hemoglobin). A reduction in 
CO binding is physiologically important, because CO is 
alow-level byproduct of cellular metabolism. Other 
factors, not yet well-defined, also may modulate the 
interaction of heme with CO in these proteins. 

The binding of 0 2 to the heme in myoglobin also de- 
pends on molecular motions, or "breathing," in the pro- 
tein structure. The heme molecule is deeply buried in 
the folded polypeptide, with no direct path for oxygen to 
move from the surrounding solution to the ligand-binding 
site. If the protein were rigid, 0 2 could not enter or leave 
the heme pocket at a measurable rate. However, rapid 
molecular flexing of the amino acid side chains produces 
transient cavities in the protein structure, and 0 2 makes 
its way in and out by moving through these cavities. 
Computer simulations of rapid structural fluctuations in 
myoglobin suggest that there are many such pathways. 
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FIGURE 5-5 Steric effects caused by ligand binding to the heme of myo- 
globin, (a) Oxygen binds to heme with the 0 2 axis at an angle, a binding 
conformation readily accommodated by myoglobin, (b) Carbon monox- 
ide binds to free heme with the CO axis perpendicular to the plane of the 
porphyrin ring. When binding to the heme in myoglobin, CO is forced to 
adopt a slight angle because the perpendicular arrangement is sterically 
blocked by His E7, the distal His. This effect weakens the binding of CO 
to myoglobin, (c) Another view of the heme of myoglobin (derived from 
PDB ID 1MBO), showing the arrangement of key amino acid residues 
around the heme. The bound 0 2 is hydrogen-bonded to the distal His, 
His E7 (His 64 ), further facilitating the binding of 0 2 . 

One major route is provided by rotation of the side chain 
of the distal His (His 64 ), which occurs on a nanosecond 
(10~ 9 s) time scale. Even subtle conformational changes 
can be critical for protein activity. 

Hemoglobin Transports Oxygen in Blood 

Q Oxygen-Binding Proteins — Hemoglobin: Oxygen Transport 

Nearly all the oxygen carried by whole blood in animals 
is bound and transported by hemoglobin in erythrocytes 
(red blood cells). Normal human erythrocytes are small 
(6 to 9 ^im in diameter), biconcave disks. They are 
formed from precursor stem cells called hemocyto- 
blasts. In the maturation process, the stem cell 
produces daughter cells that form large amounts of 
hemoglobin and then lose their intracellular organelles — 
nucleus, mitochondria, and endoplasmic reticulum. 
Erythrocytes are thus incomplete, vestigial cells, unable 
to reproduce and, in humans, destined to survive for 
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only about 120 days. Their main function is to carry 
hemoglobin, which is dissolved in the cytosol at a very 
high concentration (—34% by weight). 

In arterial blood passing from the lungs through the 
heart to the peripheral tissues, hemoglobin is about 96% 
saturated with oxygen. In the venous blood returning to 
the heart, hemoglobin is only about 64% saturated. Thus, 
each 100 mL of blood passing through a tissue releases 
about one-third of the oxygen it carries, or 6.5 mL of 0 2 
gas at atmospheric pressure and body temperature. 

Myoglobin, with its hyperbolic binding curve for 
oxygen (Fig. 5-4b), is relatively insensitive to small 
changes in the concentration of dissolved oxygen and so 
functions well as an oxygen-storage protein. Hemoglo- 
bin, with its multiple subunits and 0 2 -binding sites, is 
better suited to oxygen transport. As we shall see, inter- 
actions between the subunits of a multimeric protein 
can permit a highly sensitive response to small changes 
in ligand concentration. Interactions among the sub- 
units in hemoglobin cause conformational changes that 
alter the affinity of the protein for oxygen. The modula- 
tion of oxygen binding allows the 0 2 -transport protein 
to respond to changes in oxygen demand by tissues. 

Hemoglobin Subunits Are Structurally 
Similar to Myoglobin 

Hemoglobin (M r 64,500; abbreviated Hb) is roughly 
spherical, with a diameter of nearly 5.5 nm. It is a 
tetrameric protein containing four heme prosthetic 
groups, one associated with each polypeptide chain. 
Adult hemoglobin contains two types of globin, two a 
chains (141 residues each) and two (3 chains (146 
residues each). Although fewer than half of the amino 
acid residues are identical in the polypeptide sequences 
of the a and /3 subunits, the three-dimensional struc- 
tures of the two types of subunits are very similar. Fur- 
thermore, their structures are very similar to that of 
myoglobin (Fig. 5-6), even though the amino acid se- 
quences of the three polypeptides are identical at only 
27 positions (Fig. 5-7). All three polypeptides are 
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FIGURE 5-6 Comparison of the structures of myoglobin (PDB ID 
1 MBO) and the j8 subunit of hemoglobin (derived from PDB ID 1 HGA). 
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FIGURE 5-7 The amino acid sequences of whale myoglobin and the a 
and p chains of human hemoglobin. Dashed lines mark helix bound- 
aries To align the sequences optimally, short gaps must be introduced 
into both Hb sequences where a few amino acids are present in the 
other, compared sequences. With the exception of the missing D helix 
in Hba, this alignment permits the use of the helix lettering convention 
that emphasizes the common positioning of amino acid residues that 
are identical in all three structures (shaded). Residues shaded in pink 
are conserved in all known globins. Note that the common helix-letter- 
and-number designation for amino acids does not necessarily corre- 
spond to a common position in the linear sequence of amino acids in 
the polypeptides For example, the distal His residue is His E7 in all 
three structures, but corresponds to His 64 , His 58 , and His 63 in the 
linear sequences of Mb, Hba, and Hb/3, respectively. Nonhelical 
residues at the amino and carboxyl termini, beyond the first (A) and 
last (H) a-helical segments, are labeled NA and HC, respectively. 
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FIGURE 5-8 Dominant interactions between hemoglobin subunits. 

In this representation, a subunits are light and (3 subunits are dark. The 
strongest subunit interactions (highlighted) occur between unlike sub- 
units. When oxygen binds, the contact changes little, but there 
is a large change at the a^ 2 contact, with several ion pairs broken 
(PDB ID 1 HGA). 

members of the globin family of proteins. The helix- 
naming convention described for myoglobin is also 
applied to the hemoglobin polypeptides, except that 
the a subunit lacks the short D helix. The heme-binding 
pocket is made up largely of the E and F helices in each 
of the subunits. 

The quaternary structure of hemoglobin features 
strong interactions between unlike subunits. The 
interface (and its a 2 /3 2 counterpart) involves more than 
30 residues, and its interaction is sufficiently strong that 
although mild treatment of hemoglobin with urea tends 
to disassemble the tetramer into a/3 dimers, these 
dimers remain intact. The a^ 2 (and a 2 p 1 ) interface in- 
volves 19 residues (Fig. 5-8). Hydrophobic interac- 
tions predominate at all the interfaces, but there are 
also many hydrogen bonds and a few ion pairs (or salt 
bridges), whose importance is discussed below. 

Hemoglobin Undergoes a Structural Change 
on Binding Oxygen 

X-ray analysis has revealed two major conformations of 
hemoglobin: the R state and the T state. Although oxy- 
gen binds to hemoglobin in either state, it has a signifi- 
cantly higher affinity for hemoglobin in the R state. 
Oxygen binding stabilizes the R state. When oxygen is 
absent experimentally, the T state is more stable and is 
thus the predominant conformation of deoxyhemoglo- 
bin. T and R originally denoted "tense" and "relaxed," 
respectively, because the T state is stabilized by a 
greater number of ion pairs, many of which lie at the 
ai£ 2 (and a 2 /3 1 ) interface (Fig. 5-9). The binding of 0 2 
to a hemoglobin subunit in the T state triggers a change 
in conformation to the R state. When the entire protein 
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FIGURE 5-9 Some ion pairs that stabilize theT state of deoxyhemo- 
globin. (a) Close-up view of a portion of a deoxyhemoglobin molecule 
in theT state (PDB ID 1 HGA). Interactions between the ion pairs His 
HC3 and Asp FG1 of the /3 subunit (blue) and between Lys C5 of the a 
subunit (gray) and His HC3 (its a-carboxyl group) of the £ subunit are 
shown with dashed lines, (Recall that HC3 is the carboxyl-terminal 
residue of the j8 subunit) (b) Interactions between these ion pairs, and 
between others not shown in (a), are schematized in this representa- 
tion of the extended polypeptide chains of hemoglobin. 



undergoes this transition, the structures of the individ- 
ual subunits change little, but the a/3 subunit pairs slide 
past each other and rotate, narrowing the pocket be- 
tween the f3 subunits (Fig. 5-10). In this process, some 
of the ion pairs that stabilize the T state are broken and 
some new ones are formed. 

Max Perutz proposed that the T — > R transition is 
triggered by changes in the positions of key amino acid 
side chains surrounding the heme. In the T state, the 
porphyrin is slightly puckered, causing the heme iron 
to protrude somewhat on the proximal His (His F8) 
side. The binding of 0 2 causes the heme to assume a 
more planar conformation, shifting the position of the 
proximal His and the attached F helix (Fig. 5-11). 
These changes lead to adjustments in the ion pairs at 
the ai/3 2 interface. 

Hemoglobin Binds Oxygen Cooperatively 

Hemoglobin must bind oxygen efficiently in the lungs, 
where the p0 2 is about 13.3 kPa, and release oxygen in 
the tissues, where the p0 2 is about 4 kPa. Myoglobin, or 
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T state 

FIGURE 5 -10 TheT -> R transition. (PDB ID 1 HGA and 1 BBB) In these 
depictions of deoxyhemoglobin, as in Figure 5-9, the p subunits are 
blue and the a subunits are gray. Positively charged side chains and 
chain termini involved in ion pairs are shown in blue, their negatively 
charged partners in red. The Lys C5 of each a subunit and Asp FG1 of 
each p subunit are visible but not labeled (compare Fig. 5-9a). Note 
that the molecule is oriented slightly differently than in Figure 5-9. The 
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FIGURE 5- VI Changes in conformation near heme on 0 2 binding to 
deoxyhemoglobin. (Derived from PDB ID 1 HGA and 1 BBB) The shift 
in the position of helix F when heme binds 0 2 is thought to be one of 
the adjustments that triggers the T -> R transition. 

any protein that binds oxygen with a hyperbolic binding 
curve, would be ill-suited to this function, for the reason 
illustrated in Plgmre 5-1 2 . A protein that bound 0 2 with 
high affinity would bind it efficiently in the lungs but 
would not release much of it in the tissues. If the protein 
bound oxygen with a sufficiently low affinity to release it 
in the tissues, it would not pick up much oxygen in 
the lungs. 

Hemoglobin solves the problem by undergoing a 
transition from a low-affinity state (the T state) to a 
high-affinity state (the R state) as more 0 2 molecules are 
bound. As a result, hemoglobin has a hybrid S-shaped, 
or sigmoid, binding curve for oxygen (Fig. 5-12). A 
single-subunit protein with a single ligand-binding site 




R state 

transition from the T state to the R state shirts the subunit pairs sub- 
stantially, affecting certain ion pairs. Most noticeably, the His HC3 
residues at the carboxyl termini of the p subunits, which are involved 
in ion pairs in the T state, rotate in the R state toward the center of the 
molecule, where they are no longer in ion pairs. Another dramatic 
result of the T -> R transition is a narrowing of the pocket between the 
P subunits. 

cannot produce a sigmoid binding curve — even if bind- 
ing elicits a conformational change — because each 
molecule of ligand binds independently and cannot af- 
fect ligand binding to another molecule. In contrast, 0 2 
binding to individual subunits of hemoglobin can alter 
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FIGURE 5-12 A sigmoid (cooperative) binding curve. A sigmoid bind- 
ing curve can be viewed as a hybrid curve reflecting a transition from 
a low-affinity to a high-affinity state. Because of its cooperative bind- 
ing, as manifested by a sigmoid binding curve, hemoglobin is more 
sensitive to the small differences in 0 2 concentration between the tis- 
sues and the lungs, allowing it to bind oxygen in the lungs (where p0 2 
is high) and release it in the tissues (where p0 2 is low). 
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the affinity for 0 2 in adjacent subunits. The first mole- 
cule of 0 2 that interacts with deoxyhemoglobin binds 
weakly, because it binds to a subunit in the T state. Its 
binding, however, leads to conformational changes that 
are communicated to adjacent subunits, making it eas- 
ier for additional molecules of 0 2 to bind. In effect, the 
T — » R transition occurs more readily in the second sub- 
unit once 0 2 is bound to the first subunit. The last 
(fourth) 0 2 molecule binds to a heme in a subunit that is 
already in the R state, and hence it binds with much 
higher affinity than the first molecule. 

An allosteric protein is one in which the binding 
of a ligand to one site affects the binding properties of 
another site on the same protein. The term "allosteric" 
derives from the Greek alios, "other," and stereos, 
"solid" or "shape." Allosteric proteins are those having 
"other shapes," or conformations, induced by the binding 
of ligands referred to as modulators. The conformational 
changes induced by the modulator(s) interconvert more- 
active and less-active forms of the protein. The modula- 
tors for allosteric proteins may be either inhibitors or 
activators. When the normal ligand and modulator are 
identical, the interaction is termed homotropic. When 
the modulator is a molecule other than the normal lig- 
and, the interaction is heterotropic. Some proteins 
have two or more modulators and therefore can have 
both homotropic and heterotropic interactions. 

Cooperative binding of a ligand to a multimeric pro- 
tein, such as we observe with the binding of 0 2 to hemo- 
globin, is a form of allosteric binding. The binding of one 
ligand affects the affinities of any remaining unfilled 
binding sites, and 0 2 can be considered as both a ligand 
and an activating homotropic modulator. There is only 
one binding site for 0 2 on each subunit, so the allosteric 
effects giving rise to cooperativity are mediated by con- 
formational changes transmitted from one subunit to an- 
other by subunit-subunit interactions. A sigmoid binding 
curve is diagnostic of cooperative binding. It permits a 
much more sensitive response to ligand concentration 
and is important to the function of many multisubunit 
proteins. The principle of alios tery extends readily to 
regulatory enzymes, as we shall see in Chapter 6. 

Cooperative conformational changes depend on 
variations in the structural stability of different parts of 
a protein, as described in Chapter 4. The binding sites of 
an allosteric protein typically consist of stable segments 
in proximity to relatively unstable segments, with the 
latter capable of frequent changes in conformation or 
disorganized motion (Fig. 5-13). When a ligand binds, 
the moving parts of the protein's binding site may be sta- 
bilized in a particular conformation, affecting the con- 
formation of adjacent polypeptide subunits. If the entire 
binding site were highly stable, then few structural 
changes could occur in this site or be propagated to 
other parts of the protein when a ligand binds. 

As is the case with myoglobin, ligands other than 
oxygen can bind to hemoglobin. An important example 
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FIGURE 5-13 Structural changes in a multisubunit protein undergo- 
ing cooperative binding to ligand. Structural stability is not uniform 
throughout a protein molecule. Shown here is a hypothetical dimeric 
protein, with regions of high (blue), medium (green), and low (red) 
stability. The ligand-binding sites are composed of both high- and low- 
stability segments, so affinity for ligand is relatively low. The confor- 
mational changes that occur as ligand binds convert the protein from a 
low- to a high-affinity state, a form of induced fit. 

is carbon monoxide, which binds to hemoglobin about 
250 times better than does oxygen. Human exposure to 
CO can have tragic consequences (Box 5-1). 

Cooperative Ligand Binding Can Be 
Described Quantitatively 

Cooperative binding of oxygen by hemoglobin was first 
analyzed by Archibald Hill in 1910. From this work came 
a general approach to the study of cooperative ligand 
binding to multisubunit proteins. 

For a protein with n binding sites, the equilibrium of 
Equation 5-1 becomes 

p + nL PL„ (5-12) 

and the expression for the association constant becomes 

[PLJ 



[PHLF 



(5-13) 
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Carbon Monoxide: A Stealthy Killer 



BOX 5-1 

Lake Powell, Arizona, August 2000. A family was vacation- 
ing in a rented houseboat. They turned on the electrical 
generator to power an air conditioner and a television. 
About 15 minutes later, two brothers, aged 8 and 11, 
jumped off the swim deck at the stern. Situated immedi- 
ately below the deck was the exhaust port for the generator. 
Within two minutes, both boys were overcome by the car- 
bon monoxide in the exhaust, which had become concen- 
trated in the space under the deck. Both drowned. These 
deaths, along with a series of deaths in the 1990s that were 
linked to houseboats of similar design, eventually led to the 
recall and redesign of the generator exhaust assembly 

Carbon monoxide (CO), a colorless, odorless gas, is re- 
sponsible for more than half of yearly deaths due to poison- 
ing worldwide. CO has an approximately 250-fold greater 
affinity for hemoglobin than does oxygen. Consequently, 
relatively low levels of CO can have substantial and tragic 
effects. When CO combines with hemoglobin, the complex 
is referred to as carboxyhemoglobin, or COHb. 

Some CO is produced by natural processes, but lo- 
cally high levels generally result only from human activi- 
ties. Engine and furnace exhausts are important sources, 
as CO is a byproduct of the incomplete combustion of fos- 
sil fuels. In the United States alone, nearly 4,000 people 
succumb to CO poisoning each year, both accidentally 
and intentionally Many of the accidental deaths involve 
undetected CO buildup in enclosed spaces, such as when 
a household furnace malfunctions or leaks, venting CO 
into a home. However, CO poisoning can also occur in 
open spaces, as unsuspecting people at work or play in- 
hale the exhaust from generators, outboard motors, trac- 
tor engines, recreational vehicles, or lawn mowers. 

Carbon monoxide levels in the atmosphere are rarely 
dangerous, ranging from less than 0.05 parts per million 
(ppm) in remote and uninhabited areas to 3 to 4 ppm 
in some cities of the northern hemisphere. In the United 
States, the government-mandated (Occupational Safety 
and Health Administration, OSHA) limit for CO at work- 
sites is 50 ppm for people working an eight-hour shift. 
The tight binding of CO to hemoglobin means that COHb 
can accumulate over time as people are exposed to a 
constant low-level source of CO. 

In an average, healthy individual, 1% or less of the 
total hemoglobin is complexed as COHb. Since CO is a 
product of tobacco smoke, many smokers have COHb 
levels in the range of 3% to 8% of total hemoglobin, and 
the levels can rise to 15% for chain-smokers. COHb 
levels equilibrate at 50% in people who breathe air con- 
taining 570 ppm of CO for several hours. Reliable meth- 
ods have been developed that relate CO content in the 
atmosphere to COHb levels in the blood (Fig. 1). In tests 
of houseboats with a generator exhaust like the one re- 
sponsible for the Lake Powell deaths, CO levels reached 



6,000 to 30,000 ppm under the swim deck, and atmos- 
pheric 0 2 levels under the deck declined from 21% to 
12%. Even above the swim deck, CO levels of up to 
7,200 ppm were detected, high enough to cause death 
within a few minutes. 

How is a human affected by COHb? At levels of less 
than 10% of total hemoglobin, symptoms are rarely ob- 
served. At 15% , the individual experiences mild headaches. 
At 20% to 30%, the headache is severe and is generally ac- 
companied by nausea, dizziness, confusion, disorientation, 
and some visual disturbances; these symptoms are gener- 
ally reversed if the individual is treated with oxygen. At 
COHb levels of 30% to 50%, the neurological symptoms be- 
come more severe, and at levels near 50%, the individual 
loses consciousness and can sink into coma. Respiratory 
failure may follow. With prolonged exposure, some damage 
becomes permanent. Death normally occurs when COHb 
levels rise above 60%. Autopsy on the boys who died at 
Lake Powell revealed COHb levels of 59% and 52% . 

Binding of CO to hemoglobin is affected by many 
factors, including exercise (Fig. 1) and changes in air 
pressure related to altitude. Because of their higher 
base levels of COHb, smokers exposed to a source of 
CO often develop symptoms faster than nonsmokers. 
Individuals with heart, lung, or blood diseases that re- 
duce the availability of oxygen to tissues may also ex- 
perience symptoms at lower levels of CO exposure. 
Fetuses are at particular risk for CO poisoning, be- 
cause fetal hemoglobin has a somewhat higher affinity 
for CO than adult hemoglobin. Cases of CO exposure 
have been recorded in which the fetus died but the 
mother recovered. 

(continued on next page) 
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Carbon monoxide (ppm) 

FIGURE 1 Relationship between levels of COHb in blood and concen- 
tration of CO in the surrounding air. Four different conditions of expo- 
sure are shown, comparing the effects of short versus extended 
exposure, and exposure at rest versus exposure during light exercise. 
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Carbon Monoxide: A Stealthy Killer 

(continued from previous page) 




It may seem surprising that the loss of half of one's 
hemoglobin to COHb can prove fatal— we know that 
people with any of several anemic conditions manage to 
function reasonably well with half the usual complement 
of active hemoglobin. However, the binding of CO to he- 
moglobin does more than remove protein from the pool 
available to bind oxygen. It also affects the affinity of the 
remaining hemoglobin subunits for oxygen. As CO binds 
to one or two subunits of a hemoglobin tetramer, the 
affinity for 0 2 is increased substantially in the remaining 
subunits (Fig. 2). Thus, a hemoglobin tetramer with two 
bound CO molecules can efficiently bind 0 2 in the 
lungs— but it releases very little of it in the tissues. 
Oxygen deprivation in the tissues rapidly becomes severe. 
To add to the problem, the effects of CO are not limited 
to interference with hemoglobin function. CO binds to 
other heme proteins and a variety of metalloproteins. 
The effects of these interactions are not yet well under- 
stood, but they may be responsible for some of the 
longer-term effects of acute but nonfatal CO poisoning. 

When CO poisoning is suspected, rapid evacuation 
of the person away from the CO source is essential, but 
this does not always result in rapid recovery. When an 
individual is moved from the CO-polluted site to a nor- 
mal, outdoor atmosphere, 0 2 begins to replace the CO in 
hemoglobin— but the COHb levels drop only slowly. The 
half-time is 2 to 6,5 hours, depending on individual and 
environmental factors. If 100% oxygen is administered 
with a mask, the rate of exchange can be increased 
about fourfold; the half-time for 0 2 -CO exchange can be 
reduced to tens of minutes if 100% oxygen at a pressure 
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FIGURE 2 Several oxygen-binding curves: for normal hemoglobin, he- 
moglobin from an anemic individual with only 50% of her hemoglo- 
bin functional, and hemoglobin from an individual with 50% of his 
hemoglobin subunits complexed with CO. The p0 2 in human lungs 
and tissues is indicated 

of 3 atm (303 kPa) is supplied. Thus, rapid treatment by 
a properly equipped medical team is critical. 

Carbon monoxide detectors in all homes are highly 
recommended. This is a simple and inexpensive mea- 
sure to avoid possible tragedy. After completing the re- 
search for this box, we immediately purchased several 
new CO detectors for our homes. 



The expression for 6 (see Eqn 5-8) is 
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Rearranging, then taking the log of both sides, yields 
e [L] n 



1-0 K* 



log 



n log [L] - log K d 



(5-15) 



§ (5-16) 



where K d = [L]g 5 . 

Equation 5-16 is the Hill equation, and a plot of log 
[0/(1 - 0)] versus log [L] is called a Hill plot. Based on the 
equation, the Hill plot should have a slope of n. However, 
the experimentally determined slope actually reflects not 
the number of binding sites but the degree of interaction 
between them. The slope of a Hill plot is therefore de- 
noted by n H , the Hill coefficient, which is a measure of 
the degree of cooperativity. If n H equals 1, ligand binding 
is not cooperative, a situation that can arise even in a mul- 
tisubunit protein if the subunits do not communicate. An 



n u of greater than 1 indicates positive cooperativity in lig- 
and binding. This is the situation observed in hemoglobin, 
in which the binding of one molecule of ligand facilitates 
the binding of others. The theoretical upper limit for n H is 
reached when n H = n r In this case the binding would be 
completely cooperative: all binding sites on the protein 
would bind ligand simultaneously, and no protein mole- 
cules partially saturated with ligand would be present un- 
der any conditions. This limit is never reached in practice, 
and the measured value of n H is always less than the ac- 
tual number of ligand-binding sites in the protein. 

An n H of less than 1 indicates negative cooperativ- 
ity, in which the binding of one molecule of ligand im- 
pedes the binding of others. Well-documented cases of 
negative cooperativity are rare. 

To adapt the Hill equation to the binding of oxygen 
to hemoglobin we must again substitute p0 2 for [L] and 



log 



nlogp0 2 - nlogP 50 



(5-17) 
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FIGURE 5-14 Hill plots for oxygen binding to myoglobin and hemo- 
globin. When n H = 1, there is no evident cooperativity. The maximum 
degree of cooperativity observed for hemoglobin corresponds approxi- 
mately to n H = 3. Note that while this indicates a high level of cooper- 
ativity r> H is less than n, the number of 0 2 -binding sites in hemoglobin. 
This is normal for a protein that exhibits allosteric binding behavior. 



subunits of a cooperatively binding protein are function- 
ally identical, that each subunit can exist in (at least) two 
conformations, and that all subunits undergo the transi- 
tion from one conformation to the other simultaneously. 
In this model, no protein has individual subunits in dif- 
ferent conformations. The two conformations are in 
equilibrium. The ligand can bind to either conformation 
but binds each with different affinity. Successive binding 
of ligand molecules to the low-affinity conformation 
(which is more stable in the absence of ligand) makes a 
transition to the high-affinity conformation more likely. 

In the second model, the sequential model 
(Fig. 5-1 5b), proposed in 1966 by Daniel Koshland and 
colleagues, ligand binding can induce a change of con- 
formation in an individual subunit. A conformational 
change in one subunit makes a similar change in an 
adjacent subunit, as well as the binding of a second lig- 
and molecule, more likely. There are more potential 
intermediate states in this model than in the concerted 
model. The two models are not mutually exclusive; the 
concerted model may be viewed as the "all-or-none" lim- 
iting case of the sequential model. In Chapter 6 we use 
these models to investigate allosteric enzymes. 



Hill plots for myoglobin and hemoglobin are given in 
Figure 5-14. 

Two Models Suggest Mechanisms for 
Cooperative Binding 

Biochemists now know a great deal about the T and R 
states of hemoglobin, but much remains to be learned 
about how the T — > R transition occurs. Two models for 
the cooperative binding of ligands to proteins with mul- 
tiple binding sites have greatly influenced thinking 
about this problem. 

The first model was proposed by Jacques Monod, 
Jeffries Wyman, and Jean-Pierre Changeux in 1965, and 
is called the MWC model or the concerted model 
(Fig. 5-15a). The concerted model assumes that the 
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Hemoglobin Also Transports H + and C0 2 

In addition to carrying nearly all the oxygen required by 
cells from the lungs to the tissues, hemoglobin carries 
two end products of cellular respiration — H + and C0 2 — 
from the tissues to the lungs and the kidneys, where 
they are excreted. The C0 2) produced by oxidation of 
organic fuels in mitochondria, is hydrated to form bicar- 
bonate: 



C0 2 + H 2 0 



H + + HC0 3 " 



This reaction is catalyzed by carbonic anhydrase, an 

enzyme particularly abundant in erythrocytes. Carbon 
dioxide is not very soluble in aqueous solution, and bub- 
bles of C0 2 would form in the tissues and blood if it were 
not converted to bicarbonate. As you can see from the 
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FIGURE 5-15 Two general models for the intercon- 
version of inactive and active forms of a protein dur- 
ing cooperative ligand binding. Although the models 
may be applied to any protein— including any en- 
zyme (Chapter 6) — that exhibits cooperative binding, 
we show here four subunits because the model was 
originally proposed for hemoglobin, (a) In the con- 
certed, or all-or-none, model (MWC model), all sub- 
units are postulated to be in the same conformation, 
either all O (low affinity or inactive) or all □ (high 
affinity or active). Depending on the equilibrium, K v 
between O and □ forms, the binding of one or more 
ligand molecules (L) will pull the equilibrium toward 
the □ form. Subunits with bound L are shaded, (b) In 
the sequential model, each individual subunit can be 
in either the O or □ form. A very large number of 
conformations is thus possible. 
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reaction catalyzed by carbonic anhydrase, the hydration 
of C0 2 results in an increase in the H + concentration (a 
decrease in pH) in the tissues. The binding of oxygen by 
hemoglobin is profoundly influenced by pH and C0 2 
concentration, so the interconversion of C0 2 and bicar- 
bonate is of great importance to the regulation of 
oxygen binding and release in the blood. 

Hemoglobin transports about 40% of the total H + 
and 15% to 20% of the C0 2 formed in the tissues to 
the lungs and kidneys. (The remainder of the H + is ab- 
sorbed by the plasma's bicarbonate buffer; the remain- 
der of the C0 2 is transported as dissolved HCO3 and 
C0 2 .) The binding of H + and C0 2 is inversely related 
to the binding of oxygen. At the relatively low pH and 
high C0 2 concentration of peripheral tissues, the 
affinity of hemoglobin for oxygen decreases as H + and 
C0 2 are bound, and 0 2 is released to the tissues. Con- 
versely, in the capillaries of the lung, as C0 2 is ex- 
creted and the blood pH consequently rises, the 
affinity of hemoglobin for oxygen increases and the 
protein binds more 0 2 for transport to the peripheral 
tissues. This effect of pH and C0 2 concentration on 
the binding and release of oxygen by hemoglobin is 
called the Bohr effect, after Christian Bohr, the Dan- 
ish physiologist (and father of physicist Niels Bohr) 
who discovered it in 1904. 

The binding equilibrium for hemoglobin and one 
molecule of oxygen can be designated by the reaction 

Hb + 0 2 Hb0 2 

but this is not a complete statement. To account for the 
effect of H + concentration on this binding equilibrium, 
we rewrite the reaction as 

HHb + + 0 2 Hb0 2 + H + 

where HHb + denotes a protonated form of hemoglobin. 
This equation tells us that the 0 2 -saturation curve of hemo- 
globin is influenced by the H + concentration (Fig. 5-16). 
Both 0 2 and H + are bound by hemoglobin, but with inverse 
affinity. When the oxygen concentration is high, as in the 
lungs, hemoglobin binds 0 2 and releases protons. When 
the oxygen concentration is low, as in the peripheral tis- 
sues, H + is bound and 0 2 is released. 

Oxygen and H + are not bound at the same sites in 
hemoglobin. Oxygen binds to the iron atoms of the 
hemes, whereas H + binds to any of several amino acid 
residues in the protein. A major contribution to the 
Bohr effect is made by His 146 (His HC3) of the p sub- 
units. When protonated, this residue forms one of the 
ion pairs— to Asp 94 (Asp FG1)— that helps stabilize 
deoxyhemoglobin in the T state (Fig. 5-9). The ion 
pair stabilizes the protonated form of His HC3, giving 
this residue an abnormally high piT a in the T state. 
The pK a falls to its normal value of 6.0 in the R state 
because the ion pair cannot form, and this residue is 
largely unprotonated in oxyhemoglobin at pH 7.6, the 
blood pH in the lungs. As the concentration of H + 
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FIGURE 5-16 Effect of pH on oxygen binding to hemoglobin. The pH 

of blood is 7.6 in the lungs and 7.2 in the tissues. Experimental mea- 
surements on hemoglobin binding are often performed at pH 7.4. 

rises, protonation of His HC3 promotes release of 
oxygen by favoring a transition to the T state. Proto- 
nation of the amino-terminal residues of the a sub- 
units, certain other His residues, and perhaps other 
groups has a similar effect. 

Thus we see that the four polypeptide chains of he- 
moglobin communicate with each other not only about 
0 2 binding to their heme groups but also about H + bind- 
ing to specific amino acid residues. And there is still 
more to the story. Hemoglobin also binds C0 2) again in a 
manner inversely related to the binding of oxygen. Car- 
bon dioxide binds as a carbamate group to the a-amino 
group at the amino-terminal end of each globin chain, 
forming carbaminohemoglobin: 



C+ H 2 N-C-C C-N-C-C— 

O R O O R O 

Amino-terminal Carbamino-terminal 
residue residue 

This reaction produces H + , contributing to the Bohr ef- 
fect. The bound carbamates also form additional salt 
bridges (not shown in Fig. 5-9) that help to stabilize the 
T state and promote the release of oxygen. 

When the concentration of carbon dioxide is high, 
as in peripheral tissues, some C0 2 binds to hemoglobin 
and the affinity for 0 2 decreases, causing its release. 
Conversely, when hemoglobin reaches the lungs, the 
high oxygen concentration promotes binding of 0 2 and 
release of C0 2 . It is the capacity to communicate ligand- 
binding information from one polypeptide subunit to the 
others that makes the hemoglobin molecule so beauti- 
fully adapted to integrating the transport of 0 2 , C0 2 , 
and H + by erythrocytes. 
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Oxygen Binding to Hemoglobin Is Regulated 
by 2,3-Bisphosphogiycerate 

The interaction of 2,3-bisphosphoglycerate (BPG) 

with hemoglobin molecules further refines the function 
of hemoglobin, and provides an example of heterotropic 
allosteric modulation. 
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BPG is present in relatively high concentrations in ery- 
throcytes. When hemoglobin is isolated, it contains 
substantial amounts of bound BPG, which can be diffi- 
cult to remove completely. In fact, the 0 2 -binding curves 
for hemoglobin that we have examined to this point 
were obtained in the presence of bound BPG. 2,3-Bis- 
phosphoglycerate is known to greatly reduce the affinity 
of hemoglobin for oxygen — there is an inverse relation- 
ship between the binding of 0 2 and the binding of BPG. 
We can therefore describe another binding process 
for hemoglobin: 



HbBPG + 0 2 



Hb0 2 + BPG 



BPG binds at a site distant from the oxygen-binding 
site and regulates the 0 2 -binding affinity of hemoglobin 
in relation to the p0 2 in the lungs. BPG is important in 
the physiological adaptation to the lower p0 2 at high al- 
titudes. For a healthy human at sea level, the binding of 
0 2 to hemoglobin is regulated such that the amount of 
0 2 delivered to the tissues is nearly 40% of the maxi- 
mum that could be carried by the blood (Fig, 5-17). 
Imagine that this person is suddenly transported from 
sea level to an altitude of 4,500 meters, where the p0 2 
is considerably lower. The delivery of 0 2 to the tissues 
is now reduced. However, after just a few hours at the 
higher altitude, the BPG concentration in the blood has 
begun to rise, leading to a decrease in the affinity of he- 
moglobin for oxygen. This adjustment in the BPG level 
has only a small effect on the binding of 0 2 in the lungs 
but a considerable effect on the release of 0 2 in the tis- 
sues. As a result, the delivery of oxygen to the tissues is 
restored to nearly 40% of the 0 2 that can be trans- 
ported by the blood. The situation is reversed when the 
person returns to sea level. The BPG concentration in 
erythrocytes also increases in people suffering from hy- 
poxia, lowered oxygenation of peripheral tissues due 
to inadequate functioning of the lungs or circulatory 
system. 
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FIGURE 5-17 Effect of BPG on oxygen binding to hemoglobin. The 

BPG concentration in normal human blood is about 5 mM at sea level 
and about 8 mM at high altitudes. Note that hemoglobin binds to 
oxygen quite tightly when BPG is entirely absent, and the binding 
curve seems to be hyperbolic. In reality the measured Hill coefficient 
for 0 2 -binding cooperativity decreases only slightly (from 3 to about 
2.5) when BPG is removed from hemoglobin, but the rising part of the 
sigmoid curve is confined to a very small region close to the origin. At 
sea level, hemoglobin is nearly saturated with 0 2 in the lungs, but just 
over 60% saturated in the tissues, so the amount of 0 2 released in the 
tissues is about 38% of the maximum that can be carried in the blood. 
At high altitudes, 0 2 delivery declines by about one-fourth, to 30% of 
maximum. An increase in BPG concentration, however, decreases the 
affinity of hemoglobin for 0 2 , so approximately 37% of what can be 
carried is again delivered to the tissues. 

The site of BPG binding to hemoglobin is the cavity 
between the 0 subunits in the T state (Fig. 5-18). This 
cavity is lined with positively charged amino acid 
residues that interact with the negatively charged 
groups of BPG. Unlike 0 2 , only one molecule of BPG is 
bound to each hemoglobin tetramer. BPG lowers hemo- 
globin's affinity for oxygen by stabilizing the T state. The 
transition to the R state narrows the binding pocket for 
BPG, precluding BPG binding. In the absence of BPG, 
hemoglobin is converted to the R state more easily. 

Regulation of oxygen binding to hemoglobin by BPG 
has an important role in fetal development. Because a 
fetus must extract oxygen from its mother's blood, fetal 
hemoglobin must have greater affinity than the maternal 
hemoglobin for 0 2 . The fetus synthesizes y subunits 
rather than subunits, forming a 2 y 2 hemoglobin. This 
tetramer has a much lower affinity for BPG than normal 
adult hemoglobin, and a correspondingly higher affinity 

for 0 2 . 3 Oxygen-Binding Proteins-Hemoglobin Is Susceptible 
to Allosteric Regulation 



168 j Protein Function 




(a) 




FIGURE 5-18 Binding of BPG to deoxyhemoglobin. (a) BPG binding 
stabilizes the T state of deoxyhemoglobin (PDB ID 1HGA), shown here 
as a mesh surface image, (b) The negative charges of BPG interact with 
several positively charged groups (shown in blue in this surface 



contour image) that surround the pocket between the (3 subunits in the 
T state, (c) The binding pocket for BPG disappears on oxygenation, 
following transition to the R state (PDB ID 1 BBB). (Compare (b) and 
(c) with Fig. 5-10 ) 



Sickle-Celi Anemia Is a Molecular Disease of Hemoglobin 

The hereditary human disease sickie-cell anemia 
demonstrates strikingly the importance of amino 
acid sequence in determining the secondary, tertiary, 
and quaternary structures of globular proteins, and thus 
their biological functions. Almost 500 genetic variants of 
hemoglobin are known to occur in the human population; 
all but a few are quite rare. Most variations consist of dif- 
ferences in a single amino acid residue. The effects on he- 
moglobin structure and function are often minor but can 
sometimes be extraordinary. Each hemoglobin variation 
is the product of an altered gene. The variant genes are 
called alleles. Because humans generally have two copies 
of each gene, an individual may have two copies of one al- 
lele (thus being homozygous for that gene) or one copy of 
each of two different alleles (thus heterozygous). 

Sickle-cell anemia occurs in individuals who inherit 
the allele for sickle-cell hemoglobin from both parents. 
The erythrocytes of these individuals are fewer and also 
abnormal. In addition to an unusually large number of 
immature cells, the blood contains many long, thin, sickle- 
shaped erythrocytes (Fig. 5-19). When hemoglobin from 
sickle cells (called hemoglobin S) is deoxygenated, it be- 
comes insoluble and forms polymers that aggregate into 
tubular fibers (Fig. 5-20). Normal hemoglobin (hemoglo- 
bin A) remains soluble on deoxygenation. The insoluble 
fibers of deoxygenated hemoglobin S cause the deformed, 
sickle shape of the erythrocytes, and the proportion of 
sickled cells increases greatly as blood is deoxygenated. 

The altered properties of hemoglobin S result from 
a single amino acid substitution, a Val instead of a Glu 
residue at position 6 in the two /3 chains. The R group of 
valine has no electric charge, whereas glutamate has a 
negative charge at pH 7.4. Hemoglobin S therefore has 
two fewer negative charges than hemoglobin A (one 
fewer on each /3 chain). Replacement of the Glu residue 
by Val creates a "sticky" hydrophobic contact point at 
position 6 of the (3 chain, which is on the outer surface of 



the molecule. These sticky spots cause deoxyhemoglo- 
bin S molecules to associate abnormally with each other, 
forming the long, fibrous aggregates characteristic of 
this disorder. £ Oxygen-Binding Proteins -Defects in Hb Lead 
to Serious Genetic Disease 

Sickle-cell anemia, as we have noted, occurs in indi- 
viduals homozygous for the sickle-cell allele of the gene 
encoding the (3 subunit of hemoglobin. Individuals who re- 
ceive the sickle-cell allele from only one parent and are 
thus heterozygous experience a milder condition called 
sickle-cell trait; only about 1% of their erythrocytes be- 
come sickled on deoxygenation. These individuals may 
live completely normal lives if they avoid vigorous exercise 
and other stresses on the circulatory system. 

Sickle-cell anemia is life-threatening and painful. Peo- 
ple with this disease suffer repeated crises brought on by 
physical exertion. They become weak, dizzy, and short of 
breath, and they also experience heart murmurs and an in- 
creased pulse rate. The hemoglobin content of their blood 
is only about half the normal value of 15 to 16 g/100 mL, 




< a > 2 jum (b) 

FIGURE 5-19 A comparison of (a) uniform, cup-shaped, normal ery- 
throcytes with (b) the variably shaped erythrocytes seen in sickle-cell 
anemia, which range from normal to spiny or sickle-shaped. 
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FIGURE 5-20 Normal and sickle-cell hemoglobin, (a) Subtle differences 
between the conformations of hemoglobin A and hemoglobin S result from 
a single amino acid change in the 0 chains, (b) As a result of this change, 
deoxyhemoglobin S has a hydrophobic patch on its surface, which causes 
the molecules to aggregate into strands that align into insoluble fibers. 



because sickled cells are very fragile and rupture easily; 
this results in anemia ("lack of blood"). An even more se- 
rious consequence is that capillaries become blocked by 
the long, abnormally shaped cells, causing severe pain 
and interfering with normal organ function — a major fac- 
tor in the early death of many people with the disease. 

Without medical treatment, people with sickle-cell 
anemia usually die in childhood. Curiously, the fre- 
quency of the sickle-cell allele in populations is unusu- 



ally high in certain parts of Africa. Investigation into this 
matter led to the finding that in heterozygous individu- 
als, the allele confers a small but significant resistance to 
lethal forms of malaria. Natural selection has resulted in 
an allele population that balances the deleterious effects 
of the homozygous condition against the resistance to 
malaria afforded by the heterozygous condition. ■ 

SUMMARY 5.1 Reversible Binding 

of a Protein to a Ligand: 
Oxygen-Binding Proteins 

■ Protein function often entails interactions with 
other molecules. A protein binds a molecule, known 
as a ligand, at its binding site. Proteins may 
undergo conformational changes when a ligand 
binds, a process called induced fit. In a 
multisubunit protein, the binding of a ligand 

to one subunit may affect ligand binding to other 
subunits. Ligand binding can be regulated. 

■ Myoglobin contains a heme prosthetic group, which 
binds oxygen. Heme consists of a single atom of 
Fe 2+ coordinated within a porphyrin. Oxygen binds 
to myoglobin reversibly; this simple reversible 
binding can be described by an association constant 
K a or a dissociation constant K d . For a monomeric 
protein such as myoglobin, the fraction of binding 
sites occupied by a ligand is a hyperbolic function 
of ligand concentration. 

■ Normal adult hemoglobin has four heme-containing 
subunits, two a and two /3, similar in structure 

to each other and to myoglobin. Hemoglobin exists 
in two interchangeable structural states, T and R. 
The T state is most stable when oxygen is not 
bound. Oxygen binding promotes transition to the 
R state. 

a Oxygen binding to hemoglobin is both allosteric 
and cooperative. As 0 2 binds to one binding site, 
the hemoglobin undergoes conformational changes 
that affect the other binding sites— an example 
of allosteric behavior. Conformational changes 
between the T and R states, mediated by 
subunit-subunit interactions, result in cooperative 
binding; this is described by a sigmoid binding 
curve and can be analyzed by a Hill plot. 

■ Two major models have been proposed to explain 
the cooperative binding of ligands to multisubunit 
proteins: the concerted model and the sequential 
model. 

■ Hemoglobin also binds H + and C0 2 , resulting in the 
formation of ion pairs that stabilize the T state and 
lessen the protein's affinity for 0 2 (the Bohr effect). 
Oxygen binding to hemoglobin is also modulated 
by 2,3-bisphosphoglycerate, which binds to and 
stabilizes the T state. 
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■ Sickle-cell anemia is a genetic disease caused by a 
single amino acid substitution (Glu 6 to Val 6 ) in each 
P chain of hemoglobin. The change produces a 
hydrophobic patch on the surface of the 
hemoglobin that causes the molecules to aggregate 
into bundles of fibers. This homozygous condition 
results in serious medical complications. 

5.2 Complementary Interactions between 
Proteins and Ligands: The Immune System 
and Immunoglobulins 

We have seen how the conformations of oxygen-binding 
proteins affect and are affected by the binding of small 
ligands (0 2 or CO) to the heme group. However, most 
protein-ligand interactions do not involve a prosthetic 
group. Instead, the binding site for a ligand is more often 
like the hemoglobin binding site for BPG — a cleft in the 
protein lined with amino acid residues, arranged to make 
the binding interaction highly specific. Effective discrim- 
ination between ligands is the norm at binding sites, even 
when the ligands have only minor structural differences. 

All vertebrates have an immune system capable of 
distinguishing molecular "self" from "nonself" and then 
destroying what is identified as nonself. In this way, the 
immune system eliminates viruses, bacteria, and other 
pathogens and molecules that may pose a threat to the 
organism. On a physiological level, the immune re- 
sponse is an intricate and coordinated set of interactions 
among many classes of proteins, molecules, and cell 
types. At the level of individual proteins, the immune re- 
sponse demonstrates how an acutely sensitive and spe- 
cific biochemical system is built upon the reversible 
binding of ligands to proteins. 

The Immune Response Features a Specialized 
Array of Cells and Proteins 

Immunity is brought about by a variety of leukocytes 
(white blood cells), including macrophages and lym- 
phocytes, all of which develop from undifferentiated 
stem cells in the bone marrow. Leukocytes can leave the 
bloodstream and patrol the tissues, each cell producing 
one or more proteins capable of recognizing and binding 
to molecules that might signal an infection. 

The immune response consists of two complemen- 
tary systems, the humoral and cellular immune systems. 
The humoral immune system (Latin humor, "fluid") 
is directed at bacterial infections and extracellular 
viruses (those found in the body fluids), but can also 
respond to individual foreign proteins. The cellular 
immune system destroys host cells infected by viruses 
and also destroys some parasites and foreign tissues. 

At the heart of the humoral immune response are sol- 
uble proteins called antibodies or immunoglobulins, 
often abbreviated Ig. Immunoglobulins bind bacteria, 



viruses, or large molecules identified as foreign and target 
them for destruction. Making up 20% of blood protein, 
the immunoglobulins are produced by B lymphocytes, 
or B cells, so named because they complete their devel- 
opment in the £>one marrow. 

The agents at the heart of the cellular immune re- 
sponse are a class of T lymphocytes, or T cells (so 
called because the latter stages of their development 
occur in the thymus), known as cytotoxic T cells (T c 
cells, also called killer T cells). Recognition of infected 
cells or parasites involves proteins called T-cell recep- 
tors on the surface of T c cells. Receptors are proteins, 
usually found on the outer surface of cells and extend- 
ing through the plasma membrane; they recognize and 
bind extracellular ligands, triggering changes inside 
the cell. 

In addition to cytotoxic T cells, there are helper T 
cells (T H cells), whose function it is to produce solu- 
ble signaling proteins called cytokines, which include 
the interleukins. T H cells interact with macrophages. 
The T H cells participate only indirectly in the destruc- 
tion of infected cells and pathogens, stimulating the 
selective proliferation of those T c and B cells that can 
bind to a particular antigen. This process, called clonal 
selection, increases the number of immune system 
cells that can respond to a particular pathogen. The im- 
portance of T H cells is dramatically illustrated by the 
epidemic produced by HIV (human immunodeficiency 
virus), the virus that causes AIDS (acquired immune 
deficiency syndrome). The primary targets of HIV 
infection are T H cells. Elimination of these cells pro- 
gressively incapacitates the entire immune system. 
Table 5-2 summarizes the functions of some leukocytes 
of the immune system. 

Each recognition protein of the immune system, ei- 
ther a T-cell receptor or an antibody produced by a B cell, 
specifically binds some particular chemical structure, 



TABLE 5-2 



Cell type 



me Types of Leukocytes Associated 
with the Immune System 

Function 




Macrophages 

B lymphocytes (B cells) 

T lymphocytes (T cells) 
Cytotoxic (killer) 
T cells (T c ) 

Helper T cells (T H ) 



Ingest large particles and 
cells by phagocytosis 

Produce and secrete 
antibodies 

Interact with infected host 
cells through receptors 
on T-cell surface 

Interact with macrophages 
and secrete cytokines 
(interleukins) that 
stimulate T c , T H , and 
B cells to proliferate. 
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distinguishing it from virtually all others. Humans are 
capable of producing more than 10 8 different antibodies 
with distinct binding specificities. Given this extraordi- 
nary diversity, any chemical structure on the surface of 
a virus or invading cell will most likely be recognized and 
bound by one or more antibodies. Antibody diversity is 
derived from random reassembly of a set of im- 
munoglobulin gene segments through genetic recombi- 
nation mechanisms that are discussed in Chapter 25 
(see Fig. 25-26). 

A specialized lexicon is used to describe the unique 
interactions between antibodies or T-cell receptors and 
the molecules they bind. Any molecule or pathogen 
capable of eliciting an immune response is called an 
antigen. An antigen may be a virus, a bacterial cell wall, 
or an individual protein or other macromolecule. A com- 
plex antigen may be bound by several different antibod- 
ies. An individual antibody or T-cell receptor binds only 
a particular molecular structure within the antigen, 
called its antigenic determinant or epitope. 

It would be unproductive for the immune system to 
respond to small molecules that are common intermedi- 
ates and products of cellular metabolism. Molecules of M r 
<5,000 are generally not antigenic. However, when small 
molecules are covalently attached to large proteins in the 
laboratory, they can be used to elicit an immune response. 



These small molecules are called haptens. The antibod- 
ies produced in response to protein-linked haptens will 
then bind to the same small molecules in their free form. 
Such antibodies are sometimes used in the development 
of analytical tests described later in this chapter or as 
catalytic antibodies (see Box 6-3). We now turn to a 
more detailed description of antibodies and their binding 
properties. 

Antibodies Have Two Identical Antigen-Binding Sites 

Immunoglobulin G (IgG) is the major class of anti- 
body molecule and one of the most abundant proteins in 
the blood serum. IgG has four polypeptide chains: two 
large ones, called heavy chains, and two light chains, 
linked by noncovalent and disulfide bonds into a com- 
plex of M T 150,000. The heavy chains of an IgG molecule 
interact at one end, then branch to interact separately 
with the light chains, forming a Y-shaped molecule 
(Fig. 5-21). At the "hinges" separating the base of an 
IgG molecule from its branches, the immunoglobulin can 
be cleaved with proteases. Cleavage with the protease 
papain liberates the basal fragment, called Fc because it 
usually crystallizes readily, and the two branches, called 
Fab, the antigen-binding fragments. Each branch has a 
single antigen-binding site. 



Antigen- Antigen- 
binding binding 

site site 

\ „ Papain / 

/■ ^ cleavage 3$r/ 

■ VXv H site* V R . /% 



Fab 



(a) 




C = constant domain 
V = variable domain 
H, L = heavy, light chains 



FIGURE 5-21 Immunoglobulin G. (a) Pairs of heavy and light chains 
combine to form a Y-shaped molecule. Two antigen-binding sites are 
formed by the combination of variable domains from one light (V L ) and 
one heavy (V H ) chain. Cleavage with papain separates the Fab and Fc 
portions of the protein in the hinge region. The Fc portion of the mole- 
cule also contains bound carbohydrate (shown in (b)). (b) A ribbon 



model of the first complete IgG molecule to be crystallized and struc- 
turally analyzed (PDB ID 1 1GT). Although the molecule has two iden- 
tical heavy chains (two shades of blue) and two identical light chains 
(two shades of red), it crystallized in the asymmetric conformation 
shown here. Conformational flexibility may be important to the func- 
tion of immunoglobulins. 
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Antibody Antigen- antibody complex 

FIGURE 5-22 Binding of IgG to an antigen. To generate an optimal 
fit for the antigen, the binding sites of IgG often undergo slight confor- 
mational changes. Such induced fit is common to many protein-ligand 
interactions. 

The fundamental structure of immunoglobulins 
was first established by Gerald Edelman and Rodney 
Porter. Each chain is made up of identifiable domains; 
some are constant in sequence and structure from one 
IgG to the next, others are variable. The constant do- 
mains have a characteristic structure known as the im- 
munoglobulin fold, a well-conserved structural motif 
in the all-jS class of proteins (Chapter 4). There are 
three of these constant domains in each heavy chain 
and one in each light chain. The heavy and light chains 
also have one variable domain each, in which most of 
the variability in amino acid sequence is found. The 
variable domains associate to create the antigen-binding 
site (Fig. 5-21, Fig. 5-22). 

In many vertebrates, IgG is but one of five classes 
of immunoglobulins. Each class has a characteristic type 
of heavy chain, denoted a, 8, e, y, and for IgA, IgD, 
IgE, IgG, and IgM, respectively. Two types of light chain, 
k and A, occur in all classes of immunoglobulins. The 
overall structures of IgD and IgE are similar to that of 
IgG. IgM occurs either in a monomeric, membrane- 
bound form or in a secreted form that is a cross-linked 
pentamer of this basic structure (Fig. 5-23). IgA, 
found principally in secretions such as saliva, tears, and 
milk, can be a monomer, dimer, or trimer. IgM is the first 
antibody to be made by B lymphocytes and the major 
antibody in the early stages of a primary immune 
response. Some B cells soon begin to produce IgD (with 
the same antigen-binding site as the IgM produced by 
the same cell), but the particular function of IgD is 
less clear. 

The IgG described above is the major antibody in 
secondary immune responses, which are initiated by a 
class of B cells called memory B cells. As part of the or- 
ganism's ongoing immunity to antigens already encoun- 
tered and dealt with, IgG is the most abundant 
immunoglobulin in the blood. When IgG binds to an in- 
vading bacterium or virus, it activates certain leukocytes 
such as macrophages to engulf and destroy the invader, 
and also activates some other parts of the immune re- 
sponse. Receptors on the macrophage surface recognize 



chains 




FIGURE 5-23 IgM pentamer of immunoglobulin units. The pentamer 
is cross-linked with disulfide bonds (yellow). The J chain is a polypep- 
tide of M r 20,000 found in both IgA and IgM. 

and bind the Fc region of IgG. When these Fc receptors 
bind an antibody-pathogen complex, the macrophage 
engulfs the complex by phagocytosis (Fig. 5-24). 

HIgE plays an important role in the allergic re- 
sponse, interacting with basophils (phagocytic 
leukocytes) in the blood and with histamine-secreting 
cells called mast cells, which are widely distributed in 
tissues. This immunoglobulin binds, through its Fc re- 
gion, to special Fc receptors on the basophils or mast 
cells. In this form, IgE serves as a receptor for antigen. If 
antigen is bound, the cells are induced to secrete hista- 
mine and other biologically active amines that cause di- 
lation and increased permeability of blood vessels. 
These effects on the blood vessels are thought to facili- 
tate the movement of immune system cells and proteins 
to sites of inflammation. They also produce the symp- 
toms normally associated with allergies. Pollen or other 
allergens are recognized as foreign, triggering an im- 
mune response normally reserved for pathogens. ■ 




FIGURE 5-24 Phagocytosis of an antibody-bound virus by a 
macrophage. The Fc regions of antibodies bound to the virus now bind 
to Fc receptors on the surface of a macrophage, triggering the 
macrophage to engulf and destroy the virus. 
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(a) Conformation with 
no antigen bound 



(b) Antigen bound 
(but not shown) 



(c) Antigen bound 
(shown) 



FIGURE 5-25 Induced fit in the binding of an antigen to IgG. The mol- 
ecule here, shown in surface contour, is the Fab fragment of an IgG. 
The antigen is a small peptide derived from HIV. Two residues in the 
heavy chain (blue) and one in the light chain (pink) are colored to pro- 
vide visual points of reference, (a) View of the Fab fragment in the ab- 
sence of antigen, looking down on the antigen-binding site (PDB ID 



1 GGC). (b) The same view, but with the Fab fragment in the "bound" 
conformation (PDB ID 1GGI); the antigen is omitted to provide an 
unobstructed view of the altered binding site. Note how the binding 
cavity has enlarged and several groups have shifted position, (c) The 
same view as (b), but with the antigen in the binding site, pictured as 
a red stick structure. 



Antibodies Bind Tightly and Specifically to Antigen 

The binding specificity of an antibody is determined by 
the amino acid residues in the variable domains of its 
heavy and light chains. Many residues in these domains 
are variable, but not equally so. Some, particularly 
those lining the antigen-binding site, are hypervariable — 
especially likely to differ. Specificity is conferred by chem- 
ical complementarity between the antigen and its specific 
binding site, in terms of shape and the location of charged, 
nonpolar, and hydrogen-bonding groups. For example, a 
binding site with a negatively charged group may bind an 
antigen with a positive charge in the complementary posi- 
tion. In many instances, complementarity is achieved in- 
teractively as the structures of antigen and bmding site 
influence each other as they come closer together. Con- 
formational changes in the antibody and/or the antigen 
then allow the complementary groups to interact fully. 
This is an example of induced fit. The complex of a pep- 
tide derived from HIV (a model antigen) and an Fab mol- 
ecule, shown in Figure 5-25, illustrates some of these 
properties. The changes in structure observed on antigen 
binding are particularly striking in this example. 

A typical antibody-antigen interaction is quite 
strong, characterized by K d values as low as 10~ 10 m (re- 
call that a lower^ d corresponds to a stronger binding in- 
teraction; see Table 5-1). The K d reflects the energy 
derived from the various ionic, hydrogen-bonding, hy- 
drophobic, and van der Waals interactions that stabilize 
the binding. The binding energy required to produce a 
K d of 10" 10 m is about 65 kJ/mol. 

The Antibody-Antigen Interaction Is the Basis for a 
Variety of Important Analytical Procedures 

The extraordinary binding affinity and specificity of an- 
tibodies make them valuable analytical reagents. Two 
types of antibody preparations are in use: polyclonal and 



monoclonal. Polyclonal antibodies are those pro- 
duced by many different B lymphocytes responding to 
one antigen, such as a protein injected into an animal. 
Cells in the population of B lymphocytes produce anti- 
bodies that bind specific, different epitopes within the 
antigen. Thus, polyclonal preparations contain a mix- 
ture of antibodies that recognize different parts of the 
protein. Monoclonal antibodies, in contrast, are syn- 
thesized by a population of identical B cells (a clone) 
grown in cell culture. These antibodies are homoge- 
neous, all recognizing the same epitope. The techniques 
for producing monoclonal antibodies were developed by 
Georges Kohler and Cesar Milstein. 




Georges Kohler, 1946-1995 Cesar Milstein, 1927-2002 



The specificity of antibodies has practical uses. A 
selected antibody can be covalently attached to a resin 
and used in a chromatography column of the type shown 
in Figure 3-1 7c. When a mixture of proteins is added to 
the column, the antibody specifically binds its target 
protein and retains it on the column while other proteins 
are washed through. The target protein can then be 
eluted from the resin by a salt solution or some other 
agent. This is a powerful protein analytical tool. 

In another versatile analytical technique, an anti- 
body is attached to a radioactive label or some other 
reagent that makes it easy to detect. When the antibody 
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(T) Coat surface with sample M S ! 
(antigens). 

(2) Block unoccupied sites p. 
with nonspecific protein. 

(3) Incubate with primary 
antibody against specific 
antigen. 

(4) Incubate with secondary 
antibody-enzyme complex 
that binds primary antibody . 

(5) Add substrate. ^ 

(6) Formation of colored + 
product indicates presence 

of specific antigen. 
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FIGURE 5-26 Antibody techniques. The specific 
reaction of an antibody with its antigen is the 
basis of several techniques that identify and 
quantify a specific protein in a complex sample, 
(a) A schematic representation of the general 
method, (b) An ELISA to test for the presence of 
herpes simplex virus (HSV) antibodies in blood 
samples. Wells were coated with an HSV anti- 
gen, to which antibodies against HSV will bind. 
The second antibody is anti-human IgG linked 
to horseradish peroxidase. Following comple- 
tion of the steps shown in (a), blood samples 
with greater amounts of HSV antibody turn 
brighter yellow, (c) An immunoblot. Lanes 1 to 3 
are from an SDS gel; samples from successive 
stages in the purification of a protein kinase 
were separated and stained with Coomassie 
blue. Lanes 4 to 6 show the same samples, but 
these were electrophoretically transferred to a 
nitrocellulose membrane after separation on an 
SDS gel. The membrane was then "probed" with 
antibody against the protein kinase. The num- 
bers between the SDS gel and the immunoblot 
indicate M r in thousands. 



binds the target protein, the label reveals the presence 
of the protein in a solution or its location in a gel or even 
a living cell. Several variations of this procedure are 
illustrated in Figure 5-26. 

An ELISA (enzyme-linked immunosorbent assay) 
can be used to rapidly screen for and quantify an antigen 
in a sample (Fig. 5-26b). Proteins in the sample are ad- 
sorbed to an inert surface, usually a 96-well polystyrene 
plate. The surface is washed with a solution of an inex- 
pensive nonspecific protein (often casein from nonfat 
dry milk powder) to block proteins introduced in subse- 
quent steps from adsorbing to unoccupied sites. The 
surface is then treated with a solution containing the 
primary antibody — an antibody against the protein of in- 
terest. Unbound antibody is washed away, and the sur- 
face is treated with a solution containing a secondary 
antibody — antibody against the primary antibody — 
linked to an enzyme that catalyzes a reaction that forms 
a colored product. After unbound secondary antibody is 
washed away, the substrate of the antibody-linked en- 
zyme is added. Product formation (monitored as color 
intensity) is proportional to the concentration of the 
protein of interest in the sample. 

In an immunoblot assay (Fig. 5-26c), proteins 
that have been separated by gel electrophoresis are 
transferred electrophoretically to a nitrocellulose 



membrane. The membrane is blocked (as described 
above for ELISA), then treated successively with pri- 
mary antibody, secondary antibody linked to enzyme, 
and substrate. A colored precipitate forms only along 
the band containing the protein of interest. Im- 
munoblotting allows the detection of a minor compo- 
nent in a sample and provides an approximation of its 
molecular weight. S Immunoblotting 

We will encounter other aspects of antibodies in 
later chapters. They are extremely important in medi- 
cine and can tell us much about the structure of proteins 
and the action of genes. 

SUMMARY 5.2 Complementary Interactions 
between Proteins and 
Ligands: The Immune System 
and Immunoglobulins 

■ The immune response is mediated by interactions 
among an array of specialized leukocytes and their 
associated proteins. T lymphocytes produce 
T-cell receptors. B lymphocytes produce 
immunoglobulins. In a process called clonal selection, 
helper T cells induce the proliferation of B cells and 
cytotoxic T cells that produce immunoglobulins 
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or of T-cell receptors that bind to a specific 
antigen. 

m Humans have five classes of immunoglobulins, 
each with different biological functions. The most 
abundant class is IgG, a Y-shaped protein with two 
heavy and two light chains. The domains near 
the upper ends of the Y are hypervariable within 
the broad population of IgGs and form two 
antigen-binding sites. 

■ A given immunoglobulin generally binds to only a 
part, called the epitope, of a large antigen. Binding 
often involves a conformational change in the IgG, 
an induced fit to the antigen. 

5.3 Protein Interactions Modulated by 
Chemical Energy: Actin, Myosin, and 
Molecular Motors 

Organisms move. Cells move. Organelles and macromole- 
cules within cells move. Most of these movements arise 
from the activity of a fascinating class of protein-based 
molecular motors. Fueled by chemical energy, usually de- 
rived from ATP, large aggregates of motor proteins un- 
dergo cyclic conformational changes that accumulate into 
a unified, directional force— the tiny force that pulls apart 
chromosomes in a dividing cell, and the immense force 
that levers a pouncing, quarter-ton jungle cat into the air. 

The interactions among motor proteins, as you 
might predict, feature complementary arrangements of 
ionic, hydrogen-bonding, hydrophobic, and van der 
Waals interactions at protein binding sites. In motor pro- 
teins, however, these interactions achieve exceptionally 
high levels of spatial and temporal organization. 

Motor proteins underlie the contraction of muscles, 
the migration of organelles along microtubules, the rota- 
tion of bacterial flagella, and the movement of some pro- 
teins along DNA. Proteins called kinesins and dyneins 
move along microtubules in cells, pulling along organelles 
or reorganizing chromosomes during cell division. An in- 
teraction of dynein with microtubules brings about the 
motion of eukaryotic flagella and cilia. Flagellar motion in 
bacteria involves a complex rotational motor at the base 
of the flagellum (see Fig. 19-39). Helicases, polymerases, 
and other proteins move along DNA as they carry out 
their functions in DNA metabolism (Chapter 25). Here, 
we focus on the well-studied example of the contractile 
proteins of vertebrate skeletal muscle as a paradigm for 
how proteins translate chemical energy into motion. 

The Major Proteins of Muscle Are Myosin and Actin 

The contractile force of muscle is generated by the in- 
teraction of two proteins, myosin and actin. These pro- 
teins are arranged in filaments that undergo transient 
interactions and slide past each other to bring about 
contraction. Together, actin and myosin make up more 
than 80% of the protein mass of muscle. 



Myosin (M T 540,000) has six subunits: two heavy 
chains (each of M T 220,000) and four light chains (each 
of M T 20,000). The heavy chains account for much of the 
overall structure. At their carboxyl termini, they are 
arranged as extended a helices, wrapped around each 
other in a fibrous, left-handed coiled coil similar to that 
of a-keratin (Fig. 5-27a). At its amino terminus, each 




FIGURE 5-27 Myosin, (a) Myosin has two heavy chains (in two shades 
of pink), the carboxyl termini forming an extended coiled coil (tail) and 
the amino termini having globular domains (heads). Two light chains 
(blue) are associated with each myosin head, (b) Cleavage with trypsin 
and papain separates the myosin heads (S1 fragments) from the tails, 
(c) Ribbon representation of the myosin S1 fragment (from coordinates 
supplied by Ivan Rayment). The heavy chain is in gray, the two light 
chains in two shades of blue. 
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heavy chain has a large globular domain containing a 
site where ATP is hydrolyzed. The light chains are asso- 
ciated with the globular domains. When myosin is 
treated briefly with the protease trypsin, much of the fi- 
brous tail is cleaved off, dividing the protein into compo- 
nents called light and heavy meromyosin (Fig. 5-2 7b). 
The globular domain — called myosin subfragment 1, or 
SI, or simply the myosin head group — is liberated from 
heavy meromyosin by cleavage with papain. The SI 
fragment is the motor domain that makes muscle con- 
traction possible. SI fragments can be crystallized, and 
their overall structure as determined by Ivan Rayment 
and Hazel Holden is shown in Figure 5-27c. 

In muscle cells, molecules of myosin aggregate to 
form structures called thick filaments (Fig. 5-28a). 
These rodlike structures are the core of the contractile 
unit. Within a thick filament, several hundred myosin 
molecules are arranged with their fibrous "tails" associ- 
ated to form a long bipolar structure. The globular do- 
mains project from either end of this structure, in 
regular stacked arrays. 

The second major muscle protein, actin, is abun- 
dant in almost all eukaryotic cells. In muscle, mole- 
cules of monomeric actin, called G-actin (globular 
actin; M r 42,000), associate to form a long polymer 
called F-actin (filamentous actin). The thin filament 
consists of F-'actin (Fig. 5-28b), along with the proteins 
troponin and tropomyosin (discussed below). The fila- 
mentous parts of thin filaments assemble as successive 
monomeric actin molecules add to one end. On addition, 
each monomer binds ATP, then hydrolyzes it to ADP, so 
every actin molecule in the filament is complexed to 
ADP. This ATP hydrolysis by actin functions only in the 
assembly of the filaments; it does not contribute directly 
to the energy expended in muscle contraction. Each 
actin monomer in the thin filament can bind tightly and 
specifically to one myosin head group (Fig. 5-28c). 

Additional Proteins Organize the Thin and Thick Filaments 
into Ordered Structures 

Skeletal muscle consists of parallel bundles of muscle 
fibers, each fiber a single, very large, multinucleated 
cell, 20 to 100 fim in diameter, formed from many cells 
fused together; a single fiber often spans the length of 
the muscle. Each fiber contains about 1,000 myofibrils, 
2 fim in diameter, each consisting of a vast number of 
regularly arrayed thick and thin filaments complexed to 
other proteins (Fig. 5-29). A system of flat membra- 
nous vesicles called the sarcoplasmic reticulum sur- 
rounds each myofibril. Examined under the electron 
microscope, muscle fibers reveal alternating regions of 
high and low electron density, called the A bands and I 
bands (Fig. 5-29b, c). The A and I bands arise from the 
arrangement of thick and thin filaments, which are 
aligned and partially overlapping. The I band is the 
region of the bundle that in cross section would contain 
only thin filaments. The darker A band stretches the 
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FIGURE 5-28 The major components of muscle, (a) Myosin aggre- 
gates to form a bipolar structure called a thick filament, (b) F-actin is a 
filamentous assemblage of G-actin monomers that polymerize two by 
two, giving the appearance of two filaments spiraling about one 
another in a right-handed fashion, (c) Space-filling model of an actin 
filament (shades of red) with one myosin head (gray and two shades of 
blue) bound to an actin monomer within the filament (from coordi- 
nates supplied by Ivan Rayment). 

length of the thick filament and includes the region 
where parallel thick and thin filaments overlap. Bisect- 
ing the I band is a thin structure called the Z disk, per- 
pendicular to the thin filaments and serving as an 
anchor to which the thin filaments are attached. The A 
band too is bisected by a thin line, the M line or M disk, 
a region of high electron density in the middle of the 
thick filaments. The entire contractile unit, consisting of 
bundles of thick filaments interleaved at either end with 
bundles of thin filaments, is called the sarcomere. The 
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FIGURE 5-29 Skeletal muscle, (a) Muscle 
fibers consist of single, elongated, multinu- 
cleated cells that arise from the fusion of 
many precursor cells. The fibers are made 
up of many myofibrils (only six are shown 
here for simplicity) surrounded by the 
membranous sarcoplasmic reticulum. The 
organization of thick and thin filaments in 
a myofibril gives it a striated appearance. 
When muscle contracts, the I bands nar- 
row and the Z disks come closer together, 
as seen in electron micrographs of (b) re- 
laxed and (c) contracted muscle. 
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arrangement of interleaved bundles allows the thick and 
thin filaments to slide past each other (by a mechanism 
discussed below), causing a progressive shortening of 
each sarcomere (Fig. 5-30). 

The thin actin filaments are attached at one end to 
the Z disk in a regular pattern. The assembly includes 
the minor muscle proteins a-actinin, desmin, and vi- 
mentin. Thin filaments also contain a large protein 



called nebulin (—7,000 amino acid residues), thought 
to be structured as an a helix that is long enough to 
span the length of the filament. The M line similarly or- 
ganizes the thick filaments. It contains the proteins 
paramyosin, C-protein, and M-protein. Another class 
of proteins called titins, the largest single polypeptide 
chains discovered thus far (the titin of human cardiac 
muscle has 26,926 amino acid residues), link the thick 
filaments to the Z disk, providing additional organization 
to the overall structure. Among their structural func- 
tions, the proteins nebulin and titin are believed to act 
as "molecular rulers," regulating the length of the thin 
and thick filaments, respectively. Titin extends from the 
Z disk to the M line, regulating the length of the sarco- 
mere itself and preventing overextension of the muscle. 
The characteristic sarcomere length varies from one 
muscle tissue to the next in a vertebrate, largely due to 
the different titin variants in the tissues. 



Thin Thick 
filament filament 




FIGURE 5-30 Muscle contraction. Thick 
filaments are bipolar structures created by 
the association of many myosin molecules, 
(a) Muscle contraction occurs by the slid- 
ing of the thick and thin filaments past each 
other so that the Z disks in neighboring I 
bands draw closer together, (b) The thick 
and thin filaments are interleaved such that 
each thick filament is surrounded by six 
thin filaments. 
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ATP binds to myosin head, 
causing dissociation 
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As tightly bound ATP is hydrolyzed, 
a conformational change occurs. 
ADP and Pi remain associated 
with the myosin head, 
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P A release triggers a "power stroke," 
a conformational change in the myosin 
(?) ADP * head that moves actin and myosin 
filaments relative to one another. 
ADP is released in the process. 




FIGURE 5-31 Molecular mechanism of muscle contraction. Conforma- 
tional changes in the myosin head that are coupled to stages in the ATP 
hydrolytic cycle cause myosin to successively dissociate from one actin 
subunit, then associate with another farther along the actin filament. In 
this way the myosin heads slide along the thin filaments, drawing the 
thick filament array into the thin filament array (see Fig. 5-30). 



Myosin Thick Filaments Slide along Actin Thin Filaments 

The interaction between actin and myosin, like that be- 
tween all proteins and ligands, involves weak bonds. 
When ATP is not bound to myosin, a face on the myosin 
head group binds tightly to actin (Fig. 5-3 1 ). When ATP 
binds to myosin and is hydrolyzed to ADP and phos- 
phate, a coordinated and cyclic series of conformational 
changes occurs in which myosin releases the F-actin 
subunit and binds another subunit farther along the 
thin filament. 

The cycle has four major steps (Fig. 5-31). In step 
CD, ATP binds to myosin and a cleft in the myosin mol- 
ecule opens, disrupting the actin-myosin interaction so 
that the bound actin is released. ATP is then hydrolyzed 
in step (2), causing a conformational change in the pro- 
tein to a "high-energy" state that moves the myosin 
head and changes its orientation in relation to the actin 
thin filament. Myosin then binds weakly to an F-actin 
subunit closer to the Z disk than the one just released. 
As the phosphate product of ATP hydrolysis is released 
from myosin in step (3), another conformational change 
occurs in which the myosin cleft closes, strengthening 
the myosin-actin binding. This is followed quickly by 
step (4), a "power stroke" during which the conforma- 
tion of the myosin head returns to the original resting 
state, its orientation relative to the bound actin chang- 
ing so as to pull the tail of the myosin toward the Z disk. 
ADP is then released to complete the cycle. Each cycle 
generates about 3 to 4 pN (piconewtons) of force and 
moves the thick filament 5 to 10 nm relative to the thin 
filament. 

Because there are many myosin heads in a thick fil- 
ament, at any given moment some (probably 1% to 3%) 
are bound to thin filaments. This prevents thick fila- 
ments from slipping backward when an individual 
myosin head releases the actin subunit to which it was 
bound. The thick filament thus actively slides forward 
past the adjacent thin filaments. This process, coordi- 
nated among the many sarcomeres in a muscle fiber, 
brings about muscle contraction. 

The interaction between actin and myosin must be 
regulated so that contraction occurs only in response to 
appropriate signals from the nervous system. The regu- 
lation is mediated by a complex of two proteins, 
tropomyosin and troponin (Fig. 5-32). Tropomyosin 
binds to the thin filament, blocking the attachment sites 
for the myosin head groups. Troponin is a Ca 2+ -binding 
protein. A nerve impulse causes release of Ca 2+ from 
the sarcoplasmic reticulum. The released Ca 2+ binds 
to troponin (another protein-ligand interaction) and 
causes a conformational change in the tropomyosin- 
troponin complexes, exposing the myosin-binding sites 
on the thin filaments. Contraction follows. 

Working skeletal muscle requires two types of mo- 
lecular functions that are common in proteins — binding 
and catalysis. The actin-myosin interaction, a protein- 
ligand interaction like that of immunoglobulins with 
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FIGURE 5-32 Regulation of muscle contraction by tropomyosin and 
troponin. Tropomyosin and troponin are bound to F-actin in the thin 
filaments. In the relaxed muscle, these two proteins are arranged 
around the actin filaments so as to block the binding sites for myosin. 
Tropomyosin is a two-stranded coiled coil of a helices, the same struc- 
tural motif as in a-keratin (see Fig. 4-1 0). It forms head-to-tail polymers 
twisting around the two actin chains. Troponin is attached to the actin- 
tropomyosin complex at regular intervals of 38.5 nm. Troponin con- 
sists of three different subunits: I, C, and T. Troponin I prevents binding 
of the myosin head to actin; troponin C has a binding site for Ca 2+ ; and 
troponin T links the entire troponin complex to tropomyosin. When the 
muscle receives a neural signal to initiate contraction, Ca 2+ is released 
from the sarcoplasmic reticulum (see Fig. 5-29a) and binds to troponin 
C. This causes a conformational change in troponin C, which alters the 
positions of troponin I and tropomyosin so as to relieve the inhibition 
by troponin I and allow muscle contraction. 



antigens, is reversible and leaves the participants un- 
changed. When ATP binds myosin, however, it is hy- 
drolyzed to ADP and P it Myosin is not only an 
actin-binding protein, it is also an ATPase — an enzyme. 
The function of enzymes in catalyzing chemical trans- 
formations is the topic of the next chapter. 



SUMMARY 5.3 



Protein Interactions 
Modulated by Chemical 
Energy: Actin, Myosin, and 
Molecular Motors 

Protein-ligand interactions achieve a special degree 
of spatial and temporal organization in motor 
proteins. Muscle contraction results from 
choreographed interactions between myosin and 
actin, coupled to the hydrolysis of ATP by myosin. 

Myosin consists of two heavy and four light chains, 
forming a fibrous coiled coil (tail) domain and a 
globular (head) domain. Myosin molecules are 
organized into thick filaments, which slide past thin 
filaments composed largely of actin. ATP hydrolysis 
in myosin is coupled to a series of conformational 
changes in the myosin head, leading to dissociation 
of myosin from one F-actin subunit and its 
eventual reassociation with another, farther along 
the thin filament. The myosin thus slides along the 
actin filaments. 

Muscle contraction is stimulated by the release 
of Ca 2+ from the sarcoplasmic reticulum. The 
Ca 2+ binds to the protein troponin, leading to a 
conformational change in a troponin-tropomyosin 
complex that triggers the cycle of actin-myosin 
interactions. 



Terms in bold are defined in 

ligand 153 
binding site 153 
induced fit 153 
heme 154 
porphyrin 154 
globins 155 

equilibrium expression 155 
association constant, 

K a 156 
dissociation constant, 

K d 156 
allosteric protein 162 
Hill equation 164 
Bohr effect 166 
lymphocytes 1 70 
antibody 170 



Further Reading 



the glossary. 

immunoglobulin 170 
B lymphocytes or B 

cells 170 
T lymphocytes or T 

cells 170 
antigen 171 
epitope 171 
hapten 171 
immunoglobulin fold 172 
polyclonal antibodies 1 73 
monoclonal 

antibodies 173 
ELISA 174 
myosin 175 
actin 176 
sarcomere 176 



Oxygen-Binding Proteins 

Ackers, G.K. & Hazzard, J.H. (1993) Transduction of binding 
energy into hemoglobin cooperativity. Trends Biochem Sci. 18, 
385-390 

Changeux, J.P. & Edelstein, S.J. (2005) Allosteric mechanisms of 
signal transduction Science 308, 1424-1428. 

Koshland, D.E., Jr., Nemethy, G., & Filmer, D. (1966) Comparison 
of experimental binding data and theoretical models in proteins 
containing subunits. Biochemistry 6, 365-385. 

The paper that introduced the sequential model. 

Monod, J., Wyman, J., & Changeux, J.-P. (1965) On the nature of 
allosteric transitions: a plausible model. J. Mol Biol. 12, 88-118. 
The concerted model was first proposed in this landmark paper. 

Olson, J.S. & Phillips, G.N., Jr. (1996) Kinetic pathways and 
barriers for ligand binding to myoglobin. J. Biol. Chem. 271, 
17,593-17,596. 

Perutz, M.F. (1989) Myoglobin and haemoglobin: role of distal 
residues in reactions with haem ligands. Trends Biochem. Sci. 
14, 42^4. 

Perutz, M.F., Wilkinson, A.J., Paoli, M., & Dodson, G.G. (1998) 
The stereochemical mechanism of the cooperative effects in hemo- 
globin revisited. Annu„ Rev. Biophys. Biomol. Struct 27, 1-34. 

Squires, J.E. (2002) Artificial blood. Science 295, 1002-1005. 

A nice description of the imposing technical challenge of 
mimicking the highly evolved oxygen-transport function of blood. 

Immune System Proteins 

Cooper, M.D. & Alder, M.N. (2006) The evolution of adaptive 
immune systems Cell 124, 815-822. 

An interesting essay tracing the origins of our immune system. 

Davies, D.R. & Chacko, S. (1993) Antibody structure. Acc Chem. 
Res 26, 421-427. 

Davies, D.R., Padlan, E.A., & Sheriff, S. (1990) Antibody-antigen 
complexes. Annu Rev Biochem 59, 439-473. 
Kindt, T.J., Osborne, B.A., & Goldsby, R.A. (2007) Kuby 
Immunology, 6th edn, W. H. Freeman and Company, New York. 
Ploegh, H.L. (1998) Viral strategies of immune evasion. Science 
280, 248-253 



100 Protein Function 



Thomsen, A.R., Nansen, A., & Christensen, J.P. (1998) Virus- 
induced T cell activation and the inflammatory response Curr. Top. 
Microbiol Immunol 231, 99-123 

Yewdell, J.W. & Haeryfar, S.M.M. (2005) Understanding presenta- 
tion of viral antigens to CD8(+) T cells in vivo: the key to rational 
vaccine design, Annu Rev Immunol 23,651-682 

Molecular Motors 

Finer, J.T., Simmons, R.M., & Spudich, J.A. (1994) Single 
myosin molecule mechanics: piconewton forces and nanometre 
steps. Nature 368, 113-119. 

Modern techniques reveal the forces affecting individual motor 
proteins. 

Geeves, M.A. & Holmes, K.C. (1999) Structural mechanism of 
muscle contraction Annu Rev Biochem 68, 687-728. 

Huxley, H.E. (1998) Getting to grips with contraction: the interplay 
of structure and biochemistry Trends Biochem Sci 23,84-87 

An interesting historical perspective on deciphering the 
mechanism of muscle contraction. 

Labeit, S. & Kolmerer, B, (1995) Titins: giant proteins in charge of 
muscle ultrastructure and elasticity. Science 270, 293-296 

A structural and functional description of some of the largest 
proteins 

Molloy, J.E. & Veigel, C. (2003) Myosin motors walk the walk 
Science 300, 2045-2046 

Rayment, I. (1996) The structural basis of the myosin ATPase 
activity J. Biol Chem 271, 15,850-15,853 

Examines the muscle contraction mechanism from a structural 
perspective. 

Rayment, I. & Holden, H.M. (1994) The three-dimensional 
structure of a molecular motor Trends Biochem Sci 19,129-134. 

Spudich, J.A. (1994) How molecular motors work. Nature 372, 
515-518 

Vale, R.D. (2003) The molecular motor toolbox for intracellular 
transport Cell 112, 467-480 

Problems 

1. Relationship between Affinity and Dissociation Con- 
stant Protein A has a binding site for ligand X with a K d of 
10" 6 m. Protein B has a binding site for ligand X with a K d of 
10~ 9 m. Which protein has a higher affinity for ligand X? Ex- 
plain your reasoning, Convert the K d to K a for both proteins. 

2. Negative Cooperativity Which of the following situa- 
tions would produce a Hill plot with n H < 1.0? Explain your 
reasoning in each case, 

(a) The protein has multiple subunits, each with a single 
ligand-binding site. Binding of ligand to one site decreases the 
binding affinity of other sites for the ligand. 

(b) The protein is a single polypeptide with two ligand- 
binding sites, each having a different affinity for the ligand. 

(c) The protein is a single polypeptide with a single ligand- 
binding site. As purified, the protein preparation is heteroge- 
neous, containing some protein molecules that are partially 
denatured and thus have a lower binding affinity for the ligand. 

3. Affinity for Oxygen of Hemoglobin What is the effect of 
the following changes on the 0 2 affinity of hemoglobin? (a) A 
drop in the pH of blood plasma from 7,4 to 7.2. (b) A decrease 
in the partial pressure of C0 2 in the lungs from 6 kPa (holding 



one's breath) to 2 kPa (normal), (c) An increase in the BPG 
level from 5 mM (normal altitudes) to 8 itim (high altitudes), 
(d) An increase in CO from 1.0 parts per million (ppm) in a 
normal indoor atmosphere to 30 ppm in a home that has a mal- 
functioning or leaking furnace. 

4. Reversible Ligand Binding The protein calcineurin binds 
to the protein calmodulin with an association rate of 8.9 X 
10 3 M -1 s -1 and an overall dissociation constant, K d , of 10 nM. 
Calculate the dissociation rate, k d , including appropriate units. 

5. Cooperativity in Hemoglobin Under appropriate condi- 
tions, hemoglobin dissociates into its four subunits. The iso- 
lated a subunit binds oxygen, but the 0 2 -saturation curve is 
hyperbolic rather than sigmoid. In addition, the binding of oxy- 
gen to the isolated a subunit is not affected by the presence of 
H + , C0 2 , or BPG. What do these observations indicate about 
the source of the cooperativity in hemoglobin? 

6. Comparison of Fetal and Maternal Hemoglobins 

Studies of oxygen transport in pregnant mammals show that 
the 0 2 -saturation curves of fetal and maternal blood are 
markedly different when measured under the same conditions. 
Fetal erythrocytes contain a structural variant of hemoglobin, 
HbF, consisting of two a and two y subunits (a 2 y 2 ), whereas 
maternal erythrocytes contain HbA (a 2 jS 2 ) 

(a) Which hemoglobin has a higher affinity for oxygen un- 
der physiological conditions, HbA or HbF? Explain, 

(b) What is the physiological significance of the different 
0 2 affinities? 

(c) When all the BPG is carefully removed from samples of 
HbA and HbF, the measured 0 2 -saturation curves (and conse- 
quently the 0 2 affinities) are displaced to the left. However, 
HbA now has a greater affinity for oxygen than does HbF. 
When BPG is reintroduced, the 0 2 -saturation curves return to 
normal, as shown in the graph. What is the effect of BPG on 
the 0 2 affinity of hemoglobin? How can the above information 
be used to explain the different 0 2 affinities of fetal and ma- 
ternal hemoglobin? 



1.0 




p0 2 (kPa) 

B9 7. Hemoglobin Variants There are almost 500 
naturally occurring variants of hemoglobin. Most are 
the result of a single amino acid substitution in a globin 
polypeptide chain. Some variants produce clinical illness, 
though not all variants have deleterious effects. A brief 
sample follows. 
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HbS (sickle-cell Hb): substitutes a Val for a Glu on the surface 
Hb Cowtown: eliminates an ion pair involved in T-state 
stabilization 

Hb Memphis: substitutes one uncharged polar residue for 

another of similar size on the surface 
Hb Bibba: substitutes a Pro for a Leu involved in an a helix 
Hb Milwaukee: substitutes a Glu for a Val 
Hb Providence: substitutes an Asn for a Lys that normally 

projects into the central cavity of the tetramer 
Hb Philly: substitutes a Phe for a Tyr, disrupting hydrogen 

bonding at the a x px interface 

Explain your choices for each of the following: 

(a) The Hb variant least likely to cause pathological 
symptoms. 

(b) The variant (s) most likely to show pi values different 
from that of HbA on an isoelectric focusing gel. 

(c) The variant (s) most likely to show a decrease in BPG 
binding and an increase in the overall affinity of the hemoglo- 
bin for oxygen. 

8. Oxygen Binding and Hemoglobin Structure A team of 
biochemists uses genetic engineering to modify the interface 
region between hemoglobin subunits. The resulting hemoglo- 
bin variants exist in solution primarily as a/3 dimers (few, if 
any, a 2 jS 2 tetramers form). Are these variants likely to bind 
oxygen more weakly or more tightly? Explain your answer, 

9. Reversible (but Tight) Binding to an Antibody An 

antibody binds to an antigen with a K d of 5 X 10~ 8 m. At 
what concentration of antigen will 6 be (a) 0.2, (b) 0.5, (c) 0.6, 
(d) 0.8? 

10. Using Antibodies to Probe Structure-Function Rela- 
tionships in Proteins A monoclonal antibody binds to G-actin 
but not to F-actin. What does this tell you about the epitope 
recognized by the antibody? 

B9 11. The Immune System and Vaccines A host or- 
Kfl ganism needs time, often days, to mount an immune 
response against a new antigen, but memory cells permit a 
rapid response to pathogens previously encountered. A vac- 
cine to protect against a particular viral infection often con- 
sists of weakened or killed virus or isolated proteins from a 
viral protein coat. When injected into a human patient, the 
vaccine generally does not cause an infection and illness, but it 
effectively "teaches" the immune system what the viral parti- 
cles look like, stimulating the production of memory cells. On 
subsequent infection, these cells can bind to the virus and trig- 
ger a rapid immune response. Some pathogens, including HIV, 
have developed mechanisms to evade the immune system, 
making it difficult or impossible to develop effective vaccines 
against them. What strategy could a pathogen use to evade the 
immune system? Assume that a host's antibodies and/or T-cell 
receptors are available to bind to any structure that might ap- 
pear on the surface of a pathogen and that, once bound, the 
pathogen is destroyed. 

12. How We Become a "Stiff" When a vertebrate dies, its 
muscles stiffen as they are deprived of ATP, a state called rigor 
mortis. Explain the molecular basis of the rigor state. 



13. Sarcomeres from Another Point of Mew The symmetry 
of thick and thin filaments in a sarcomere is such that six thin fila- 
ments ordinarily surround each thick filament in a hexagonal array. 
Draw a cross section (transverse cut) of a myofibril at the following 
points: (a) at the M line; (b) through the I band; (c) through the 
dense region of the A band; (d) through the less dense region of the 
A band, adjacent to the M line (see Fig. 5-29b, c). 

Biochemistry on the Internet 

14. lysozyme and Antibodies To fully appreciate how pro- 
teins function in a cell, it is helpful to have a three-dimensional 
view of how proteins interact with other cellular components. 
Fortunately, this is possible using Web-based protein data- 
bases and three-dimensional molecular viewing utilities. Some 
molecular viewers require that you download a program or 
plug-in; some can be problematic when used with certain 
operating systems or browsers; some require the use of 
command-line code; some have a more user-friendly interface. 
We suggest you go to www.umass.edu/microbio/rasmol and 
look at the information about RasMol, Protein Explorer, and 
Jmol First Glance. Choose the viewer most compatible with 
your operating system, browser, and level of expertise. Then 
download and instaD any software or plug-ins you may need. 

In this exercise you will examine the interactions between 
the enzyme lysozyme (Chapter 4) and the Fab portion of the 
anti-lysozyme antibody. Use the PDB identifier 1FDL to explore 
the structure of the IgGl Fab fragment-lysozyme complex 
(antibody-antigen complex). To answer the following ques- 
tions, use the information on the Structure Summary page at 
the Protein Data Bank (www.rcsb.org), and view the structure 
using RasMol, Protein Explorer, or Jmol First Glance. 

(a) Which chains in the three-dimensional model corre- 
spond to the antibody fragment and which correspond to the 
antigen, lysozyme? 

(b) What type of secondary structure predominates in this 
Fab fragment? 

(c) How many amino acid residues are in the heavy and 
light chains of the Fab fragment? In lysozyme? Estimate the 
percentage of the lysozyme that interacts with the antigen- 
binding site of the antibody fragment. 

(d) Identify the specific amino acid residues in lysozyme 
and in the variable regions of the Fab heavy and light chains that 
are situated at the antigen-antibody interface. Are the residues 
contiguous in the primary sequence of the polypeptide chains? 

15. Exploring Reversible Interactions of Proteins and 
Ligands with Living Graphs 0 Use the living graphs for 
Equations 5-8, 5-11, 5-14, and 5-16 to work through the following 
exercises. 

(a) Reversible binding of a ligand to a simple protein, with- 
out cooperativity. For Equation 5-8, set up a plot of 0 versus 
[L] (vertical and horizontal axes, respectively). Examine the 
plots generated when K d is set at 5, 10, 20, and 100 fiM. Higher 
affinity of the protein for the ligand means more binding at 
lower ligand concentrations. Suppose that four different pro- 
teins exhibit these four different K d values for ligand L. Which 
protein would have the highest affinity for L? 
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Examine the plot generated when K d = 10 fiM. How much 
does 6 increase when [L] increases from 0.2 to 0.4 ^m? How 
much does 6 increase when [L] increases from 40 to 80 ^M? 

You can do the same exercise for Equation 5-11. Convert 
[L] to p0 2 and K d to P 50 . Examine the curves generated when 
P 50 is set at 0.5, 1, 2, and 10 kPa. For the curve generated 
when P 50 = 1 kPa, how much does 8 change when the p0 2 in- 
creases from 0.02 to 0,04 kPa? From 4 to 8 kPa? 

(b) Cooperative binding of a ligand to a multisubunit pro- 
tein. Using Equation 5-14, generate a binding curve for a pro- 
tein and ligand with K d = 10 jjlm and n = 3. Note the altered 
definition of K d in Equation 5-16, On the same plot, add a 
curve for a protein with K d = 20 /iM and n = 3. Now see how 
both curves change when you change to n = 4. Generate Hill 
plots (Eqn 5-16) for each of these cases. For K d = 10 jjlm and 
n = 3, what is 6 when [L] = 20 /am? 

(c) Explore these equations further by varying all the pa- 
rameters used above. 



Data Analysis Problem 

16. Protein Function During the 1980s, the structures of 
actin and myosin were known only at the resolution shown in 
Figure 5-28a, b. Although researchers knew that the SI portion 
of myosin binds to actin and hydrolyzes ATP, there was a sub- 
stantial debate about where in the myosin molecule the contrac- 
tile force was generated. At the time, two competing models 
were proposed for the mechanism of force generation in myosin. 

In the "hinge" model, SI bound to actin, but the pulling 
force was generated by contraction of the "hinge region" in the 
myosin tail. The hinge region is in the heavy meromyosin por- 
tion of the myosin molecule, near where trypsin cleaves off 
light meromyosin (see Fig. 5-27b) . This is roughly the point la- 
beled "Two supercoiled a helices" in Figure 5-2 7a. In the "SI" 
model, the pulling force was generated in the SI "head" itself 
and the tail was just for structural support. 

Many experiments had been performed but provided no 
conclusive evidence. In 1987, James Spudich and his colleagues 
at Stanford University published a study that, although not 
conclusive, went a long way toward resolving this controversy. 

Recombinant DNA techniques were not sufficiently devel- 
oped to address this issue in vivo, so Spudich and colleagues used 
an interesting in vitro motility assay. The alga Nitella has extremely 
long cells, often several centimeters in length and about 1 mm in di- 
ameter. These cells have actin fibers that run along their long axes, 
and the cells can be cut open along their length to expose the actin 
fibers. Spudich and his group had observed that plastic beads 
coated with myosin would "walk" along these fibers in the presence 
of ATP, just as myosin would do in contracting muscle. 

For these experiments, they used a more well-defined 
method for attaching the myosin to the beads. The "beads" 
were clumps of killed bacterial (Staphylococcus aureus) 
cells. These cells have a protein on their surface that binds to 
the Fc region of antibody molecules (Fig. 5-21a). The anti- 
bodies, in turn, bind to several (unknown) places along the tail 
of the myosin molecule. When bead-antibody-myosin com- 
plexes were prepared with intact myosin molecules, they 
would move along Nitella actin fibers in the presence of ATP. 



(a) Sketch a diagram showing what a bead-antibody- 
myosin complex might look like at the molecular level. 

(b) Why was ATP required for the beads to move along the 
actin fibers? 

(c) Spudich and coworkers used antibodies that bound to 
the myosin tail. Why would this experiment have failed if they 
had used an antibody that bound to the part of SI that nor- 
mally binds to actin? Why would this experiment have failed if 
they had used an antibody that bound to actin? 

To help focus in on the part of myosin responsible for 
force production, Spudich and his colleagues used trypsin to 
produce two partial myosin molecules (see Fig. 5-27): (1) 
heavy meromyosin (HMM), made by briefly digesting myosin 
with trypsin; HMM consists of SI and the part of the tail that 
includes the hinge; and (2) short heavy meromyosin (SHMM), 
made from a more extensive digestion of HMM with trypsin; 
SHMM consists of SI and a shorter part of the tail that does not 
include the hinge. Brief digestion of myosin with trypsin pro- 
duces HMM and light meromyosin (Fig. 5-27), by cleavage of 
a single specific peptide bond in the myosin molecule. 

(d) Why might trypsin attack this peptide bond first rather 
than other peptide bonds in myosin? 

Spudich and colleagues prepared bead-antibody-myosin 
complexes with varying amounts of myosin, HMM, and SHMM, 
and measured their speeds along Nitella actin fibers in the 
presence of ATP The graph below sketches their results. 
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Density of myosin or myosin fragment bound to beads 

(e) Which model ("SI" or "hinge") is consistent with these 
results? Explain your reasoning. 

(f) Provide a plausible explanation for why the speed of 
the beads increased with increasing myosin density. 

(g) Provide a plausible explanation for why the speed of 
the beads reached a plateau at high myosin density. 

The more extensive trypsin digestion required to produce 
SHMM had a side effect: another specific cleavage of the 
myosin polypeptide backbone in addition to the cleavage in 
the tail. This second cleavage was in the SI head, 

(h) Based on this information, why is it surprising that 
SHMM was still capable of moving beads along actin fibers? 

(i) As it turns out, the tertiary structure of the SI head 
remains intact in SHMM. Provide a plausible explanation of 
how the protein remains intact and functional even though the 
polypeptide backbone has been cleaved and is no longer 
continuous. 

Reference 

Hynes, T.R., Block, S.M., White, B.T., & Spudich, J.A. 
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volved in force production. Cell 48, 953-963. 



One way in which this condition might be fulfilled would be if the 
molecules when combined with the enzyme, lay slightly further apart 
than their equilibrium distance when [covalently joinedl, but nearer 
than their equilibrium distance when free, . . . Using Fischer's lock and 
key simile, the key does not fit the lock quite perfectly but exercises a 
certain strain on it. 

B. 5. Haidane, Enzymes, 1930 

Catalysis can be described formally in terms of a stabilization of the 
transition state through tight binding to the catalyst. 

—William R }encks, artidv in Advances in Enzymology, 1 H75 
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There are two fundamental conditions for life. First, 
the organism must be able to self -replicate (a topic 
considered in Part III); second, it must be able to 
catalyze chemical reactions efficiently and selectively 
The central importance of catalysis may seem surpris- 
ing, but it is easy to demonstrate. As described in 
Chapter 1 , living systems make use of energy from the 
environment. Many of us, for example, consume sub- 
stantial amounts of sucrose — common table sugar — as a 
kind of fuel, usually in the form of sweetened foods and 
drinks. The conversion of sucrose to C0 2 and H 2 0 in the 
presence of oxygen is a highly exergonic process, re- 
leasing free energy that we can use to think, move, taste, 
and see. However, a bag of sugar can remain on the shelf 
for years without any obvious conversion to C0 2 and 
H 2 0. Although this chemical process is thermodynami- 
cally favorable, it is very slow! Yet when sucrose is con- 
sumed by a human (or almost any other organism), it 
releases its chemical energy in seconds. The difference 
is catalysis. Without catalysis, chemical reactions such 
as sucrose oxidation could not occur on a useful time 
scale, and thus could not sustain life. 

In this chapter, then, we turn our attention to the 
reaction catalysts of biological systems: the enzymes, 
the most remarkable and highly specialized proteins. 
Enzymes have extraordinary catalytic power, often far 
greater than that of synthetic or inorganic catalysts. 



They have a high degree of specificity for their sub- 
strates, they accelerate chemical reactions tremen- 
dously, and they function in aqueous solutions under 
very mild conditions of temperature and pH. Few nonbi- 
ological catalysts have all these properties. 

Enzymes are central to every biochemical process. 
Acting in organized sequences, they catalyze the hun- 
dreds of stepwise reactions that degrade nutrient mole- 
cules, conserve and transform chemical energy, and 
make biological macromolecules from simple precursors. 

The study of enzymes has immense practical impor- 
tance. In some diseases, especially inheritable genetic 
disorders, there may be a deficiency or even a total 
absence of one or more enzymes. Other disease condi- 
tions may be caused by excessive activity of an enzyme. 
Measurements of the activities of enzymes in blood 
plasma, erythrocytes, or tissue samples are important in 
diagnosing certain illnesses. Many drugs act through 
interactions with enzymes. Enzymes are also important 
practical tools in chemical engineering, food technology, 
and agriculture. 

We begin with descriptions of the properties of en- 
zymes and the principles underlying their catalytic 
power, then introduce enzyme kinetics, a discipline that 
provides much of the framework for any discussion of 
enzymes. Specific examples of enzyme mechanisms are 
then provided, illustrating principles introduced earlier 
in the chapter. We end with a discussion of how enzyme 
activity is regulated. 

6.1 An Introduction to Enzymes 

Much of the history of biochemistry is the history of 
enzyme research. Biological catalysis was first recog- 
nized and described in the late 1700s, in studies on the 
digestion of meat by secretions of the stomach. Re- 
search continued in the 1800s with examinations of the 
conversion of starch to sugar by saliva and various plant 
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extracts. In the 1850s, Louis Pasteur concluded that fer- 
mentation of sugar into alcohol by yeast is catalyzed by 
"ferments." He postulated that these ferments were in- 
separable from the structure of living yeast cells; this 
view, called vitalism, prevailed for decades. Then in 
1897 Eduard Buchner discovered that yeast extracts 
could ferment sugar to alcohol, proving that fermenta- 
tion was promoted by molecules that continued to 
function when removed from cells. Buchner's experi- 
ment at once marked the end of vitalistic notions and 
the dawn of the science of biochemistry. Frederick W. 
Kiihne later gave the name enzymes to the molecules 
detected by Buchner. 

The isolation and crystallization of urease by James 
Sumner in 1926 was a breakthrough in early enzyme 
studies. Sumner found that urease crystals consisted 
entirely of protein, and he postulated that all enzymes 
are proteins. In the absence of other examples, this idea 
remained controversial for some time. Only in the 1930s 
was Sumner's conclusion widely accepted, after John 
Northrop and Moses Kunitz crystallized pepsin, trypsin, 
and other digestive enzymes and found them also to be 
proteins. During this period, J. B. S. Haldane wrote a 
treatise entitled Enzymes. Although the molecular na- 
ture of enzymes was not yet fully appreciated, Haldane 
made the remarkable suggestion that weak bonding in- 
teractions between an enzyme and its substrate might be 
used to catalyze a reaction. This insight lies at the heart 
of our current understanding of enzymatic catalysis. 

Since the latter part of the twentieth century, thou- 
sands of enzymes have been purified, their structures 
elucidated, and their mechanisms explained. 

Most Enzymes Are Proteins 

With the exception of a small group of catalytic RNA 
molecules (Chapter 26), all enzymes are proteins. Their 
catalytic activity depends on the integrity of their native 
protein conformation. If an enzyme is denatured or dis- 
sociated into its subunits, catalytic activity is usually 
lost. If an enzyme is broken down into its component 
amino acids, its catalytic activity is always destroyed. 
Thus the primary, secondary, tertiary, and quaternary 
structures of protein enzymes are essential to their cat- 
alytic activity. 



Enzymes, like other proteins, have mo- 
lecular weights ranging from about 12,000 
to more than 1 million. Some enzymes re- 
quire no chemical groups for activity 
other than their amino acid residues. Oth- 
ers require an additional chemical compo- 
nent called a cofactor — either one or 
more inorganic ions, such as Fe 2+ , Mg 2+ , 
Mn 2+ , or Zn 2+ (Table 6-1), or a complex 
organic or metalloorganic molecule called 
a coenzyme. Coenzymes act as transient 
carriers of specific functional groups 
(Table 6-2). Most are derived from vita- 
mins, organic nutrients required in small amounts in the 
diet. We consider coenzymes in more detail as we en- 
counter them in the metabolic pathways discussed in 
Part II. Some enzymes require both a coenzyme and one 
or more metal ions for activity. A coenzyme or metal ion 
that is very tightly or even covalently bound to the en- 
zyme protein is called a prosthetic group. A complete, 
catalytically active enzyme together with its bound 
coenzyme and/or metal ions is called a holoenzyme. 
The protein part of such an enzyme is called the apoen- 
zyme or apoprotein. Finally, some enzyme proteins are 
modified covalently by phosphorylation, glycosylation, 
and other processes. Many of these alterations are in- 
volved in the regulation of enzyme activity. 

Enzymes Are Classified by the Reactions They Catalyze 

Many enzymes have been named by adding the suffix 
"-ase" to the name of their substrate or to a word or 
phrase describing their activity. Thus urease catalyzes 
hydrolysis of urea, and DNA polymerase catalyzes the 
polymerization of nucleotides to form DNA. Other en- 
zymes were named by their discoverers for a broad 
function, before the specific reaction catalyzed was 



TABLE 6-1 


Some Inorganic Ions That Serve as 
Cofactors for Enzymes 


Ions 


Enzymes 


Cu 2+ 


Cytochrome oxidase 


Fe 2+ or Fe 3+ 


Cytochrome oxidase, catalase, 




peroxidase 


K + 


Pyruvate kinase 


Mg 2+ 


Hexokinase, glucose 6-phosphatase, 




pyruvate kinase 


Mn 2+ 


Arginase, ribonucleotide reductase 


Mo 


Dinitrogenase 


Ni 2+ 


Urease 


Se 


Glutathione peroxidase 


Zn 2+ 


Carbonic anhydrase, alcohol 



dehydrogenase, carboxypeptidases 
A and B 



6.1 An Introduction to Enzymes Ql85^j 



TABLE 6-2 



Coenzyme 



Some Coenzymes That Serve as Transient Carriers of Specific Atoms or Functional Groups 

Examples of chemical groups transferred Dietary precursor in mammals 



Biocytin 
Coenzyme A 

5 ' -Deoxyadenosylcobalamin 
(coenzyme B 12 ) 

Flavin adenine dinucleotide 

Lipoate 

Nicotinamide adenine dinucleotide 
Pyridoxal phosphate 
Tetrahydrofolate 
Thiamine pyrophosphate 



C0 2 

Acyl groups 

H atoms and alkyl groups 
Electrons 

Electrons and acyl groups 
Hydride ion (:H~) 
Amino groups 
One-carbon groups 
Aldehydes 



Biotin 

Pantothenic acid and other compounds 
Vitamin B 12 

Riboflavin (vitamin B 2 ) 
Not required in diet 
Nicotinic acid (niacin) 
Pyridoxine (vitamin B 6 ) 
Folate 

Thiamine (vitamin B x ) 



Note: The structures and modes of action of these coenzymes are described in Part I 



known. For example, an enzyme known to act in the di- 
gestion of foods was named pepsin, from the Greek pep- 
sis, "digestion," and lysozyme was named for its ability 
to lyse (break down) bacterial cell walls. Still others 
were named for their source: trypsin, named in part 
from the Greek tryein, "to wear down," was obtained by 
rubbing pancreatic tissue with glycerin. Sometimes the 
same enzyme has two or more names, or two different 
enzymes have the same name. Because of such ambigui- 
ties, and the ever-increasing number of newly discovered 
enzymes, biochemists, by international agreement, have 
adopted a system for naming and classifying enzymes. 
This system divides enzymes into six classes, each with 
subclasses, based on the type of reaction catalyzed 
(Table 6-3). Each enzyme is assigned a four-part classifi- 
cation number and a systematic name, which identifies 
the reaction it catalyzes. As an example, the formal sys- 
tematic name of the enzyme catalyzing the reaction 

ATP + D-glucose > ADP + D-glucose 6-phosphate 

is ATP:glucose phosphotransferase, which indicates that 
it catalyzes the transfer of a phosphoryl group from ATP 
to glucose. Its Enzyme Commission number (E.C. num- 
ber) is 2.7.1.1. The first number (2) denotes the class 
name (transferase); the second number (7), the sub- 



class (phosphotransferase); the third number (1), a 
phosphotransferase with a hydroxyl group as acceptor; 
and the fourth number (1), D-glucose as the phosphoryl 
group acceptor. For many enzymes, a common name is 
more frequently used — in this case hexokinase. A com- 
plete list and description of the thousands of known en- 
zymes is maintained by the Nomenclature Committee of 
the International Union of Biochemistry and Molecular 
Biology (www.chem.qmul.ac.uk/iubmb/enzyme). This 
chapter is devoted primarily to principles and properties 
common to all enzymes. 

SUMMARY 6.1 An Introduction to Enzymes 

■ Life depends on powerful and specific catalysts: 
the enzymes. Almost every biochemical reaction is 
catalyzed by an enzyme. 

■ With the exception of a few catalytic RNAs, all known 
enzymes are proteins. Many require nonprotein 
coenzymes or cofactors for their catalytic function. 

■ Enzymes are classified according to the type of 
reaction they catalyze. All enzymes have formal 
E.C. numbers and names, and most have trivial 
names. 



International Classification of Enzyme 

Class no. Class name Type of reaction catalyzed 



1 Oxidoreductases Transfer of electrons (hydride ions or H atoms) 

2 Transferases Group transfer reactions 

3 Hydrolases Hydrolysis reactions (transfer of functional groups to water) 

4 Lyases Addition of groups to double bonds, or formation of double bonds by removal of groups 

5 Isomerases Transfer of groups within molecules to yield isomeric forms 

6 Ligases Formation of C—C, C— S, C— 0, and C— N bonds by condensation reactions coupled to 

cleavage of ATP or similar cofactor 




FIGURE 6-1 Binding of a substrate to an enzyme at the active site. 

The enzyme chymotrypsin, with bound substrate in red (PDB ID 
7GCH). Some key active-site amino acid residues appear as a red 
splotch on the enzyme surface. 

6.2 How Enzymes Work 

The enzymatic catalysis of reactions is essential to living 
systems. Under biologically relevant conditions, uncat- 
alyzed reactions tend to be slow — most biological 
molecules are quite stable in the neutral-pH, mild- 
temperature, aqueous environment inside cells. Further- 
more, many common chemical processes are unfavorable 
or unlikely in the cellular environment, such as the tran- 
sient formation of unstable charged intermediates or the 
collision of two or more molecules in the precise orienta- 
tion required for reaction. Reactions required to digest 
food, send nerve signals, or contract a muscle simply do 
not occur at a useful rate without catalysis. 

An enzyme circumvents these problems by provid- 
ing a specific environment within which a given reaction 
can occur more rapidly. The distinguishing feature of an 
enzyme-catalyzed reaction is that it takes place within 
the confines of a pocket on the enzyme called the active 
site (Fig. 6-1). The molecule that is bound in the 
active site and acted upon by the enzyme is called the 
substrate. The surface of the active site is lined with 
amino acid residues with substituent groups that bind 
the substrate and catalyze its chemical transformation. 
Often, the active site encloses a substrate, sequestering 
it completely from solution. The enzyme-substrate com- 
plex, whose existence was first proposed by Charles- 
Adolphe Wurtz in 1880, is central to the action of 
enzymes. It is also the starting point for mathematical 
treatments that define the kinetic behavior of enzyme- 
catalyzed reactions and for theoretical descriptions of 
enzyme mechanisms. 

Enzymes Affect Reaction Rates, Not Equilibria 

A simple enzymatic reaction might be written 

E + S ES EP E + P (6-1) 



where E, S, and P represent the enzyme, substrate, and 
product; ES and EP are transient complexes of the en- 
zyme with the substrate and with the product. 

To understand catalysis, we must first appreciate 
the important distinction between reaction equilibria 
and reaction rates. The function of a catalyst is to in- 
crease the rate of a reaction. Catalysts do not affect 
reaction equilibria. Any reaction, such as S ^ P, can be 
described by a reaction coordinate diagram (Fig. 6-2), a 
picture of the energy changes during the reaction. As dis- 
cussed in Chapter 1, energy in biological systems is de- 
scribed in terms of free energy, G. In the coordinate 
diagram, the free energy of the system is plotted against 
the progress of the reaction (the reaction coordinate). 
The starting point for either the forward or the reverse 
reaction is called the ground state, the contribution 
to the free energy of the system by an average molecule 
(S or P) under a given set of conditions. 

KEY CONVENTION: To describe the free-energy changes for 
reactions, chemists define a standard set of conditions 
(temperature 298 K; partial pressure of each gas 1 atm, 
or 101.3 kPa; concentration of each solute 1 m) and ex- 
press the free-energy change for a reacting system under 
these conditions as AG°, the standard free-energy 
change. Because biochemical systems commonly in- 
volve H + concentrations far below 1 m, biochemists de- 
fine a biochemical standard free-energy change, 
AG ro , the standard free-energy change at pH 7.0; we 
employ this definition throughout the book. A more 
complete definition of AG'° is given in Chapter 13. ■ 

The equilibrium between S and P reflects the differ- 
ence in the free energies of their ground states. In the 
example shown in Figure 6-2, the free energy of the 
ground state of P is lower than that of S, so AG' ° for the 
reaction is negative and the equilibrium favors P. The 
position and direction of equilibrium are not affected by 
any catalyst. 



Transition state [%) 
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FIGURE 6-2 Reaction coordinate diagram. The free energy of the system 
is plotted against the progress of the reaction S -> P. A diagram of this 
kind is a description of the energy changes during the reaction, and the 
horizontal axis (reaction coordinate) reflects the progressive chemical 
changes (e.g., bond breakage or formation) as S is converted to P. The ac- 
tivation energies, AC*, for the S -> P and P -» S reactions are indicated. 
AC'° is the overall standard free-energy change in the direction S -> P. 
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A favorable equilibrium does not mean that the 
S->P conversion will occur at a detectable rate. The 
rate of a reaction is dependent on an entirely different 
parameter. There is an energy barrier between S and P: 
the energy required for alignment of reacting groups, 
formation of transient unstable charges, bond re- 
arrangements, and other transformations required for 
the reaction to proceed in either direction. This is illus- 
trated by the energy "hill" in Figures 6-2 and 6-3. To 
undergo reaction, the molecules must overcome this 
barrier and therefore must be raised to a higher energy 
level. At the top of the energy hill is a point at which de- 
cay to the S or P state is equally probable (it is downhill 
either way). This is called the transition state. The 
transition state is not a chemical species with any signif- 
icant stability and should not be confused with a reac- 
tion intermediate (such as ES or EP). It is simply a 
fleeting molecular moment in which events such as bond 
breakage, bond formation, and charge development 
have proceeded to the precise point at which decay to 
either substrate or product is equally likely. The differ- 
ence between the energy levels of the ground state and 
the transition state is the activation energy, AG*. The 
rate of a reaction reflects this activation energy: a higher 
activation energy corresponds to a slower reaction. 
Reaction rates can be increased by raising the tempera- 
ture and/or pressure, thereby increasing the number of 
molecules with sufficient energy to overcome the energy 
barrier. Alternatively, the activation energy can be low- 
ered by adding a catalyst (Fig. 6-3). Catalysts enhance 
reaction rates by lowering activation energies. 

Enzymes are no exception to the rule that catalysts 
do not affect reaction equilibria. The bidirectional ar- 
rows in Equation 6-1 make this point: any enzyme that 
catalyzes the reaction S — > P also catalyzes the reaction 
P — » S. The role of enzymes is to accelerate the inter- 
conversion of S and P. The enzyme is not used up in the 
process, and the equilibrium point is unaffected. How- 
ever, the reaction reaches equilibrium much faster when 
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FIGURE 6-3 Reaction coordinate diagram comparing enzyme- 
catalyzed and uncatalyzed reactions. In the reaction S — > P, the ES and 

EP intermediates occupy minima in the energy progress curve of the 
enzyme-catalyzed reaction. The terms AC*ncat an d AC* a t correspond 
to the activation energy for the uncatalyzed reaction and the overall 
activation energy for the catalyzed reaction, respectively. The activa- 
tion energy is lower when the enzyme catalyzes the reaction. 



the appropriate enzyme is present, because the rate of 
the reaction is increased. 

This general principle is illustrated in the conver- 
sion of sucrose and oxygen to carbon dioxide and water: 

C^H^On + 120 2 12C0 2 + 11H 2 0 

This conversion, which takes place through a series of 
separate reactions, has a very large and negative AG' 0 , 
and at equilibrium the amount of sucrose present is neg- 
ligible. Yet sucrose is a stable compound, because the 
activation energy barrier that must be overcome before 
sucrose reacts with oxygen is quite high. Sucrose can be 
stored in a container with oxygen almost indefinitely 
without reacting. In cells, however, sucrose is readily 
broken down to C0 2 and H 2 0 in a series of reactions cat- 
alyzed by enzymes. These enzymes not only accelerate 
the reactions, they organize and control them so that 
much of the energy released is recovered in other chem- 
ical forms and made available to the cell for other tasks. 
The reaction pathway by which sucrose (and other sug- 
ars) is broken down is the primary energy-yielding path- 
way for cells, and the enzymes of this pathway allow the 
reaction sequence to proceed on a biologically useful 
time scale. 

Any reaction may have several steps, involving the 
formation and decay of transient chemical species called 
reaction intermediates.* A reaction intermediate is 
any species on the reaction pathway that has a finite 
chemical lifetime (longer than a molecular vibration, 
-10" 13 seconds). When the S^P reaction is catalyzed 
by an enzyme, the ES and EP complexes can be consid- 
ered intermediates, even though S and P are stable 
chemical species (Eqn 6-1); the ES and EP complexes 
occupy valleys in the reaction coordinate diagram (Fig. 
6-3). Additional, less stable chemical intermediates of- 
ten exist in the course of an enzyme-catalyzed reaction. 
The interconversion of two sequential reaction interme- 
diates thus constitutes a reaction step. When several 
steps occur in a reaction, the overall rate is determined 
by the step (or steps) with the highest activation en- 
ergy; this is called the rate-limiting step. In a simple 
case, the rate-limiting step is the highest-energy point in 
the diagram for interconversion of S and P. In practice, 
the rate-limiting step can vary with reaction conditions, 
and for many enzymes several steps may have similar 
activation energies, which means they are all partially 
rate-limiting. 

Activation energies are energy barriers to chemical 
reactions. These barriers are crucial to life itself. The rate 
at which a molecule undergoes a particular reaction 

* In this chapter, step and intermediate refer to chemical species in 
the reaction pathway of a single enzyme-catalyzed reaction. In the con- 
text of metabolic pathways involving many enzymes (discussed in Part 
II), these terms are used somewhat differently. An entire enzymatic 
reaction is often referred to as a "step" in a pathway, and the product 
of one enzymatic reaction (which is the substrate for the next enzyme 
in the pathway) is referred to as an "intermediate," 
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decreases as the activation barrier for that reaction in- 
creases. Without such energy barriers, complex macro- 
molecules would revert spontaneously to much simpler 
molecular forms, and the complex and highly ordered 
structures and metabolic processes of cells could not 
exist. Over the course of evolution, enzymes have devel- 
oped to lower activation energies selectively for reactions 
that are needed for cell survival. 

Reaction Rates and Equilibria Have Precise 
Thermodynamic Definitions 

Reaction equilibria are inextricably linked to the 
standard free-energy change for the reaction, AG' 0 , 
and reaction rates are linked to the activation energy, 
AG*. A basic introduction to these thermodynamic re- 
lationships is the next step in understanding how en- 
zymes work. 

An equilibrium such as S^P is described by 
an equilibrium constant, K e(l , or simply K (p. 24). 
Under the standard conditions used to compare bio- 
chemical processes, an equilibrium constant is denoted 
r eq (or iT) : 



From thermodynamics, the relationship between K' eq 
and AG' 0 can be described by the expression 

AG' 0 = -RT In K' eq (6 _ 3 ) 

where R is the gas constant, 8.315 J/mol ■ K, and Tis the 
absolute temperature, 298 K (25 °C). Equation 6-3 is 
developed and discussed in more detail in Chapter 13. 
The important point here is that the equilibrium con- 
stant is directly related to the overall standard free- 
energy change for the reaction (Table 6-4). A large 
negative value for AG' 0 reflects a favorable reaction 
equilibrium — but as already noted, this does not mean 
the reaction will proceed at a rapid rate. 



The rate of any reaction is determined by the con- 
centration of the reactant (or reactants) and by a rate 
constant, usually denoted by k. For the unimolecular 
reaction S^P, the rate (or velocity) of the reaction, V— 
representing the amount of S that reacts per unit time — 
is expressed by a rate equation: 

V=klS] (6-4) 

In this reaction, the rate depends only on the concentra- 
tion of S. This is called a first-order reaction. The factor k 
is a proportionality constant that reflects the probability 
of reaction under a given set of conditions (pH, tempera- 
ture, and so forth). Here, A; is a first-order rate constant 
and has units of reciprocal time, such as s -1 . If a first-or- 
der reaction has a rate constant k of 0.03 s" 1 , this may be 
interpreted (qualitatively) to mean that 3% of the avail- 
able S will be converted to P in 1 s. A reaction with a rate 
constant of 2,000 s -1 will be over in a small fraction of a 
second. If a reaction rate depends on the concentration 
of two different compounds, or if the reaction is between 
two molecules of the same compound, the reaction is 
second order and k is a second-order rate constant, with 
units of M _1 s _1 . The rate equation then becomes 

V= fc[SJ[S 2 ] (6-5) 

From transition-state theory we can derive an expres- 
sion that relates the magnitude of a rate constant to the 
activation energy: 

k = ^ e -AG*mT (6 _ 6) 
h 

where k is the Boltzmann constant and h is Planck's 
constant. The important point here is that the relation- 
ship between the rate constant k and the activation 
energy AG* is inverse and exponential. In simplified 
terms, this is the basis for the statement that a lower 
activation energy means a faster reaction rate. 

Now we turn from what enzymes do to how they 
do it. 



| TABLE 6-4 


Relationship between K'^ and AG'° 


^eq 


A<?'° (kJ/mol) 


10" 6 


34.2 


10" 5 


28.5 


10" 4 


22.8 


10" 3 


17.1 


10" 2 


11.4 


10" 1 


5.7 


1 


0.0 


10 1 


-5.7 


10 2 


-11.4 


10 3 


-17.1 



Note: The relationship is calculated from AG'° = -RT\r\ K' eQ (Eqn 6-3), 



A Few Principles Explain the Catalytic Power and 
Specificity of Enzymes 

Enzymes are extraordinary catalysts. The rate en- 
hancements they bring about are in the range of 5 to 17 
orders of magnitude (Table 6-5). Enzymes are also very 
specific, readily discriminating between substrates with 
quite similar structures. How can these enormous and 
highly selective rate enhancements be explained? What 
is the source of the energy for the dramatic lowering of 
the activation energies for specific reactions? 

The answer to these questions has two distinct but in- 
terwoven parts. The first lies in the rearrangement of co- 
valent bonds during an enzyme-catalyzed reaction. 
Chemical reactions of many types take place between sub- 
strates and enzymes' functional groups (specific amino 
acid side chains, metal ions, and coenzymes). Catalytic 
functional groups on an enzyme may form a transient 
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Some Rate Enhancements Produced 
by Enzymes 



Cyclophilin 


10 6 


Carbonic anhydrase 


10 7 


Triose phosphate isomerase 


10 9 


Carboxypeptidase A 


10 11 


Phosphoglucomutase 


10 12 


Succinyl-CoA transferase 


10 13 


Urease 


10 14 


Orotidine monophosphate decarboxylase 


10 17 



covalent bond with a substrate and activate it for reaction, 
or a group may be transiently transferred from the sub- 
strate to the enzyme. In many cases, these reactions occur 
only in the enzyme active site. Covalent interactions be- 
tween enzymes and substrates lower the activation energy 
(and thereby accelerate the reaction) by providing an al- 
ternative, lower-energy reaction path. The specific types 
of rearrangements that occur are described in Section 6.4. 

The second part of the explanation lies in the non- 
covalent interactions between enzyme and substrate. 
Much of the energy required to lower activation energies 
is derived from weak, noncovalent interactions between 
substrate and enzyme. What really sets enzymes apart 
from most other catalysts is the formation of a specific 
ES complex. The interaction between substrate and en- 
zyme in this complex is mediated by the same forces that 
stabilize protein structure, including hydrogen bonds 
and hydrophobic and ionic interactions (Chapter 4). For- 
mation of each weak interaction in the ES complex is ac- 
companied by release of a small amount of free energy 
that stabilizes the interaction. The energy derived from 
enzyme-substrate interaction is called binding energy, 
Atr B . Its significance extends beyond a simple stabiliza- 
tion of the enzyme-substrate interaction. Binding energy 
is a major source of free energy used by enzymes to 
lower the activation energies of reactions. 

Two fundamental and interrelated principles pro- 
vide a general explanation for how enzymes use nonco- 
valent binding energy: 

1. Much of the catalytic power of enzymes is ulti- 
mately derived from the free energy released in 
forming many weak bonds and interactions between 
an enzyme and its substrate. This binding energy 
contributes to specificity as well as to catalysis. 

2. Weak interactions are optimized in the reaction 
transition state; enzyme active sites are comple- 
mentary not to the substrates per se but to the 
transition states through which substrates pass as 
they are converted to products during an enzy- 
matic reaction. 

These themes are critical to an understanding of 
enzymes, and they now become our primary focus. 



Weak Interactions between Enzyme and Substrate Are 
Optimized in the Transition State 

How does an enzyme use binding energy to lower the 
activation energy for a reaction? Formation of the ES 
complex is not the explanation in itself, although some 
of the earliest considerations of enzyme mechanisms 
began with this idea. Studies on enzyme specificity 
carried out by Emil Fischer led him to propose, in 1894, 
that enzymes were structurally complementary to their 
substrates, so that they fit together like a lock and key 
(Fig. 6-4). This elegant idea, that a specific (exclusive) 
interaction between two biological molecules is medi- 
ated by molecular surfaces with complementary shapes, 
has greatly influenced the development of biochemistry, 
and such interactions lie at the heart of many biochemi- 
cal processes. However, the "lock and key" hypothesis 
can be misleading when applied to enzymatic catalysis. 
An enzyme completely complementary to its substrate 
would be a very poor enzyme, as we can demonstrate. 




FIGURE 6-4 Complementary shapes of a substrate and its binding site 
on an enzyme. The enzyme dihydrofolate reductase with its substrate 
NADP + (red), unbound (top) and bound (bottom); another bound sub- 
strate, tetrahydrofolate (yellow), is also visible (PDB ID 1RA2). In this 
model, the NADP + binds to a pocket that is complementary to it in 
shape and ionic properties, an illustration of Emil Fischer's "lock and 
key' 7 hypothesis of enzyme action. In reality, the complementarity be- 
tween protein and ligand (in this case substrate) is rarely perfect, as we 
saw in Chapter 5. 
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Consider an imaginary reaction, the breaking of a 
magnetized metal stick. The uncatalyzed reaction is 
shown in Figure 6-5a. Let's examine two imaginary 
enzymes — two "stickases" — that could catalyze this re- 
action, both of which employ magnetic forces as a para- 
digm for the binding energy used by real enzymes. We 
first design an enzyme perfectly complementary to the 
substrate (Fig. 6-5b). The active site of this stickase is a 
pocket lined with magnets. To react (break), the stick 
must reach the transition state of the reaction, but the 
stick fits so tightly in the active site that it cannot bend, 
because bending would eliminate some of the magnetic 
interactions between stick and enzyme. Such an enzyme 
impedes the reaction, stabilizing the substrate instead. 
In a reaction coordinate diagram (Fig. 6-5b), this kind 
of ES complex would correspond to an energy trough 
from which the substrate would have difficulty escaping. 
Such an enzyme would be useless. 

The modern notion of enzymatic catalysis, first pro- 
posed by Michael Polanyi (1921) and Haldane (1930), 



was elaborated by Linus Pauling in 1946: in order to cat- 
alyze reactions, an enzyme must be complementary to 
the reaction transition state. This means that optimal 
interactions between substrate and enzyme occur only in 
the transition state. Figure 6-5c demonstrates how such 
an enzyme can work. The metal stick binds to the stick- 
ase, but only a subset of the possible magnetic interac- 
tions are used in forming the ES complex. The bound 
substrate must still undergo the increase in free energy 
needed to reach the transition state. Now, however, the 
increase in free energy required to draw the stick into a 
bent and partially broken conformation is offset, or "paid 
for," by the magnetic interactions (binding energy) that 
form between the enzyme and substrate in the transition 
state. Many of these interactions involve parts of the 
stick that are distant from the point of breakage; thus in- 
teractions between the stickase and nonreacting parts of 
the stick provide some of the energy needed to catalyze 
stick breakage. This "energy payment" translates into a 
lower net activation energy and a faster reaction rate. 



(a) No enzyme 



Substrate 
(metal stick) 



Transition state 
(bent stick) 



Products 
(broken stick) 




(b) Enzyme complementary to substrate 

Magnets 
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(c) Enzyme complementary to transition state 




ES 



FIGURE 6-5 An imaginary enzyme (stickase) designed to catalyze 
breakage of a metal stick, (a) Before the stick is broken, it must first 
be bent (the transition state). In both stickase examples, magnetic in- 
teractions take the place of weak bonding interactions between en- 
zyme and substrate, (b) A stickase with a magnet-lined pocket 
complementary in structure to the stick (the substrate) stabilizes the 
substrate. Bending is impeded by the magnetic attraction between 
stick and stickase. (c) An enzyme with a pocket complementary to 
the reaction transition state helps to destabilize the stick, contributing 
to catalysis of the reaction. The binding energy of the magnetic 
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interactions compensates for the increase in free energy required to 
bend the stick. Reaction coordinate diagrams (right) show the energy 
consequences of complementarity to substrate versus complementar- 
ity to transition state (EP complexes are omitted). AG M , the difference 
between the transition-state energies of the uncatalyzed and cat- 
alyzed reactions, is contributed by the magnetic interactions be- 
tween the stick and stickase. When the enzyme is complementary to 
the substrate (b), the ES complex is more stable and has less free en- 
ergy in the ground state than substrate alone. The result is an increase 
in the activation energy. 
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FIGURE 6-6 Role of binding energy in catalysis. To lower the activation 
energy for a reaction, the system must acquire an amount of energy 
equivalent to the amount by which AC* is lowered. Much of this energy 
comes from binding energy (AC B ) contributed by formation of weak 
noncovalent interactions between substrate and enzyme in the transi- 
tion state The role of AC B is analogous to that of AC M in Figure 6-5. 

Real enzymes work on an analogous principle. Some 
weak interactions are formed in the ES complex, but the 
full complement of such interactions between substrate 
and enzyme is formed only when the substrate reaches 
the transition state. The free energy (binding energy) re- 
leased by the formation of these interactions partially off- 
sets the energy required to reach the top of the energy 
hill. The summation of the unfavorable (positive) activa- 
tion energy AG + and the favorable (negative) binding en- 
ergy AG B results in a lower net activation energy (Fig. 
6-6). Even on the enzyme, the transition state is not a sta- 
ble species but a brief point in time that the substrate 
spends atop an energy hill. The enzyme-catalyzed reac- 
tion is much faster than the uncatalyzed process, however, 
because the hill is much smaller. The important principle 
is that weak binding interactions between the enzyme 
and the substrate provide a substantial driving force 
for enzymatic catalysis. The groups on the substrate 
that are involved in these weak interactions can be at 
some distance from the bonds that are broken or changed. 
The weak interactions formed only in the transition state 
are those that make the primary contribution to catalysis. 

The requirement for multiple weak interactions to 
drive catalysis is one reason why enzymes (and some 
coenzymes) are so large An enzyme must provide func- 
tional groups for ionic, hydrogen-bond, and other inter- 
actions, and also must precisely position these groups 
so that binding energy is optimized in the transition 
state. Adequate binding is accomplished most readily 
by positioning a substrate in a cavity (the active site) 
where it is effectively removed from water. The size of 
proteins reflects the need for superstructure to keep 
interacting groups properly positioned and to keep the 
cavity from collapsing. 

Binding Energy Contributes to Reaction Specificity 
and Catalysis 

Can we demonstrate quantitatively that binding energy 
accounts for the huge rate accelerations brought about 



by enzymes? Yes. As a point of reference, Equation 6-6 
allows us to calculate that AG* must be lowered by 
about 5.7 kJ/mol to accelerate a first-order reaction by a 
factor of ten, under conditions commonly found in cells. 
The energy available from formation of a single weak in- 
teraction is generally estimated to be 4 to 30 kJ/mol. The 
overall energy available from a number of such interac- 
tions is therefore sufficient to lower activation energies 
by the 60 to 100 kJ/mol required to explain the large rate 
enhancements observed for many enzymes. 

The same binding energy that provides energy for 
catalysis also gives an enzyme its specificity, the ability 
to discriminate between a substrate and a competing 
molecule. Conceptually, specificity is easy to distinguish 
from catalysis, but this distinction is much more difficult 
to make experimentally, because catalysis and specificity 
arise from the same phenomenon. If an enzyme active site 
has functional groups arranged optimally to form a vari- 
ety of weak interactions with a particular substrate in the 
transition state, the enzyme will not be able to interact to 
the same degree with any other molecule. For example, if 
the substrate has a hydroxyl group that forms a hydrogen 
bond with a specific Glu residue on the enzyme, any mol- 
ecule lacking a hydroxyl group at that particular position 
will be a poorer substrate for the enzyme. In addition, any 
molecule with an extra functional group for which the en- 
zyme has no pocket or binding site is likely to be excluded 
from the enzyme. In general, specificity is derived from 
the formation of many weak interactions between the en- 
zyme and its specific substrate molecule. 

The importance of binding energy to catalysis can 
be readily demonstrated. For example, the glycolytic 
enzyme triose phosphate isomerase catalyzes the inter- 
conversion of glyceraldehyde 3-phosphate and dihy- 
droxyacetone phosphate: 

HC=0 H 2 C-OH 

si k I 

HC — OH v C=0 

triose 

3 CH 2 OPOi" P h0S P hate CHaOPOi" 

isomerase 

Glyceraldehyde Dihydroxyacetone 
3-phosphate phosphate 

This reaction rearranges the carbonyl and hy- 
droxyl groups on carbons 1 and 2. However, more than 
80% of the enzymatic rate acceleration has been traced 
to enzyme-substrate interactions involving the phos- 
phate group on carbon 3 of the substrate. This was de- 
termined by comparing the enzyme-catalyzed reactions 
with glyceraldehyde 3-phosphate and with glyceralde- 
hyde (no phosphate group at position 3) as substrate. 

The general principles outlined above can be illus- 
trated by a variety of recognized catalytic mechanisms. 
These mechanisms are not mutually exclusive, and a 
given enzyme might incorporate several types in its 
overall mechanism of action. 

Consider what needs to occur for a reaction to take 
place. Prominent physical and thermodynamic factors 
contributing to AG*, the barrier to reaction, might include: 
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(1) the entropy (freedom of motion) of molecules in so- 
lution, which reduces the possibility that they will react 
together; (2) the solvation shell of hydrogen-bonded wa- 
ter that surrounds and helps to stabilize most biomole- 
cules in aqueous solution; (3) the distortion of substrates 
that must occur in many reactions; and (4) the need for 
proper alignment of catalytic functional groups on the en- 
zyme. Binding energy can be used to overcome all these 
barriers. 

First, a large restriction in the relative motions of 
two substrates that are to react, or entropy reduction, 
is one obvious benefit of binding them to an enzyme. 
Binding energy holds the substrates in the proper orien- 
tation to react — a substantial contribution to catalysis, 
because productive collisions between molecules in solu- 
tion can be exceedingly rare. Substrates can be precisely 
aligned on the enzyme, with many weak interactions be- 
tween each substrate and strategically located groups on 
the enzyme clamping the substrate molecules into the 
proper positions. Studies have shown that constraining 
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the motion of two reactants can produce rate enhance- 
ments of many orders of magnitude (Fig. 6-7). 

Second, formation of weak bonds between substrate 
and enzyme results in desolvation of the substrate. En- 
zyme-substrate interactions replace most or all of the hy- 
drogen bonds between the substrate and water. Third, 
binding energy involving weak interactions formed only 
in the reaction transition state helps to compensate ther- 
modynamically for any distortion, primarily electron re- 
distribution, that the substrate must undergo to react. 

Finally, the enzyme itself usually undergoes a 
change in conformation when the substrate binds, 
induced by multiple weak interactions with the sub- 
strate. This is referred to as induced fit, a mechanism 
postulated by Daniel Koshland in 1958. The motions can 
affect a small part of the enzyme near the active site, or 
can involve changes in the positioning of entire domains. 
Typically, a network of coupled motions occurs through- 
out the enzyme that ultimately brings about the re- 
quired changes in the active site. Induced fit serves to 
bring specific functional groups on the enzyme into the 
proper position to catalyze the reaction. The conforma- 
tional change also permits formation of additional weak 
bonding interactions in the transition state. In either 
case, the new enzyme conformation has enhanced cat- 
alytic properties. As we have seen, induced fit is a com- 
mon feature of the reversible binding of ligands to 
proteins (Chapter 5). Induced fit is also important in the 
interaction of almost every enzyme with its substrate. 
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FIGURE 6-7 Rate enhancement by entropy reduction. Shown here are 
reactions of an ester with a carboxylate group to form an anhydride. The 
R group is the same in each case, (a) For this bimolecular reaction, the 



rate constant k is second order, with units of t 



(b) When the two 



reacting groups are in a single molecule, and thus have less freedom of 
motion, the reaction is much faster. For this unimolecular reaction, k has 
units of s" 1 . Dividing the rate constant for (b) by the rate constant for (a) 
gives a rate enhancement of about 1 0 5 m. (The enhancement has units of 
molarity because we are comparing a unimolecular and a bimolecular 
reaction.) Put another way if the reactant in (b) were present at a con- 
centration of 1 m, the reacting groups would behave as though they were 
present at a concentration of 1 0 5 m. Note that the reactant in (b) has free- 
dom of rotation about three bonds (shown with curved arrows), but this 
still represents a substantial reduction of entropy over (a). If the bonds 
that rotate in (b) are constrained as in (c), the entropy is reduced further 
and the reaction exhibits a rate enhancement of 10 8 m relative to (a). 



Specific Catalytic Groups Contribute to Catalysis 

In most enzymes, the binding energy used to form the ES 
complex is just one of several contributors to the overall 
catalytic mechanism. Once a substrate is bound to an en- 
zyme, properly positioned catalytic functional groups aid 
in the cleavage and formation of bonds by a variety of 
mechanisms, including general acid-base catalysis, cova- 
lent catalysis, and metal ion catalysis. These are distinct 
from mechanisms based on binding energy, because they 
generally involve transient covalent interaction with a 
substrate or group transfer to or from a substrate. 

General Acid-Base Catalysis Many biochemical re- 
actions involve the formation of unstable charged inter- 
mediates that tend to break down rapidly to their 
constituent reactant species, thus impeding the reaction 
(Fig. 6-8). Charged intermediates can often be stabi- 
lized by the transfer of protons to or from the substrate 
or intermediate to form a species that breaks down more 
readily to products. For nonenzymatic reactions, the 
proton transfers can involve either the constituents of 
water alone or other weak proton donors or acceptors. 
Catalysis of the type that uses only the H + (H 3 0 + ) or 
OH~ ions present in water is referred to as specific 
acid-base catalysis. If protons are transferred between 
the intermediate and water faster than the intermediate 
breaks down to reactants, the intermediate is effectively 
stabilized every time it forms. No additional catalysis 
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down rapidly 
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When proton transfer 
to or from H 2 0 is 
faster than the 
rate of breakdown 
of intermediates, 
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other proton donors 
or acceptors does not 
increase the rate of 
the reaction. 
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than the rate of breakdown 
of intermediates, only a 
fraction of the intermediates 
formed are stabilized. 
The presence of alternative 
proton donors (HA) or 
acceptors (B •) increases 
the rate of the reaction. 
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FIGURE 6-8 How a catalyst circumvents unfavorable charge develop- 
ment during cleavage of an amide. The hydrolysis of an amide bond, 
shown here, is the same reaction as that catalyzed by chymotrypsin and 
other proteases. Charge development is unfavorable and can be cir- 
cumvented by donation of a proton by H 3 0 + (specific acid catalysis) or 
HA (general acid catalysis), where HA represents any acid. Similarly, 
charge can be neutralized by proton abstraction by OH~ (specific base 
catalysis) or B: (general base catalysis), where B: represents any base. 

mediated by other proton acceptors or donors will oc- 
cur. In many cases, however, water is not enough. The 
term general acid-base catalysis refers to proton 
transfers mediated by other classes of molecules. For 
nonenzymatic reactions in aqueous solutions, this oc- 
curs only when the unstable reaction intermediate 
breaks down to reactants faster than protons can be 
transferred to or from water. Many weak organic acids 
can supplement water as proton donors in this situation, 
or weak organic bases can serve as proton acceptors. 

In the active site of an enzyme, a number of amino 
acid side chains can similarly act as proton donors and 
acceptors (Fig, f>~9). These groups can be precisely 
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FIGURE 6-9 Amino acids in general acid-base catalysis. Many organic 
reactions are promoted by proton donors (general acids) or proton 
acceptors (general bases). The active sites of some enzymes contain 
amino acid functional groups, such as those shown here, that can par- 
ticipate in the catalytic process as proton donors or proton acceptors. 

positioned in an enzyme active site to allow proton 
transfers, providing rate enhancements of the order of 
10 2 to 10 5 . This type of catalysis occurs on the vast ma- 
jority of enzymes. In fact, proton transfers are the most 
common biochemical reactions. 

Covalent Catalysis In covalent catalysis, a transient 
covalent bond is formed between the enzyme and the 
substrate. Consider the hydrolysis of a bond between 
groups A and B: 

h 2 o 



A— B 



A + B 



In the presence of a covalent catalyst (an enzyme with a 
nucleophilic group X:) the reaction becomes 



A— B + X: 



A— X + B 



A + X: + B 



This alters the pathway of the reaction, and it results in 
catalysis only when the new pathway has a lower acti- 
vation energy than the uncatalyzed pathway. Both of the 
new steps must be faster than the uncatalyzed reaction. 
A number of amino acid side chains, including all those 
in Figure 6-9, and the functional groups of some enzyme 
cofactors can serve as nucleophiles in the formation of 
covalent bonds with substrates. These covalent com- 
plexes always undergo further reaction to regenerate 
the free enzyme. The covalent bond formed between the 
enzyme and the substrate can activate a substrate for 
further reaction in a manner that is usually specific to 
the particular group or coenzyme. 

Metal Ion Catalysis Metals, whether tightly bound to 
the enzyme or taken up from solution along with the 
substrate, can participate in catalysis in several ways. 



^194] Enzymes 



Ionic interactions between an enzyme-bound metal and a 
substrate can help orient the substrate for reaction or 
stabilize charged reaction transition states. This use of 
weak bonding interactions between metal and substrate 
is similar to some of the uses of enzyme-substrate bind- 
ing energy described earlier. Metals can also mediate 
oxidation-reduction reactions by reversible changes in the 
metal ion's oxidation state. Nearly a third of all known en- 
zymes require one or more metal ions for catalytic activity. 

Most enzymes combine several catalytic strate- 
gies to bring about a rate enhancement. A good exam- 
ple is the use of covalent catalysis, general acid-base 
catalysis, and transition-state stabilization in the reac- 
tion catalyzed by chymotrypsin, detailed in Section 6.4. 

SUMMARY 6.2 How Enzymes Work 

■ Enzymes are highly effective catalysts, commonly 
enhancing reaction rates by a factor of 10 5 to 10 17 . 

■ Enzyme-catalyzed reactions are characterized by 
the formation of a complex between substrate and 
enzyme (an ES complex). Substrate binding occurs 
in a pocket on the enzyme called the active site. 

■ The function of enzymes and other catalysts is to 
lower the activation energy, AG*, for a reaction and 
thereby enhance the reaction rate. The equilibrium 
of a reaction is unaffected by the enzyme. 

■ A significant part of the energy used for enzymatic 
rate enhancements is derived from weak interactions 
(hydrogen bonds and hydrophobic and ionic 
interactions) between substrate and enzyme. The 
enzyme active site is structured so that some of 
these weak interactions occur preferentially in 

the reaction transition state, thus stabilizing the 
transition state. The need for multiple interactions 
is one reason for the large size of enzymes. The 
binding energy, AG B , can be used to lower substrate 
entropy or to cause a conformational change in 
the enzyme (induced fit). Binding energy also 
accounts for the exquisite specificity of enzymes 
for their substrates. 

m Additional catalytic mechanisms employed by 
enzymes include general acid-base catalysis, 
covalent catalysis, and metal ion catalysis. Catalysis 
often involves transient covalent interactions 
between the substrate and the enzyme, or group 
transfers to and from the enzyme, so as to provide 
a new, lower-energy reaction path. 

6.3 Enzyme Kinetics as an Approach to 
Understanding Mechanism 

Biochemists commonly use several approaches to study 
the mechanism of action of purified enzymes. The 
three-dimensional structure of the protein provides im- 
portant information, which is enhanced by classical 



protein chemistry and modern methods of site-directed 
mutagenesis (changing the amino acid sequence of a 
protein by genetic engineering; see Fig. 9-11). These 
technologies permit enzymologists to examine the role 
of individual amino acids in enzyme structure and ac- 
tion. However, the oldest approach to understanding 
enzyme mechanisms, and the one that remains most 
important, is to determine the rate of a reaction and 
how it changes in response to changes in experimental 
parameters, a discipline known as enzyme kinetics. 
We provide here a basic introduction to the kinetics of 
enzyme-catalyzed reactions. More advanced treat- 
ments are available in the sources cited at the end of 
the chapter. 

Substrate Concentration Affects the Rate of 
Enzyme-Catalyzed Reactions 

A key factor affecting the rate of a reaction catalyzed by 
an enzyme is the concentration of substrate, [S]. How- 
ever, studying the effects of substrate concentration is 
complicated by the fact that [S] changes during the 
course of an in vitro reaction as substrate is converted to 
product. One simplifying approach in kinetics experi- 
ments is to measure the initial rate (or initial velocity) , 
designated V 0 (Fig. 6-10). In a typical reaction, the en- 
zyme may be present in nanomolar quantities, whereas 
[S] may be five or six orders of magnitude higher. If only 
the beginning of the reaction is monitored (often the 
first 60 seconds or less), changes in [S] can be limited to 
a few percent, and [S] can be regarded as constant. V Q 
can then be explored as a function of [S], which is ad- 
justed by the investigator. The effect on V 0 of varying [S] 
when the enzyme concentration is held constant is 
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FIGURE 6-10 Initial velocities of enzyme-catalyzed reactions. A 

theoretical enzyme catalyzes the reaction S ^± P, and is present at a 
concentration sufficient to catalyze the reaction at a maximum veloc- 
ity/ tmax, of 1 AM/min. The Michael is constant, K m (explained in the 
text), is 0.5 /am. Progress curves are shown for substrate concentrations 
below, at, and above the K m . The rate of an enzyme-catalyzed reaction 
declines as substrate is converted to product. A tangent to each curve 
taken at time = 0 defines the initial velocity, V 0f of each reaction. 
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FIGURE 6-1 1 Effect of substrate concentration on the initial velocity of 
an enzyme-catalyzed reaction. The maximum velocity, V max/ is extrapo- 
lated from the plot, because V 0 approaches but never quite reaches 
^max- The substrate concentration at which V 0 is half maximal is K m the 
Michaelis constant. The concentration of enzyme in an experiment such 
as this is generally so low that [S] >> [E] even when [S] is described as 
low or relatively low. The units shown are typical for enzyme-catalyzed 
reactions and are given only to help illustrate the meaning of V 0 and [S]. 
(Note that the curve describes part of a rectangular hyperbola, with one 
asymptote at V max . If the curve were continued below [S] = 0, it would 
approach a vertical asymptote at [S] = -K m ) 

shown in Figure 6-11. At relatively low concentrations 
of substrate, V 0 increases almost linearly with an in- 
crease in [S]. At higher substrate concentrations, V 0 in- 
creases by smaller and smaller amounts in response to 
increases in [S]. Finally, a point is reached beyond which 
increases in V 0 are vanishingly small as [S] increases. 
This plateau-like V 0 region is close to the maximum 
velocity, F max 

The ES complex is the key to understanding this ki- 
netic behavior, just as it was a starting point for our dis- 
cussion of catalysis. The kinetic pattern in Figure 6-11 
led Victor Henri, following the lead of Wurtz, to propose 
in 1903 that the combination of an enzyme with its sub- 
strate molecule to form an ES complex is a necessary 
step in enzymatic catalysis. This idea was expanded into 
a general theory of enzyme action, particularly by 
Leonor Michaelis and Maud Menten in 1913. They pos- 
tulated that the enzyme first combines reversibly with 
its substrate to form an enzyme-substrate complex in a 
relatively fast reversible step: 

E + S ES (6-7) 

The ES complex then breaks down in a slower second 
step to yield the free enzyme and the reaction product P: 

ES 4^ E + P (6-8) 

k 2 

Because the slower second reaction (Eqn 6-8) must 
limit the rate of the overall reaction, the overall rate 
must be proportional to the concentration of the species 
that reacts in the second step, that is, ES, 
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At any given instant in an enzyme-catalyzed reac- 
tion, the enzyme exists in two forms, the free or un- 
combined form E and the combined form ES. At low 
[S], most of the enzyme is in the uncombined form E. 
Here, the rate is proportional to [S] because the equi- 
librium of Equation 6-7 is pushed toward formation of 
more ES as [S] increases. The maximum initial rate of 
the catalyzed reaction (V max ) is observed when virtu- 
ally all the enzyme is present as the ES complex and 
[E] is vanishingly small. Under these conditions, the 
enzyme is "saturated" with its substrate, so that fur- 
ther increases in [S] have no effect on rate. This condi- 
tion exists when [S] is sufficiently high that essentially 
all the free enzyme has been converted to the ES form. 
After the ES complex breaks down to yield the product 
P, the enzyme is free to catalyze reaction of another 
molecule of substrate. The saturation effect is a distin- 
guishing characteristic of enzymatic catalysts and is re- 
sponsible for the plateau observed in Figure 6-11. The 
pattern seen in Figure 6-11 is sometimes referred to as 
saturation kinetics. 

When the enzyme is first mixed with a large excess of 
substrate, there is an initial period, the pre-steady 
state, during which the concentration of ES builds up. 
This period is usually too short to be easily observed, last- 
ing just microseconds, and is not evident in Figure 6-10. 
The reaction quickly achieves a steady state in which 
[ES] (and the concentrations of any other intermediates) 
remains approximately constant over time. The concept 
of a steady state was introduced by G. E. Briggs and Hal- 
dane in 1925. The measured V 0 generally reflects the 
steady state, even though V 0 is limited to the early part of 
the reaction, and analysis of these initial rates is referred 
to as steady-state kinetics. 

The Relationship between Substrate Concentration and 
Reaction Rate Can Be Expressed Quantitatively 

The curve expressing the relationship between [S] and 
V Q (Fig. 6-11) has the same general shape for most en- 
zymes (it approaches a rectangular hyperbola), which 
can be expressed algebraically by the Michaelis- 
Menten equation. Michaelis and Menten derived this 
equation starting from their basic hypothesis that 
the rate-limiting step in enzymatic reactions is the 
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breakdown of the ES complex to product and free en- 
zyme. The equation is 



V [SI 



g (6-9) 



The important terms are [S], F 0 , V max , and a constant 
called the Michaelis constant, K m . All these terms are 
readily measured experimentally. 

Here we develop the basic logic and the algebraic 
steps in a modern derivation of the Michaelis-Menten 
equation, which includes the steady-state assumption 
introduced by Briggs and Haldane. The derivation starts 
with the two basic steps of the formation and break- 
down of ES (Eqns 6-7 and 6-8). Early in the reaction, 
the concentration of the product, [P], is negligible, and 
we make the simplifying assumption that the reverse re- 
action, P->S (described by /c_ 2 ), can be ignored. This 
assumption is not critical but it simplifies our task. The 
overall reaction then reduces to 



E + S 4^ ES 



E + P 



(6-10) 



V 0 is determined by the breakdown of ES to form prod- 
uct, which is determined by [ES]: 



V 0 = fc 2 [ES] 



(6-11) 



Because [ES] in Equation 6-1 1 is not easily measured ex- 
perimentally, we must begin by finding an alternative ex- 
pression for this term. First, we introduce the term [E t ], 
representing the total enzyme concentration (the sum 
of free and substrate-bound enzyme). Free or unbound 
enzyme can then be represented by [E t ] - [ES]. Also, 
because [S] is ordinarily far greater than [E t ], the 
amount of substrate bound by the enzyme at any given 
time is negligible compared with the total [S]. With these 
conditions in mind, the following steps lead us to an ex- 
pression for V 0 in terms of easily measurable parameters. 

Step 1 The rates of formation and breakdown of ES 
are determined by the steps governed by the rate con- 
stants k x (formation) and k- x + k 2 (breakdown to reac- 
tants and products, respectively), according to the 
expressions 

Rate of ES formation = ^([EJ - [ES])[S] (6-12) 

Rate of ES breakdown = jfe.^ES] + k 2 [E3] (6-13) 

Step 2 We now make an important assumption: that 
the initial rate of reaction reflects a steady state in which 
[ES] is constant— that is, the rate of formation of ES 
is equal to the rate of its breakdown. This is called 
the steady-state assumption. The expressions in 
Equations 6-12 and 6-13 can be equated for the steady 
state, giving 

&i([E t ] - [ES])[S] = £_JES] + & 2 [ES] (6-14) 

Step 3 In a series of algebraic steps, we now solve 
Equation 6-14 for [ES]. First, the left side is multiplied 
out and the right side simplified to give 



Asi[EJ[S] - MESHS] = (&_! + fc 2 )[ES] (6-15) 

Adding the term /qfESHS] to both sides of the equation 
and simplifying gives 

ME t ][S] = ttxIS] + k. 1 + k 2 )[ES] (6-16) 

We then solve this equation for [ES]: 

*i[E t ][S] 



[ES] 



MS] + k- x + k 2 



(6-17) 



This can now be simplified further, combining the rate 
constants into one expression: 



[ES] 



[E t ][S] 



[S] + (k- ± + * 2 )/Ax 



(6-18) 



The term (k- 1 + k 2 )/k l is defined as the Michaelis 
constant, K m . Substituting this into Equation 6-18 sim- 
plifies the expression to 



[ES] = 



m is] 

iC t [S] 



(6-19) 



Step 4 We can now express V 0 in terms of [ES]. Substi- 
tuting the right side of Equation 6-19 for [ES] in Equa- 
tion 6-11 gives 



jglEj] IS] 

*m + IS] 



(6-20) 



This equation can be further simplified. Because the 
maximum velocity occurs when the enzyme is saturated 
(that is, with [ES] = [E t ]) V max can be defined as fc 2 [E t ]. 
Substituting this in Equation 6-20 gives Equation 6-9: 



VmaxtS] 

K m + [S] 



This is the Michaelis-Menten equation, the rate 

equation for a one-substrate enzyme-catalyzed reac- 
tion. It is a statement of the quantitative relationship be- 
tween the initial velocity V 0} the maximum velocity 
F max , and the initial substrate concentration [S], all re- 
lated through the Michaelis constant K m . Note that K m 
has units of concentration. Does the equation fit experi- 
mental observations? Yes; we can confirm this by con- 
sidering the limiting situations where [S] is very high or 
very low, as shown in Figure 6-12. 

An important numerical relationship emerges from 
the Michaelis-Menten equation in the special case when 
V 0 is exactly one-half V max (Fig. 6-12). Then 



' max r max 



v max [S] 



2 K m + [S] 
On dividing by F max , we obtain 
1 [S] 



2 K m + [S] 
Solving for K m , we get K m + [S] = 2[S], or 



/f m = [S], when y 0 = -y ma , 



(6-21) 



(6-22) 



(6-23) 
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[S] (mM) 

FIGURE 6-12 Dependence of initial velocity on substrate concentration. 

This graph shows the kinetic parameters that define the limits of the curve 
at high and low [S]. At low [S], K m » [S] and the [S] term in the denom- 
inator of the Michael is-Menten equation (Eqn 6-9) becomes insignificant. 
The equation simplifies to V 0 = V max [$]/K m and V 0 exhibits a linear de- 
pendence on [S], as observed here. At high [S], where [S] >> K m the K m 
term in the denominator of the Michael is-Menten equation becomes in- 
significant and the equation simplifies to V 0 = \/ max ; this is consistent with 
the plateau observed at high [S]. The Michaelis-Menten equation is there- 
fore consistent with the observed dependence of V 0 on [S], and the shape 
of the curve is defined by the terms V max /K m at low [S] and V max at high [S]. 



This is a very useful, practical definition of K m : K m is 
equivalent to the substrate concentration at which V 0 is 
one-half 7 max . 

The Michaelis-Menten equation (Eqn 6-9) can be 
algebraically transformed into versions that are useful in 
the practical determination of K m and V max (Box 6-1) 
and, as we describe later, in the analysis of inhibitor ac- 
tion (see Box 6-2 on page 202). 

Kinetic Parameters Are Used to Compare 
Enzyme Activities 

It is important to distinguish between the Michaelis- 
Menten equation and the specific kinetic mechanism on 
which it was originally based. The equation describes 
the kinetic behavior of a great many enzymes, and all en- 
zymes that exhibit a hyperbolic dependence of V 0 on [S] 
are said to follow Michaelis-Menten kinetics. The 
practical rule that K m = [S] when V 0 = % V max (Eqn 
6-23) holds for all enzymes that follow Michaelis- 
Menten kinetics. (The most important exceptions to 
Michaelis-Menten kinetics are the regulatory enzymes, 
discussed in Section 6.5.) However, the Michaelis- 
Menten equation does not depend on the relatively sim- 
ple two-step reaction mechanism proposed by Michaelis 



BOX 6-1 Transformations of the Michaelis-Menten Equation: The Double-Reciprocal Plot 



The Michaelis-Menten equation 

0 K m + [S] 

can be algebraically transformed into equations that are 
more useful in plotting experimental data. One common 
transformation is derived simply by taking the recipro- 
cal of both sides of the Michaelis-Menten equation: 

1 _ K m + [S] 
V 0 V max [S] 

Separating the components of the numerator on the 
right side of the equation gives 

1_ = __gm_ IS] 

V 0 V max [S] V max [S] 
which simplifies to 

1 _ -Km 1 A 

V 0 " VmaxtS] V m ax 

This form of the Michaelis-Menten equation is called the 
Lineweaver-Burk equation. For enzymes obeying the 
Michaelis-Menten relationship, a plot of 1/V 0 versus 1/[S] 
(the "double reciprocal" of the V 0 versus [S] plot 
we have been using to this point) yields a straight line 
(Fig. 1). This line has a slope of iT m /F max , an intercept of 
l/V max on the 1/V 0 axis, and an intercept of —l/K m on 
the 1/[S] axis. The double-reciprocal presentation, also 



called a Lineweaver-Burk plot, has the great advantage 
of allowing a more accurate determination of K max , 
which can only be approximated from a simple plot of 
V 0 versus [S] (see Fig. 6-12). 

Other transformations of the Michaelis-Menten 
equation have been derived, each with some particular 
advantage in analyzing enzyme kinetic data. (See Prob- 
lem 14 at the end of this chapter.) 

The double-reciprocal plot of enzyme reaction rates 
is very useful in distinguishing between certain types of 
enzymatic reaction mechanisms (see Fig. 6-14) and in 
analyzing enzyme inhibition (see Box 6-2). 
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FIGURE 1 A double-reciprocal or Lineweaver-Burk plot 
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TABLE 6-6 



Enzyme 



and Menten (Eqn 6-10). Many enzymes that 
follow Michaelis-Menten kinetics have quite 
different reaction mechanisms, and en- 
zymes that catalyze reactions with six or 
eight identifiable steps often exhibit the 
same steady-state kinetic behavior. Even 
though Equation 6-23 holds true for many 
enzymes, both the magnitude and the real 
meaning of V max and K m can differ from one 
enzyme to the next. This is an important 
limitation of the steady-state approach to 
enzyme kinetics. The parameters F" max and 
K m can be obtained experimentally for any 
given enzyme, but by themselves they pro- 
vide little information about the number, 
rates, or chemical nature of discrete steps in the reac- 
tion. Steady-state kinetics nevertheless is the standard 
language by which biochemists compare and character- 
ize the catalytic efficiencies of enzymes. 

Interpreting F max and K m Figure 6-12 shows a sim- 
ple graphical method for obtaining an approximate value 
for K m . A more convenient procedure, using a double- 
reciprocal plot, is presented in Box 6-1. The K m can 
vary greatly from enzyme to enzyme, and even for differ- 
ent substrates of the same enzyme (Table 6-6). The term 
is sometimes used (often inappropriately) as an indicator 
of the affinity of an enzyme for its substrate. The actual 
meaning of K m depends on specific aspects of the reac- 
tion mechanism such as the number and relative rates of 
the individual steps. For reactions with two steps, 



K m for Some Enzymes and Substrates 

Substrate 



K m (mM) 



Hexokinase (brain) 

Carbonic anhydrase 
Chymotrypsin 

£-Galactosidase 
Threonine dehydratase 



ATP 0.4 

D-Glucose 0.05 
D-Fructose 1.5 

HCO3 26 

Glycyltyrosinylglycine 1 08 

A^-Benzoyltyrosinamide 2.5 

D-Lactose 4.0 

L-Threonine 5.0 



For example, consider the quite common situation 
where product release, EP -> E + P, is rate-limiting. 
Early in the reaction (when [P] is low), the overall reac- 
tion can be described by the scheme 



E + S 



EP 



E + P 



(6-25) 



&2 + & - 1 



(6-24) 



In this case, most of the enzyme is in the EP form at 
saturation, and V max = fc 3 [E t ]. It is useful to define a 
more general rate constant, A: cat , to describe the limiting 
rate of any enzyme-catalyzed reaction at saturation. If 
the reaction has several steps and one is clearly rate- 
limiting, k CSit is equivalent to the rate constant for that 
limiting step. For the simple reaction of Equation 6-10, 
^cat = ^2- For the reaction of Equation 6-25, k cat = k 3 . 
When several steps are partially rate-limiting, k cat can 
become a complex function of several of the rate con- 
stants that define each individual reaction step. In the 



When k 2 is rate-limiting, k 2 «k- Y and K m reduces to 
k-ilki, which is defined as the dissociation constant, 

K d , of the ES complex. Where these conditions hold, K m 
does represent a measure of the affinity of the enzyme for 
its substrate in the ES complex. However, this scenario 
does not apply for most enzymes. Sometimes k 2 » k- u 
and theniT m = k 2 /k 1 . In other cases, k 2 and k- 1 are com- 
parable and K m remains a more complex function of all 
three rate constants (Eqn 6-24). The Michaelis-Menten 
equation and the characteristic saturation behavior of the 
enzyme still apply, but K m cannot be considered a simple 
measure of substrate affinity. Even more 
common are cases in which the reaction goes 
through several steps after formation of ES; 
K m can then become a very complex function 
of many rate constants. 

The quantity V max also varies greatly 
from one enzyme to the next. If an enzyme 
reacts by the two-step Michaelis-Menten 
mechanism, V max = k 2 [E t ], where k 2 is rate- 
limiting. However, the number of reaction 
steps and the identity of the rate-limiting 
step(s) can vary from enzyme to enzyme. 



Michaelis-Menten equation, k cat 
tion 6-9 becomes 



^max/[E t ] ; and Equa- 



K m + [S] 



(6-26) 



The constant k cat is a first-order rate constant and hence 
has units of reciprocal time. It is also called the 
turnover number. It is equivalent to the number of 
substrate molecules converted to product in a given unit 
of time on a single enzyme molecule when the enzyme is 
saturated with substrate. The turnover numbers of sev- 
eral enzymes are given in Table 6-7. 



TABLE 6-7 



Enzyme 



I T U rnoverNumbers,UofSomeE n2 ym £ 

Substrate 



^cat ( s l ) 



Catalase 

Carbonic anhydrase 
Acetylcholinesterase 
/3-Lactamase 
Fumarase 

RecA protein (an ATPase) 



H 2 0 2 
HCO3 

Acetylcholine 
Benzylpenicillin 
Fumarate 
ATP 



40,000,000 
400,000 
14,000 
2,000 
800 
0.5 
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Comparing Catalytic Mechanisms and Efficiencies 

The kinetic parameters k cat and K m are useful for the 
study and comparison of different enzymes, whether 
their reaction mechanisms are simple or complex. Each 
enzyme has values of k cat and K m that reflect the cellular 
environment, the concentration of substrate normally 
encountered in vivo by the enzyme, and the chemistry 
of the reaction being catalyzed. 

The parameters k cat and K m also allow us to evalu- 
ate the kinetic efficiency of enzymes, but either param- 
eter alone is insufficient for this task. Two enzymes 
catalyzing different reactions may have the same k cat 
(turnover number) , yet the rates of the uncatalyzed re- 
actions may be different and thus the rate enhance- 
ments brought about by the enzymes may differ greatly. 
Experimentally, the K m for an enzyme tends to be simi- 
lar to the cellular concentration of its substrate. An en- 
zyme that acts on a substrate present at a very low 
concentration in the cell usually has a lower K m than an 
enzyme that acts on a substrate that is more abundant. 

The best way to compare the catalytic efficiencies of 
different enzymes or the turnover of different sub- 
strates by the same enzyme is to compare the ratio 
k cat /K m for the two reactions. This parameter, some- 
times called the specificity constant, is the rate 
constant for the conversion of E + S to E + P. When 
[S] << K mi Equation 6-26 reduces to the form 



V 0 = 



-[E t ][S] 



(6-27) 



V Q in this case depends on the concentration of two re- 
actants, [E t ] and [S]; therefore this is a second-order rate 
equation and the constant k cat /K m is a second-order rate 
constant with units of m~ 1 s~ 1 . There is an upper limit to 
k cat /K m , imposed by the rate at which E and S can dif- 
fuse together in an aqueous solution. This diffusion- 
controlled limit is 10 s to 10 9 m _1 s _1 , and many enzymes 
have a k cat /K m near this range (Table 6-8). Such enzymes 
are said to have achieved catalytic perfection. Note that 
different values of k cat and K m can produce the 
maximum ratio. 



■■I WORKED EXAMPLE 6-1 Determining ff m 

An enzyme is discovered that catalyzes the chemical 
reaction 



SAD 



HAPPY 



A team of motivated researchers sets out to study 
the enzyme, which they call happyase. They find that 
the fc cat for happyase is 600 s" 1 . They carry out several 
experiments. 

When [E t ] = 20 nM and [SAD] = 40 /am, the 
reaction velocity, V 0 , is 9.6 jjlm s" 1 . Calculate K m for 
the substrate SAD. 

Solution: We know k caU [E t ], [S], and V 0 . We want to solve 
for K m . Equation 6-26, in which we substitute fe ca t[E t ] for 
F max in the Michaelis-Menten equation, is most useful 
here. Substituting our known values in Equation 6-26 
allows us to solve for K m . 



^aifEJLSj 



9.6 fM s 1 = 
9.6 /am s" 1 

9.6 JLtM S 1 (K m + 40 flU) 

K m + 40 jum 
K m + 40 fiU 

Km 



(600 s _1 )(0.020 //,m)(40 ijlM) 



K m + 40 ju-M 



480jllM 2 s 1 
K m + 40 //M 

480 jLtM 2 s _1 

480 ^M 2 s _1 
9.6jLtMs _1 

50 /am 

50 fjM — 40 fxM 
10 flM 



Once you have worked with this equation, you will 
recognize shortcuts to solve problems like this. For 
example, one can calculate F max knowing that k cat [E t ] = 
V max (600 s" 1 X 0.020 jam = 12 /am s" 1 in this case). A 



TABLE 6-8 



Enzymes for Which kJK m Is Close to the Diffusion-Controlled Limit (10 s to 1 0 9 m 's 



Enzyme 


Substrate 


^cat 


(M) 


(M-'S" 1 ) 


Acetylcholinesterase 


Acetylcholine 


1.4 x 10 4 


9 X 10" 5 


1.6 x 10 8 


Carbonic anhydrase 


co 2 


1 X 10 6 


1.2 X 10~ 2 


8.3 X 10 7 




HCO; 


4 X 10 5 


2.6 X 10~ 2 


1.5 X 10 7 


Catalase 


H 2 0 2 


4 X 10 7 


1.1 x 10° 


4 X 10 7 


Crotonase 


Crotonyl-CoA 


5.7 X 10 3 


2 X 10" 5 


2.8 x 10 8 


Fumarase 


Fumarate 


8 X 10 2 


5 X 10" 6 


1.6 X 10 8 




Malate 


9 x 10 2 


2.5 X 10" 5 


3.6 x 10 7 


^-Lactamase 


Benzylpenicillin 


2.0 x 10 3 


2 X 10" 5 


1 x 10 8 



Source: Fersht, A. (1999) Structure and Mechanism in Protein Science, p. 166, W. H. Freeman and Company, New York. 
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simple rearrangement of Equation 6-26 by dividing both 
sides by 7 max gives 



V 0 



[S] 



Thus, the ratio V 0 /V max = 9.6 s _1 /12 s" 1 = 
[S]/(K m + [S]). This simplifies the process of solving 
foriC m , giving 0.25[S] or 10 /am. 



(a) Enzyme reaction involving a ternary complex 



Random order 
E ES1S2 



-> E + Pi + P 2 



Ordered S 2 

E + Si < i ESi ES1S2 



E + Pi + P 2 



■ WORKED EXAMPLE 6-2 Determining [S] 

In a separate happyase experiment using [E t ] = 10 nM, 
the reaction velocity, V 0} is measured as 3 s" 1 . What 
is the [S] used in this experiment? 

Solution: Using the same logic as in Worked Example 6-1, 
we see that the t^ max for this enzyme concentration is 
6 /i,M s -1 . Note that the V 0 is exactly half of the V max , 
Recall that K m is by definition equal to the [S] where V 0 = 
%Vmax- Thus, the [S] in this problem must be the same 
as the K m , or 10 fiu. If V 0 were anything other than 
^Knax, it would be simplest to use the expression 
V 0 /V max = [S]/(K m + [S]) to solve for [S]. 



(b) Enzyme reaction in which no ternary complex is formed 

Pi s 2 

E + Si ^ ESi E'Pi ^ E' ^ E'S 2 > E + P 2 

FIGURE 6-13 Common mechanisms for enzyme-catalyzed bisub- 
strate reactions, (a) The enzyme and both substrates come together to 
form a ternary complex. In ordered binding, substrate 1 must bind 
before substrate 2 can bind productively. In random binding, the sub- 
strates can bind in either order, (b) An enzyme-substrate complex 
forms, a product leaves the complex, the altered enzyme forms a sec- 
ond complex with another substrate molecule, and the second product 
leaves, regenerating the enzyme. Substrate 1 may transfer a functional 
group to the enzyme (to form the covalently modified E'), which is 
subsequently transferred to substrate 2. This is called a Ping-Pong or 
double-displacement mechanism. 



Many Enzymes Catalyze Reactions with Two or 
More Substrates 

We have seen how [S] affects the rate of a simple enzy- 
matic reaction (S-»P) with only one substrate mole- 
cule. In most enzymatic reactions, however, two (and 
sometimes more) different substrate molecules bind to 
the enzyme and participate in the reaction. For exam- 
ple, in the reaction catalyzed by hexokinase, ATP and 
glucose are the substrate molecules, and ADP and glu- 
cose 6-phosphate are the products: 

ATP + glucose > ADP + glucose 6-phosphate 

The rates of such bisubstrate reactions can also be 
analyzed by the Michaelis-Menten approach. Hexoki- 
nase has a characteristic K m for each of its substrates 
(Table 6-6). 

Enzymatic reactions with two substrates usually in- 
volve transfer of an atom or a functional group from one 
substrate to the other. These reactions proceed by one 
of several different pathways. In some cases, both sub- 
strates are bound to the enzyme concurrently at some 
point in the course of the reaction, forming a noncova- 
lent ternary complex (Fig. 6-13a); the substrates bind 
in a random sequence or in a specific order. In other 
cases, the first substrate is converted to product and 
dissociates before the second substrate binds, so no ter- 
nary complex is formed. An example of this is the Ping- 
Pong, or double-displacement, mechanism (Fig. 6-13b). 
Steady-state kinetics can often help distinguish among 
these possibilities (Fig. 6-14). 
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FIGURE 6-14 Steady-state kinetic analysis of bisubstrate reactions. In 

these double-reciprocal plots (see Box 6-1), the concentration of sub- 
strate 1 is varied while the concentration of substrate 2 is held con- 
stant This is repeated for several values of [S 2 ], generating several 
separate lines, (a) Intersecting lines indicate that a ternary complex is 
formed in the reaction; (b) parallel lines indicate a Ping-Pong (double- 
displacement) pathway. 
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Pre -Steady State Kinetics Can Provide Evidence for 
Specific Reaction Steps 

We have introduced kinetics as the primary method for 
studying the steps in an enzymatic reaction, and we 
have also outlined the limitations of the most common 
kinetic parameters in providing such information. The 
two most important experimental parameters obtained 
from steady-state kinetics are k cat and k cat /K m . Variation 
in k cat and k cat /K m with changes in pH or temperature 
can provide additional information about steps in a reac- 
tion pathway. In the case of bisubstrate reactions, 
steady-state kinetics can help determine whether a ter- 
nary complex is formed during the reaction (Fig. 6-14). 
A more complete picture generally requires more 
sophisticated kinetic methods that go beyond the scope 
of an introductory text. Here, we briefly introduce one 
of the most important kinetic approaches for studying 
reaction mechanisms, pre-steady state kinetics. 

A complete description of an enzyme-catalyzed reac- 
tion requires direct measurement of the rates of individual 
reaction steps — for example, the association of enzyme 
and substrate to form the ES complex. It is during the 
pre-steady state (see p. 195) that the rates of many reac- 
tion steps can be measured independently and events dur- 
ing reaction of a single substrate molecule can be observed. 
Because the pre-steady state phase is generally very short, 
the experiments often require specialized techniques for 
very rapid mixing and sampling. One objective is to gain a 
complete and quantitative picture of the energy changes 
during the reaction. As we have already noted, reaction 
rates and equilibria are related to the free-energy changes 
during a reaction. Another objective is to measure the rate 
of individual reaction steps. In a number of cases investiga- 
tors have been able to record the rates of every individual 
step in a multistep enzymatic reaction. Some examples of 
the application of pre-steady state kinetics are included in 
the descriptions of specific enzymes in Section 6.4. 

Enzymes Are Subject to Reversible or 
Irreversibie Inhibition 

Enzyme inhibitors are molecules that interfere with catal- 
ysis, slowing or halting enzymatic reactions. Enzymes 
catalyze virtually all cellular processes, so it should not be 
surprising that enzyme inhibitors are among the most im- 
portant pharmaceutical agents known. For example, as- 
pirin (acetylsalicylate) inhibits the enzyme that catalyzes 
the first step in the synthesis of prostaglandins, com- 
pounds involved in many processes, including some that 
produce pain. The study of enzyme inhibitors also has 
provided valuable information about enzyme mechanisms 
and has helped define some metabolic pathways. There 
are two broad classes of enzyme inhibitors: reversible and 
irreversible. 

Reversible Inhibition One common type of re- 
versible inhibition is called competitive (Fig. 6-1 5a). 



(a) Competitive inhibition 

E + S v ES > E + P 




(b) Uncompetitive inhibition 

E + S * ES » E + P 

+ 




(c) Mixed inhibition 

E + S . ES > E + P 

+ + 





FIGURE 6-15 Three types of reversible inhibition, (a) Competitive in- 
hibitors bind to the enzyme's active site; K| is the equilibrium constant 
for inhibitor binding to E. (b) Uncompetitive inhibitors bind at a separate 
site, but bind only to the ES complex; K{ is the equilibrium constant for 
inhibitor binding to ES. (c) Mixed inhibitors bind at a separate site, but 
may bind to either E or ES. 

A competitive inhibitor competes with the substrate 
for the active site of an enzyme. While the inhibitor (I) 
occupies the active site it prevents binding of the sub- 
strate to the enzyme. Many competitive inhibitors are 
structurally similar to the substrate and combine with 
the enzyme to form an EI complex, but without leading 
to catalysis. Even fleeting combinations of this type will 
reduce the efficiency of the enzyme. By taking into ac- 
count the molecular geometry of inhibitors we can reach 
conclusions about which parts of the normal substrate 
bind to the enzyme. Competitive inhibition can be 
analyzed quantitatively by steady-state kinetics. In the 
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presence of a competitive inhibitor, the Michaelis- 
Menten equation (Eqn 6-9) becomes 



V ma JS] 



Where 



1-3 



and 



§(6-28) 



[E][I] 
[EI] 



Equation 6-28 describes the important features of com- 
petitive inhibition. The experimentally determined vari- 
able aK m , the K m observed in the presence of the 
inhibitor, is often called the "apparent" K m . 

Because the inhibitor binds reversibly to the enzyme, 
the competition can be biased to favor the substrate sim- 
ply by adding more substrate. When [S] far exceeds [I], 
the probability that an inhibitor molecule will bind to the 
enzyme is minimized and the reaction exhibits a normal 



V max . However, the [S] at which V 0 = V 2 V maK , the appar- 
ent K m , increases in the presence of inhibitor by the fac- 
tor a. This effect on apparent K m , combined with the 
absence of an effect on F max , is diagnostic of competitive 
inhibition and is readily revealed in a double-reciprocal 
plot (Box 6-2). The equilibrium constant for inhibitor 
binding, K h can be obtained from the same plot. 

A medical therapy based on competition at the 
active site is used to treat patients who have in- 
gested methanol, a solvent found in gas-line antifreeze. 
The liver enzyme alcohol dehydrogenase converts 
methanol to formaldehyde, which is damaging to many 
tissues. Blindness is a common result of methanol 
ingestion, because the eyes are particularly sensitive 
to formaldehyde. Ethanol competes effectively with 
methanol as an alternative substrate for alcohol dehydro- 
genase. The effect of ethanol is much like that of a com- 
petitive inhibitor, with the distinction that ethanol is also 



BOX 6-2 Kinetic Tests for Determining Inhibition Mechanisms 



The double-reciprocal plot (see Box 6-1) offers an easy 
way of determining whether an enzyme inhibitor is 
competitive, uncompetitive, or mixed. Two sets of rate 
experiments are carried out, with the enzyme concen- 
tration held constant in each set. In the first set, [S] 
is also held constant, permitting measurement of the 
effect of increasing inhibitor concentration [I] on the 
initial rate V 0 (not shown). In the second set, [I] is held 
constant but [S] is varied. The results are plotted as 1/V 0 
versus 1/[S]. 

Figure 1 shows a set of double-reciprocal plots, one 
obtained in the absence of inhibitor and two at different 
concentrations of a competitive inhibitor. Increasing [I] 
results in a family of lines with a common intercept on 
the 1/F 0 axis but with different slopes. Because the in- 



tercept on the 1/V 0 axis equals 1/F maxj we know that 
^max is unchanged by the presence of a competitive in- 
hibitor. That is, regardless of the concentration of a 
competitive inhibitor, a sufficiently high substrate con- 
centration will always displace the inhibitor from the en- 
zyme's active site. Above the graph is the rearrangement 
of Equation 6-28 on which the plot is based. The value 
of a can be calculated from the change in slope at any 
given [I]. Knowing [I] and a, we can calculate K T from the 
expression 

[I] 



1 + 



For uncompetitive and mixed inhibition, similar plots of 
rate data give the families of lines shown in Figures 2 and 3. 
Changes in axis intercepts signal changes in F max and K m . 
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a substrate for alcohol dehydrogenase and its concentra- 
tion will decrease over time as the enzyme converts it to 
acetaldehyde. The therapy for methanol poisoning is 
slow intravenous infusion of ethanol, at a rate that main- 
tains a controlled concentration in the bloodstream for 
several hours. This slows the formation of formaldehyde, 
lessening the danger while the kidneys filter out the 
methanol to be excreted harmlessly in the urine. ■ 

Two other types of reversible inhibition, uncompet- 
itive and mixed, though often denned in terms of one- 
substrate enzymes, are in practice observed only with 
enzymes having two or more substrates. An uncompet- 
itive inhibitor (Fig. 6-1 5b) binds at a site distinct from 
the substrate active site and, unlike a competitive 
inhibitor, binds only to the ES complex. In the presence 
of an uncompetitive inhibitor, the Michaelis-Menten 
equation is altered to 



Where 



Vn 



IE 



JS] 



K m + a'[S] 



§(6-29) 



and 



[ES][I] 
[ESI] 



As described by Equation 6-29, at high concentrations of 
substrate, V 0 approaches V max /a'. Thus, an uncompeti- 
tive inhibitor lowers the measured V max . Apparent K m 
also decreases, because the [S] required to reach one-half 
V max decreases by the factor a'. 

A mixed inhibitor (Fig. 6-15c) also binds at a site 
distinct from the substrate active site, but it binds to 
either E or ES. The rate equation describing mixed 
inhibition is 



:\S] 



aK ni + a'-lS] 



§ (6-30) 



where a and a' are denned as above. A mixed inhibitor 
usually affects both K m and V max - The special case of 
a = a', rarely encountered in experiments, classically 
has been defined as noncompetitive inhibition. 
Examine Equation 6-30 to see why a noncompetitive 
inhibitor would affect the V max but not the K m . 

Equation 6-30 is a general expression for the effects 
of reversible inhibitors, simplifying to the expressions 
for competitive and uncompetitive inhibition when a' = 
1.0 or a = 1.0, respectively. From this expression we can 
summarize the effects of inhibitors on individual kinetic 
parameters. For all reversible inhibitors, apparent V max = 
V ma Ja\ because the right side of Equation 6-30 always 
simplifies to V max /a f at sufficiently high substrate 
concentrations. For competitive inhibitors, a* = 1.0 and 
can thus be ignored. Taking this expression for apparent 
F max , we can also derive a general expression for appar- 
ent K m to show how this parameter changes in the pres- 
ence of reversible inhibitors. Apparent K m , as always, 
equals the [S] at which V Q is one-half apparent V max or, 



TABLE 6-9 



Inhibitor type 



Effects of Reversible Inhibitors 
Apparent and Apparent ff t 

Apparent F max Apparent K x 



on 



None 

Competitive 

Uncompetitive 

Mixed 



Knax 



K m 
ctK m 
Kja' 
aKja ' 



more generally, when V 0 = V m J2a\ This condition is 
met when [S] = aKJa\ Thus, apparent K m = aKJot'. 
This expression is simpler when either a or a' is 1.0 (for 
uncompetitive or competitive inhibitors), as summa- 
rized in Table 6-9. 

In practice, uncompetitive and mixed inhibition are 
observed only for enzymes with two or more sub- 
strates—say, Si and S 2 — and are very important in the 
experimental analysis of such enzymes. If an inhibitor 
binds to the site normally occupied by it may act as a 
competitive inhibitor in experiments in which [SJ is var- 
ied. If an inhibitor binds to the site normally occupied by 
S 2) it may act as a mixed or uncompetitive inhibitor of 
The actual inhibition patterns observed depend on 
whether the S r and S 2 -binding events are ordered or 
random, and thus the order in which substrates bind and 
products leave the active site can be determined. Use of 
one of the reaction products as an inhibitor is often par- 
ticularly informative. If only one of two reaction products 
is present, no reverse reaction can take place. However, 
a product generally binds to some part of the active site, 
thus serving as an inhibitor. Enzymologists can use elab- 
orate kinetic studies involving different combinations 
and amounts of products and inhibitors to develop a de- 
tailed picture of the mechanism of a bisubstrate reaction. 

Wm WORKED EXAMPLE 6-3 Effect of Inhibitor on K m 

The researchers working on happyase (see Worked Ex- 
amples 6-1 and 6-2, pp. 199, 200) discover that the 
compound STRESS is a potent competitive inhibitor of 
happyase. Addition of 1 nM STRESS increases the mea- 
sured K m for SAD by a factor of 2. What are the values 
for a and a' under these conditions? 

Solution: Recall that the apparent K mi the K m measured in 
the presence of a competitive inhibitor, is denned as aK m . 
Because K m for SAD increases by a factor of 2 in the pres- 
ence of 1 nM STRESS, the value of a must be 2. The value 
of a' for a competitive inhibitor is 1 by definition. 

Irreversible Inhibition The irreversible inhibitors 

bind covalently with or destroy a functional group on an 
enzyme that is essential for the enzyme's activity, or form 
a particularly stable noncovalent association. Formation 
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FIGURE 6-16 Irreversible inhibition. Reaction of chymotrypsin with 
diisopropylfluorophosphate (DIFP), which modifies Ser 195 , irreversibly 
inhibits the enzyme. This has led to the conclusion that Ser 195 is the 
key active-site Ser residue in chymotrypsin. 

of a covalent link between an irreversible inhibitor and an 
enzyme is common. Irreversible inhibitors are another 
useful tool for studying reaction mechanisms. Amino 
acids with key catalytic functions in the active site can 
sometimes be identified by determining which residue is 
covalently linked to an inhibitor after the enzyme is 
inactivated. An example is shown in Figure 6-16. 

A special class of irreversible inhibitors is the sui- 
cide inactivators. These compounds are relatively un- 
reactive until they bind to the active site of a specific 
enzyme. A suicide inactivator undergoes the first few 
chemical steps of the normal enzymatic reaction, but in- 
stead of being transformed into the normal product, the 
inactivator is converted to a very reactive compound 
that combines irreversibly with the enzyme. These com- 
pounds are also called mechanism-based inactiva- 
tors, because they hijack the normal enzyme reaction 
mechanism to inactivate the enzyme. Suicide inactiva- 
tors play a significant role in rational drug design, a 
modern approach to obtaining new pharmaceutical 
agents in which chemists synthesize novel substrates 
based on knowledge of substrates and reaction mecha- 
nisms. A well-designed suicide inactivator is specific for 
a single enzyme and is unreactive until it is within that 
enzyme's active site, so drugs based on this approach 
can offer the important advantage of few side effects 
(see Box 22-3). Some examples of irreversible in- 
hibitors of medical importance are described at the end 
of Section 6.4. 

Enzyme Activity Depends on pH 

Enzymes have an optimum pH (or pH range) at which 
their activity is maximal (Fig. 6-17); at higher or lower 
pH, activity decreases. This is not surprising. Amino 
acid side chains in the active site may act as weak acids 
and bases with critical functions that depend on their 
maintaining a certain state of ionization, and elsewhere 
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FIGURE 6-17 The pH-activity profiles of two enzymes. These curves are 
constructed from measurements of initial velocities when the reaction is 
carried out in buffers of different pH. Because pH is a logarithmic scale 
reflecting tenfold changes in [H + ], the changes in V 0 are also plotted on 
a logarithmic scale. The pH optimum for the activity of an enzyme is 
generally close to the pH of the environment in which the enzyme is 
normally found. Pepsin, which hydrolyzes certain peptide bonds of pro- 
teins during digestion in the stomach, has a pH optimum of about 1 .6. 
The pH of gastric juice is between 1 and 2. Glucose 6-phosphatase of 
hepatocytes (liver cells), with a pH optimum of about 7.8, is responsible 
for releasing glucose into the blood. The normal pH of the cytosol of 
hepatocytes is about 7.2. 



in the protein ionized side chains may play an essential 
role in the interactions that maintain protein structure. 
Removing a proton from a His residue, for example, 
might eliminate an ionic interaction essential for stabi- 
lizing the active conformation of the enzyme. A less 
common cause of pH sensitivity is titration of a group on 
the substrate. 

The pH range over which an enzyme undergoes 
changes in activity can provide a clue to the type of 
amino acid residue involved (see Table 3-1), A change 
in activity near pH 7.0, for example, often reflects titra- 
tion of a His residue. The effects of pH must be inter- 
preted with some caution, however. In the closely 
packed environment of a protein, the pif a of amino acid 
side chains can be significantly altered. For example, a 
nearby positive charge can lower the p7f a of a Lys 
residue, and a nearby negative charge can increase it. 
Such effects sometimes result in a pif a that is shifted by 
several pH units from its value in the free amino acid. In 
the enzyme acetoacetate decarboxylase, for example, 
one Lys residue has a pK a of 6.6 (compared with 10.5 in 
free lysine) due to electrostatic effects of nearby 
positive charges. 

SUMMARY 6.3 Enzyme Kinetics as an 

Approach to Understanding 
Mechanism 

■ Most enzymes have certain kinetic properties in 
common. When substrate is added to an enzyme, 
the reaction rapidly achieves a steady state in 
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which the rate at which the ES complex forms 
balances the rate at which it breaks down. As [S] 
increases, the steady-state activity of a fixed 
concentration of enzyme increases in a hyperbolic 
fashion to approach a characteristic maximum rate, 
V maxy at which essentially all the enzyme has 
formed a complex with substrate. 

■ The substrate concentration that results in a reac- 
tion rate equal to one-half F max is the Michaelis 
constant K m , which is characteristic for each en- 
zyme acting on a given substrate. The Michaelis- 
Menten equation 

0 K m + [S] 

relates initial velocity to [S] and V max through the 
constant K m . Michaelis-Menten kinetics is also 
called steady-state kinetics. 

■ K m and V max have different meanings for different 
enzymes. The limiting rate of an enzyme-catalyzed 
reaction at saturation is described by the constant 
/c cat , the turnover number. The ratio k CSLt /K m 
provides a good measure of catalytic efficiency. 
The Michaelis-Menten equation is also applicable 
to bisubstrate reactions, which occur by ternary- 
complex or Ping-Pong (double-displacement) 
pathways. 

■ Reversible inhibition of an enzyme may be 
competitive, uncompetitive, or mixed. Competitive 
inhibitors compete with substrate by binding 
reversibly to the active site, but they are not 
transformed by the enzyme. Uncompetitive in- 
hibitors bind only to the ES complex, at a site 
distinct from the active site. Mixed inhibitors bind 
to either E or ES, again at a site distinct from the 
active site. In irreversible inhibition an inhibitor 
binds permanently to an active site by forming 

a covalent bond or a very stable noncovalent 
interaction. 

■ Every enzyme has an optimum pH (or pH range) at 
which it has maximal activity 

6,4 Examples of Enzymatic Reactions 

Thus far we have focused on the general principles of 
catalysis and on introducing some of the kinetic pa- 
rameters used to describe enzyme action. We now 
turn to several examples of specific enzyme reaction 
mechanisms. 

An understanding of the complete mechanism of ac- 
tion of a purified enzyme requires identification of all 
substrates, cofactors, products, and regulators. More- 
over, it requires a knowledge of (1) the temporal se- 
quence in which enzyme-bound reaction intermediates 
form, (2) the structure of each intermediate and each 
transition state, (3) the rates of interconversion 



between intermediates, (4) the structural relationship 
of the enzyme to each intermediate, and (5) the energy 
contributed by all reacting and interacting groups to 
intermediate complexes and transition states. As yet, 
there is probably no enzyme for which we have an 
understanding that meets all these requirements. 

We present here the mechanisms for four enzymes: 
chymotrypsin, hexokinase, enolase, and lysozyme. 
These examples are not intended to cover all possible 
classes of enzyme chemistry. They are chosen in part 
because they are among the best understood enzymes, 
and in part because they clearly illustrate some general 
principles outlined in this chapter. The discussion con- 
centrates on selected principles, along with some key 
experiments that have helped to bring these principles 
into focus. We use the chymotrypsin example to review 
some of the conventions used to depict enzyme mecha- 
nisms. Much mechanistic detail and experimental evi- 
dence is necessarily omitted; no one book could 
completely document the rich experimental history of 
these enzymes. Also absent from these discussions is 
the special contribution of coenzymes to the catalytic 
activity of many enzymes. The function of coenzymes is 
chemically varied, and we describe each as it is encoun- 
tered in Part II. 

The Chymotrypsin Mechanism Involves Acylation and 
Deacylation of a Ser Residue 

Bovine pancreatic chymotrypsin (M r 25,191) is a pro- 
tease, an enzyme that catalyzes the hydrolytic cleavage of 
peptide bonds. This protease is specific for peptide bonds 
adjacent to aromatic amino acid residues (Trp, Phe, Tvr). 
The three-dimensional structure of chymotrypsin is 
shown in Figure 6-18, with functional groups in the 
active site emphasized. The reaction catalyzed by this 
enzyme illustrates the principle of transition-state stabi- 
lization and also provides a classic example of general 
acid-base catalysis and covalent catalysis. 

Chymotrypsin enhances the rate of peptide bond 
hydrolysis by a factor of at least 10 9 . It does not catalyze 
a direct attack of water on the peptide bond; instead, a 
transient covalent acyl-enzyme intermediate is formed. 
The reaction thus has two distinct phases. In the acyla- 
tion phase, the peptide bond is cleaved and an ester 
linkage is formed between the peptide carbonyl carbon 
and the enzyme. In the deacylation phase, the ester 
linkage is hydrolyzed and the nonacylated enzyme re- 
generated. 

The first evidence for a covalent acyl-enzyme inter- 
mediate came from a classic application of pre-steady 
state kinetics. In addition to its action on polypeptides, 
chymotrypsin also catalyzes the hydrolysis of small es- 
ters and amides. These reactions are much slower than 
hydrolysis of peptides because less binding energy is 
available with smaller substrates, and they are therefore 
easier to study. Investigations by B. S. Hartley and B. A. 
Kilby in 1954 found that chymotrypsin hydrolysis of the 
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FIGURE 6-18 Structure of chymotrypsin. (PDB ID 7GCH) (a) A representation of primary structure, show- 
ing disulfide bonds and the amino acid residues crucial to catalysis. The protein consists of three polypep- 
tide chains linked by disulfide bonds. (The numbering of residues in chymotrypsin, with "missing" residues 
14, 15, 147, and 148, is explained in Fig. 6-38.) The active-site amino acid residues are grouped together 
in the three-dimensional structure, (b) A depiction of the enzyme emphasizing its surface. The pocket in 
which the aromatic amino acid side chain of the substrate is bound is shown in green. Key active-site 
residues, including Ser 195 , His 57 , and Asp 102 , are red. The roles of these residues in catalysis are illustrated 
in Figure 6-21 . (c) The polypeptide backbone as a ribbon structure. Disulfide bonds are yellow; the three 
chains are colored as in part (a), (d) A close-up of the active site with a substrate (mostly green) bound. 
Two of the active-site residues, Ser 195 and His 57 (both red), are partly visible. The hydroxyl of Ser 195 attacks 
the carbonyl group of the substrate (the oxygen is purple); the developing negative charge on the oxygen 
is stabilized by the oxyanion hole (amide nitrogens, including one from Ser 195 , in orange), as explained in 
Figure 6-21 . In the substrate, the aromatic amino acid side chain and the amide nitrogen of the peptide 
bond to be cleaved (protruding toward the viewer and projecting the path of the rest of the substrate 
polypeptide chain) are in blue. 



ester p-nitrophenylacetate, as measured by release of p- 
nitrophenol, proceeds with a rapid burst before leveling 
off to a slower rate (Fig. 6-19). By extrapolating back 
to zero time, they concluded that the burst phase corre- 
sponded to just under one molecule of p-nitrophenol re- 
leased for every enzyme molecule present. Hartley and 
Kilby suggested that this reflected a rapid acylation of 
all the enzyme molecules (with release of p-nitrophe- 
nol), with the rate for subsequent turnover of the en- 
zyme limited by a slow deacylation step. Similar results 
have since been obtained with many other enzymes. The 
observation of a burst phase provides yet another exam- 
ple of the use of kinetics to break down a reaction into 
its constituent steps. 



FIGURE 6-19 Pre-steady state kinetic evidence for an acyl-enzyme 
intermediate. The hydrolysis of p-nitrophenylacetate by chymotrypsin 
is measured by release of p-nitrophenol (a colored product). Initially, 
the reaction releases a rapid burst of p-nitrophenol nearly stoichiomet- 
ric with the amount of enzyme present. This reflects the fast acylation 
phase of the reaction. The subsequent rate is slower, because enzyme 
turnover is limited by the rate of the slower deacylation phase. 
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Additional features of the chymotrypsin mechanism 
have been discovered by analyzing the dependence of the 
reaction on pH. The rate of chymotrypsin-catalyzed 
cleavage generally exhibits a bell-shaped pH-rate profile 
(Fig. 6-20). The rates plotted in Figure 6-20a are ob- 
tained at low (subsaturating) substrate concentrations 
and therefore represent k cat /K m (see Eqn 6-27, p. 199). 
The plot can be dissected further by first obtaining the 
maximum rates at each pH, and then plotting k cat alone 
versus pH (Fig. 6-20b); after obtaining the K m at each 
pH, researchers can then plot 1/K m (Fig. 6-20c). Kinetic 
and structural analyses have revealed that the change in 
k eat reflects the ionization state of His 57 . The decline in 
k cat at low pH results from protonation of His 57 (so that it 
cannot extract a proton from Ser 195 in step (T) of the re- 
action; see Fig. 6-21). This rate reduction illustrates the 
importance of general acid and general base catalysis in 
the mechanism for chymotrypsin. The changes in the 
l/K m term reflect the ionization of the a-amino group of 
He 16 (at the amino-terminal end of one of chymotrypsin's 
three polypeptide chains). This group forms a salt bridge 
to Asp 194 , stabilizing the active conformation of the 



FIGURE 6-20 The pH dependence of chymotrypsin-catalyzed reac- 
tions, (a) The rates of chymotrypsin-mediated cleavage produce a bell- 
shaped pH-rate profile with an optimum at pH 8.0. The rate (v) being 
plotted is that at low substrate concentrations and thus reflects the term 
k c JK m . The plot can be broken down to its components by using kinetic 
methods to determine the terms k cat and K m separately at each pH. 
When this is done (b and c), it becomes clear that the transition just 
above pH 7 is due to changes in k cat/ whereas the transition above pH 8.5 
is due to changes in 1/K m . Kinetic and structural studies have shown that 
the transitions illustrated in (b) and (c) reflect the ionization states of the 
His 57 side chain (when substrate is not bound) and the a-amino group of 
lie 16 (at the amino terminus of the B chain), respectively. For optimal 
activity, His 57 must be unprotonated and lie 16 must be protonated. 



enzyme. When this group loses its proton at high pH, the 
salt bridge is eliminated and a conformational change 
closes the hydrophobic pocket where the aromatic amino 
acid side chain of the substrate inserts (Fig. 6-18). Sub- 
strates can no longer bind properly, which is measured 
kinetically as an increase in K m . 

The nucleophile in the acylation phase is the oxygen 
of Ser 195 . (Proteases with a Ser residue that plays this role 
in reaction mechanisms are called serine proteases.) The 
pK a of a Ser hydroxyl group is generally too high for the 
unprotonated form to be present in significant concentra- 
tions at physiological pH. However, in chymotrypsin, 
Ser 195 is linked to His 57 and Asp 102 in a hydrogen-bonding 
network referred to as the catalytic triad. When a pep- 
tide substrate binds to chymotrypsin, a subtle change in 
conformation compresses the hydrogen bond between 
His 57 and Asp 102 , resulting in a stronger interaction, called 
a low-barrier hydrogen bond. This enhanced interaction 
increases the pK a of His 57 from ~7 (for free histidine) to 
>12, allowing the His residue to act as an enhanced gen- 
eral base that can remove the proton from the Ser 195 hy- 
droxyl group. Deprotonation prevents development of a 
very unstable positive charge on the Ser 195 hydroxyl and 
makes the Ser side chain a stronger nucleophile. At later 
reaction stages, His 57 also acts as a proton donor, proto- 
nating the amino group in the displaced portion of the sub- 
strate (the leaving group). 

As the Ser 195 oxygen attacks the carbonyl group 
of the substrate, a very short-lived tetrahedral interme- 
diate is formed in which the carbonyl oxygen acquires a 
negative charge (Fig. 6-21, step (D). This charge, form- 
ing within a pocket on the enzyme called the oxyanion 
hole, is stabilized by hydrogen bonds contributed by 
the amide groups of two peptide bonds in the chy- 
motrypsin backbone. One of these hydrogen bonds 
(contributed by Gly 193 ) is present only in this interme- 
diate and in the transition states for its formation and 
breakdown; it reduces the energy required to reach 
these states. This is an example of the use of binding 
energy in catalysis. 

The role of transition state complementarity in 
enzyme catalysis is further explored in Box 6-3. 
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How to Read Reaction Mechanisms— 
A Refresher Asp™)-c( 
Chemical reaction mechanisms, which 
trace the formation and breakage of 
covalent bonds, are communicated with 
dots and curved arrows, a convention 
known informally as "electron pushing. 1 ' 
A covalent bond consists of a shared 
pair of electrons. Nonbonded electrons 
important to the reaction mechanism 
are designated by dots (— OH). Curved 
arrows ( represent the movement of 
electron pairs. For movement of a single 
electron (as in a free radical reaction), a single- 
headed (fishhook-type) arrow is used Most 
reaction steps involve an unshared electron pair (as in the 
chymotrypsin mechanism). 

Some atoms are more electronegative than others; that is, they 
more strongly attract electrons. The relative electronegativities of 
atoms encountered in this text are F > 0 > N > C ~ S > P « H. For 
example, the two electron pairs making up a C = 0 (carbonyl) 
bond are not shared equally; the carbon is relatively electron- 
deficient as the oxygen draws away the electrons. Many reactions 
involve an electron-rich atom (a nucleophile) reacting with an 
electron-deficient atom (an electrophile). Some common 
nucleophiles and electrophiles in biochemistry are shown at right. 

In general, a reaction mechanism is initiated at an unshared 
electron pair of a nucleophile. In mechanism diagrams, the base 
of the electron-pushing arrow originates near the electron-pair 
dots, and the head of the arrow points directly at the electro- 
philic center being attacked. Where the unshared electron pair 
confers a formal negative charge on the nucleophile, the negative 
charge symbol itself can represent the unshared electron pair 
and serves as the base of the arrow. In the chymotrypsin mech- 
anism, the nucleophilic electron pair in the ES complex between 
steps (D and (g) is provided by the oxygen of the Ser 195 hydroxyl 
group. This electron pair (2 of the 8 valence electrons of the 
hydroxyl oxygen) provides the base of the curved arrow. The 
electrophilic center under attack is the carbonyl carbon of the 
peptide bond to be cleaved. The C, 0, and N atoms have a max- 
imum of 8 valence electrons, and H has a maximum of 2, These 
atoms are occasionally found in unstable states with less than 
their maximum allotment of electrons, but C, 0, and N cannot 
have more than 8. Thus, when the electron pair from chymo- 
trypsin's Ser 195 attacks the substrate's carbonyl carbon, an 
electron pair is displaced from the carbon valence shell (you 
cannot have 5 bonds to carbon!). These electrons move toward 
the more electronegative carbonyl oxygen. The oxygen has 8 
valence electrons both before and after this chemical process, but 
the number shared with the carbon is reduced from 4 to 2, and 
the carbonyl oxygen acquires a negative charge. In the next step, 
the electron pair conferring the negative charge on the oxygen 
moves back to re-form a bond with carbon and reestablish the 
carbonyl linkage. Again, an electron pair must be displaced from 
the carbon, and this time it is the electron pair shared with the 
amino group of the peptide linkage. This breaks the peptide 
bond. The remaining steps follow a similar pattern. 
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When substrate binds, the side 
chain of the residue adjacent to 
the peptide bond to be cleaved 
nestles in a hydrophobic pocket 
on the enzyme, positioning the 
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Interaction of Ser 1 ** 6 and His 67 generates a 
strongly nucleophilic alkoxide ion on Ser 195 ; 
the ion attacks the peptide car bony I group, 
forming a tetrahedral acyl- CO mplex 
enzyme* This is accom- 
panied by formation 
of a short-lived 
negative charge 
on the car bony] 
oxygen of the 
substrate, which 
is stabilized by 
hydrogen bond* , 
ing in the 
oxyanion hole. 



\HIs Bi 

vS 

'HO-fSar 1 * 6 
? 

i c— ch— mh- 



Short-lived 
intermediate* 

(acylation) 



■AA Jf 



Instability of the negative charge on 
the substrate carbonyl oxygen leads to 
collapse of the tetrahedral inter- 
mediate; re- formation of a double bond 
with carbon displaces the bond between 
carbon and the amino group of the 
peptide linkage, breaking the peptide 
bond* The amino leaving 
Hjs^ group is proton ated by 

His 57 , facilitating its 
displacement. 
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MECHANISM FIGURE 6-21 Hydrolytic cleavage of a peptide bond by 
chymotrypsin. The reaction has two phases. In the acylation phase 
(steps to ), formation of a covalent acyl-enzyme intermediate is 
coupled to cleavage of the peptide bond. In the deacylation phase 
(steps to ), deacylation regenerates the free enzyme; this is es- 
sentially the reverse of the acylation phase, with water mirroring, in re- 
verse, the role of the amine component of the substrate. 
3 Chymotrypsin Mechanism 



Short-lived 
intermediate* 

(deacylation) 





His 5 " 



Ser 11 



H-O— C- 
0* 



GIv 



lH,t 



Collapse 
of the 

tetrahedral 
intermediate 
forms the second 
product, a carboxylate anion, 
and displaces Ser 1915 , 




Product 1 

aa„-c-ch-™h 
h a 1 



A-CH-NH-AA n 
0 




H-0' 



Acyl-enzyme 
intermediate 



EEs* 




H-0< 



Gly 1 " 



kCH— NH-AA, 



Ser lfl5 




Gly 193 Bar™ 

Acyl-enzyme 
intermediate 



An incoming water 
molecule is de proton ated 
by general base catalysis, 
generating a strongly 
nucleophilic hydroxide ion. 
Attack of hydroxide on the 
ester linkage of the acyl- 
enzyme generates a second 
tetrahedral intermediate* with 
oxygen in the oxyanion hole 
again taking on a negative charge. 



The tetrahedral intermediate in the chymotrypsin reaction pathway, 
and the second tetrahedral intermediate that forms later, are sometimes 
referred to as transition states, which can lead to confusion. An inter- 
mediate is any chemical species with a finite lifetime, "finite" being de- 
fined as longer than the time required for a molecular vibration (~1 CT 13 
seconds). A transition state is simply the maximum-energy species 
formed on the reaction coordinate and does not have a finite lifetime. 
The tetrahedral intermediates formed in the chymotrypsin reaction 
closely resemble, both energetically and structurally, the transition 
states leading to their formation and breakdown. However, the inter- 
mediate represents a committed stage of completed bond formation, 



whereas the transition state is part of the process of reaction. In the case 
of chymotrypsin, given the close relationship between the intermediate 
and the actual transition state the distinction between them is routinely 
glossed over. Furthermore, the interaction of the negatively charged 
oxygen with the amide nitrogens in the oxyanion hole, often referred to 
as transition-state stabilization, also serves to stabilize the intermediate 
in this case. Not all intermediates are so short-lived that they resemble 
transition states. The chymotrypsin acyl-enzyme intermediate is much 
more stable and more readily detected and studied, and it is never con- 
fused with a transition state. 
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1 0 X 6-3 Evidence for Enzyme -Transition State Complementarity 



The transition state of a reaction is difficult to study be- 
cause it is so short-lived. To understand enzymatic 
catalysis, however, we must understand what occurs 
during this fleeting moment in the course of a reaction. 
Complementarity between an enzyme and the transition 
state is virtually a requirement for catalysis, because the 
energy hill upon which the transition state sits is what 
the enzyme must lower if catalysis is to occur. How can 
we obtain evidence for enzyme-transition state comple- 
mentarity? Fortunately, we have a variety of ap- 
proaches, old and new, to address this problem, each 
providing compelling evidence in support of this general 
principle of enzyme action. 

Structure-Activity Correlations 

If enzymes are complementary to reaction transition 
states, then some functional groups in both the sub- 
strate and the enzyme must interact preferentially in 
the transition state rather than in the ES complex. 
Changing these groups should have little effect on for- 
mation of the ES complex and hence should not affect 
kinetic parameters (the dissociation constant, K d \ or 
sometimes K m , if K d = jf m ) that reflect the E + S^ES 
equilibrium. Changing these same groups should have a 
large effect on the overall rate (fc cat or k cat /K m ~) of the re- 
action, however, because the bound substrate lacks po- 
tential binding interactions needed to lower the 
activation energy. 

An excellent example of this effect is seen in the 
kinetics associated with a series of related substrates 
for the enzyme chymotrypsin (Fig. 1). Chymotrypsin 
normally catalyzes the hydrolysis of peptide bonds 
next to aromatic amino acids. The substrates shown in 



Figure 1 are convenient smaller models for the natural 
substrates (long polypeptides and proteins). The ad- 
ditional chemical groups added in each substrate (A to 
B to C) are shaded. As the table shows, the interaction 
between the enzyme and these added functional 
groups has a minimal effect oniT m (taken here as a re- 
flection of K d ) but a large, positive effect on fc cat and 
k cat /K m . This is what we would expect if the interac- 
tion contributed largely to stabilization of the transi- 
tion state. The results also demonstrate that the rate 
of a reaction can be affected greatly by enzyme-sub- 
strate interactions that are physically remote from the 
covalent bonds that are altered in the enzyme-cat- 
alyzed reaction. Chymotrypsin is described in more 
detail in the text. 

A complementary experimental approach is to 
modify the enzyme, eliminating certain enzyme-sub- 
strate interactions by replacing specific amino acid 
residues through site-directed mutagenesis (see Fig. 
9-11). Results from such experiments again demon- 
strate the importance of binding energy in stabilizing 
the transition state. 

Transition-State Analogs 

Even though transition states cannot be observed di- 
rectly, chemists can often predict the approximate 
structure of a transition state based on accumulated 
knowledge about reaction mechanisms. The transition 
state is by definition transient and so unstable that di- 
rect measurement of the binding interaction between 
this species and the enzyme is impossible. In some 
cases, however, stable molecules can be designed that 
resemble transition states. These are called transition- 
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FIGURE 1 Effects of small struc- 
tural changes in the substrate 
on kinetic parameters for 
chymotrypsin-catalyzed amide 
hydrolysis. 
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state analogs. In principle, they should bind to an 
enzyme more tightly than does the substrate in the ES 
complex, because they should fit the active site better 
(that is, form a greater number of weak interactions) 
than the substrate itself. The idea of transition-state 
analogs was suggested by Pauling in the 1940s, and it 
has been explored using a number of enzymes. These 
experiments have the limitation that a transition-state 
analog cannot perfectly mimic a transition state. Some 
analogs, however, bind an enzyme 10 2 to 10 6 times 
more tightly than does the normal substrate, providing 
good evidence that enzyme active sites are indeed 
complementary to transition states. The same princi- 
ple is used in the pharmaceutical industry to design 
new drugs. The powerful anti-HIV drugs called pro- 
tease inhibitors were designed in part as tight-binding 
transition-state analogs directed at the active site of 
HIV protease. 

Catalytic Antibodies 

If a transition-state analog can be designed for the reac- 
tion S -> P then an antibody that binds tightly to this 
analog might be expected to catalyze S -» P. Antibodies 
(immunoglobulins; see Fig. 5-21) are key components 
of the immune response. When a transition-state analog 
is used as a protein-bound epitope to stimulate the pro- 
duction of antibodies, the antibodies that bind it are po- 
tential catalysts of the corresponding reaction. This use 
of "catalytic antibodies," first suggested by William P. 
Jencks in 1969, has become practical with the develop- 
ment of laboratory techniques to produce quantities of 
identical antibodies that bind one specific antigen (mon- 
oclonal antibodies, p. 173). 

Pioneering work in the laboratories of Richard 
Lerner and Peter Schultz has resulted in the isolation of 
a number of monoclonal antibodies that catalyze the hy- 
drolysis of esters or carbonates (Fig. 2). In these reac- 
tions, the attack by water (OH") on the carbonyl carbon 
produces a tetrahedral transition state in which a partial 
negative charge has developed on the carbonyl oxygen. 
Phosphonate ester compounds mimic the structure and 
charge distribution of this transition state in ester hy- 
drolysis, making them good transition-state analogs; 
phosphate ester compounds are used for carbonate hy- 
drolysis reactions. Antibodies that bind the phospho- 
nate or phosphate compound tightly have been found to 
accelerate the corresponding ester or carbonate hydrol- 
ysis reaction by factors of 10 3 to 10 4 Structural analyses 
of a few of these catalytic antibodies have shown that 
some catalytic amino acid side chains are arranged 
such that they could interact with the substrate in the 
transition state. 



Catalytic antibodies generally do not approach the 
catalytic efficiency of enzymes, but medical and indus- 
trial uses for them are nevertheless emerging. For ex- 
ample, catalytic antibodies designed to degrade cocaine 
are being investigated as a potential aid in the treatment 
of cocaine addiction. 
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FIGURE 2 The expected transition states for ester or carbonate hydroly- 
sis reactions, Phosphonate ester and phosphate ester compounds, re- 
spectively, make good transition-state analogs for these reactions. 
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Hexokinase Undergoes Induced Fit on 
Substrate Binding 

Yeast hexokinase (M T 107,862) is a bisubstrate enzyme 
that catalyzes the reversible reaction 



CH*OH 



CH 2 OPO|" 



HO 




H OH 
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OH 
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ATP and ADP always bind to enzymes as a complex with 
the metal ion Mg 2+ . 

The hydroxyl at C-6 of glucose (to which the y-phos- 
phoryl of ATP is transferred in the hexokinase reaction) 
is similar in chemical reactivity to water, and water freely 
enters the enzyme active site. Yet hexokinase favors the 
reaction with glucose by a factor of 10 6 . The enzyme can 
discriminate between glucose and water because of a 
conformational change in the enzyme when the correct 
substrates binds (Fig. 6-22). Hexokinase thus provides 
a good example of induced fit. When glucose is not 
present, the enzyme is in an inactive conformation with 
the active-site amino acid side chains out of position for 
reaction. When glucose (but not water) and Mg • ATP 
bind, the binding energy derived from this interaction 
induces a conformational change in hexokinase to the 
catalytically active form. 

This model has been reinforced by kinetic studies. 
The five-carbon sugar xylose, stereochemically similar 
to glucose but one carbon shorter, binds to hexokinase 



but in a position where it cannot be phosphorylated. 
Nevertheless, addition of xylose to the reaction mixture 
increases the rate of ATP hydrolysis. Evidently, the 
binding of xylose is sufficient to induce a change in hex- 
okinase to its active conformation, and the enzyme is 
thereby "tricked" into phosphorylating water. The hex- 
okinase reaction also illustrates that enzyme specificity 
is not always a simple matter of binding one compound 
but not another. In the case of hexokinase, specificity is 
observed not in the formation of the ES complex but in 
the relative rates of subsequent catalytic steps. Water is 
not excluded from the active site, but reaction rates 
increase greatly in the presence of the functional 
phosphoryl group acceptor (glucose). 
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Induced fit is only one aspect of the catalytic mech- 
anism of hexokinase — like chymotrypsin, hexokinase 
uses several catalytic strategies. For example, the 
active-site amino acid residues (those brought into posi- 
tion by the conformational change that follows substrate 
binding) participate in general acid-base catalysis and 
transition-state stabilization. 
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MECHANISM FIGURE 6-23 Two-step reaction catalyzed by enolase. 

(a) The mechanism by which enolase converts 2-phosphoglycerate (2- 
PGA) to phosphoenolpyruvate. The carboxyl group of 2-PGA is coordi- 
nated by two magnesium ions at the active site, (b) The substrate, 2-PGA, 



Glu 211 



in relation to the Mg 2+ ions, Lys 345 , and Glu 211 in the enolase active site. 
Nitrogen is shown in blue, phosphorus in orange; hydrogen atoms are 
not shown (PDB ID lONE). 



The Enolase Reaction Mechanism Requires Metal ions 

Another glycolytic enzyme, enolase, catalyzes the re- 
versible dehydration of 2-phosphoglycerate to phos- 
phoenolpyruvate: 



H-C— O—P-O" ; C-O-P-CT + H 2 0 

II II I 

HO-CH 2 0- CH 2 O- 

2-Phosphoglycerate Phosphoenolpyruvate 

Yeast enolase (Af r 93,316) is a dimer with 436 
amino acid residues per subunit. The enolase reaction 
illustrates one type of metal ion catalysis and pro- 
vides an additional example of general acid-base 
catalysis and transition-state stabilization. The reac- 
tion occurs in two steps (Fig. 6-23a). First, Lys 345 
acts as a general base catalyst, abstracting a proton 
from C-2 of 2-phosphoglycerate; then Glu 211 acts as a 
general acid catalyst, donating a proton to the — OH 
leaving group. The proton at C-2 of 2-phosphoglycer- 
ate is not very acidic and thus is not readily removed. 



However, in the enzyme active site, 2-phosphoglycer- 
ate undergoes strong ionic interactions with two 
bound Mg 2+ ions (Fig. 6-23b), making the C-2 proton 
more acidic (lowering the piT a ) and easier to abstract. 
Hydrogen bonding to other active-site amino acid 
residues also contributes to the overall mechanism. 
The various interactions effectively stabilize both the 
enolate intermediate and the transition state preced- 
ing its formation. 

Lysozyme Uses Two Successive Nucleophiiic 
Displacement Reactions 

Lysozyme is a natural antibacterial agent found in tears 
and egg whites. The hen egg white lysozyme (M r 14,296) 
is a monomer with 129 amino acid residues. This was the 
first enzyme to have its three-dimensional structure de- 
tenriined, by David Phillips and colleagues in 1965. The 
structure revealed four stabilizing disulfide bonds and 
a cleft containing the active site (Fig. 6-24a). More 
than five decades of investigations have provided a de- 
tailed picture of the structure and activity of the enzyme, 
and an interesting story of how biochemical science 
progresses. 
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FIGURE 6-24 Hen egg white lysozyme and the reaction it catalyzes. 

(a) Ribbon diagram of the enzyme with the active-site residues Glu 35 and 
Asp 52 shown as blue stick structures and bound substrate shown in red 
(PDB ID 1 LZE). (b) Reaction catalyzed by hen egg white lysozyme. A seg- 
ment of a peptidoglycan polymer is shown, with the lysozyme binding 
sites A through F shaded. The glycosidic C — O bond between sugar 
residues bound to sites D and E is cleaved, as indicated by the red arrow. 
The hydrolytic reaction is shown in the inset, with the fate of the oxygen 
in the H 2 0 traced in red. Mur2Ac is N-acetylmuramic acid; GlcNAc, N- 
acetylglucosamine. RO — represents a lactyl (lactic acid) group; — NAc 
and AcN — > an N-acetyl group (see key). 

The substrate of lysozyme is peptidoglycan, a 
carbohydrate found in many bacterial cell walls (see 
Fig. 20-31). Lysozyme cleaves the (j31— »4) glycosidic 
C — 0 bond (see p. 243) between the two types of 
sugar residue in the molecule, N-acetylmuramic acid 
(Mur2Ac) and AT-acetylglucosamine (GlcNAc), often 
referred to as NAM and NAG, respectively, in the re- 
search literature on enzymology (Fig. 6-24b). Six 
residues of the alternating Mur2Ac and GlcNAc in 
peptidoglycan bind in the active site, in binding sites 
labeled A through F. Model building has shown that 
the lactyl side chain of Mur2Ac cannot be accommo- 
dated in sites C and E, restricting Mur2Ac binding to 
sites B, D, and F. Only one of the bound glycosidic 
bonds is cleaved, that between a Mur2Ac residue in 
site D and a GlcNAc residue in site E. The key cat- 
alytic amino acid residues in the active site are Glu 35 
and Asp 52 (Fig. 6-25a). The reaction is a nucle- 
ophilic substitution, with — OH from water replacing 
the GlcNAc at C-l of Mur2Ac. 

With the active site residues identified and a detailed 
structure of the enzyme available, the path to under- 
standing the reaction mechanism seemed open in the 
1960s. However, definitive evidence for a particular 
mechanism eluded investigators for nearly four decades. 



There are two chemically reasonable mechanisms that 
could generate the observed product of lysozyme-medi- 
ated cleavage of the glycosidic bond. Phillips and col- 
leagues proposed a dissociative (S N l-type) mechanism 
(Fig. 6-25a, left), in which the GlcNAc initially dissoci- 
ates in step (I) to leave behind a glycosyl cation (a car- 
bocation) intermediate. In this mechanism, the departing 
GlcNAc is protonated by general acid catalysis by Glu 35 , 
located in a hydrophobic pocket that gives its carboxyl 
group an unusually high piT a . The carbocation is stabilized 
by resonance involving the adjacent ring oxygen, as well 
as by electrostatic interaction with the negative charge on 
the nearby Asp 52 . In step (2), water attacks at C-l of 
Mur2Ac to yield the product. The alternative mechanism 
(Fig. 6-25a, right) involves two consecutive direct- 
displacement (S N 2-type) steps. In step ®, Asp 52 attacks 
C-l of Mur2Ac to displace the GlcNAc. As in the first 
mechanism, Glu 35 acts as a general acid to protonate the 
departing GlcNAc. In step©, water attacks at C-l of 
Mur2Ac to displace the Asp 52 and generate product. 

The Phillips mechanism (S N 1), was widely accepted 
for more than three decades. However, some controversy 
persisted and tests continued. The scientific method 
sometimes advances an issue slowly, and a truly insightful 
experiment can be difficult to design. Some early 
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MECHANISM FIGURE 6-25 Lysozyme reaction. In this reaction (described 
in the text), the water introduced into the product at C-1 of Mur2Ac is in 
the same configuration as the original glycosidic bond. The reaction is 
thus a molecular substitution with retention of configuration, (a) Two pro- 
posed pathways potentially explain the overall reaction and its properties. 
The S N 1 pathway (left) is the original Phillips mechanism. The S N 2 
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pathway (right) is the mechanism most consistent with current data, (b) A 
surface rendering of the lysozyme active site with the covalent enzyme- 
substrate intermediate shown as a ball-and-stick structure. Side chains of 
active-site residues are shown as ball-and-stick structures protruding from 
ribbons (PDB ID 1H6M). 
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arguments against the Phillips mechanism were suggestive 
but not completely persuasive. For example, the half-life 
of the proposed glycosyl cation was estimated to be 10~ 12 
seconds, just longer than a molecular vibration and not 
long enough for the needed diffusion of other molecules. 
More important, lysozyme is a member of a family of en- 
zymes called "retaining glycosidases," all of which cat- 
alyze reactions in which the product has the same 
anomeric configuration as the substrate (anomeric con- 
figurations of carbohydrates are examined in Chapter 7), 
and all of which are known to have reactive covalent in- 
termediates like that envisioned in the alternative (S N 2) 
pathway. Hence, the Phillips mechanism ran counter to 
experimental findings for closely related enzymes. 

A compelling experiment tipped the scales decid- 
edly in favor of the S N 2 pathway, as reported by 
Stephen Withers and colleagues in 2001. Making use of 
a mutant enzyme (with residue 35 changed from Glu to 
Gin) and artificial substrates, which combined to slow 
the rate of key steps in the reaction, these workers 
were able to stabilize the elusive covalent intermediate. 
This in turn allowed them to observe the intermediate 
directly, using both mass spectrometry and x-ray 
crystallography (Fig. 6-25b). 

Is the lysozyme mechanism now proven? No. A key 
feature of the scientific method, as Albert Einstein once 
summarized it, is "No amount of experimentation can 
ever prove me right; a single experiment can prove me 
wrong." In the case of the lysozyme mechanism, one 
might argue (and some have) that the artificial sub- 
strates, with fluorine substitutions at C-l and C-2, 
that were used to stabilize the covalent intermediate 
might have altered the reaction pathway The highly 
electronegative fluorine could destabilize an already 
electron-deficient oxocarbenium ion in the glycosyl 
cation intermediate that might occur in an S N 1 pathway. 
However, the S N 2 pathway is now the mechanism most 
in concert with available data. 

An Understanding of Enzyme Mechanism Drives Important 
Advances in Medicine 

The drugs used to treat maladies ranging from 
headache to HIV infection are almost always in- 
hibitors of an enzyme. Two examples are explored here: 
the antibiotic penicillin (and its derivatives) and the pro- 
tease inhibitors used to treat HIV infections, all of which 
are irreversible inhibitors. 

Penicillin was discovered in 1928 by Alexander 
Fleming, but it took another 15 years before this rela- 
tively unstable compound was understood well enough 
to use it as a pharmaceutical agent to treat bacterial in- 
fections. Penicillin interferes with the synthesis of pep- 
tidoglycan (described in Chapter 20, Fig. 20-32), the 
major component of the rigid cell wall that protects bac- 
teria from osmotic lysis. Peptidoglycan consists of poly- 
saccharides and peptides cross-linked in several steps 
that include a transpeptidase reaction (Fig. 6-26). It is 
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FIGURE 6-26 The transpeptidase reaction. This reaction, which links 
two peptidoglycan precursors into a larger polymer, is facilitated by an 
active-site Ser and a covalent catalysis mechanism similar to that of chy- 
motrypsin. Note that peptidoglycan is one of the few places in nature 
where D-amino acid residues are found. The active-site Ser attacks the 
carbonyl of the peptide bond between the two D-Ala residues, creating 
a covalent ester linkage between the substrate and the enzyme with re- 
lease of the terminal D-Ala residue. An amino group from the second 
peptidoglycan precursor then attacks the ester linkage, displacing the 
enzyme and cross-linking the two precursors. 



6.4 Examples of Enzymatic Reactions [~21 7^ 



Side chain Thiazolidine ring 



O 



H H 



H I i Y 
0 ^ c - n -ch" ch 3 

^-Lactam COOH 
ring 



R groups < 




CH 2 



Penicillin G 
(benzylpenicillin) 



^0-CH 2 - 



Penicillin V 



HO 




CH- 
NHo 



Amoxicillin 



General structure of penicillins 
(a) 



O H H 

i 1 q fu 

j ' X 



Trans- . 

peptidaae^Ser-OH 



H 

O 




CH CH 3 
COOH Penicillin 



0 

II 



H H 



R C— N-C — C 

H | I 

peptidase — Ser— 0 \\^ H 



*S ^CH 3 

A 

XH CH 3 

COOH 



Stably derivatizedj 
inactive transpeptidase 

(b) 



FIGURE 6-27 Transpeptidase inhibition by 0-Iactam antibiotics, (a) 

/3-Lactam antibiotics feature a five-membered thiazolidine ring fused 
to a four-membered /Mactam ring. The latter ring is strained and in- 
cludes an amide moiety that plays a critical role in the inactivation of 
peptidoglycan synthesis. The R group varies in different penicillins 
Penicillin G was the first to be isolated and remains one of the most 
effective, but it is degraded by stomach acid and must be administered 



by injection. Penicillin V is nearly as effective and is acid stable, so it 
can be administered orally. Amoxicillin has a broad range of effec- 
tiveness, is readily administered orally, and is thus the most widely 
prescribed /3-lactam antibiotic, (b) Attack on the amide moiety of the 
jS-lactam ring by a transpeptidase active-site Ser results in a covalent 
acyl-enzyme product. This is hydrolyzed so slowly that adduct forma- 
tion is practically irreversible, and the transpeptidase is inactivated. 



this reaction that is inhibited by penicillin and related 
compounds (Fig. 6-27a), all of which mimic one con- 
formation of the D-Ala— D-Ala segment of the peptido- 
glycan precursor. The peptide bond in the precursor is 
replaced by a highly reactive jS-lactam ring. When peni- 
cillin binds to the transpeptidase, an active-site Ser at- 
tacks the carbonyl of the /3-lactam ring and generates a 
covalent adduct between penicillin and the enzyme. 
However, the leaving group remains attached because 
it is linked by the remnant of the /3-lactam ring 
(Fig. 6-2 7b). The covalent complex irreversibly inacti- 
vates the enzyme. This, in turn, blocks synthesis of the 



bacterial cell wall, and most bacteria die as the fragile 
inner membrane bursts under osmotic pressure. 

Human use of penicillin and its derivatives has led 
to the evolution of strains of pathogenic bacteria that 
express ^-lactamases (Fig. 6-28a), enzymes that 
cleave /3-lactam antibiotics, rendering them inactive. 
The bacteria thereby become resistant to the antibi- 
otics. The genes for these enzymes have spread rapidly 
through bacterial populations under the selective pres- 
sure imposed by the use (and often overuse) of j8-lac- 
tam antibiotics. Human medicine responded with the 
development of compounds such as clavulanic acid, a 
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FIGURE 6-28 Lactamases and j3- lactamase inhibition, (a) j3- 

Lactamases promote cleavage of the /3-lactam ring in /3-lactam antibi- 
otics, inactivating them, (b) Clavulanic acid is a suicide inhibitor, 
making use of the normal chemical mechanism of /^-lactamases to cre- 
ate a reactive species at the active site. This reactive species is attacked 
by groups in the active site to irreversibly acylate the enzyme. 

suicide inactivator, which irreversibly inactivates the fi- 
lactamases (Fig. 6-28b). Clavulanic acid mimics the 
structure of a /3-lactam antibiotic, and forms a covalent 
adduct with a Ser in the ^-lactamase active site. This 
leads to a rearrangement that creates a much more re- 
active derivative, which is subsequently attacked by an- 
other nucleophile in the active site to irreversibly 
acylate the enzyme and inactivate it. Amoxicillin and 
clavulanic acid are combined in a widely used pharma- 
ceutical formulation with the trade name Augmentin. 
The cycle of chemical warfare between humans and 
bacteria continues unabated. Strains of disease-causing 
bacteria that are resistant to both amoxicillin and 
clavulanic acid (reflecting mutations in ^-lactamase 
that render it unreactive to clavulanic acid) have been 
discovered. The development of new antibiotics prom- 
ises to be a growth industry for the foreseeable future. 

Antiviral agents provide another example of modern 
drug development. The human immunodeficiency virus 
(HIV) is the causative agent of acquired immune defi- 
ciency syndrome, or AIDS. In 2005, an estimated 37 to 45 
million people worldwide were living with HIV infec- 
tions, with 3.9 to 6.6 million new infections that year and 
more than 2.4 million fatalities. AIDS first surfaced as a 
world epidemic in the 1980s; HIV was discovered soon 
after and identified as a retrovirus. Retroviruses pos- 
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sess an RNA genome and an enzyme, reverse transcrip- 
tase, capable of using RNA to direct the synthesis of a 
complementary DNA. Efforts to understand HIV and de- 
velop therapies for HIV infection benefited from 
decades of basic research on other retroviruses. A retro- 
virus such as HIV has a relatively simple life cycle (see 
Fig. 26-33). Its RNA genome is converted to duplex 
DNA in several steps catalyzed by a reverse transcrip- 
tase (described in Chapter 26). The duplex DNA is then 
inserted into a chromosome in the nucleus of the host 
cell by the enzyme integrase (described in Chapter 25). 
The integrated copy of the viral genome can remain dor- 
mant indefinitely. Alternatively, it can be transcribed 
back into RNA, which can then be translated into pro- 
teins to construct new virus particles. Most of the viral 
genes are translated into large polyproteins, which are 
cut by the HIV protease into the individual proteins 
needed to make the virus (see Fig. 26-34). There are 
only three key enzymes in this cycle — the reverse tran- 
scriptase, the integrase, and the protease — which thus 
are the potential drug targets. 
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Aided by general 
base catalysis, water 
attacks the carbonyl 
carbon, generating 
a tetrahedral 
intermediate, 




The tetrahedral 
intermediate 
collapses; the 
amino acid leaving 
group is protonated 
as it is expelled. 




FIGURE 6-29 Mechanism of action of HIV protease. Two active-site 
Asp residues (from different subunits) act as general acid-base catalysts, 



Asp 25 



facilitating the attack of water on the peptide bond. The unstable tetra- 
hedral intermediate in the reaction pathway is highlighted in pink. 



There are four major subclasses of proteases. Ser- 
ine proteases, such as chymo trypsin and trypsin, and 
cysteine proteases (in which Cys serves a catalytic role 
similar to that of Ser in the active site) feature covalent 
enzyme-substrate complexes; aspartyl proteases and 
metalloproteases do not. The HIV protease is an as- 
partyl protease. Two active-site Asp residues facilitate a 
direct attack of water on the peptide bond to be cleaved 
(Fig. 6-29). The initial product of the attack of water 
on the carbonyl group of the peptide bond to be cleaved 
is an unstable tetrahedral intermediate, much as we 
have seen for the chymotrypsin reaction. This interme- 
diate is close in structure and energy to the reaction 
transition state. The drugs that have been developed as 
HIV protease inhibitors form noncovalent complexes 
with the enzyme, but they bind to it so tightly that they 
can be considered irreversible inhibitors. The tight 
binding is derived in part from their design as transi- 
tion-state analogs (see Box 6-3). The success of these 
drugs makes a point worth emphasizing. The catalytic 
principles we have studied in this chapter are not sim- 
ply abstruse ideas to be memorized— their application 
saves lives. 

The HIV protease cleaves peptide bonds between 
Phe and Pro residues most efficiently. The active site 
thus has a pocket to bind aromatic groups next to the 
bond to be cleaved. The structures of several HIV protease 
inhibitors are shown in Figure 6-30. Although the struc- 
tures appear varied, they all share a core structure— a 
main chain with a hydroxy! group positioned next to a 
branch containing a benzyl group. This arrangement tar- 
gets the benzyl group to the aromatic binding pocket. The 
adjacent hydroxyl group mimics the negatively charged 
oxygen in the tetrahedral intermediate in the normal reac- 
tion, providing a transition-state analog. The remainder of 
each inhibitor structure was designed to fit into and bind 
to various crevices along the surface of the enzyme, en- 
hancing overall binding. Availability of these effective 
drugs has vastly increased the lifespan and quality of life of 
millions of people with HIV and AIDS. ■ 




Saquinavir 



FIGURE 6-30 HIV protease inhibitors. The hydroxyl group (red) acts as 
a transition-state analog, mimicking the oxygen of the tetrahedral inter- 
mediate. The adjacent benzyl group (blue) helps to properly position the 
drug in the active site. 
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SUMMARY 6.4 Examples of Enzymatic 
Reactions 

■ Chymotrypsin is a serine protease with a 
well-understood mechanism, featuring general 
acid-base catalysis, covalent catalysis, and 
transition-state stabilization. 

■ Hexokinase provides an excellent example of 
induced fit as a means of using substrate binding 
energy 

■ The enolase reaction proceeds via metal ion catalysis. 

■ Lysozyme makes use of covalent catalysis and 
general acid catalysis as it promotes two successive 
nucleophilic displacement reactions. 

■ Understanding enzyme mechanism allows the 
development of drugs to inhibit enzyme action. 

6.5 Regulatory Enzymes 

In cellular metabolism, groups of enzymes work together 
in sequential pathways to carry out a given metabolic 
process, such as the multireaction breakdown of glucose 
to lactate or the multireaction synthesis of an amino acid 
from simpler precursors. In such enzyme systems, the 
reaction product of one enzyme becomes the substrate 
of the next. 

Most of the enzymes in each metabolic pathway fol- 
low the kinetic patterns we have already described. Each 
pathway, however, includes one or more enzymes that 
have a greater effect on the rate of the overall sequence, 
These regulatory enzymes exhibit increased or de- 
creased catalytic activity in response to certain signals. 
Adjustments in the rate of reactions catalyzed by regula- 
tory enzymes, and therefore in the rate of entire metabolic 
sequences, allow the cell to meet changing needs for en- 
ergy and for biomolecules required in growth and repair. 

In most multienzyme systems, the first enzyme of 
the sequence is a regulatory enzyme. This is an excel- 
lent place to regulate a pathway, because catalysis of 
even the first few reactions of a sequence that leads to 
an unneeded product diverts energy and metabolites 
from more important processes. Other enzymes in the 
sequence may play subtler roles in modulating the flux 
through a pathway, as described in Chapter 15. 

The activities of regulatory enzymes are modulated 
in a variety of ways. Allosteric enzymes function 
through reversible, noncovalent binding of regulatory 
compounds called allosteric modulators or allosteric 
effectors, which are generally small metabolites or 
cofactors. Other enzymes are regulated by reversible co- 
valent modification. Both classes of regulatory en- 
zymes tend to be multisubunit proteins, and in some 
cases the regulatory site(s) and the active site are on sep- 
arate subunits. Metabolic systems have at least two other 
mechanisms of enzyme regulation. Some enzymes are 
stimulated or inhibited when they are bound by separate 



regulatory proteins. Others are activated when peptide 
segments are removed by proteolytic cleavage; unlike ef- 
fector-mediated regulation, regulation by proteolytic 
cleavage is irreversible. Important examples of both 
mechanisms are found in physiological processes such as 
digestion, blood clotting, hormone action, and vision. 

Cell growth and survival depend on efficient use of 
resources, and this efficiency is made possible by reg- 
ulatory enzymes. No single rule governs the occur- 
rence of different types of regulation in different 
systems. To a degree, allosteric (noncovalent) regula- 
tion may permit fine-tuning of metabolic pathways 
that are required continuously but at different levels 
of activity as cellular conditions change. Regulation by 
covalent modification may be all or none — usually the 
case with proteolytic cleavage — or it may allow for 
subtle changes in activity. Several types of regulation 
may occur in a single regulatory enzyme. The remain- 
der of this chapter is devoted to a discussion of these 
methods of enzyme regulation. 

Allosteric Enzymes Undergo Conformational Changes in 
Response to Modulator Binding 

As we saw in Chapter 5, allosteric proteins are those hav- 
ing "other shapes" or conformations induced by the bind- 
ing of modulators. The same concept applies to certain 
regulatory enzymes, as conformational changes induced 
by one or more modulators interconvert more-active and 
less-active forms of the enzyme. The modulators for al- 
losteric enzymes may be inhibitory or stimulatory. Often 
the modulator is the substrate itself; regulatory enzymes 
for which substrate and modulator are identical are called 
homotropic. The effect is similar to that of 0 2 binding to 
hemoglobin (Chapter 5): binding of the ligand — or sub- 
strate, in the case of enzymes — causes conformational 
changes that affect the subsequent activity of other sites 
on the protein. When the modulator is a molecule other 
than the substrate, the enzyme is said to be heterotropic. 
Note that allosteric modulators should not be confused 
with uncompetitive and mixed inhibitors. Although the 
latter bind at a second site on the enzyme, they do not 
necessarily mediate conformational changes between ac- 
tive and inactive forms, and the kinetic effects are distinct. 

The properties of allosteric enzymes are significantly 
different from those of simple nonregulatory enzymes. 
Some of the differences are structural. In addition to active 
sites, allosteric enzymes generally have one or more regu- 
latory, or allosteric, sites for binding the modulator (Fig. 
6-31). Just as an enzyme's active site is specific 
for its substrate, each regulatory site is specific for its mod- 
ulator. Enzymes with several modulators generally have 
different specific binding sites for each. In homotropic en- 
zymes, the active site and regulatory site are the same. 

Allosteric enzymes are generally larger and more 
complex than nonallosteric enzymes. Most have two or 
more subunits. Aspartate transcarbamoylase, which 
catalyzes an early reaction in the biosynthesis of pyrim- 
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FIGURE 6-31 Subunit interactions in an allosteric enzyme, and inter- 
actions with inhibitors and activators. In many allosteric enzymes the 
substrate binding site and the modulator binding site(s) are on different 
subunits, the catalytic (C) and regulatory (R) subunits, respectively. 
Binding of the positive (stimulatory) modulator (M) to its specific site 
on the regulatory subunit is communicated to the catalytic subunit 
through a conformational change. This change renders the catalytic 
subunit active and capable of binding the substrate (S) with higher 
affinity. On dissociation of the modulator from the regulatory subunit, 
the enzyme reverts to its inactive or less active form. 




FIGURE 6-32 Two views of the regulatory enzyme aspartate transcar- 
bamoylase. (Derived from PDB ID 2AT2.)This allosteric regulatory en- 
zyme has two stacked catalytic clusters, each with three catalytic 
polypeptide chains (in shades of blue and purple), and three regulatory 
clusters, each with two regulatory polypeptide chains (in red and yel- 
low). The regulatory clusters form the points of a triangle surrounding 
the catalytic subunits. Binding sites for allosteric modulators are on the 
regulatory subunits. Modulator binding produces large changes in en- 
zyme conformation and activity. The role of this enzyme in nucleotide 
synthesis, and details of its regulation, are discussed in Chapter 22. 



idine nucleotides (see Fig. 22-36), has 12 polypeptide 
chains organized into catalytic and regulatory subunits. 
Figure 6-32 shows the quaternary structure of this 
enzyme, deduced from x-ray analysis. 

In Many Pathways, Regulated Steps Are Catalyzed by 
Allosteric Enzymes 

In some multienzyme systems, the regulatory enzymes are 
specifically inhibited by the end product of the pathway 
whenever the concentration of the end product exceeds 
the cell's requirements. When the regulatory enzyme re- 
action is slowed, subsequent enzymes may operate at dif- 
ferent rates as their substrate pools are depleted. The rate 
of production of the pathway's end product is thereby 
brought into balance with the cell's needs. This type of 
regulation is called feedback inhibition. Buildup of the 
end product ultimately slows the entire pathway. 

One of the first known examples of allosteric feed- 
back inhibition was the bacterial enzyme system that cat- 
alyzes the conversion of L-threonine to L-isoleucine in 
five steps (Fig. 6-33). In this system, the first enzyme, 
threonine dehydratase, is inhibited by isoleucine, the 
product of the last reaction of the series. This is an ex- 
ample of heterotropic allosteric inhibition. Isoleucine is 
quite specific as an inhibitor. No other intermediate in 
this sequence inhibits threonine dehydratase, nor is any 
other enzyme in the sequence inhibited by isoleucine. 
Isoleucine binds not to the active site but to another spe- 
cific site on the enzyme molecule, the regulatory site. 
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FIGURE 6-33 Feedback inhibi- 
tion. The conversion of L-threo- 
nine to L-isoleucine is catalyzed 
by a sequence of five enzymes (E-| 
to E 5 ). Threonine dehydratase (E^ 
is specifically inhibited allosteri- 
cally by L-isoleucine, the end 
product of the sequence, but not 
by any of the four intermediates 
(A to D). Feedback inhibition is 
indicated by the dashed feedback 
line and the & symbol at the 
threon i ne dehydratase reaction 
arrow, a device used throughout 
this book. 
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This binding is noncovalent and readily reversible; if the 
isoleucine concentration decreases, the rate of threonine 
dehydration increases. Thus threonine dehydratase ac- 
tivity responds rapidly and reversibly to fluctuations in 
the cellular concentration of isoleucine. As we shall see 
in Part II of this book, the patterns of regulation in many 
other metabolic pathways are much more complex. 

The Kinetic Properties of Allosteric Enzymes Diverge from 

Michaelis-Menten Behavior 

Allosteric enzymes show relationships between V 0 and 
[S] that differ from Michaelis-Menten kinetics. They do 
exhibit saturation with the substrate when [S] is suffi- 
ciently high, but for some allosteric enzymes, plots of V 0 
versus [S] (Fig. 6-34) produce a sigmoid saturation 
curve, rather than the hyperbolic curve typical of non- 
regulatory enzymes. On the sigmoid saturation curve we 
can find a value of [S] at which V Q is half-maximal, but 
we cannot refer to it with the designation K m , because 
the enzyme does not follow the hyperbolic Michaelis- 
Menten relationship. Instead, the symbol [S] 0 5 or K 0 5 is 
often used to represent the substrate concentration giv- 
ing half-maximal velocity of the reaction catalyzed by an 
allosteric enzyme (Fig. 6-34). 

Sigmoid kinetic behavior generally reflects coopera- 
tive interactions between protein subunits. In other 
words, changes in the structure of one subunit are trans- 
lated into structural changes in adjacent subunits, an ef- 
fect mediated by noncovalent interactions at the interface 
between subunits. The principles are particularly well il- 
lustrated by a nonenzyme: 0 2 binding to hemoglobin. Sig- 
moid kinetic behavior is explained by the concerted and 
sequential models for subunit interactions (see Fig. 5-15). 

Homotropic allosteric enzymes generally are multi- 
subunit proteins and, as noted earlier, the same binding 
site on each subunit functions as both the active site and 
the regulatory site. Most commonly, the substrate acts 
as a positive modulator (an activator), because the sub- 
units act cooperatively: the binding of one molecule of 
substrate to one binding site alters the enzyme's confor- 
mation and enhances the binding of subsequent sub- 
strate molecules. This accounts for the sigmoid rather 
than hyperbolic change in V 0 with increasing [S]. One 
characteristic of sigmoid kinetics is that small changes 
in the concentration of a modulator can be associated 
with large changes in activity As is evident in Figure 
6-34a, a relatively small increase in [S] in the steep part 
of the curve causes a comparatively large increase in V 0 . 

For heterotropic allosteric enzymes, those whose 
modulators are metabolites other than the normal sub- 
strate, it is difficult to generalize about the shape of the 
substrate-saturation curve. An activator may cause the 
curve to become more nearly hyperbolic, with a decrease 
in K 0 5 but no change in V maxi resulting in an increased 
reaction velocity at a fixed substrate concentration (V 0 is 
higher for any value of [S]; Fig. 6-34b, upper curve). 
Other heterotropic allosteric enzymes respond to an 
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FIGURE 6-34 Substrate-activity curves for representative allosteric 
enzymes. Three examples of complex responses of allosteric enzymes 
to their modulators, (a) The sigmoid curve of a homotropic enzyme, in 
which the substrate also serves as a positive (stimulatory) modulator, or 
activator. Note the resemblance to the oxygen-saturation curve of he- 
moglobin (see Fig. 5-1 2). (b) The effects of a positive modulator (+) and 
a negative modulator (-) on an allosteric enzyme in which K 05 is al- 
tered without a change in V max . The central curve shows the substrate- 
activity relationship without a modulator, (c) A less common type of 
modulation, in which V max is altered and K 05 is nearly constant. 

activator by an increase in F max with little change in K 0m5 
(Fig. 6-34c). A negative modulator (an inhibitor) may 
produce a more sigmoid substrate-saturation curve, with 
an increase in K 05 (Fig. 6-34b, lower curve). Het- 
erotropic allosteric enzymes therefore show different 
kinds of responses in their substrate-activity curves, 
because some have inhibitory modulators, some have 
activating modulators, and some have both. 
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Some Enzymes Are Reguiated by Reversible 
Covalent Modification 

In another important class of regulatory enzymes, activ- 
ity is modulated by covalent modification of one or more 
of the amino acid residues in the enzyme molecule. Over 
500 different types of covalent modification have been 
found in proteins. Common modifying groups include 
phosphoryl, acetyl, adenylyl, uridylyl, methyl, amide, 
carboxyl, myristoyl, palmitoyl, prenyl, hydroxyl, sulfate, 
and adenosine diphosphate ribosyl groups (Fig. 6-35). 
There are even entire proteins that are used as special- 
ized modifying groups, including ubiquitin and sumo. 
These varied groups are generally linked to and re- 
moved from a regulated enzyme by separate enzymes. 
When an amino acid residue in an enzyme is modified, a 
novel amino acid with altered properties has effectively 
been introduced into the enzyme. Introduction of a 
charge can alter the local properties of the enzyme and 
induce a change in conformation. Introduction of a 
hydrophobic group can trigger association with a mem- 
brane. The changes are often substantial and can be 
critical to the function of the altered enzyme. 

The variety of enzyme modifications is too great to 
cover in detail, but some examples can be offered. An 
example of an enzyme regulated by methylation is 
the methyl-accepting chemotaxis protein of bacteria. 
This protein is part of a system that permits a bacterium 
to swim toward an attractant (such as a sugar) in 
solution and away from repellent chemicals. The 
methylating agent is 5-adenosylmethionine (adoMet) 
(see Fig. 18-18). Acetylation is a common modification, 
with approximately 80% of the soluble proteins in 
eukaryotes, including many enzymes, acetylated at their 
amino termini. Ubiquitin is added to proteins as a tag 
that predestines them for proteolytic degradation (see 
Fig. 27-47). Ubiquitination can also have a regulatory 
function. Sumo is found attached to many eukaryotic 
nuclear proteins with roles in the regulation of tran- 
scription, chromatin structure, and DNA repair. 

ADP-ribosylation is an especially interesting reac- 
tion, observed in a number of proteins; the ADP-ribose is 
derived from nicotinamide adenine dinucleotide (NAD) 
(see Fig. 8-38). This type of modification occurs for the 
bacterial enzyme dinitrogenase reductase, resulting in 
regulation of the important process of biological nitrogen 
fixation. Diphtheria toxin and cholera toxin are enzymes 
that catalyze the ADP-ribosylation (and inactivation) of 
key cellular enzymes or proteins. 

Phosphorylation is probably the most important 
type of regulatory modification. It is estimated that one- 
third of all proteins in a eukaryotic cell are phosphory- 
lated, and one or (often) many phosphorylation events 
are part of virtually every regulatory process. Some pro- 
teins have only one phosphorylated residue, others have 
several, and a few have dozens of sites for phosphoryla- 
tion. This mode of covalent modification is central to a 
large number of regulatory pathways, and we therefore 
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FIGURE 6-35 Some enzyme modification reactions. 
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discuss it in some detail. It will be discussed at length in 
Chapter 12. 

All of these modifications will be encountered again 
in this text. 

Phosphoryl Groups Affect the Structure and 
Catalytic Activity of Enzymes 

The attachment of phosphoryl groups to specific 
amino acid residues of a protein is catalyzed by pro- 
tein kinases; removal of phosphoryl groups is cat- 
alyzed by protein phosphatases. The addition of a 
phosphoryl group to a Ser, Thr, or Tyr residue intro- 
duces a bulky, charged group into a region that was 
only moderately polar. The oxygen atoms of a phos- 
phoryl group can hydrogen-bond with one or several 
groups in a protein, commonly the amide groups of the 
peptide backbone at the start of an a helix or the 
charged guanidinium group of an Arg residue. The two 
negative charges on a phosphorylated side chain can 
also repel neighboring negatively charged (Asp or 
Glu) residues. When the modified side chain is located 
in a region of an enzyme critical to its three-dimen- 
sional structure, phosphorylation can have dramatic 
effects on enzyme conformation and thus on substrate 
binding and catalysis. 

An important example of enzyme regulation by 
phosphorylation is seen in glycogen phosphorylase (M T 
94,500) of muscle and liver (Chapter 15), which cat- 
alyzes the reaction 

(Glucose)^ + Pj ► (glucose)„_i + glucose 1-phosphate 

Glycogen Shortened 
glycogen 
chain 

The glucose 1-phosphate so formed can be used for ATP 
synthesis in muscle or converted to free glucose in the 
liver. Glycogen phosphorylase occurs in two forms: the 



more active phosphorylase a and the less active phospho- 
rylase b (Fig. 6-36). Phosphorylase a has two subunits, 
each with a specific Ser residue that is phosphorylated at 
its hydroxyl group. These serine phosphate residues are 
required for maximal activity of the enzyme. The phos- 
phoryl groups can be hydrolyticaUy removed by a separate 
enzyme called phosphorylase phosphatase: 

Phosphorylase a + 2H 2 0 > phosphorylase b + 2Pi 

(more active) (less active) 

In this reaction, phosphorylase a is converted to phospho- 
rylase b by the cleavage of two serine phosphate covalent 
bonds, one on each subunit of glycogen phosphorylase. 

Phosphorylase b can in turn be reactivated — cova- 
lently transformed back into active phosphorylase a — 
by another enzyme, phosphorylase kinase, which 
catalyzes the transfer of phosphoryl groups from ATP to 
the hydroxyl groups of the two specific Ser residues in 
phosphorylase b: 

2ATP + phosphorylase b > 2ADP + phosphorylase a 

(less active) (more active) 

The breakdown of glycogen in skeletal muscles and the 
liver is regulated by variations in the ratio of the two 
forms of glycogen phosphorylase. The a and b forms dif- 
fer in their secondary, tertiary, and quaternary struc- 
tures; the active site undergoes changes in structure 
and, consequently, changes in catalytic activity as the 
two forms are interconverted. 

The regulation of glycogen phosphorylase by phos- 
phorylation illustrates the effects on both structure and 
catalytic activity of adding a phosphoryl group. In the 
unphosphorylated state, each subunit of this enzyme is 
folded so as to bring the 20 residues at its amino termi- 
nus, including a number of basic residues, into a region 
containing several acidic amino acids; this produces an 



FIGURE 6-36 Regulation of muscle glycogen phosphorylase activity by 
multiple mechanisms. The activity of glycogen phosphorylase in muscle 
is subjected to a multilevel system of regulation, involving covalent 
modification (phosphorylation), allosteric regulation, and a regulatory 
cascade sensitive to hormonal status that acts on the enzymes involved 
in phosphorylation and dephosphorylation. In the more active form of 
the enzyme, phosphorylase a, specific Ser residues, one on each sub- 
unit, are phosphorylated. Phosphorylase a is converted to the less active 
phosphorylase b by enzymatic loss of these phosphoryl groups, pro- 
moted by phosphoprotein phosphatase 1 (PP1 ). Phosphorylase b can be 
reconverted (reactivated) to phosphorylase a by the action of phospho- 
rylase kinase. The activity of both forms of the enzyme is allosterically 
regulated by an activator (AMP) and by inhibitors (glucose 6-phosphate 
and ATP) that bind to separate sites on the enzyme. The activities of 
phosphorylase kinase and PP1 are also regulated via a short pathway 
that responds to the hormones glucagon and epinephrine. When blood 
sugar levels are low, the pancreas and adrenal glands secrete glucagon 



and epinephrine. Epinephrine binds to its receptor in muscle and some 
other tissues, and activates the enzyme adenylyl cyclase. Glucagon 
plays a similar role, binding to receptors in the liver. This leads to the syn- 
thesis of high levels of the modified nucleotide cyclic AMP (cAMP; see 
p. 298), activating the enzyme cAMP-dependent protein kinase (also 
called protein kinase A or PKA). PKA phosphorylates several target pro- 
teins, among them phosphorylase kinase and phosphoprotein phos- 
phatase inhibitor 1 (PPI-1). The phosphorylated phosphorylase kinase is 
activated and in turn phosphorylates and activates glycogen phosphory- 
lase. At the same time, the phosphorylated PPI-1 interacts with and in- 
hibits PP1. PPI-1 also keeps itself active (phosphorylated) by inhibiting 
phosphoprotein phosphatase 2B (PP2B), the enzyme that dephosphory- 
lates (inactivates) it. In this way, the equilibrium between the a and b 
forms of glycogen phosphorylase is shifted decisively toward the more 
active glycogen phosphorylase a. Note that the two forms of phosphory- 
lase kinase are both activated to a degree by Ca 2+ ion (not shown). This 
pathway is discussed in more detail in Chapters 14, 15, and 23. 
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electrostatic interaction that stabilizes the conformation. 
Phosphorylation of Ser 14 interferes with this interaction, 
forcing the amino-terminal domain out of the acidic envi- 
ronment and into a conformation that allows interaction 
between the ®-Ser and several Arg side chains. In this 
conformation, the enzyme is much more active. 

Phosphorylation of an enzyme can affect catalysis in 
another way: by altering substrate-binding affinity For 
example, when isocitrate dehydrogenase (an enzyme of 
the citric acid cycle; Chapter 16) is phosphorylated, elec- 
trostatic repulsion by the phosphoryl group inhibits the 
binding of citrate (a tricarboxylic acid) at the active site. 

Multiple Phosphorylations Allow Exquisite 
Regulatory Control 

The Ser, Thr, or Tyr residues that are phosphorylated in 
regulated proteins occur within common structural mo- 
tifs, called consensus sequences, that are recognized by 
specific protein kinases (Table 6-10). Some kinases are 
basophilic, preferring to phosphorylate a residue having 
basic neighbors; others have different substrate prefer- 



ences, such as for a residue near a Pro residue. Amino 
acid sequence is not the only important factor in deter- 
mining whether a given residue will be phosphorylated, 
however. Protein folding brings together residues that 
are distant in the primary sequence; the resulting three- 
dimensional structure can determine whether a protein 
kinase has access to a given residue and can recognize it 
as a substrate. Another factor influencing the substrate 
specificity of certain protein kinases is the proximity of 
other phosphorylated residues. 

Regulation by phosphorylation is often complicated. 
Some proteins have consensus sequences recognized by 
several different protein kinases, each of which can 
phosphorylate the protein and alter its enzymatic activ- 
ity. In some cases, phosphorylation is hierarchical: a cer- 
tain residue can be phosphorylated only if a neighboring 
residue has already been phosphorylated. For example, 
glycogen synthase, the enzyme that catalyzes the con- 
densation of glucose monomers to form glycogen 
(Chapter 15), is inactivated by phosphorylation of spe- 
cific Ser residues and is also modulated by at least four 
other protein kinases that phosphorylate four other 
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Consensus Sequences for Protein Kinases 


Protein kinase 


Consensus sequence and phosphorylated residue 


Protein kinase A 


-x-R-[RK]-x-[ST]-B- 


Protein kinase G 


-x-R-[RK]-x-[ST]-x- 


Protein kinase C 


-[RK](2)-x-[ST]-B-[RK](2> 


Protein kinase B 


-x-R-x-[ST]-x-K- 


Ca 2 ^calmodulin kinase I 


-B-x-R-x(2)-[ST]-x(3)-B- 


Ca 2+ /calmodulin kinase II 


-B-x-[RK]-x(2)-[ST]-x(2> 


Myosin light chain kinase (smooth muscle) 


-K(2)-R-x(2)-S-x-B(2)- 


Phosphorylase b kinase 


-K-R-K-Q-I-S-V-R- 


Extracellular signal-regulated kinase (ERK) 


-P-x-[ST]-P(2> 


Cyclin-dependent protein kinase (cdc2) 


-x-[ST]-P-x-[KR]- 


Casein kinase I 


-[SpTp]-x(2)-[ST]-B f 


WCtoCJ-Ll Jvii Lclot; 11 


-x-FSTl-xf2V[ED1-x- 


/3-Adrenergic receptor kinase 


-[DE](n)-[ST]-x(3) 


Rhodopsin kinase 


-x(2)-[ST]-E(?z)- 


Insulin receptor kinase 


-x-E(3yY-M(4yK(2)-S-R-G-D~ -M^T-M-Q-I- 




G-K(3)-L-P-A-T-G-D- } -M-N-M-S-P-V-G-D- 


Epidermal growth factor (EGF) receptor kinase 


-E(4)-Y^F~E-L-V- 



Sources: Pinna, LA. & Ruzzene, M.H. (1996) How do protein kinases recognize their substrates? Biochim. Biophys.Acta 1314, 191-225; Kemp, B.E. & Pearson, R.B. (1990) Protein 
kinase recognition sequence motifs. Trends Biochem. Sci. 15, 342-346; Ken nelly, RJ. & Krebs, E.G. (1991) Consensus sequences as substrate specificity determinants for protein 
kinases and protein phosphatases. J. Biol. Chem. 266, 15,555-15,558. 

♦Shown here are deduced consensus sequences (in roman type) and actual sequences from known substrates (italic). The Ser (S),Thr (T), orTyr (Y) residue that undergoes phosphory- 
lation is in red; all amino acid residues are shown as their one-letter abbreviations (see Table 3-1). x represents any amino acid. B, any hydrophobic amino acid. Sp,Tp, andYp are Ser, 
Thr, and Tyr residues that must already be phosphorylated for the kinase to recognize the site. 

f The best target site has two amino acid residues separating the phosphorylated and target Ser/Thr residues; target sites with one or three intervening residues function at a reduced level. 
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Kinase 


Phosphorylation 
sites 


Degree of 
synthase 
inactivation 


Protein kinase A 


1A, IB, 2, 4 


+ 


Protein kinase G 


1A, IB, 2 


+ 


Protein kinase C 


1A 


+ 


Ca 2+ /calmodulin 
kinase 


IB, 2 


+ 
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2 


+ 


Casein kinase I 


At least nine 
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FIGURE 6-37 Multiple regulatory phosphorylations. The enzyme glyco- 
gen synthase has at least nine separate sites in five designated regions 
susceptible to phosphorylation by one of the cellular protein kinases. 
Thus, regulation of this enzyme is a matter not of binary (on/off) switch- 
ing but of finely tuned modulation of activity over a wide range in 
response to a variety of signals. 



sites in the enzyme (Fig. 6-37). The enzyme is not a 
substrate for glycogen synthase kinase 3, for example, 
until one site has been phosphorylated by casein kinase 
II. Some phosphorylations inhibit glycogen synthase 
more than others, and some combinations of phosphory- 
lations are cumulative. These multiple regulatory phos- 
phorylations provide the potential for extremely subtle 
modulation of enzyme activity. 

To serve as an effective regulatory mechanism, 
phosphorylation must be reversible. In general, phos- 
phoryl groups are added and removed by different en- 
zymes, and the processes can therefore be separately 
regulated. Cells contain a family of phosphoprotein 
phosphatases that hydrolyze specific ®-Ser, ®-Thr, 
and ®-Tyr esters, releasing Pi. The phosphoprotein 
phosphatases we know of thus far act only on a subset of 
phosphoproteins, but they show less substrate 
specificity than protein kinases. 

Some Enzymes and Other Proteins Are Regulated by 
Proteolytic Cleavage of an Enzyme Precursor 

For some enzymes, an inactive precursor called a zymo- 
gen is cleaved to form the active enzyme. Many prote- 
olytic enzymes (proteases) of the stomach and pancreas 
are regulated in this way. Chymotrypsin and trypsin are 
initially synthesized as chymotrypsinogen and trypsino- 
gen (Fig. 6-38). Specific cleavage causes conforma- 
tional changes that expose the enzyme active site. 
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FIGURE 6-38 Activation of zymogens 
by proteolytic cleavage. Shown here is 
the formation of chymotrypsin and 
trypsin from their zymogens, chy- 
motrypsinogen and trypsinogen. The 
bars represent the amino acid se- 
quences of the polypeptide chains, 
with numbers indicating the relative 
positions of the residues (the amino-ter- 
minal residue is number 1). Residues at 
the termini of the polypeptide frag- 
ments generated by cleavage are indi- 
cated below the bars. Note that in the 
final active forms, some numbered 
residues are missing. Recall that the 
three polypeptide chains (A, B, and C) 
of chymotrypsin are linked by disulfide 
bonds (see Fig, 6-1 8), 



Because this type of activation is irreversible, other 
mechanisms are needed to inactivate these enzymes. 
Proteases are inactivated by inhibitor proteins that bind 
very tightly to the enzyme active site. For example, pan- 
creatic trypsin inhibitor (M T 6,000) binds to and inhibits 
trypsin. ax-Antiproteinase (M T 53,000) primarily inhibits 
neutrophil elastase (neutrophils are a type of leukocyte, 
or white blood cell; elastase is a protease acting on 
elastin, a component of some connective tissues) . An in- 
sufficiency of a r antiproteinase, which can be caused by 
exposure to cigarette smoke, has been associated with 
lung damage, including emphysema. 

Proteases are not the only proteins activated by 
proteolysis. In other cases, however, the precursors are 
called not zymogens but, more generally, proproteins 
or proenzymes, as appropriate. For example, the con- 
nective tissue protein collagen is initially synthesized as 
the soluble precursor procollagen. Blood clotting is me- 
diated by a complicated cascade of proteolytic activa- 
tions: fibrin, the protein of blood clots, is produced by 
proteolysis of fibrinogen, its inactive proprotein; the 
protease responsible for this activation is thrombin 
(similar in many respects to chymotrypsin) , which itself 
is produced by proteolysis of a proprotein (in this case a 
zymogen), prothrombin. 

Some Regulatory Enzymes Use Several 
Regulatory Mechanisms 

Glycogen phosphorylase catalyzes the first reaction in 
a pathway that feeds stored glucose into energy-yield- 
ing carbohydrate metabolism (Chapters 14 and 15). 
This is an important metabolic pathway, and its regula- 
tion is correspondingly complex. Although the primary 
regulation of glycogen phosphorylase is through cova- 
lent modification, as outlined in Figure 6-36, glycogen 



phosphorylase is also modulated allosterically by AMP, 
which is an activator of phosphorylase b, and by glu- 
cose 6-phosphate and ATP, both inhibitors. In addition, 
the enzymes that add and remove the phosphoryl 
groups are themselves regulated by — and so the entire 
system is sensitive to — the levels of hormones that 
regulate blood sugar (Fig. 6-36; see also Chapters 15 
and 23). 

Other complex regulatory enzymes are found at key 
metabolic crossroads. Bacterial glutamine synthetase, 
which catalyzes a reaction that introduces reduced ni- 
trogen into cellular metabolism (Chapter 22), is among 
the most complex regulatory enzymes known. It is regu- 
lated allosterically (with at least eight different modula- 
tors); by reversible covalent modification; and by the 
association of other regulatory proteins, a mechanism 
examined in detail when we consider the regulation of 
specific metabolic pathways. 

What is the advantage of such complexity in the reg- 
ulation of enzymatic activity? We began this chapter by 
stressing the central importance of catalysis to the very 
existence of life. The control of catalysis is also critical 
to life. If all possible reactions in a cell were catalyzed si- 
multaneously, macromolecules and metabolites would 
quickly be broken down to much simpler chemical 
forms. Instead, cells catalyze only the reactions they 
need at a given moment. When chemical resources are 
plentiful, cells synthesize and store glucose and other 
metabolites. When chemical resources are scarce, cells 
use these stores to fuel cellular metabolism. Chemical 
energy is used economically, parceled out to various 
metabolic pathways as cellular needs dictate. The avail- 
ability of powerful catalysts, each specific for a given re- 
action, makes the regulation of these reactions possible. 
This in turn gives rise to the complex, highly regulated 
symphony we call life. 



228 I Enzymes 



SUMMARY 6.5 Regulatory Enzymes 

■ The activities of metabolic pathways in cells are 
regulated by control of the activities of certain 
enzymes. 

■ In feedback inhibition, the end product of a 
pathway inhibits the first enzyme of that pathway. 

■ The activity of an allosteric enzyme is adjusted by 
reversible binding of a specific modulator to a 
regulatory site. A modulator may be the substrate 
itself or some other metabolite, and the effect of 
the modulator may be inhibitory or stimulatory. 
The kinetic behavior of allosteric enzymes reflects 
cooperative interactions among enzyme subunits. 

■ Other regulatory enzymes are modulated by 
covalent modification of a specific functional group 
necessary for activity The phosphorylation of 
specific amino acid residues is a particularly 
common way to regulate enzyme activity. 

■ Many proteolytic enzymes are synthesized as 
inactive precursors called zymogens, which are 
activated by cleavage of small peptide fragments. 

■ Enzymes at important metabolic intersections 
may be regulated by complex combinations of 
effectors, allowing coordination of the activities of 
interconnected pathways. 

Key Terms — 

Terms in bold are defined in the glossary. 
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Problems 

1. Keeping the Sweet Taste of Corn The sweet taste of 
freshly picked corn (maize) is due to the high level of sugar in 



the kernels. Store-bought corn (several days after picking) is 
not as sweet, because about 50% of the free sugar is con- 
verted to starch within one day of picking. To preserve the 
sweetness of fresh corn, the husked ears can be immersed in 
boiling water for a few minutes ("blanched") then cooled in 
cold water. Corn processed in this way and stored in a freezer 
maintains its sweetness. What is the biochemical basis for this 
procedure? 

2. Intracellular Concentration of Enzymes To approxi- 
mate the actual concentration of enzymes in a bacterial cell, 
assume that the cell contains equal concentrations of 1,000 
different enzymes in solution in the cytosol and that each pro- 
tein has a molecular weight of 100,000. Assume also that the 
bacterial cell is a cylinder (diameter 1.0 /mi, height 2.0 Aim), 
that the cytosol (specific gravity 1.20) is 20% soluble protein 
by weight, and that the soluble protein consists entirely of en- 
zymes. Calculate the average molar concentration of each 
enzyme in this hypothetical cell. 

3. Rate Enhancement by Urease The enzyme urease en- 
hances the rate of urea hydrolysis at pH 8.0 and 20 °C by a fac- 
tor of 10 14 . If a given quantity of urease can completely 
hydrolyze a given quantity of urea in 5,0 min at 20 °C and pH 
8.0, how long would it take for this amount of urea to be hy- 
drolyzed under the same conditions in the absence of urease? 
Assume that both reactions take place in sterile systems so 
that bacteria cannot attack the urea. 

4. Protection of an Enzyme against Denaturation by 
Heat When enzyme solutions are heated, there is a progres- 
sive loss of catalytic activity over time due to denaturation of 
the enzyme. A solution of the enzyme hexokinase incubated at 
45 °C lost 50% of its activity in 12 min, but when incubated at 
45 °C in the presence of a very large concentration of one of its 
substrates, it lost only 3% of its activity in 12 min. Suggest why 
thermal denaturation of hexokinase was retarded in the pres- 
ence of one of its substrates. 

5. Requirements of Active Sites in Enzymes Carboxypep- 
tidase, which sequentially removes carboxyl-temiinal amino 
acid residues from its peptide substrates, is a single polypeptide 
of 307 amino acids. The two essential catalytic groups in the ac- 
tive site are furnished by Arg 145 and Glu 270 

(a) If the carboxypeptidase chain were a perfect a helix, 
how far apart (in A) would Arg 145 and Glu 270 be? (Hint: See 
Fig. 4-4a.) 

(b) Explain how the two amino acid residues can catalyze 
a reaction occurring in the space of a few angstroms. 

6. Quantitative Assay for Lactate Dehydrogenase The 

muscle enzyme lactate dehydrogenase catalyzes the reaction 



CH 3 -C— COO" + NADH + H + > CH 3 — C-COO" + NAD + 

Pyruvate ^ 

Lactate 

NADH and NAD + are the reduced and oxidized forms, respec- 
tively, of the coenzyme NAD. Solutions of NADH, but not NAD + , 
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absorb light at 340 nm. This property is used to determine 
the concentration of NADH in solution by measuring spec- 
trophotometrically the amount of light absorbed at 340 nm 
by the solution. Explain how these properties of NADH can be 
used to design a quantitative assay for lactate dehydrogenase. 

7. Effect of Enzymes on Reactions Which of the following 
effects would be brought about by any enzyme catalyzing the 
simple reaction 

Ai rpi 
p where K f eq = ^? 

(a) Decreased K' eq ; (b) Increased k^\ (c) Increased K' eq ; (d) In- 
creased AG"; (e) Decreased AG*; (f) More negative AG'°; 
(g) Increased k 2 . 

8. Relation between Reaction Velocity and Substrate 
Concentration: Michaelis-Menten Equation 

(a) At what substrate concentration would an enzyme 
with a k cat of 30.0 s -1 and a K m of 0.0050 M operate at one- 
quarter of its maximum rate? 

(b) Determine the fraction of 7 max that would be obtained at 
the following substrate concentrations [S]: l AK m , 2K m , and lQK m . 

(c) An enzyme that catalyzes the reaction X ^ Y is iso- 
lated from two bacterial species. The enzymes have the same 
7 max , but different K m values for the substrate X. Enzyme A 
has a K m of 2.0 /am, while enzyme B has a K m of 0.5 /am. The 
plot below shows the kinetics of reactions carried out with the 
same concentration of each enzyme and with [X] = 1 fjuM. 
Which curve corresponds to which enzyme? 




9. Applying the Michaelis-Menten Equation I A research 
group discovers a new version of happyase, which they call 
happyase*, that catalyzes the chemical reaction 

HAPPY SAD 

The researchers begin to characterize the enzyme. 

(a) In the first experiment, with [E t ] at 4 nM, they find that 
the 7 max is 1.6 fiM s -1 . Based on this experiment, what is the 
k cat for happyase*? (Include appropriate units.) 

(b) In another experiment, with [E t ] at 1 nM and [HAPPY] 
at 30 pM, the researchers find that V 0 = 300 Ms" 1 . What is 
the measured K m of happyase* for its substrate HAPPY? (In- 
clude appropriate units.) 



(c) Further research shows that the purified happyase* 
used in the first two experiments was actually contaminated 
with a reversible inhibitor called ANGER. When ANGER is care- 
fully removed from the happyase* preparation, and the two ex- 
periments repeated, the measured 7 max in (a) is increased to 
4.8 jjm s -1 , and the measured K m in (b) is now 15 /am. For the 
inhibitor ANGER, calculate the values of a and a'. 

(d) Based on the information given above, what type of in- 
hibitor is ANGER? 

10. Applying the Michaelis-Menten Equation II Another 
enzyme is found that catalyzes the reaction 

A B 

Researchers find that the K m for the substrate A is 4 /oi, and 
the k cat is 20 min -1 . 

(a) In an experiment, [A] = 6 mM, and the initial velocity, 7 0 
was 480 nM min -1 . What was the [E t ] used in the experiment? 

(b) In another experiment, [E t ] = 0.5 fiM, and the measured 
Vq = 5 fjM min -1 -, What was the [A] used in the experiment? 

(c) The compound Z is found to be a very strong compet- 
itive inhibitor of the enzyme, with an a of 10. In an experiment 
with the same [E t ] as in part (a), but a different [A], an amount 
of Z is added that reduces the rate V 0 to 240 nM min -1 . What is 
the [A] in this experiment? 

(d) Based on the kinetic parameters given above, has this 
enzyme evolved to achieve catalytic perfection? Explain your 
answer briefly, using the kinetic parameter(s) that define 
catalytic perfection, 

11. Estimation of F max and K m by Inspection Although 
graphical methods are available for accurate determination of 
the 7 max and K m of an enzyme-catalyzed reaction (see Box 6-1) , 
sometimes these quantities can be quickly estimated by in- 
specting values of V 0 at increasing [S]. Estimate the V max andif m 
of the enzyme-catalyzed reaction for which the following data 
were obtained, 



[S] (M) 


F 0 (/uM/min) 


2.5 X 10 -6 


28 


4.0 X 10" 6 


40 


1 X 10" 5 


70 


2 X 10~ 5 


95 


4 X 10" 5 


112 


1 X 10" 4 


128 


2 X 10" 3 


139 


1 X 10" 2 


140 



12. Properties of an Enzyme of Prostaglandin Synthe- 
sis Prostaglandins are a class of eicosanoids, fatty acid deriv- 
atives with a variety of extremely potent actions on vertebrate 
tissues. They are responsible for producing fever and inflam- 
mation and its associated pain. Prostaglandins are derived 
from the 20-carbon fatty acid arachidonic acid in a reaction 
catalyzed by the enzyme prostaglandin endoperoxide syn- 
thase. This enzyme, a cyclooxygenase, uses oxygen to convert 
arachidonic acid to PGG 2 , the immediate precursor of many 
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different prostaglandins (prostaglandin synthesis is described 
in Chapter 21). 

(a) The kinetic data given below are for the reaction cat- 
alyzed by prostaglandin endoperoxide synthase. Focusing 
here on the first two columns, determine the 7 max and K m of 
the enzyme. 

Rate of formation 



[Arachidonic 
acid] 
(him) 


Rate of formation 
ofPGG 2 
(mM/min) 


ofPGG 2 withlO 
mg/mL ibuprofen 
(mM/min) 


05 


23.5 


16.67 


1.0 


32.2 


25,25 


1.5 


36.9 


30.49 


2.5 


41.8 


37.04 


3.5 


44 0 


38.91 



(b) Ibuprofen is an inhibitor of prostaglandin endoperox- 
ide synthase. By inhibiting the synthesis of prostaglandins, 
ibuprofen reduces inflammation and pain. Using the data in 
the first and third columns of the table, determine the type of 
inhibition that ibuprofen exerts on prostaglandin endoperox- 
ide synthase. 

% 13. Graphical Analysis of F max and K m The following 
experimental data were collected during a study of the cat- 
alytic activity of an intestinal peptidase with the substrate 
glycylglycine: 



Glycylglycine 


+ H 2 0 ► 2 glycine 




Product formed 


[S] (niM) 


(ftmol/min) 


15 


0.21 


2.0 


0 24 


30 


0.28 


4.0 


0 33 


80 


0,40 


16.0 


0.45 



Use graphical analysis (see Box 6-1 and its associated Living 
Graph) to determine the K m and 7 max for this enzyme prepa- 
ration and substrate. 

14. The Eadie-Hofstee Equation One transformation of 
the Michaelis-Menten equation is the Lineweaver-Burk, or 
double-reciprocal, equation. Multiplying both sides of the 
Lineweaver-Burk equation by 7 max and rearranging gives the 
Eadie-Hofstee equation: 

A plot of V 0 versus V 0 /[S] for an enzyme-catalyzed reaction 
is shown below. The blue curve was obtained in the absence 
of inhibitor. Which of the other curves (A, B, or C) shows the 
enzyme activity when a competitive inhibitor is added to the 
reaction mixture? Hint: See Equation 6-30. 




[S] 



15. The Turnover Number of Carbonic Anhydrase Car- 
bonic anhydrase of erythrocytes (M T 30,000) has one of the 
highest turnover numbers we know of. It catalyzes the re- 
versible hydration of C0 2 : 

H 2 0 + C0 2 H 2 C0 3 

This is an important process in the transport of C0 2 from the 
tissues to the lungs. If 10.0 fig of pure carbonic anhydrase cat- 
alyzes the hydration of 0.30 g of C0 2 in 1 min at 37 °C at 7 max , 
what is the turnover number (/c cat ) of carbonic anhydrase (in 
units of min -1 )? 

16. Deriving a Rate Equation for Competitive Inhibi- 
tion The rate equation for an enzyme subject to competitive 
inhibition is 

° aK m + [S] 

Beginning with a new definition of total enzyme as 

[EJ = [E] + [ES] + [EI] 

and the definitions of a and Kj provided in the text, derive the 
rate equation above. Use the derivation of the Michaelis- 
Menten equation as a guide. 

17. Irreversible Inhibition of an Enzyme Many enzymes 
are inhibited irreversibly by heavy metal ions such as Hg 2+ , 
Cu 2+ , or Ag + , which can react with essential sulfhydryl groups 
to form mercaptides: 

Enz-SH + Ag + ► Enz-S-Ag + H + 

The affinity of Ag + for sulfhydryl groups is so great that Ag + 
can be used to titrate — SH groups quantitatively To 10.0 mL 
of a solution containing 1.0 mg/mL of a pure enzyme, an inves- 
tigator added just enough AgN0 3 to completely inactivate the 
enzyme. A total of 0.342 /xmol of AgN0 3 was required. Calcu- 
late the minimum molecular weight of the enzyme. Why does 
the value obtained in this way give only the minimum molec- 
ular weight? 

S9 18. Clinical Application of Differential Enzyme 
Bu8 Inhibition Human blood serum contains a class of 
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enzymes known as acid phosphatases, which hydrolyze biolog- 
ical phosphate esters under slightly acidic conditions (pH 5.0): 



R— O— P— 0- + H 2 0 - — > R— OH + HO— P— O- 
O 0 

Acid phosphatases are produced by erythrocytes, the liver, 
kidney, spleen, and prostate gland. The enzyme of the prostate 
gland is clinically important, because its increased activity in 
the blood can be an indication of prostate cancer. The phos- 
phatase from the prostate gland is strongly inhibited by tar- 
trate ion, but acid phosphatases from other tissues are not. 
How can this information be used to develop a specific proce- 
dure for measuring the activity of the acid phosphatase of the 
prostate gland in human blood serum? 

E5 19. Inhibition of Carbonic Anhydrase by Aceta- 
Kfl zolamide Carbonic anhydrase is strongly inhibited by 
the drug acetazolamide, which is used as a diuretic (i.e., to in- 
crease the production of urine) and to lower excessively high 
pressure in the eye (due to accumulation of intraocular fluid) 
in glaucoma Carbonic anhydrase plays an important role in 
these and other secretory processes, because it participates in 
regulating the pH and bicarbonate content of several body flu- 
ids. The experimental curve of initial reaction velocity (as per- 
centage of V m3LX ) versus [S] for the carbonic anhydrase 
reaction is illustrated below (upper curve). When the experi- 
ment is repeated in the presence of acetazolamide, the lower 
curve is obtained. From an inspection of the curves and your 
knowledge of the kinetic properties of competitive and mixed 
enzyme inhibitors, determine the nature of the inhibition by 
acetazolamide. Explain your reasoning. 



100 r 




0.2 0.4 0.6 0.8 

[S] (mM) 



20. The Effects of Reversible Inhibitors Derive the ex- 
pression for the effect of a reversible inhibitor on observed K m 
(apparent K m = aKJa'*), Start with Equation 6-30 and the 
statement that apparent K m is equivalent to the [S] at which 
V 0 = V max /2a'. 

21. pH Optimum of lysozyme The active site of lysozyme 
contains two amino acid residues essential for catalysis: Glu 35 
and Asp 52 . The pif a values of the carboxyl side chains of these 
residues are 5.9 and 4,5, respectively. What is the ionization 
state (protonated or deprotonated) of each residue at pH 5.2, 



the pH optimum of lysozyme? How can the ionization states of 
these residues explain the pH-activity profile of lysozyme 
shown below? 




2 4 6 8 10 

pH 



H 22. Working with Kinetics Go to the Living Graphs for 
Chapter 6. 

(a) Using the Living Graph for Equation 6-9, create a V 
versus [S] plot. Use 7 max = 100 s _1 , andif m = 10 fxM. How 
much does V 0 increase when [S] is doubled, from 0.2 to 0.4 /am? 
What is V 0 when [S] = 10 /xm? How much does the V 0 increase 
when [S] increases from 100 to 200 /am? Observe how the 
graph changes when the values for 7 max or K m are halved or 
doubled. 

(b) Using the Living Graph for Equation 6-30 and the ki- 
netic parameters in (a), create a plot in which both a and a' 
are 1.0. Now observe how the plot changes when a = 2.0; 
when a' = 3.0; and when a = 2.0 and a' = 3.0. 

(c) Using the Living Graphs for Equation 6-30 and the 
Lineweaver-Burk equation in Box 6-1, create Lineweaver- 
Burk (double-reciprocal) plots for all the cases in (a) and (b) . 
When a = 2.0, does the x intercept move to the right or to the 
left? If a = 2.0 and a' = 3.0, does the x intercept move to the 
right or to the left? 

Data Analysis Problem 

23. Exploring and Engineering Lactate Dehydrogenase 

Examining the structure of an enzyme results in hypotheses 
about the relationship between different amino acids in the 
protein's structure and the protein's function. One way to test 
these hypotheses is to use recombinant DNA technology to 
generate mutant versions of the enzyme and then examine the 
structure and function of these altered forms. The technology 
used to do this is described in Chapter 9, 

One example of this kind of analysis is the work of Clarke 
and colleagues on the enzyme lactate dehydrogenase, pub- 
lished in 1989. Lactate dehydrogenase (LDH) catalyzes the re- 
duction of pyruvate with NADH to form lactate (see Section 
14.3). A schematic of the enzyme's active site is shown below; 
the pyruvate is in the center: 
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Lactate dehydrogenase 
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NH 

I 

HN " 



Glni02 
I 

A 

O NH 2 



H 



CH 3 



HqC— C- 
H 



Thr^ 46 

I 



-OH 



NH 

H J 

|Pyruvate H 




N-(NADH) 



0 



0 

H H 

HN + NH 

V 
I 

NH 

Arg"i 



H CH 3 

I 1 
H3C — C — CH9 

I 



The reaction mechanism is similar to many NADH reduc- 
tions (Fig. 13-24); it is approximately the reverse of steps 2 
and 3 of Figure 14-7. The transition state involves a strongly 
polarized carbonyl group of the pyruvate molecule as shown 
below: 

CH 3 

u 

"0-C + 

I 

c 

(a) A mutant form of LDH in which Arg 109 is replaced with 
Gin shows only 5% of the pyruvate binding and 0.07% of the 
activity of wild-type enzyme. Provide a plausible explanation 
for the effects of this mutation. 



(b) A mutant form of LDH in which Arg 171 is replaced with 
Lys shows only 0.05% of the wild-type level of substrate bind- 
ing. Why is this dramatic effect surprising? 

(c) In the crystal structure of LDH, the guanidinium 
group of Arg 171 and the carboxyl group of pyruvate are aligned 
as shown in a co-planar "forked" configuration. Based on this, 
provide a plausible explanation for the dramatic effect of sub- 
stituting Arg 171 with Lys. 

(d) A mutant form of LDH in which He 250 is replaced with 
Gin shows reduced binding of NADH. Provide a plausible ex- 
planation for this result. 

Clarke and colleagues also set out to engineer a mutant 
version of LDH that would bind and reduce oxaloacetate 
rather than pyruvate. They made a single substitution, replac- 
ing Gin 102 with Arg; the resulting enzyme would reduce ox- 
aloacetate to malate and would no longer reduce pyruvate to 
lactate. They had therefore converted LDH to malate dehy- 
drogenase. 

(e) Sketch the active site of this mutant LDH with oxaloac- 
etate bound. 

(f) Provide a plausible explanation for why this mutant en- 
zyme now "prefers" oxaloacetate instead of pyruvate. 

(g) The authors were surprised that substituting a larger 
amino acid in the active site allowed a larger substrate to bind. 
Provide a plausible explanation for this result. 
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I would feel more optimistic about a bright future for man if he spent 
less time proving that he can outwit Nature and more time tasting her 
sweetness and respecting her seniority. 

— £ B. White, "Coon Tree/' 1977 
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Carbohydrates are the most abundant biomolecules on 
Earth. Each year, photosynthesis converts more than 
100 billion metric tons of C0 2 and H 2 0 into cellulose 
and other plant products. Certain carbohydrates (sugar 
and starch) are a dietary staple in most parts of the world, 
and the oxidation of carbohydrates is the central energy- 
yielding pathway in most nonphotosynthetic cells. Carbo- 
hydrate polymers (also called glycans) serve as structural 
and protective elements in the cell walls of bacteria and 
plants and in the connective tissues of animals. Other car- 
bohydrate polymers lubricate skeletal joints and partici- 
pate in recognition and adhesion between cells. Complex 
carbohydrate polymers covalently attached to proteins or 
lipids act as signals that determine the intracellular loca- 
tion or metabolic fate of these hybrid molecules, called gly- 
coconjugates. This chapter introduces the major classes 
of carbohydrates and glycoconjugates and provides a few 
examples of their many structural and functional roles. 

Carbohydrates are polyhydroxy aldehydes or ketones, 
or substances that yield such compounds on hydrolysis. 
Many, but not all, carbohydrates have the empirical formula 
(CH 2 0) n ; some also contain nitrogen, phosphorus, or sulfur. 

There are three major size classes of carbohydrates: 
monosaccharides, oligosaccharides, and polysaccha- 
rides (the word "saccharide" is derived from the Greek 
sakcharon, meaning "sugar"). Monosaccharides, or 
simple sugars, consist of a single polyhydroxy aldehyde 
or ketone unit. The most abundant monosaccharide in 
nature is the six-carbon sugar D-glucose, sometimes re- 
ferred to as dextrose. Monosaccharides of four or more 
carbons tend to have cyclic structures. 



Oligosaccharides consist of short chains of mono- 
saccharide units, or residues, joined by characteristic 
linkages called glycosidic bonds. The most abundant are 
the disaccharides, with two monosaccharide units. 
Typical is sucrose (cane sugar), which consists of the 
six-carbon sugars D-glucose and D-fructose. All common 
monosaccharides and disaccharides have names ending 
with the suffix "-ose." In cells, most oligosaccharides 
consisting of three or more units do not occur as free en- 
tities but are joined to nonsugar molecules (lipids or 
proteins) in glycoconjugates. 

The polysaccharides are sugar polymers contain- 
ing more than 20 or so monosaccharide units; some have 
hundreds or thousands of units. Some polysaccharides, 
such as cellulose, are linear chains; others, such as 
glycogen, are branched. Both glycogen and cellulose 
consist of recurring units of D-glucose, but they differ in 
the type of glycosidic linkage and consequently have 
strikingly different properties and biological roles. 

7.1 Monosaccharides and Disaccharides 

The simplest of the carbohydrates, the monosaccha- 
rides, are either aldehydes or ketones with two or more 
hydroxyl groups; the six-carbon monosaccharides glu- 
cose and fructose have five hydroxyl groups. Many of the 
carbon atoms to which hydroxyl groups are attached are 
chiral centers, which give rise to the many sugar 
stereoisomers found in nature. We begin by describing 
the families of monosaccharides with backbones of 
three to seven carbons — their structure and stereoiso- 
mer^ forms, and the means of representing their three- 
dimensional structures on paper. We then discuss several 
chemical reactions of the carbonyl groups of monosac- 
charides. One such reaction, the addition of a hydroxyl 
group from within the same molecule, generates cyclic 
forms having four or more backbone carbons (the forms 
that predominate in aqueous solution). This ring closure 
creates a new chiral center, adding further stereochemi- 
cal complexity to this class of compounds. The nomen- 
clature for unambiguously specifying the configuration 
about each carbon atom in a cyclic form and the means 
of representing these structures on paper are therefore 
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described in some detail; this information will be useful 
as we discuss the metabolism of monosaccharides in Part 
II. We also introduce here some important monosaccha- 
ride derivatives encountered in later chapters. 

The Two Families of Monosaccharides Are Aldoses 
and Ketoses 

Monosaccharides are colorless, crystalline solids that are 
freely soluble in water but insoluble in nonpolar solvents. 
Most have a sweet taste. The backbones of common mono- 
saccharides are unbranched carbon chains in which all the 
carbon atoms are linked by single bonds. In the open-chain 
form, one of the carbon atoms is double-bonded to an oxy- 
gen atom to form a carbonyl group; each of the other car- 
bon atoms has a hydroxyl group. If the carbonyl group is at 
an end of the carbon chain (that is, in an aldehyde group) 
the monosaccharide is an aldose; if the carbonyl group is 
at any other position (in a ketone group) the monosaccha- 
ride is a ketose. The simplest monosaccharides are the 
two three-carbon trioses: glyceraldehyde, an aldotriose, 
and dihydroxyacetone, a ketotriose (Fig. 7-la). 

Monosaccharides with four, five, six, and seven car- 
bon atoms in their backbones are called, respectively, tet- 
roses, pentoses, hexoses, and heptoses. There are aldoses 
and ketoses of each of these chain lengths: aldotetroses 
and ketotetroses, aldopentoses and ketopentoses, and so 
on. The hexoses, which include the aldohexose D-glucose 
and the ketohexose D-fructose (Fig. 7-lb), are the most 
common monosaccharides in nature. The aldopentoses d- 
ribose and 2-deoxy-D-ribose (Fig. 7-1 c) are components 
of nucleotides and nucleic acids (Chapter 8). 

Monosaccharides Have Asymmetric Centers 

All the monosaccharides except dihydroxyacetone con- 
tain one or more asymmetric (chiral) carbon atoms and 
thus occur in optically active isomeric forms (pp. 16-17). 
The simplest aldose, glyceraldehyde, contains one chiral 
center (the middle carbon atom) and therefore has two 
different optical isomers, or enantiomers (Fig. 7-2). 

KEY CONVENTION: One of the two enantiomers is, by con- 
vention, designated the d isomer, the other the l isomer. 
As for other biomolecules with chiral centers, the ab- 
solute configurations of sugars are known from x-ray 
crystallography. To represent three-dimensional sugar 
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FIGURE 7-1 Representative monosaccharides, (a) Two trioses, an aldose 
and a ketose. The carbonyl group in each is shaded, (b) Two common 
hexoses. (c) The pentose components of nucleic acids. D-Ribose is a 
component of ribonucleic acid (RNA), and 2-deoxy-D-ribose is a com- 
ponent of deoxyribonucleic acid (DNA). 



structures on paper, we often use Fischer projection 
formulas (Fig. 7-2). In Fischer projection formulas, 
horizontal bonds project out of the plane of the paper, 
toward the reader; vertical bonds project behind the 
plane of the paper, away from the reader. 
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Fischer projection formulas 
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Perspective formulas 

FIGURE 7-2 Three ways to represent the two enantiomers of glycer- 
aldehyde. The enantiomers are mirror images of each other. Ball-and- 
stick models show the actual configuration of molecules. Recall (see 
Fig. 1-17) that in perspective formulas, solid wedge-shaped bonds 
point toward the reader, dashed wedges point away. 
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FIGURE 7-3 Aldoses and ketoses. The series of (a) D-aldoses and 
(b) D-ketoses having from three to six carbon atoms, shown as projec- 
tion formulas. The carbon atoms in red are chiral centers, In all these 
d isomers, the chiral carbon most distant from the carbonyl carbon has 
the same configuration as the chiral carbon in D-glyceraldehyde. The 
sugars named in boxes are the most common in nature; you will en- 
counter these again in this and later chapters. 

In general, a molecule with n chiral centers can have 
2 n stereoisomers. Glyceraldehyde has 2 1 = 2; the aldo- 
hexoses, with four chiral centers, have 2 4 = 16 stereoiso- 
mers. The stereoisomers of monosaccharides of each 
carbon-chain length can be divided into two groups that 
differ in the configuration about the chiral center most 
distant from the carbonyl carbon. Those in which the 
configuration at this reference carbon is the same as 
that of D-glyceraldehyde are designated d isomers, and 
those with the same configuration as L-glyceraldehyde 
are l isomers. When the hydroxyl group on the refer- 
ence carbon is on the right in a projection formula that 
has the carbonyl carbon at the top, the sugar is the d iso- 
mer; when on the left, it is the l isomer. Of the 16 possi- 
ble aldohexoses, eight are d forms and eight are l. Most 
of the hexoses of living organisms are d isomers. 

Figure 7-3 shows the structures of the d stereoiso- 
mers of all the aldoses and ketoses having three to six 
carbon atoms. The carbons of a sugar are numbered be- 
ginning at the end of the chain nearest the carbonyl 
group. Each of the eight D-aldohexoses, which differ in 
the stereochemistry at C-2, C-3, or C-4, has its own 
name: D-glucose, D-galactose, D-mannose, and so forth 
(Fig. 7-3a). The four- and five-carbon ketoses are 
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FIGURE 7-4 Epimers. D-Glucose and two of its epimers are shown as 
projection formulas. Each epimer differs from D-glucose in the config- 
uration at one chiral center (shaded pink). 

designated by inserting "ul" into the name of a corre- 
sponding aldose; for example, D-ribulose is the ketopen- 
tose corresponding to the aldopentose D-ribose. The 
ketohexoses are named otherwise: for example, fruc- 
tose (from the Latin fructus, "fruit"; fruits are one 
source of this sugar) and sorbose (from Sorbus, the 
genus of mountain ash, which has berries rich in the re- 
lated sugar alcohol sorbitol). Two sugars that differ only 
in the configuration around one carbon atom are called 
epimers; D-glucose and D-mannose, which differ only in 
the stereochemistry at C-2, are epimers, as are D-glu- 
cose and D-galactose (which differ at C-4) (Fig. 7-4). 

Some sugars occur naturally in their L form; 
examples are L-arabinose and the L isomers of some 
sugar derivatives that are common components of 
glycoconjugates (Section 7.3). 
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The Common Monosaccharides Have Cyclic Structures 

For simplicity, we have thus far represented the struc- 
tures of aldoses and ketoses as straight-chain mole- 
cules (Figs 7-3, 7-4). In fact, in aqueous solution, 
aldotetroses and all monosaccharides with five or more 
carbon atoms in the backbone occur predominantly as 
cyclic (ring) structures in which the carbonyl group 
has formed a covalent bond with the oxygen of a hy- 
droxyl group along the chain. The formation of these 
ring structures is the result of a general reaction be- 
tween alcohols and aldehydes or ketones to form deriv- 
atives called hemiacetals or hemiketals (Fig. 7-5), 
which contain an additional asymmetric carbon atom 
and thus can exist in two stereoisomeric forms. For ex- 
ample, D-glucose exists in solution as an intramolecular 
hemiacetal in which the free hydroxyl group at C-5 has 
reacted with the aldehydic C-l, rendering the latter 
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FIGURE 7-5 Formation of hemiacetals and hemiketals. An aldehyde 
or ketone can react with an alcohol in a 1 :1 ratio to yield a hemiacetal 
or hemiketal, respectively creating a new chiral center at the carbonyl 
carbon. Substitution of a second alcohol molecule produces an acetal 
or ketal. When the second alcohol is part of another sugar molecule, 
the bond produced is a glycosidic bond (p. 243). 

carbon asymmetric and producing two stereoisomers, 
designated a and /3 (Fig. 7-6). The designation a indi- 
cates that the hydroxyl group at the anomeric center 
is, in a Fischer projection, on the same side as the 
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FIGURE 7-6 Formation of the two cyclic forms of D-glucose. Reac- 
tion between the aldehyde group at C-1 and the hydroxyl group at C-5 
forms a hemiacetal linkage, producing either of two stereoisomers, the 
a and jS anomers, which differ only in the stereochemistry around the 
hemiacetal carbon. The interconversion of a and jS anomers is called 
mutarotation. 
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hydroxyl attached at the farthest chiral center, whereas 
(5 indicates that these hydroxyl groups are on opposite 
sides. These six-membered ring compounds are called 
pyranoses because they resemble the six-membered 
ring compound pyran (Fig. 7-7). The systematic names 
for the two ring forms of D-glucose are a-D-glucopyranose 
and /3-D-glucopyranose. 

Aldoses also exist in cyclic forms having five- 
membered rings, which, because they resemble the five- 
membered ring compound furan, are called furanoses. 
However, the six-membered aldopyranose ring is much 
more stable than the aldofuranose ring and predomi- 
nates in aldohexose and aldopentose solutions. Only al- 
doses having five or more carbon atoms can form 
pyranose rings. 

Isomeric forms of monosaccharides that differ only 
in their configuration about the hemiacetal or hemiketal 
carbon atom are called anomers. The hemiacetal (or 
carbonyl) carbon atom is called the anomeric carbon. 
The a and /3 anomers of D-glucose interconvert in 
aqueous solution by a process called mutarotation 
(Fig. 7-6). Thus, a solution of a-D-glucose and a solution 
of jS-D-glucose eventually form identical equilibrium 
mixtures having identical optical properties. This mix- 
ture consists of about one-third a-D-glucose, two-thirds 
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FIGURE 7-7 Pyranoses and furanoses. The pyranose forms of 
D-glucose and the furanose forms of D-fructose are shown here as Ha- 
worth perspective formulas. The edges of the ring nearest the reader 
are represented by bold lines. Hydroxyl groups below the plane of the 
ring in these Haworth perspectives would appear at the right side of a 
Fischer projection (compare with Fig. 7-6). Pyran and furan are shown 
for comparison. 



j3-D-glucose, and very small amounts of the linear and 
five-membered ring (glucofuranose) forms. 

Ketohexoses also occur in a and jS anomeric forms. 
In these compounds the hydroxyl group at C-5 (or C-6) 
reacts with the keto group at C-2, forming a furanose (or 
pyranose) ring containing a hemiketal linkage (Fig. 7-5). 
D-Fructose readily forms the furanose ring (Fig. 7-7); 
the more common anomer of this sugar in combined 
forms or in derivatives is /3-D-fructofuranose. 

Haworth perspective formulas like those in Fig- 
ure 7-7 are commonly used to show the stereochemistry 
of ring forms of monosaccharides. However, the six- 
membered pyranose ring is not planar, as Haworth per- 
spectives suggest, but tends to assume either of two 
"chair" conformations (Fig. 7-8). Recall from Chapter 1 
(p. 18) that two conformations of a molecule are inter- 
convertible without the breakage of covalent bonds, 
whereas two configurations can be interconverted only 
by breaking a covalent bond. To interconvert a and /3 
configurations, the bond involving the ring oxygen atom 
would have to be broken, but interconversion of the two 
chair forms does not require bond breakage. The specific 
three-dimensional structures of the monosaccharide units 
are important in determining the biological properties and 
functions of some polysaccharides, as we shall see. 
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FIGURE 7-8 Conformational formulas of pyranoses. (a) Two chair 
forms of the pyranose ring. Bonds to substituents and hydrogen atoms 
on the ring carbons may be either axial (ax), projecting parallel to the 
vertical axis through the ring, or equatorial (eq), projecting roughly 
perpendicular to this axis. Two conformers such are these are not readily 
interconvertible without breaking the ring. However, when the mole- 
cule is "stretched" (by atomic force microscopy; see Box 11-1), an 
input of about 46 kj of energy per mole of sugar can force the inter- 
conversion of chair forms. Generally, substituents in the equatorial 
positions are less sterically hindered by neighboring substituents, and 
conformers with bulky substituents in equatorial positions are favored. 
Another conformation, the "boat" (not shown), is seen only in deriva- 
tives with very bulky substituents. (b) The preferred chair conformation 
of a-D-glucopyranose. 
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Organisms Contain a Variety of Hexose Derivatives 

In addition to simple hexoses such as glucose, galactose, 
and mannose, there are a number of sugar derivatives in 
which a hydroxyl group in the parent compound is re- 
placed with another substituent, or a carbon atom is ox- 
idized to a carboxyl group (Fig. 7-9). In glucosamine, 
galactosamine, and mannosamine, the hydroxyl at C-2 
of the parent compound is replaced with an amino 
group. The amino group is nearly always condensed 
with acetic acid, as in TV-acetylglucosamine. This 
glucosamine derivative is part of many structural poly- 
mers, including those of the bacterial cell wall. Bacter- 
ial cell walls also contain a derivative of glucosamine, 
iV-acetylmuramic acid, in which lactic acid (a three- 
carbon carboxylic acid) is ether-linked to the oxygen 
at C-3 of Af-acetylglucosamine. The substitution of a 
hydrogen for the hydroxyl group at C-6 of L-galactose 



or L-mannose produces L-fucose or L-rhamnose, re- 
spectively. L-Fucose is found in the complex oligosac- 
charide components of glycoproteins and glycolipids; 
L-rhamnose is found in plant polysaccharides. 

Oxidation of the carbonyl (aldehyde) carbon of glu- 
cose to the carboxyl level produces gluconic acid; other 
aldoses yield other aldonic acids. Oxidation of the car- 
bon at the other end of the carbon chain — C-6 of glucose, 
galactose, or mannose — forms the corresponding uronic 
acid: glucuronic, galacturonic, or mannuronic acid. Both 
aldonic and uronic acids form stable intramolecular es- 
ters called lactones (Fig. 7-9, lower left). In addition 
to these acidic hexose derivatives, one nine-carbon 
acidic sugar deserves mention: A^-acetylneuraminic acid 
(a sialic acid, but often referred to simply as "sialic acid"), 
a derivative of Af-acetylmannosamine, is a component 
of many glycoproteins and glycolipids in animals. The 
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FIGURE 7-9 Some hexose derivatives important in biology. In amino 
sugars, an — NH 2 group replaces one of the — OH groups in the parent 
hexose. Substitution of — H for —OH produces a deoxy sugar; note 
that the deoxy sugars shown here occur in nature as the l isomers The 
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acidic sugars contain a carboxylate group, which confers a negative 
charge at neutral pH. D-Glucono-S-lactone results from formation of 
an ester linkage between the C-1 carboxylate group and the C-5 (also 
known as the 5 carbon) hydroxyl group of D-gluconate. 
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carboxylic acid groups of the acidic sugar derivatives 
are ionized at pH 7, and the compounds are therefore 
correctly named as the carboxylates — glucuronate, 
galacturonate, and so forth. 

In the synthesis and metabolism of carbohydrates, 
the intermediates are very often not the sugars them- 
selves but their phosphorylated derivatives. Condensa- 
tion of phosphoric acid with one of the hydroxyl groups 
of a sugar forms a phosphate ester, as in glucose 
6-phosphate (Fig. 7-9). Sugar phosphates are relatively 
stable at neutral pH and bear a negative charge. One ef- 
fect of sugar phosphorylation within cells is to trap the 
sugar inside the cell; most cells do not have plasma 
membrane transporters for phosphorylated sugars. 
Phosphorylation also activates sugars for subsequent 
chemical transformation. Several important phosphory- 
lated derivatives of sugars are components of nu- 
cleotides (discussed in the next chapter). 

Monosaccharides Are Reducing Agents 

Monosaccharides can be oxidized by relatively 
mild oxidizing agents such as cupric (Cu 2+ ) ion 
(Fig. 7-10). The carbonyl carbon is oxidized to a car- 
boxyl group. Glucose and other sugars capable of 
reducing cupric ion are called reducing sugars. They 
form enediols, which are converted to aldonic acids and 
then to a complex mixture of 2-, 3-, 4-, and 6-carbon 
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FIGURE 7-10 Sugars as reducing agents. Oxidation of the anomeric 
carbon (and probably the neighboring carbon) of glucose and other 
sugars under alkaline conditions is the basis for Fehling's reaction. The 
cuprous ion (Cu + ) produced forms a red cuprous oxide precipitate. In 
the hemiacetal (ring) form, C-1 of glucose cannot be oxidized by 
complexed Cu 2+ , However, the open-chain form is in equilibrium 
with the ring form, and eventually the oxidation reaction goes to com- 
pletion. The reaction with Cu 2+ is complex, yielding a mixture of prod- 
ucts and reducing 3 mol of Cu 2+ per mole of glucose. 

acids. This is the basis of Fehling's reaction, a qualitative 
test for the presence of reducing sugar. By measuring 
the amount of oxidizing agent reduced by a solution of a 
sugar, it is also possible to estimate the concentration of 
that sugar. For many years this test was used to detect 
and measure elevated glucose levels in blood and urine 
in the diagnosis of diabetes mellitus (Box 7-1). ■ 



BOX 7- 



MEDICINE 



Blood Glucose Measurements in the Diagnosis and 
Treatment of Diabetes 



Glucose is the principal fuel for the brain. When the 
amount of glucose reaching the brain is too low, the 
consequences can be dire: lethargy, coma, permanent 
brain damage, and death (see Fig. 23-25). Animals 
have evolved complex hormonal mechanisms to ensure 
that the concentration of glucose in the blood remains 
high enough (about 5 itiM) to satisfy the brain's needs, 
but not too high, because elevated blood glucose can 
also have serious physiological consequences. 

Individuals with insulin-dependent diabetes melli- 
tus do not produce sufficient insulin, the hormone that 
normally serves to reduce blood glucose concentration, 
and if the diabetes is untreated their blood glucose lev- 
els may rise to severalfold higher than normal. These 
high glucose levels are believed to be at least one cause 
of the serious long-term consequences of untreated dia- 
betes — kidney failure, cardiovascular disease, blind- 
ness, and impaired wound healing — so one goal of 
therapy is to provide just enough insulin (by injection) 
to keep blood glucose levels near normal. To maintain 
the correct balance of exercise, diet, and insulin for the 
individual, blood glucose concentration needs to be 
measured several times a day, and the amount of insulin 
injected adjusted appropriately. 



The concentrations of glucose in blood and urine 
can be determined by a simple assay for reducing sugar, 
such as Fehling's reaction, which for many years was 
used as a diagnostic test for diabetes (Fig. 7-10). Mod- 
ern measurements require just a drop of blood, added 
to a test strip containing the enzyme glucose oxidase 
(Fig. 1); a simple photometer measures the color pro- 
duced when the H 2 0 2 from glucose oxidation reacts with 
a dye, and reads out the blood glucose concentration. 

Because blood glucose levels change with the timing 
of meals and exercise, single-time measurements do not 
necessarily reflect the average blood glucose over hours 
and days, so dangerous increases may go undetected. 
The average glucose concentration can be assessed by 
looking at its effect on hemoglobin, the oxygen-carrying 
protein in erythrocytes (p. 158). Transporters in the ery- 
throcyte membrane equilibrate intracellular and plasma 

D-Glucose + 0 2 > D-Glucono-5-lactone+ H 2 U 2 

FIGURE 1 The glucose oxidase reaction, used in the measurement of 
blood glucose. A second enzyme, a peroxidase, catalyzes the reaction 
of the H 2 0 2 with a colorless compound to produce a colored product, 
which is measured spectrophotometrically. 

(continued on next page) 
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ood Glucose Measurements in the Diagnosis and 
Treatment of Diabetes {continued from previous page) 



glucose concentrations, so hemoglobin is constantly ex- 
posed to glucose at whatever concentration is present in 
the blood. A nonenzymatic reaction occurs between glu- 
cose and primary amino groups in hemoglobin (either 
the amino-terminal Val or the e-amino groups of Lys 
residues; see Fig. 2). The rate of this process is propor- 
tional to the concentration of glucose, so the reaction 
can be used as the basis for estimating the average blood 
glucose level over weeks. The amount of glycated hemo- 
globin (GHB) present at any time reflects the average 
blood glucose concentration over the circulating "life- 
time" of the erythrocyte (about 120 days), although the 
concentration in the last two weeks is the most impor- 
tant in setting the level of GHB. 

The extent of hemoglobin glycation (so named 
to distinguish it from glycosylation, the enzymatic 
transfer of glucose to a protein) is measured clinically 
by extracting hemoglobin from a small sample of blood 
and separating GHB from unmodified hemoglobin elec- 
trophoretically, taking advantage of the charge differ- 
ence resulting from modification of the amino 
group(s). Normal GHB values are about 5% of total he- 
moglobin (corresponding to blood glucose of 120 mg/ 
100 mL). In people with untreated diabetes, how- 
ever, this value may be as high as 13%, indicating an 
average blood glucose level of about 300 mg/100 mL— 
dangerously high. One criterion for success in an indi- 
vidual program of insulin therapy (the timing, 
frequency, and amount of insulin injected) is maintain- 
ing GHB values at about 7%. 

In the hemoglobin glycation reaction, the first step 
(formation of a Schiff base) is followed by a series of re- 
arrangements, oxidations, and dehydrations of the car- 
bohydrate moiety to produce a heterogeneous mixture 
of AGEs, advanced glycation end products. These prod- 
ucts can leave the erythrocyte and form covalent cross- 
links between proteins, interfering with normal protein 
function (Fig. 2). The accumulation of relatively high 
concentrations of AGEs in people with diabetes may, by 
cross-linking critical proteins, cause the damage to the 
kidneys, retinas, and cardiovascular system that charac- 
terize the disease. This pathogenic process is a potential 
target for drug action. 

FIGURE 2 The nonenzymatic reaction of glucose with a primary amino 
group in hemoglobin begins with© formation of a Schiff base, which 
® undergoes the Amadori rearrangement to generate a stable product; 
(3) this ketoamine can further cyclize to yield GHB. @ Subsequent 
reactions generate advanced glycation end products (AGEs), such as 
e-N-carboxymethyllysine and methylglyoxal, compounds that© can 
damage other proteins by cross-linking them, causing pathological 
changes. 
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Disaccharides Contain a Glycosidic Bond 

Disaccharides (such as maltose, lactose, and sucrose) 
consist of two monosaccharides joined covalently by an 
O-glycosidic bond, which is formed when a hydroxyl 
group of one sugar reacts with the anomeric carbon of 
the other (Fig. 7-11). This reaction represents the for- 
mation of an acetal from a hemiacetal (such as glucopy- 
ranose) and an alcohol (a hydroxyl group of the second 
sugar molecule) (Fig. 7-5), and the resulting compound 
is called a glycoside. Glycosidic bonds are readily 
hydrolyzed by acid but resist cleavage by base. Thus 
disaccharides can be hydrolyzed to yield their free 
monosaccharide components by boiling with dilute acid. 
AT-glycosyl bonds join the anomeric carbon of a sugar to 
a nitrogen atom in glycoproteins (see Fig. 7-29) and nu- 
cleotides (see Fig. 8-1). 

The oxidation of a sugar by cupric ion (the reaction 
that defines a reducing sugar) occurs only with the 
linear form, which exists in equilibrium with the cyclic 
form(s). When the anomeric carbon is involved in a gly- 
cosidic bond, that sugar residue cannot take the linear 
form and therefore becomes a nonreducing sugar. In 
describing disaccharides or polysaccharides, the end of a 
chain with a free anomeric carbon (one not involved in a 
glycosidic bond) is commonly called the reducing end. 
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FIGURE 7-11 Formation of maltose. A disaccharide is formed from 
two monosaccharides (here, two molecules of D-glucose) when an 
— OH (alcohol) of one glucose molecule (right) condenses with the in- 
tramolecular hemiacetal of the other glucose molecule (left), with 
elimination of H 2 0 and formation of a glycosidic bond. The reversal of 
this reaction is hydrolysis — attack by H 2 0 on the glycosidic bond The 
maltose molecule, shown here as an illustration, retains a reducing 
hemiacetal at the C-l not involved in the glycosidic bond. Because 
mutarotation interconverts the a and p forms of the hemiacetal, the 
bonds at this position are sometimes depicted with wavy lines, as 
shown here, to indicate that the structure may be either a or p. 



The disaccharide maltose (Fig. 7-11) contains two 
D-glucose residues joined by a glycosidic linkage be- 
tween C-l (the anomeric carbon) of one glucose 
residue and C-4 of the other. Because the disaccharide 
retains a free anomeric carbon (C-l of the glucose 
residue on the right in Fig. 7-11), maltose is a reducing 
sugar. The configuration of the anomeric carbon atom 
in the glycosidic linkage is a. The glucose residue with 
the free anomeric carbon is capable of existing in a- and 
/3-pyranose forms. 

KEY CONVENTION: To name reducing disaccharides such 
as maltose unambiguously, and especially to name more 
complex oligosaccharides, several rules are followed. By 
convention, the name describes the compound written 
with its nonreducing end to the left, and we can "build 
up" the name in the following order. (1) Give the config- 
uration (a or ff) at the anomeric carbon joining the first 
monosaccharide unit (on the left) to the second. (2) 
Name the nonreducing residue; to distinguish five- and 
six-membered ring structures, insert "furano" or 
"pyrano" into the name. (3) Indicate in parentheses the 
two carbon atoms joined by the glycosidic bond, with an 
arrow connecting the two numbers; for example, (1— >4) 
shows that C-l of the first-named sugar residue is joined 
to C-4 of the second. (4) Name the second residue. If 
there is a third residue, describe the second glycosidic 
bond by the same conventions. (To shorten the descrip- 
tion of complex polysaccharides, three-letter abbrevia- 
tions or colored symbols for the monosaccharides 
are often used, as given in Table 7-1.) Following this 
convention for naming oligosaccharides, maltose is a-D- 
glucopyranosyl-(1^4)-D-glucopyranose. Because most 
sugars encountered in this book are the d enantiomers 
and the pyranose form of hexoses predominates, we 
generally use a shortened version of the formal name of 



TABLE 7-1 



Symbols and Abbreviations for 
Common Monosaccharides and Some 
of Their Derivatives 



Abequose 


Abe 


Glucuronic acid 


❖ GlcA 


Arabinose 


Ara 


Galactosamine 


□ GalN 


Fructose 


Fru 


Glucosamine 


H GlcN 


Fucose 


AFuc 


^-Acetylgalactosamine 


□ GalNAc 


Galactose 


O Gal 


AT-Acetylglucosamine 


■ GlcNAc 


Glucose 


• Glc 


Iduronic acid 


^ IdoA 


Mannose 


• Man 


Muramic acid 


Mur 


Rhamnose 


Rha 


A^Acetylmuramic acid 


Mur2Ac 


Ribose 


Rib 


AT-Acetymeuraminic 


♦ NeuSAc 


Xylose 


*Xyl 


acid (a sialic acid) 



Note: In a commonly used convention, hexoses are represented as circles, N-acetylhex- 
osamines as squares, and hexosamines as squares divided diagonally. All sugars with 
the "gluco" configuration are blue, those with the "galacto" configuration are yellow, and 
"manno" sugars are green. Other substituents can be added as needed: sulfate (S), 
phosphate (P), 0-acetyl (OAc), or 0-methyl (Ome). 



[244J Carbohydrates and Glycobiology 



such compounds, giving the configuration of the 
anomeric carbon and naming the carbons joined by 
the glycosidic bond. In this abbreviated nomenclature, 
maltose is Glc(a:l-»4)Glc. ■ 

The disaccharide lactose (Fig. 7-12), which yields 
D-galactose and D-glucose on hydrolysis, occurs naturally 
in milk. The anomeric carbon of the glucose residue is 
available for oxidation, and thus lactose is a reducing di- 
saccharide. Its abbreviated name is Gal(j31-»4)Glc. 
Sucrose (table sugar) is a disaccharide of glucose and 
fructose. It is formed by plants but not by animals. In 
contrast to maltose and lactose, sucrose contains no free 
anomeric carbon atom; the anomeric carbons of both 
monosaccharide units are involved in the glycosidic bond 
(Fig. 7-12). Sucrose is therefore a nonreducing sugar. 
In the abbreviated nomenclature, a double-headed 
arrow connects the symbols specifying the anomeric 
carbons and their configurations. For example, the ab- 
breviated name of sucrose is either Glc(al<^>2/3)Fru or 
Fru(/32<H»la)Glc. Sucrose is a major intermediate prod- 
uct of photosynthesis; in many plants it is the principal 
form in which sugar is transported from the leaves to 
other parts of the plant body. Trehalose, Glc(al«*la)Glc 
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FIGURE 7-12 Some common disaccharides. Like maltose in Figure 
7-11, these are shown as Haworth perspectives. The common name, 
full systematic name, and abbreviation are given for each disaccharide. 
Formal nomenclature for sucrose names glucose as the parent glyco- 
side, although it is typically depicted as shown, with glucose on the left. 



(Fig. 7-12)— a disaccharide of D-glucose that, like su- 
crose, is a nonreducing sugar— is a major constituent of 
the circulating fluid (hemolymph) of insects, serving as 
an energy-storage compound. Fungi also contain tre- 
halose and are used as a commercial source of this sugar. 

SUMMARY 7.1 Monosaccharides and 
Disaccharides 

■ Sugars (also called saccharides) are compounds 
containing an aldehyde or ketone group and two or 
more hydroxyl groups. 

■ Monosaccharides generally contain several chiral 
carbons and therefore exist in a variety of 
stereochemical forms, which may be represented 
on paper as Fischer projections. Epimers are sugars 
that differ in configuration at only one carbon atom. 

■ Monosaccharides commonly form internal 
hemiacetals or hemiketals, in which the aldehyde 
or ketone group joins with a hydroxyl group of the 
same molecule, creating a cyclic structure; this can 
be represented as a Haworth perspective formula. 
The carbon atom originally found in the aldehyde or 
ketone group (the anomeric carbon) can assume 
either of two configurations, a and jS, which are 
interconvertible by mutarotation. In the linear form, 
which is in equilibrium with the cyclic forms, the 
anomeric carbon is easily oxidized. 

■ A hydroxyl group of one monosaccharide can add to 
the anomeric carbon of a second monosaccharide to 
form an acetal. In this disaccharide, the glycosidic 
bond protects the anomeric carbon from oxidation. 

■ Oligosaccharides are short polymers of several 
monosaccharides joined by glycosidic bonds. At 
one end of the chain, the reducing end, is a 
monosaccharide unit with its anomeric carbon 
not involved in a glycosidic bond. 

■ The common nomenclature for di- or oligosaccharides 
specifies the order of monosaccharide units, the 
configuration at each anomeric carbon, and the 
carbon atoms involved in the glycosidic linkage (s). 

7.2 Polysaccharides 

Most carbohydrates found in nature occur as polysac- 
charides, polymers of medium to high molecular weight. 
Polysaccharides, also called glycans, differ from each 
other in the identity of their recurring monosaccharide 
units, in the length of their chains, in the types of bonds 
linking the units, and in the degree of branching. 
Homopolysaccharides contain only a single monomelic 
species; heteropolysaccharides contain two or more 
different kinds (Fig. 7-13). Some homopolysaccharides 
serve as storage forms of monosaccharides that are used 
as fuels; starch and glycogen are homopolysaccharides of 
this type. Other homopolysaccharides (cellulose and 
chitin, for example) serve as structural elements in plant 
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FIGURE 7-13 Homo- and heteropolysaccharides. Polysaccharides 
may be composed of one, two, or several different monosaccharides, 
in straight or branched chains of varying length. 

cell walls and animal exoskeletons. Heteropolysaccha- 
rides provide extracellular support for organisms of all 
kingdoms. For example, the rigid layer of the bacterial 
cell envelope (the peptidoglycan) is composed in part of 
a heteropolysaccharide built from two alternating mono- 
saccharide units. In animal tissues, the extracellular 



space is occupied by several types of heteropolysaccha- 
rides, which form a matrix that holds individual cells to- 
gether and provides protection, shape, and support to 
cells, tissues, and organs. 

Unlike proteins, polysaccharides generally do not 
have defining molecular weights. This difference is a 
consequence of the mechanisms of assembly of the two 
types of polymer. As we shall see in Chapter 27, proteins 
are synthesized on a template (messenger RNA) of de- 
fined sequence and length, by enzymes that follow the 
template exactly. For polysaccharide synthesis there is 
no template; rather, the program for polysaccharide 
synthesis is intrinsic to the enzymes that catalyze the 
polymerization of the monomeric units, and there is no 
specific stopping point in the synthetic process. 

Some Homopolysaccharides Are Stored Forms of Fuel 

The most important storage polysaccharides are starch 
in plant cells and glycogen in animal cells. Both polysac- 
charides occur intracellularly as large clusters or gran- 
ules. Starch and glycogen molecules are heavily 
hydrated, because they have many exposed hydroxyl 
groups available to hydrogen-bond with water. Most 
plant cells have the ability to form starch (see Fig. 20-2), 
and starch storage is especially abundant in tubers 
(underground stems), such as potatoes, and in seeds. 

Starch contains two types of glucose polymer, amy- 
lose and amylopectin (Fig. 7-14). The former consists 
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FIGURE 7-14 Glycogen and starch, (a) A short segment of amylose, a 
linear polymer of D-glucose residues in («1 ^4) linkage, A single chain 
can contain several thousand glucose residues. Amylopectin has 
stretches of similarly linked residues between branch points. Glycogen 
has the same basic structure, but has more branching than amylopectin. 
(b) An (a1 — >6) branch point of glycogen or amylopectin. (c) A cluster of 
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amylose and amylopectin like that believed to occur in starch granules. 
Strands of amylopectin (red) form double-helical structures with each 
other or with amylose strands (blue). Glucose residues at the nonreduc- 
ing ends of the outer branches are removed enzymatically during the 
mobilization of starch for energy production. Glycogen has a similar 
structure but is more highly branched and more compact. 
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of long, unbranched chains of D-glucose residues con- 
nected by (a 1— >4) linkages (as in maltose). Such chains 
vary in molecular weight from a few thousand to more 
than a million. Amylopectin also has a high molecular 
weight (up to 200 million) but unlike amylose is highly 
branched. The glycosidic linkages joining successive 
glucose residues in amylopectin chains are (al->4); the 
branch points (occurring every 24 to 30 residues) are 
(al— >6) linkages. 

Glycogen is the main storage polysaccharide of 
animal cells. Like amylopectin, glycogen is a polymer of 
(al^4)-linked subunits of glucose, with (al^6)-linked 
branches, but glycogen is more extensively branched 
(on average, every 8 to 12 residues) and more compact 
than starch. Glycogen is especially abundant in the liver, 
where it may constitute as much as 7% of the wet 
weight; it is also present in skeletal muscle. In hepato- 
cytes glycogen is found in large granules, which are 
themselves clusters of smaller granules composed of 
single, highly branched glycogen molecules with an 
average molecular weight of several million. Such glyco- 
gen granules also contain, in tightly bound form, the 
enzymes responsible for the synthesis and degradation 
of glycogen. 

Because each branch in glycogen ends with a nonre- 
ducing sugar unit, a glycogen molecule with n branches 
has n + 1 nonreducing ends, but only one reducing end. 
When glycogen is used as an energy source, glucose units 
are removed one at a time from the nonreducing ends. 
Degradative enzymes that act only at nonreducing ends 
can work simultaneously on the many branches, speeding 
the conversion of the polymer to monosaccharides. 

Why not store glucose in its monomeric form? It has 
been calculated that hepatocytes store glycogen equiva- 
lent to a glucose concentration of 0.4 m. The actual 
concentration of glycogen, which is insoluble and con- 
tributes little to the osmolarity of the cytosol, is about 
0.01 hm. If the cytosol contained 0.4 m glucose, the 
osmolarity would be threateningly elevated, leading to 
osmotic entry of water that might rupture the cell (see 
Fig. 2-12). Furthermore, with an intracellular glucose 
concentration of 0.4 m and an external concentration of 
about 5 rtiM (the concentration in the blood of a mam- 
mal), the free-energy change for glucose uptake into 
cells against this very high concentration gradient would 
be prohibitively large. 

Dextrans are bacterial and yeast polysaccharides 
made up of (a l->6) -linked poly-D-glucose; all have 
(al-^3) branches, and some also have (al— >2) or 
(a 1^4) branches. Dental plaque, formed by bacteria 
growing on the surface of teeth, is rich in dextrans. Syn- 
thetic dextrans are used in several commercial products 
(for example, Sephadex) that serve in the fractionation 
of proteins by size-exclusion chromatography (see 
Fig. 3-1 7b). The dextrans in these products are chemi- 
cally cross-linked to form insoluble materials of various 
porosities, admitting macromolecules of various sizes. 



Some Homopolysaccharides Serve Structural Roles 

Cellulose, a fibrous, tough, water-insoluble substance, is 
found in the cell walls of plants, particularly in stalks, 
stems, trunks, and all the woody portions of the plant 
body. Cellulose constitutes much of the mass of wood, 
and cotton is almost pure cellulose. Like amylose, the 
cellulose molecule is a linear, unbranched homopolysac- 
charide, consisting of 10,000 to 15,000 D-glucose units. 
But there is a very important difference: in cellulose the 
glucose residues have the (3 configuration (Fig. 7-15), 
whereas in amylose the glucose is in the a configuration. 
The glucose residues in cellulose are linked by (jSl-^4) 
glycosidic bonds, in contrast to the (al^4) bonds of 
amylose. This difference gives cellulose and amylose 
very different structures and physical properties. 

Glycogen and starch ingested in the diet are hy- 
drolyzed by a-amylases and glycosidases, enzymes in 
saliva and the intestine that break (a 1— »4) glycosidic 
bonds between glucose units. Most animals cannot use 
cellulose as a fuel source, because they lack an enzyme 
to hydrolyze the (/31->4) linkages. Termites readily di- 
gest cellulose (and therefore wood), but only because 
their intestinal tract harbors a symbiotic microorganism, 
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FIGURE 7-15 Cellulose, (a) Two units of a cellulose chain; the D-glu- 
cose residues are in (/31 ->4) linkage. The rigid chair structures can ro- 
tate relative to one another, (b) Scale drawing of segments of two 
parallel cellulose chains, showing the conformation of the D-glucose 
residues and the hydrogen-bond cross-links. In the hexose unit at the 
lower left, all hydrogen atoms are shown; in the other three hexose 
units, the hydrogens attached to carbon have been omitted for clarity, 
as they do not participate in hydrogen bonding. 
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FIGURE 7-16 Cellulose breakdown by wood fungi. A wood fungus 
growing on an oak log. All wood fungi have the enzyme cellulase, 
which breaks the (>31 — »4) glycosidic bonds in cellulose, so that wood 
is a source of metabolizable sugar (glucose) for the fungus. The only 
vertebrates able to use cellulose as food are cattle and other ruminants 
(sheep, goats, camels, giraffes). The extra stomach compartment 
(rumen) of a ruminant teems with bacteria and protists that secrete 
cellulase. 
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Trichonympha, that secretes cellulase, which hy- 
drolyzes the (01^4) linkages. Wood-rot fungi and bac- 
teria also produce cellulase (Fig. 7-16). 

Chitin is a linear homopolysaccharide composed 
of AT-acetylglucosamine residues in (/31— »4) linkage 
(Fig. 7-17). The only chemical difference from cellulose 
is the replacement of the hydroxyl group at C-2 with an 
acetylated amino group. Chitin forms extended fibers 
similar to those of cellulose, and like cellulose cannot 
be digested by vertebrates. Chitin is the principal com- 
ponent of the hard exoskeletons of nearly a million 
species of arthropods — insects, lobsters, and crabs, for 
example — and is probably the second most abundant 
polysaccharide, next to cellulose, in nature; an esti- 
mated 1 billion tons of chitin are produced each year in 
the biosphere! 



FIGURE 7-17 Chitin. (a) A short segment of chitin, a homopolymer of 
N-acetyl-D-glucosamine units in (J31— >4) linkage, (b) A spotted June 
beetle {Pelidnota punctata), showing its surface armor (exoskeleton) 
of chitin. 



Steric Factors and Hydrogen Bonding Influence 
Homopolysaccharide Folding 

The folding of polysaccharides in three dimensions fol- 
lows the same principles as those governing polypeptide 
structure: subunits with a more-or-less rigid structure 
dictated by covalent bonds form three-dimensional 
macromolecular structures that are stabilized by weak 
interactions within or between molecules: hydrogen 
bonds and hydrophobic and van der Waals interactions, 
and, for polymers with charged subunits, electrostatic 
interactions. Because polysaccharides have so many 
hydroxyl groups, hydrogen bonding has an especially 
important influence on their structure. Glycogen, starch, 
and cellulose are composed of pyranoside subunits 
(having six-membered rings), as are the oligosaccha- 
rides of glycoproteins and glycolipids to be discussed 
later. Such molecules can be represented as a series of 
rigid pyranose rings connected by an oxygen atom 
bridging two carbon atoms (the glycosidic bond). There 
is, in principle, free rotation about both C— 0 bonds 
linking the residues (Fig. 7-1 5a), but as in polypeptides 
(see Figs 4-2, 4-8), rotation about each bond is lim- 
ited by steric hindrance by substituents. The three- 
dimensional structures of these molecules can be 
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described in terms of the dihedral angles, <f) and ^ about 
the glycosidic bond (Fig. 7-18), analogous to angles 
<f> and ^ made by the peptide bond (see Fig. 4-2). 

The bulkiness of the pyranose ring and its sub- 
stituents, and electronic effects at the anomeric carbon, 
place constraints on the angles <f> and W\ thus certain 
conformations are much more stable than others, as can 
be shown on a map of energy as a function of <f> and V 
(Fig. 7-19). 

The most stable three-dimensional structure for the 
(al->4)-linked chains of starch and glycogen is a tightly 
coiled helix (Fig. 7-20), stabilized by interchain hydro- 
gen bonds. In amylose (with no branches) this structure 
is regular enough to allow crystallization and thus deter- 
mination of the structure by x-ray diffraction. The aver- 
age plane of each residue along the amylose chain forms 
a 60° angle with the average plane of the preceding 
residue, so the helical structure has six residues per 
turn. For amylose, the core of the helix is of precisely 
the right dimensions to accommodate iodine as complex 
ions (I 3- and I 5 ~), giving an intensely blue complex. This 
interaction is a common qualitative test for amylose. 

For cellulose, the most stable conformation is that in 
which each chair is turned 180° relative to its neighbors, 
yielding a straight, extended chain. All — OH groups are 
available for hydrogen bonding with neighboring chains. 
With several chains lying side by side, a stabilizing network 
of interchain and intrachain hydrogen bonds produces 
straight, stable supramolecular fibers of great tensile 
strength (Fig. 7-15b). This property of cellulose has made 
it a useful substance to civilizations for millennia. Many 
manufactured products, including papyrus, paper, card- 
board, rayon, insulating tiles, and a variety of other useful 
materials, are derived from cellulose. The water content of 
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FIGURE 7-18 Conformation at the glycosidic bonds of cellulose, amy- 
lose, and dextran. The polymers are depicted as rigid pyranose rings 
joined by glycosidic bonds, with free rotation about these bonds. Note 
that in dextran there is also free rotation about the bond between C-5 
and C-6 (torsion angle o) (omega)). 

these materials is low because extensive interchain hydro- 
gen bonding between cellulose molecules satisfies their 
capacity for hydrogen-bond formation. 
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FIGURE 7-19 A map of favored conformations for oligosaccharides and 
polysaccharides. The torsion angles ^ and <t> (see Fig. 7-1 8), which de- 
fine the spatial relationship between adjacent rings, can in principle 
have any value from 0° to 360°. In fact, some of the torsion angles would 
give conformations that are sterically hindered, whereas others give con- 
formations that maximize hydrogen bonding, (a) When the relative en- 
ergy (X) is plotted for each value of <p and % with isoenergy ("same 
energy") contours drawn at intervals of 1 kcal/mol above the minimum 
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energy state, the result is a map of preferred conformations. This is anal- 
ogous to the Ramachandran plot for peptides (see Figs 4-3, 4-8). 
(b) Two energetic extremes for the disaccharide Gal(j81-»3)Gal; these 
values fall on the energy diagram (a) as shown by the red and blue dots. 
The red dot indicates the least favored conformation, the blue dot the 
most favored conformation. The known conformations of the three poly- 
saccharides shown in Figure 7-18 have been determined by x-ray crys- 
tallography, and all fall within the lowest-energy regions of the map. 
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FIGURE 7-20 Starch (amylose). (a) In the most stable conformation, with 
adjacent rigid chairs, the polysaccharide chain is curved, rather than lin- 
ear as in cellulose (see Fig. 7-1 5). (b) A model of a segment of amylose; 
for clarity the hydroxy I groups have been omitted from all but one of the 
glucose residues. Compare the two residues shaded in pink with the 
chemical structures in (a). The conformation of »4) linkages in amy- 
lose, amylopectin, and glycogen causes these polymers to assume tightly 
coiled helical structures. These compact structures produce the dense 
granules of stored starch or glycogen seen in many cells (see Fig. 20-2). 

Bacterial and Algal Ceil Walls Contain 
Structural Heteropolysaccharides 

The rigid component of bacterial cell walls (peptidogly- 
can) is a heteropolymer of alternating (/3 1^4) -linked 
AT-acetylglucosamine and A^acetylmuramic acid residues 
(see Fig. 20-31). The linear polymers lie side by side in 
the cell wall, cross-linked by short peptides, the exact 
structure of which depends on the bacterial species. The 
peptide cross-links weld the polysaccharide chains into a 
strong sheath that envelops the entire cell and prevents 
cellular swelling and lysis due to the osmotic entry of 
water. The enzyme lysozyme kills bacteria by hydrolyzing 
the (/31— »4) glycosidic bond between AT-acetylglucosa- 
mine and Af-acetylmuramic acid (see Fig. 6-24) . 
Lysozyme is notably present in tears, presumably as a de- 
fense against bacterial infections of the eye. It is also pro- 
duced by certain bacterial viruses to ensure their release 
from the host bacterial cell, an essential step of the viral 
infection cycle. Penicillin and related antibiotics kill bacte- 
ria by preventing synthesis of the cross-links, leaving the 
cell wall too weak to resist osmotic lysis (see pp. 216-217). 

Certain marine red algae, including some of the sea- 
weeds, have cell walls that contain agar, a mixture of 
sulfated heteropolysaccharides made up of D-galactose 
and an L-galactose derivative ether-linked between C-3 
and C-6. Agar is a complex mixture of polysaccharides, 
all with the same backbone structure, but substituted 
to varying degrees with sulfate and pyruvate. Agarose 
(M T ~ 150,000) is the agar component with the fewest 
charged groups (sulfates, pyruvates) (Fig. 7-21). The 
remarkable gel-forming property of agarose makes it 
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FIGURE 7-21 Agarose. The repeating unit consists of D-galactose 
— >4)-l inked to 3,6-anhydro-i_-gaiactose (in which an ether bridge 
connects C-3 and C-6). These units are joined by (a1->3) glycosidic 
links to form a polymer 600 to 700 residues long. A small fraction of the 
3,6-anhydrogalactose residues have a sulfate ester at C-2 (as shown here). 

useful in the biochemistry laboratory. When a suspen- 
sion of agarose in water is heated and cooled, the 
agarose forms a double helix: two molecules in parallel 
orientation twist together with a helix repeat of three 
residues; water molecules are trapped in the central 
cavity. These structures in turn associate with each 
other to form a gel — a three-dimensional matrix that 
traps large amounts of water. Agarose gels are used as 
inert supports for the electrophoretic separation of 
nucleic acids, an essential part of the DNA sequencing 
process (p. 292). Agar is also used to form a surface for 
the growth of bacterial colonies. Another commercial 
use of agar is for the capsules in which some vitamins 
and drugs are packaged; the dried agar material dis- 
solves readily in the stomach and is metabolically inert. 

Glycosaminoglycans Are Heteropolysaccharides 
of the Extracellular Matrix 

The extracellular space in the tissues of multicellular 
animals is filled with a gel-like material, the extracellu- 
lar matrix (ECM), also called ground substance, 
which holds the cells together and provides a porous 
pathway for the diffusion of nutrients and oxygen 
to individual cells. The reticular ECM that surrounds fi- 
broblasts and other connective tissue cells is composed 
of an interlocking meshwork of heteropolysaccharides 
and fibrous proteins such as fibrillar collagens, elastin, 
and fibronectin. Basement membrane is a specialized 
ECM that underlies epithelial cells; it consists of special- 
ized collagens, laminin, and heteropolysaccharides. 
These heteropolysaccharides, the glycosaminogly- 
cans, are a family of linear polymers composed of 
repeating disaccharide units (Fig. 7-22). They are 
unique to animals and bacteria and are not found in 
plants. One of the two monosaccharides is always either 
A^-acetylglucosamine or AT-acetylgalactosamine; the 
other is in most cases a uronic acid, usually D-glucuronic 
or L-iduronic acid. Some glycosaminoglycans contain es- 
terified sulfate groups. The combination of sulfate 
groups and the carboxylate groups of the uronic acid 
residues gives glycosaininoglycans a very high density of 
negative charge. To minimize the repulsive forces among 
neighboring charged groups, these molecules assume an 
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extended conformation in solution, forming a rodlike he- 
lix in which the negatively charged carboxylate groups 
occur on alternate sides of the helix (as shown for he- 
parin in Fig. 7-22). The extended rod form also provides 
maximum separation between the negatively charged 
sulfate groups. The specific patterns of sulfated and non- 
sulfated sugar residues in glycosaminoglycans provide 
for specific recognition by a variety of protein ligands 
that bind electrostatically to these molecules. The sul- 
fated glycosaminoglycans are attached to extracellular 
proteins to form proteoglycans (Section 7.3). 



FIGURE 7-22 Repeating units of some common glycosaminoglycans 
of extracellular matrix. The molecules are copolymers of alternating 
uronic acid and amino sugar residues (keratan sulfate is the exception), 
with sulfate esters in any of several positions, except in hyaluronan. The 
ionized carboxylate and sulfate groups (red in the perspective formulas) 
give these polymers their characteristic high negative charge. Therapeu- 
tic heparin contains primarily iduronic acid (IdoA) and a smaller pro- 
portion of glucuronic acid (GlcA, not shown), and is generally highly 
sulfated and heterogeneous in length. The space-filling model shows a 
heparin segment as its solution structure, as determined by NMR spec- 
troscopy (PDB ID 1 HPN). The carbons in the iduronic acid sulfate are 
colored blue; those in glucosamine sulfate are green. Oxygen and 
sulfur atoms are shown in their standard colors of red and yellow, 
respectively. The hydrogen atoms are not shown (for clarity). Heparan 
sulfate (not shown) is similar to heparin but has a higher proportion of 
GlcA and fewer sulfate groups, arranged in a less regular pattern. 



The glycosaminoglycan hyaluronan (hyaluronic 
acid) contains alternating residues of D-glucuronic acid 
and AT-acetylglucosamine (Fig. 7-22). With up to 50,000 
repeats of the basic disaccharide unit, hyaluronan has a 
molecular weight of several million; it forms clear, 
highly viscous solutions that serve as lubricants in the 
synovial fluid of joints and give the vitreous humor of 
the vertebrate eye its jellylike consistency (the Greek 
hyalos means "glass"; hyaluronan can have a glassy or 
translucent appearance). Hyaluronan is also a compo- 
nent of the extracellular matrix of cartilage and ten- 
dons, to which it contributes tensile strength and 
elasticity as a result of its strong interactions with other 
components of the matrix. Hyaluronidase, an enzyme 
secreted by some pathogenic bacteria, can hydrolyze 
the glycosidic linkages of hyaluronan, rendering tissues 
more susceptible to bacterial invasion. In many species, 
a similar enzyme in sperm hydrolyzes an outer gly- 
cosaminoglycan coat around the ovum, allowing sperm 
penetration. 

Other glycosaminoglycans differ from hyaluronan in 
three respects: they are generally much shorter poly- 
mers, they are covalently linked to specific proteins 
(proteoglycans), and one or both monomeric units 
differ from those of hyaluronan. Chondroitin sulfate 
(Greek chondros, "cartilage") contributes to the tensile 
strength of cartilage, tendons, ligaments, and the walls 
of the aorta. Dermatan sulfate (Greek derma, "skin") 
contributes to the pliability of skin and is also present in 
blood vessels and heart valves. In this polymer, many of 
the glucuronate residues present in chondroitin sulfate 
are replaced by their 5-epimer, L-iduronate. 
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Keratan sulfates (Greek keras, "horn") have no uronic 
acid and their sulfate content is variable. They are present 
in cornea, cartilage, bone, and a variety of horny structures 
formed of dead cells: horn, hair, hoofs, nails, and claws. He- 
paran sulfate (Greek hepar, "liver") is produced by all an- 
imal cells and contains variable arrangements of sulfated 
and nonsulfated sugars. The sulfated segments of the chain 
allow it to interact with a large number of proteins, includ- 
ing growth factors and ECM components, as well as various 
enzymes and factors present in plasma. Heparin is a frac- 
tionated form of heparan sulfate derived mostly from 
mast cells (a type of leukocyte). Heparin is a therapeutic 
agent used to inhibit coagulation through its capacity to 
bind the protease inhibitor antithrombin. Heparin binding 
causes antithrombin to bind to and inhibit thrombin, a pro- 
tease essential to blood clotting. The interaction is strongly 
electrostatic; heparin has the highest negative charge den- 
sity of any known biological macromolecule (Fig. 7-23). 
Purified heparin is routinely added to blood samples 
obtained for clinical analysis, and to blood donated for 
transfusion, to prevent clotting. 

Table 7-2 summarizes the composition, properties, 
roles, and occurrence of the polysaccharides described 
in Section 7.2, 




FIGURE 7-23 Interaction between a glycosaminoglycan and its bind- 
ing protein. Fibroblast growth factor 1 (FGF1), its cell surface receptor 
(FGFR), and a short segment of a glycosaminoglycan (heparin) were 
co-crystallized to yield the structure shown here (PDB ID 1E0O). The 
proteins are represented as surface contour images, with color to rep- 
resent surface electrostatic potential: red, predominantly negative 
charge; blue, predominantly positive charge. Heparin is shown in a 
ball-and-stick representation, with the negative charges ( — SO3 and 
— COO") attracted to the positive (blue) surface of the FGF1 protein. 
Heparin was used in this experiment, but the glycosaminoglycan that 
binds FGF1 in vivo is heparan sulfate on the cell surface. 



TABLE 7-2 



Polymer 



Structures and Roles of Some Polysaccharide 



Type 



Repeating unit 1 



Size (number of 
monosaccharide 
units) 



Roles/significance 



Starch 
Amylose 
Amylopectin 



Glycogen 

Cellulose 
Chitin 
Dextran 
Peptidoglycan 

Agarose 

Hyaluronan (a 
glycosamino- 
glycan) 



Energy storage: in plants 



Homo- 
Homo- 



Homo- 

Homo- 

Homo- 

Homo- 

Hetero-; 
peptides 
attached 

Hetero- 

Hetero-; 
acidic 



(al— >4)Glc, linear 
(al_>4)Glc, with 
(al->6)Glc 
branches every 
24-30 residues 

(al->4)Gle, with 
(al->6)Glc 
branches every 
8-12 residues 

(/31->4)Glc 

(/31->4)GlcNAc 

(al->6)Glc, with 
(al— »3) branches 

4)Mur2Ac(j81^4) 
GlcNAc(j81 

3) D-Gal(01-K)3,6- 
anhydro-L-Gal(a 1 

4) GlcA(/31->3) 
GlcNAc(/31 



50-5,000 
Up to 10 6 

Up to 50,000 

Up to 15,000 
Very large 
Wide range 
Very large 

1,000 

Up to 100,000 



Energy storage: in bacteria and animal cells 



Structural: in plants, gives rigidity and 
strength to cell walls 

Structural: in insects, spiders, crustaceans, 

gives rigidity and strength to exoskeletons 
Structural: in bacteria, extracellular adhesive 

Structural: in bacteria, gives rigidity and 
strength to cell envelope 

Structural: in algae, cell wall material 

Structural: in vertebrates, extracellular matrix 
of skin and connective tissue; viscosity 
and lubrication in joints 



*Each polymer is classified as a homopolysaccharide (homo-) or hetero polysaccharide (hetero-). 

^he abbreviated names for the peptidoglycan, agarose, and hyaluronan repeating units indicate that the polymer contains repeats of this disaccha- 
ride unit. For example, in peptidoglycan, the GlcNAc of one disaccharide unit is (/31->4)-linked to the first residue of the next disaccharide unit. 
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SUMMARY 7.2 Polysaccharides 

■ Polysaccharides (glycans) serve as stored fuel 
and as structural components of cell walls and 
extracellular matrix. 

■ The homopolysaccharides starch and glycogen are 
stored fuels in plant, animal, and bacterial cells. 
They consist of D-glucose with (a l-*4) linkages, 
and both contain some branches 

■ The homopolysaccharides cellulose, chitin, and 
dextran serve structural roles. Cellulose, composed 
of (jS 1-^4) -linked D-glucose residues, lends strength 
and rigidity to plant cell walls. Chitin, a polymer of 
(jS 1^4) -linked Af-acetylglucosamine, strengthens 
the exoskeletons of arthropods. Dextran forms an 
adhesive coat around certain bacteria. 

■ Homopolysaccharides fold in three dimensions. The 
chair form of the pyranose ring is essentially rigid, 
so the conformation of the polymers is determined 
by rotation about the bonds from the rings to the 
oxygen atom in the glycosidic linkage. Starch and 
glycogen form helical structures with intrachain 
hydrogen bonding; cellulose and chitin form long, 
straight strands that interact with neighboring 
strands. 

■ Bacterial and algal cell walls are strengthened by 
heteropolysaccharides — peptidoglycan in bacteria, 
agar in red algae The repeating disaccharide in 
peptidoglycan is GlcNAc(/31^>4)Mur2Ac; in sugar, 
it is D-Gal(£l-^4)3,6-anhydro-L-Gal. 

■ Glycosaminoglycans are extracellular 
heteropolysaccharides in which one of the two 
monosaccharide units is a uronic acid (keratan 
sulfate is an exception) and the other an AT-acetylated 
amino sugar. Sulfate esters on some of the hydroxyl 
groups and on the amino group of some glucosamine 
residues in heparin and in heparan sulfate give 
these polymers a high density of negative charge, 
forcing them to assume extended conformations. 
These polymers (hyaluronan, chondroitin sulfate, 
dermatan sulfate, and keratan sulfate) provide 
viscosity, adhesiveness, and tensile strength to the 
extracellular matrix. 

73 Glycoconjugates: Proteoglycans, 
Glycoproteins, and Glycolipids 

In addition to their important roles as stored fuels 
(starch, glycogen, dextran) and as structural materials 
(cellulose, chitin, peptidoglycans), polysaccharides and 
oligosaccharides are information carriers. Some provide 
communication between cells and their extracellular 
surroundings; others label proteins for transport to and 
localization in specific organelles, or for destruction 
when the protein is malformed or superfluous; and oth- 
ers serve as recognition sites for extracellular signal 



molecules (growth factors, for example) or extracellular 
parasites (bacteria or viruses). On almost every eukary- 
otic cell, specific oligosaccharide chains attached to 
components of the plasma membrane form a carbohy- 
drate layer (the glycocalyx), several nanometers thick, 
that serves as an information-rich surface that a cell 
shows to its surroundings. These oligosaccharides are 
central players in cell-cell recognition and adhesion, cell 
migration during development, blood clotting, the im- 
mune response, wound healing, and other cellular 
processes. In most of these cases, the informational car- 
bohydrate is covalently joined to a protein or a lipid to 
form a glycoconjugate, which is the biologically active 
molecule. 

Proteoglycans are macromolecules of the cell 
surface or extracellular matrix in which one or more 
sulfated glycosaminoglycan chains are joined covalently 
to a membrane protein or a secreted protein. The gly- 
cosaminoglycan chain can bind to extracellular proteins 
through electrostatic interactions with the negatively 
charged groups on the polysaccharide. Proteoglycans 
are major components of all extracellular matrices. 

Glycoproteins have one or several oligosaccha- 
rides of varying complexity joined covalently to a pro- 
tein. They are usually found on the outer face of the 
plasma membrane (as part of the glycocalyx), in the 
extracellular matrix, and in the blood. Inside cells they 
are found in specific organelles such as Golgi complexes, 
secretory granules, and lysosomes. The oligosaccharide 
portions of glycoproteins are very heterogeneous and, 
like glycosaminoglycans, they are rich in information, 
forming highly specific sites for recognition and high- 
affinity binding by carbohydrate-binding proteins called 
lectins. Some cytosolic and nuclear proteins can be 
glycosylated as well. 

Glycolipids are membrane sphingolipids in which 
the hydrophilic head groups are oligosaccharides. As in 
glycoproteins, the oligosaccharides act as specific sites 
for recognition by lectins. The brain and neurons are 
rich in glycolipids, which help in nerve conduction and 
myelin formation. Glycolipids also play a role in signal 
transduction in cells. 

Proteoglycans Are Glycosaminoglycan-Containing 
Macromolecules of the Cell Surface and 
Extracellular Matrix 

Mammalian cells can produce 40 types of proteoglycans. 
These molecules act as tissue organizers, and they influ- 
ence various cellular activities, such as growth factor 
activation and adhesion. The basic proteoglycan unit con- 
sists of a "core protein" with covalently attached gly- 
cosaminoglycan (s). The point of attachment is a Ser 
residue, to which the glycosaminoglycan is joined through 
a tetrasaccharide bridge (Fig. 7-24). The Ser residue is 
generally in the sequence -Ser-Gly-X-Gly- (where X 
is any amino acid residue), although not every protein 
with this sequence has an attached glycosaminoglycan. 
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FIGURE 7-24 Proteoglycan structure, showing the tetrasaccharide 
bridge. A typical tetrasaccharide linker (blue) connects a glycosamino- 
glycan — in this case chondroitin 4-sulfate (orange) — to a Ser residue 
(pink) in the core protein. The xylose residue at the reducing end of 
the linker is joined by its anomeric carbon to the hydroxyl of the 
Ser residue. 

Many proteoglycans are secreted into the extracellular 
matrix, but some are integral membrane proteins (see 
Fig. 11-6). For example, the sheet-like extracellular ma- 
trix (basal lamina) that separates organized groups of 
cells from other groups contains a family of core proteins 
(M T 20,000 to 40,000), each with several covalently at- 
tached heparan sulfate chains. There are two major fam- 
ilies of membrane heparan sulfate proteoglycans. 
Syndecans have a single transmembrane domain and an 
extracellular domain bearing three to five chains of he- 
paran sulfate and in some cases chondroitin sulfate 
(Fig. 7-25a). Glypicans are attached to the membrane 
by a lipid anchor, a derivative of the membrane lipid 
phosphatidylinositol (Chapter 11). Both syndecans and 
glypicans can be shed into the extracellular space. A pro- 
tease in the ECM that cuts close to the membrane sur- 
face releases syndecan ectodomains (those domains 
outside the plasma membrane), and a phospholipase 
that breaks the connection to the membrane lipid 
releases glypicans. Numerous chondroitin sulfate and 
dermatan sulfate proteoglycans also exist, some as 
membrane-bound entities, others as secreted products 
in the ECM. 

The glycosaminoglycan chains can bind to a variety 
of extracellular ligands and thereby modulate the lig- 
ands' interaction with specific receptors of the cell sur- 
face. Detailed studies of heparan sulfate demonstrate a 
domain structure that is not random; some domains 
(typically 3 to 8 disaccharide units long) differ from 
neighboring domains in sequence and in ability to bind to 
specific proteins. Highly sulfated domains (called NS 
domains) alternate with domains having unmodified 
GlcNAc and GlcA residues (AT-acetylated, or NA, do- 
mains) (Fig. 7-25b). The exact pattern of sulfation in the 
NS domain depends on the particular proteoglycan; 
given the number of possible modifications of the 
GlcNAc-IdoA dimer, at least 32 different disaccharide 
units are possible. Furthermore, the same core protein 
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FIGURE 7-25 Two families of membrane proteoglycans, (a) Schematic 
diagrams of a syndecan and a glypican in the plasma membrane. Syn- 
decans are held in the membrane by hydrophobic interactions be- 
tween a sequence of nonpolar amino acid residues and plasma 
membrane lipids; they can be released by a single proteolytic cut near 
the membrane surface. In a typical syndecan, the extracellular amino- 
terminal domain is covalently attached (by tetrasaccharide linkers such 
as those in Fig. 7-24) to three heparan sulfate chains and two chon- 
droitin sulfate chains. Glypicans are held in the membrane by a 
covalently attached membrane lipid (GPI anchor; see Chapter 1 1 ), and 
are shed if the lipid-protein bond is cleaved by a phospholipase. All 
glypicans have 14 conserved Cys residues, which form disulfide bonds 
to stabilize the protein moiety and either two or three glycosamino- 
glycan chains attached near the carboxyl terminus, close to the mem- 
brane surface, (b) Along a heparan sulfate chain, regions rich in 
sulfated sugars, the NS domains (green), alternate with regions with 
chiefly unmodified residues of GlcNAc and GlcA, the NA domains 
(gray). One of the NS domains is shown in more detail, revealing a 
high density of modified residues: GlcNS (N-sulfoglucosamine), with a 
sulfate ester at C-6; and both GlcA and IdoA, with a sulfate ester at 
C-2. The exact pattern of sulfation in the NS domain differs among 
proteoglycans. 

can display different heparan sulfate structures when 
synthesized in different cell types. 

The NS domains bind specifically to extracellular 
proteins and signaling molecules to alter their activities. 
The change in activity may result from a conformational 
change in the protein that is induced by the binding 
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(a) Conformational activation 

- Factor Xa 




A conformational change induced in the protein 
antithrombin (AT) on binding a specific 
pentasaccharide NS domain allows its interaction 
with blood clotting factor Xa, preventing clotting. 



(b) Enhanced protein-protein interaction 

" "v-Vr^-"- Thrombin 





Binding of AT and thrombin to two adjacent 
NS domains brings the two proteins into close 
proximity, favoring their interaction, which inhibits 
blood clotting. 



(c) Coreceptor for extracellular ligands 

FGF(ligand) 

I NS domain 



- FGF receptor dimer 



L- Membrane 

NS domains interact with both the fibroblast growth 
factor (FGF) and its receptor, bringing the oligomeric 
complex together and increasing the effectiveness of 
a low concentration of FGF 



(d) Cell surface localization/concentration 

Lipoprotein lipase 

" * I _ 




NS domain 



Membrane 



The high density of negative charges in heparan sulfate 
attracts positively charged lipoprotein lipase 
molecules and holds them by electrostatic and 
sequence-specific interactions with NS domains. 



FIGURE 7-26 Four types of protein interactions with NS domains of heparan sulfate. 



(Fig. 7-26a), or it may be due to the ability of adjacent 
domains of heparan sulfate to bind to two different 
proteins, bringing them into close proximity and en- 
hancing protein-protein interactions (Fig. 7-26b). A 
third general mechanism of action is the binding of ex- 
tracellular signal molecules (growth factors, for exam- 
ple) to heparan sulfate, which increases their local 
concentrations and enhances their interaction with 
growth factor receptors in the cell surface; in this case, 
the heparan sulfate acts as a coreceptor (Fig. 7-26c). 
For example, fibroblast growth factor (FGF), an extra- 
cellular protein signal that stimulates cell division, first 
binds to heparan sulfate moieties of syndecan molecules 
in the target cell's plasma membrane. Syndecan pres- 
ents FGF to the FGF plasma membrane receptor, and 
only then can FGF interact productively with its recep- 
tor to trigger cell division. Finally, in another type of 
mechanism, the NS domains interact — electrostatically 
and otherwise — with a variety of soluble molecules out- 
side the cell, maintaining high local concentrations at 
the cell surface (Fig. 7-26d). 

The importance of correctly synthesizing sulfated 
domains in heparan sulfate is demonstrated in mutant 
("knockout") mice lacking the enzyme that sulfates the 
C-2 hydroxyl of IdoA. These animals are born without 
kidneys and with very severe developmental abnormali- 
ties of the skeleton and eyes. Other studies demonstrate 
that membrane proteoglycans are important in lipopro- 
tein clearance in the liver. There is growing evidence 
that the path taken by developing axons in the nervous 
system, and thus the wiring circuitry, is influenced by 



proteoglycans containing heparan sulfates and chon- 
droitin sulfate, which provide directional cues for axon 
outgrowth. 

Some proteoglycans can form proteoglycan ag- 
gregates, enormous supramolecular assemblies of 
many core proteins all bound to a single molecule of 
hyaluronan. Aggrecan core protein (M T -250,000) has 
multiple chains of chondroitin sulfate and keratan sul- 
fate, joined to Ser residues in the core protein through 
trisaccharide linkers, to give an aggrecan monomer of 
M r -2 X 10 6 . When a hundred or more of these "deco- 
rated" core proteins bind a single, extended molecule of 
hyaluronate (Fig. 7-27), the resulting proteoglycan 
aggregate (M T > 2 X 10 s ) and its associated water of hy- 
dration occupy a volume about equal to that of a bacter- 
ial cell! Aggrecan interacts strongly with collagen in the 
extracellular matrix of cartilage, contributing to the de- 
velopment, tensile strength, and resiliency of this 
connective tissue. 

Interwoven with these enormous extracellular pro- 
teoglycans are fibrous matrix proteins such as collagen, 
elastin, and fibronectin, forming a cross-linked mesh- 
work that gives the whole extracellular matrix strength 
and resilience. Some of these proteins are multiadhe- 
sive, a single protein having binding sites for several dif- 
ferent matrix molecules. Fibronectin, for example, has 
separate domains that bind fibrin, heparan sulfate, colla- 
gen, and a family of plasma membrane proteins called 
integrins that mediate signaling between the cell inte- 
rior and the extracellular matrix (see Fig. 12-28). The 
overall picture of cell-matrix interactions that emerges 
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Hyaluronan 
(up to 50,000 




FIGURE 7-27 Proteoglycan aggregate of the extracellular matrix. 

Schematic drawing of a proteoglycan with many aggrecan molecules. 
One very long molecule of hyaluronan is associated noncovalently with 
about 100 molecules of the core protein aggrecan. Each aggrecan mol- 
ecule contains many covalently bound chondroitin sulfate and keratan 
sulfate chains. Link proteins at the junction between each core protein 
and the hyaluronan backbone mediate the core protein-hyaluronan in- 
teraction. The micrograph shows a single molecule of aggrecan, viewed 
with the atomic force microscope (see Box 11-1). 

(Fig. 7-28) shows an array of interactions between cel- 
lular and extracellular molecules. These interactions 
serve not merely to anchor cells to the extracellular 
matrix but also to provide paths that direct the migration 
of cells in developing tissue and to convey information 
in both directions across the plasma membrane. 




r can 



Plasma membrane 



FIGURE 7-28 Interactions between cells and the extracellular matrix. 

The association between cells and the proteoglycan of the extracellu- 
lar matrix is mediated by a membrane protein (integrin) and by an ex- 
tracellular protein (fibronectin in this example) with binding sites for 
both integrin and the proteoglycan. Note the close association of col- 
lagen fibers with the fibronectin and proteoglycan. 

Glycoproteins Have Covalently Attached Oligosaccharides 

Glycoproteins are carbohydrate-protein conjugates in 
which the glycans are smaller, branched, and more 
structurally diverse than the glycosaminoglycans of 
proteoglycans. The carbohydrate is attached at its 
anomeric carbon through a glycosidic link to the — OH 
of a Ser or Thr residue (O-linked), or through an N-g\y- 
cosyl link to the amide nitrogen of an Asn residue (N- 
linked) (Fig. 7-29). Some glycoproteins have a single 



(a) O-linked 



FIGURE 7-29 Oligosaccharide linkages in glycopro- 
teins, (a) O-linked oligosaccharides have a glyco- 
sidic bond to the hydroxyl group of Ser or Thr residues 
(pink), illustrated here with GalNAc as the sugar at the 
reducing end of the oligosaccharide. One simple 
chain and one complex chain are shown, (b) N-linked 
oligosaccharides have an N-glycosyl bond to the 
amide nitrogen of an Asn residue (green), illustrated 
here with GlcNAc as the terminal sugar. Three com- 
mon types of oligosaccharide chains that are /V-linked 
in glycoproteins are shown. A complete description of 
oligosaccharide structure requires specification of the 
position and stereochemistry (a or jS) of each glyco- 
sidic linkage. 
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oligosaccharide chain, but many have more than one; 
the carbohydrate may constitute from 1% to 70% or 
more of the glycoprotein by mass. Mucins are secreted 
or membrane glycoproteins that can contain large num- 
bers of O-linked oligosaccharide chains. Mucins are 
present in most secretions; they give mucus its charac- 
teristic slipperiness. About half of all proteins of mam- 
mals are glycosylated, and about 1% of all mammalian 
genes encode enzymes involved in the synthesis and 
attachment of these oligosaccharide chains. Sequences 
for the attachment of O-linked chains tend to be rich in 
Gly, Val, and Pro residues. In contrast the attachment of 
AMinked chains depends on the consensus sequence 
N-{P}-[ST] (see Box 3-3 for the conventions on repre- 
senting consensus sequences). As with proteoglycans, 
not all potential sites are used. 

One class of glycoproteins found in the cytoplasm 
and the nucleus is unique in that the glycosylated posi- 
tions in the protein carry only single residues of 
AT-acetylglucosamine, in O-glycosidic linkage to the 
hydroxyl group of Ser side chains. This modification 
is reversible and often occurs on the same Ser residues 
that are phosphorylated at some stage in the protein's 
activity. The two modifications are mutually exclusive, 
and this type of glycosylation may prove to be important 
in the regulation of protein activity. We discuss it in the 
context of protein phosphorylation in Chapter 12. 

As we shall see in Chapter 11, the external surface 
of the plasma membrane has many membrane glycopro- 
teins with arrays of covalently attached oligosaccharides 
of varying complexity. The first well-characterized mem- 
brane glycoprotein was glycophorin A of the erythrocyte 
membrane (see Fig. 11-7). It contains 60% carbohy- 
drate by mass, in the form of 16 oligosaccharide chains 
(totaling 60 to 70 monosaccharide residues) covalently 
attached to amino acid residues near the amino termi- 
nus of the polypeptide chain. Fifteen of the oligosaccha- 
ride chains are O-linked to Ser or Thr residues, and one 
is AMinked to an Asn residue. 

Glycomics is the systematic characterization of all 
of the carbohydrate components of a given cell or tissue, 
including those attached to proteins and to lipids. For 
glycoproteins, this also means determining which pro- 
teins are glycosylated and where in the amino acid se- 
quence each oligosaccharide is attached. This is a 
challenging undertaking, but worthwhile because of the 
potential insights it offers into normal patterns of glyco- 
sylation and the ways in which they are altered during 
development or in genetic diseases or cancer. Current 
methods of characterizing the whole carbohydrate com- 
plement of cells depend heavily on sophisticated appli- 
cation of mass spectroscopy (see Fig. 7-37). 

The structures of a large number of O- and AMinked 
oligosaccharides from a variety of glycoproteins are 
known; Figure 7-29 shows a few typical examples. We 
consider the mechanisms by which specific proteins ac- 
quire specific oligosaccharide moieties in Chapter 27. 



Many of the proteins secreted by eukaryotic cells 
are glycoproteins, including most of the proteins of 
blood. For example, immunoglobulins (antibodies) and 
certain hormones, such as follicle-stimulating hormone, 
luteinizing hormone, and thyroid-stimulating hormone, 
are glycoproteins. Many milk proteins, including lactal- 
bumin, and some of the proteins secreted by the pan- 
creas (such as ribonuclease) are glycosylated, as are 
most of the proteins contained in lysosomes. 

The biological advantages of adding oligosaccha- 
rides to proteins are slowly being uncovered. The very 
hydrophilic clusters of carbohydrate alter the polarity 
and solubility of the proteins with which they are conju- 
gated. Oligosaccharide chains that are attached to newly 
synthesized proteins in the endoplasmic reticulum (ER) 
and elaborated in the Golgi complex serve as destination 
labels and also act in protein quality control, targeting 
misfolded proteins for degradation (see Fig. 27-39). 
When numerous negatively charged oligosaccharide 
chains are clustered in a single region of a protein, the 
charge repulsion among them favors the formation of an 
extended, rodlike structure in that region. The bulki- 
ness and negative charge of oligosaccharide chains also 
protect some proteins from attack by proteolytic en- 
zymes. Beyond these global physical effects on protein 
structure, there are also more specific biological effects 
of oligosaccharide chains in glycoproteins (Section 7.4). 
The importance of normal protein glycosylation is clear 
from the finding of at least 18 different genetic disorders 
of glycosylation in humans, all causing severely defec- 
tive physical or mental development; some of these dis- 
orders are fatal. 

Glycolipids and Lipopolysaccharides 
Are Membrane Components 

Glycoproteins are not the only cellular components that 
bear complex oligosaccharide chains; some lipids, too, 
have covalently bound oligosaccharides. Gangliosides 
are membrane lipids of eukaryotic cells in which the po- 
lar head group, the part of the lipid that forms the outer 
surface of the membrane, is a complex oligosaccharide 
containing a sialic acid (Fig. 7-9) and other monosac- 
charide residues. Some of the oligosaccharide moieties 
of gangliosides, such as those that determine human 
blood groups (see Fig. 10-15), are identical with those 
found in certain glycoproteins, which therefore also 
contribute to blood group type. Like the oligosaccharide 
moieties of glycoproteins, those of membrane lipids are 
generally, perhaps always, found on the outer face of the 
plasma membrane. 

I Lipopolysaccharides are the dominant surface 
BJfl feature of the outer membrane of gram-negative 
bacteria such as Escherichia coli and Salmonella 
typhimurium. These molecules are prime targets of 
the antibodies produced by the vertebrate immune sys- 
tem in response to bacterial infection and are therefore 
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FIGURE 7-30 Bacterial lipopolysaccharides. Schematic diagram of the 
lipopolysaccharide of the outer membrane of Salmonella typhimurium. 
Kdo is 3-deoxy-D-man/io-octulosonic acid (previously called ke- 
todeoxyoctonic acid); Hep is i-glycero-D-manno-heptose; AbeOAc is 
abequose (a 3,6-dideoxyhexose) acetylated on one of its hydroxyls. 
There are six fatty acid residues in the lipid A portion of the molecule 
Different bacterial species have subtly different lipopolysaccharide 
structures, but they have in common a lipid region (lipid A), a core 
oligosaccharide also known as endotoxin, and an "O-specific" chain, 
which is the principal determinant of the serotype (immunological 
reactivity) of the bacterium. The outer membranes of the gram-negative 
bacteria 5. typhimurium and E coli contain so many lipopolysaccharide 
molecules that the cell surface is virtually covered with O-specific 
chains. 

important determinants of the serotype of bacterial 
strains (serotypes are strains that are distinguished on 
the basis of antigenic properties). The lipopolysaccha- 
rides of S. typhimurium contain six fatty acids bound 
to two glucosamine residues, one of which is the point of 
attachment for a complex oligosaccharide (Fig. 7-30). 
E. coli has similar but unique lipopolysaccharides. The 
lipid A portion of the lipopolysaccharides of some bacte- 
ria is called endotoxin; its toxicity to humans and other 
animals is responsible for the dangerously lowered 
blood pressure that occurs in toxic shock syndrome re- 
sulting from gram-negative bacterial infections. ■ 



SUMMARY 7.3 Glycoconjugates: 

Proteoglycans, Glycoproteins, 
and Glycolipids 

■ Proteoglycans are glycoconjugates in which one 
or more large glycans, called sulfated 
glycosaminoglycans (heparan sulfate, chondroitin 
sulfate, dermatan sulfate, or keratan sulfate) are 
covalently attached to a core protein. Bound to 
the outside of the plasma membrane by a 
transmembrane peptide or a covalently attached 
lipid, proteoglycans provide points of adhesion, 
recognition, and information transfer between cells, 
or between the cell and the extracellular matrix. 

m Glycoproteins contain oligosaccharides covalently 
linked to Asp or Ser/Thr residues. The glycans 
are typically branched and smaller than 
glycosaminoglycans. Many cell surface or 
extracellular proteins are glycoproteins, as are 
most secreted proteins. The covalently attached 
oligosaccharides influence the folding and stability 
of the proteins, provide critical information about 
the targeting of newly synthesized proteins, and 
allow for specific recognition by other proteins. 

■ Glycomics is the determination of the full 
complement of sugar-containing molecules in a cell 
or tissue, and the determination of the function of 
each such molecule. 

■ Glycolipids in plants and animals and lipopoly- 
saccharides in bacteria are components of the cell 
envelope with covalently attached oligosaccharide 
chains exposed on the cell's outer surface. 

7.4 Carbohydrates as Informational 
Molecules: The Sugar Code 

Glycobiology, the study of the structure and function of 
glycoconjugates, is one of the most active and exciting 
areas of biochemistry and cell biology. As is becoming 
increasingly clear, cells use specific oligosaccharides to 
encode important information about intracellular tar- 
geting of proteins, cell-cell interactions, cell differentia- 
tion and tissue development, and extracellular signals. 
Our discussion uses just a few examples to illustrate the 
diversity of structure and the range of biological activity 
of the glycoconjugates. In Chapter 20 we discuss the 
biosynthesis of polysaccharides, including peptidogly- 
can; and in Chapter 27, the assembly of oligosaccharide 
chains on glycoproteins. 

Improved methods for the analysis of oligosaccha- 
ride and polysaccharide structure have revealed remark- 
able complexity and diversity in the oligosaccharides of 
glycoproteins and glycolipids. Consider the oligosaccha- 
ride chains in Figure 7-29, typical of those found in 
many glycoproteins. The most complex of those shown 
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contains 14 monosaccharide residues of four different 
kinds, variously linked as (1— >2), (l->3), (l->4), (l->6), 
(2— »3), and (2— »6), some with the a and some with the 
jS configuration. Branched structures, not found in nu- 
cleic acids or proteins, are common in oligosaccharides. 
With the reasonable assumption that 20 different mono- 
saccharide subunits are available for construction of 
oligosaccharides, we can calculate that many billions of 
different hexameric oligosaccharides are possible; this 
compares with 6.4 X 10 7 (20 6 ) different hexapeptides 
possible with the 20 common amino acids, and 4,096 (4 6 ) 
different hexanucleotides with the four nucleotide sub- 
units. If we also allow for variations in oligosaccharides 
resulting from sulfation of one or more residues, the 
number of possible oligosaccharides increases by two 
orders of magnitude. In reality, only a subset of possible 
combinations is found, given the restrictions imposed by 
the biosynthetic enzymes and the availability of precur- 
sors. Nevertheless, the enormously rich structural infor- 
mation in glycans does not merely rival but far surpasses 
that of nucleic acids in the density of information con- 
tained in a molecule of modest size. Each of the oligosac- 
charides represented in Figure 7-29 presents a unique, 
three-dimensional face — a word in the sugar code — 
readable by the proteins that interact with it. 

Lectins Are Proteins That Read the Sugar Code 
and Mediate Many Biological Processes 

Lectins, found in all organisms, are proteins that bind 
carbohydrates with high specificity and with moderate to 
high affinity (Table 7-3). Lectins serve in a wide variety 
of cell-cell recognition, signaling, and adhesion processes 
and in intracellular targeting of newly synthesized pro- 
teins. Plant lectins, abundant in seeds, probably serve as 



deterrents to insects and other predators. In the labora- 
tory, purified plant lectins are useful reagents for detect- 
ing and separating glycans and glycoproteins with 
different oligosaccharide moieties. Here we discuss just a 
few examples of the roles of lectins in animal cells. 

Some peptide hormones that circulate in the blood 
have oligosaccharide moieties that strongly influence 
their circulatory half-life. Luteinizing hormone and thy- 
rotropin (polypeptide hormones produced in the adre- 
nal cortex) have TV-linked oligosaccharides that end 
with the disaccharide GalNAc4S(/31— >4)GlcNAc, which 
is recognized by a lectin (receptor) of hepatocytes. 
(GalNAc4S is AT-acetylgalactosamine sulfated on the 
— OH group at C-4.) Receptor-hormone interaction me- 
diates the uptake and destruction of luteinizing hor- 
mone and thyrotropin, reducing their concentration in 
the blood. Thus the blood levels of these hormones un- 
dergo a periodic rise (due to pulsatile secretion by the 
adrenal cortex) and fall (due to continual destruction 
by hepatocytes). 

The residues of NeuSAc (a sialic acid) situated at 
the ends of the oligosaccharide chains of many plasma 
glycoproteins (Fig. 7-29) protect those proteins from 
uptake and degradation in the liver. For example, ceru- 
loplasmin, a copper-containing serum glycoprotein, has 
several oligosaccharide chains ending in Neu5Ac. The 
mechanism that removes sialic acid residues from serum 
glycoproteins is unclear. It may be due to the activity of 
the enzyme sialidase (also called neuraminidase) pro- 
duced by invading organisms or to a steady, slow release 
by extracellular enzymes. The plasma membrane of he- 
patocytes has lectin molecules (asialoglycoprotein re- 
ceptors; "asialo-" indicating "without sialic acid") that 
specifically bind oligosaccharide chains with galactose 
residues no longer "protected" by a terminal NeuSAc 
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Some Lectins and the Oligosaccharide LigandsThey Bind 



Lectin source and lectin 



Abbreviation 



Ligand(s) 



Plant 

Concanavalin A 

Griffonia simplicifolia lectin 4 
Wheat germ agglutinin 

Ricin 
Animal 

Galectin-1 

Mannose-binding protein A 
Viral 

Influenza virus hemagglutinin 

Polyoma virus protein 1 

Bacterial 

Enterotoxin 

Cholera toxin 



ConA 

GS4 

WGA 



MBP-A 

HA 
VP1 

LT 
CT 



Manal— OCH 3 

Lewis b (Le b ) tetrasaccharide 
Neu5Ac(a2->3)Gal(/31-*4)Glc 
GlcNAc(j61->4)GlcNAc 
Gal(jS1^4)Glc 

Gal(/31^4)Glc 

High-mannose octasaccharide 

NeuSAc 02-»6) Gal( p l->4) Glc 
Neu5Ac(a2->3)Gal(/31-*4)Glc 

Gal 

GM1 pentasaccharide 



Source: Weiss, W.I. & Drickamer, K. (1996) Structural basis of lectin-carbohydrate recognition. Annu. Rev. Biochem. 65, 441-473. 
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residue. Receptor-ceruloplasmin interaction triggers en- 
docytosis and destruction of the ceruloplasmin. 



H 3 C— C 




A^Acetylneuraminic acid (NeuSAc) 
(a sialic acid) 



A similar mechanism is apparently responsible for re- 
moving "old" erythrocytes from the mammalian blood- 
stream. Newly synthesized erythrocytes have several 
membrane glycoproteins with oligosaccharide chains that 
end in Neu5Ac. When the sialic acid residues are removed 
by withdrawing a sample of blood from experimental ani- 
mals, treating it with sialidase in vitro, and reintroducing it 
into the circulation, the treated erythrocytes disappear 
from the bloodstream within a few hours; erythrocytes 
with intact oligosaccharides (withdrawn and reintroduced 
without sialidase treatment) continue to circulate for days. 
Cell surface lectins are important in the develop- 
ment of some human diseases — both human lectins 
and the lectins of infectious agents. Selectins are a family 
of plasma membrane lectins that mediate cell-cell recogni- 
tion and adhesion in a wide range of cellular processes. 
One such process is the movement of immune cells (neu- 
trophils) through the capillary wall, from blood to tissues, 
at sites of infection or inflammation (Fig. 7-31). At an 
infection site, P-selectin on the surface of capillary 
endothelial cells interacts with a specific oligosaccharide 
of the glycoproteins of circulating neutrophils. This inter- 
action slows the neutrophils as they adhere to and roll 
along the endothelial lining of the capillaries. A second 
interaction, between integrin molecules (p. 455) in the 
neutrophil plasma membrane and an adhesion protein on 
the endothelial cell surface, now stops the neutrophil and 
allows it to move through the capillary wall into the 
infected tissues to initiate the immune attack. Two other 
selectins participate in this "lymphocyte homing": E- 
selectin on the endothelial cell and L-selectin on the neu- 
trophil bind their cognate oligosaccharides on the 
neutrophil and endothelial cell, respectively. 

Human selectins mediate the inflammatory re- 
sponses in rheumatoid arthritis, asthma, psoriasis, mul- 
tiple sclerosis, and the rejection of transplanted organs, 
and thus there is great interest in developing drugs that 
inhibit selectin-mediated cell adhesion. Many carcino- 
mas express an antigen normally present only in fetal 
cells (sialyl Lewis x, or sialyl Le x ) that, when shed into 
the circulation, facilitates tumor cell survival and metas- 
tasis. Carbohydrate derivatives that mimic the sialyl 
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FIGURE 7-31 Role of lectin-ligand interactions in lymphocyte move- 
ment to the site of an infection or injury. A neutrophil circulating 
through a capillary is slowed by transient interactions between P-selectin 
molecules in the plasma membrane of the capillary endothelial cells 
and glycoprotein ligands for P-selectin on the neutrophil surface. As it 
interacts with successive P-selectin molecules, the neutrophil rolls 
along the capillary surface. Near a site of inflammation, stronger inter- 
actions between integrin in the capillary surface and its ligand in the 
neutrophil surface lead to tight adhesion. The neutrophil stops rolling 
and, under the influence of signals sent out from the site of inflamma- 
tion, begins extravasation — escape through the capillary wall — as it 
moves toward the site of inflammation. 

Le x portion of sialoglycoproteins or that alter the 
biosynthesis of the oligosaccharide might prove effective 
as selectin-specific drugs for treating chronic inflamma- 
tion or metastatic disease. 

Several animal viruses, including the influenza virus, 
attach to their host cells through interactions with 
oligosaccharides displayed on the host cell surface. The 
lectin of the influenza virus, known as the HA (hemagglu- 
tinin) protein, is essential for viral entry and infection. Af- 
ter the virus has entered a host cell and has been 
replicated, the newly synthesized viral particles bud out of 
the cell, wrapped in a portion of its plasma membrane. A vi- 
ral sialidase (neuraminidase) trims the terminal sialic acid 
residue from the host cell's oligosaccharides, releasing the 
viral particles from their interaction with the cell and pre- 
venting their aggregation with one another. Another 
round of infection can now begin. The antiviral drugs os- 
eltamivir (Tamiflu) and zanamivir (Relenza) (next page) 
are used clinically in the treatment of influenza. These 
drugs are sugar analogs; they inhibit the viral sialidase by 
competing with the host cell's oligosaccharides for binding. 
This prevents the release of viruses from the infected cell 
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and also causes viral particles to aggregate, both of which 
block another cycle of infection. 




COOH 



Oseltamivir Zanamivir 
(Tamiflu) (Relenza) 



Lectins on the surface of the herpes simplex viruses 
HSV-1 and HSV-2 (the causative agents of oral and gen- 
ital herpes, respectively) bind specifically to heparan 
sulfate on the host cell surface as a first step in the in- 
fection cycle; infection requires precisely the right pat- 
tern of sulfation on this polymer. Analogs of heparan 
sulfate that mimic its interaction with the viruses are be- 
ing investigated as possible antiviral drugs, interfering 
with interactions between virus and cell. 

Some microbial pathogens have lectins that mediate 
bacterial adhesion to host cells or the entry of toxin into 
cells. For example, Helicobacter pylori — shown by 
Barry J. Marshall and J. Robin Warren in the 1980s to 
be responsible for most gastric ulcers — adheres to the 
inner surface of the stomach as bacterial membrane 
lectins interact with specific oligosaccharides of mem- 
brane glycoproteins of the gastric epithelial cells 
(Fig. 7-32). Among the binding sites recognized byH. 
pylori is the oligosaccharide Lewis b (Le b ), when it is 
part of the type 0 blood group determinant. This obser- 
vation helps to explain the severalfold greater incidence 
of gastric ulcers in people of blood type O than in those 
of type A or B. Chemically synthesized analogs of the Le b 
oligosaccharide may prove useful in treating this type of 
ulcer. Administered orally, they could prevent bacterial 
adhesion (and thus infection) by competing with the 
gastric glycoproteins for binding to the bacterial lectin. 




FIGURE 7-32 An ulcer in the making. Helicobacter pylori cells adher- 
ing to the gastric surface. This bacterium causes ulcers by interactions 
between a bacterial surface lectin and the Le b oligosaccharide (a blood 
group antigen) of the gastric epithelium, 




Barry J. Marshall J. Robin Warren 



Some of the most devastating of the human parasitic 
diseases, widespread in much of the developing world, 
are caused by eukaryotic microorganisms that display 
unusual surface oligosaccharides, which in some cases 
are known to be protective for the parasites. These or- 
ganisms include the trypanosomes, responsible for 
African sleeping sickness and Chagas disease; Plasmo- 
dium falciparum, the malaria parasite; and Entamoeba 
histolytica, the causative agent of amoebic dysentery. 
The prospect of finding drugs that interfere with the syn- 
thesis of these unusual oligosaccharide chains, and 
therefore with the replication of the parasites, has in- 
spired much recent work on the biosynthetic pathways 
of these oligosaccharides. 

The cholera toxin molecule (produced by the bac- 
terium Vibrio cholerae) triggers diarrhea after entering 
intestinal cells responsible for water absorption from 
the intestine. The toxin attaches to its target cell 
through the oligosaccharide moiety of ganglioside GM1, 
a membrane phospholipid (for the structure of GM1 see 
Box 10-2, Fig. 1), on the surface of intestinal epithelial 
cells. Similarly, the pertussis toxin produced by Borde- 
tella pertussis, the bacterium that causes whooping 
cough, enters target cells only after interacting with a 
host cell oligosaccharide (or perhaps several oligosac- 
charides) bearing a terminal sialic acid residue. Under- 
standing the details of the oligosaccharide-binding sites 
of these toxins (lectins) may allow the development of 
genetically engineered toxin analogs for use in vaccines. 
Toxin analogs engineered to lack the carbohydrate bind- 
ing site would be harmless because they could not bind 
to and enter cells, but they might elicit an immune re- 
sponse that would protect against later exposure to the 
natural toxin. It is also possible to imagine drugs that 
would act by mimicking cell surface oligosaccharides, 
binding to the bacterial lectins or toxins and preventing 
their productive binding to cell surfaces. 

Lectins also act intracellularly An oligosaccharide 
containing mannose 6-phosphate marks newly synthe- 
sized proteins in the Golgi complex for transfer to the 
lysosome (see Fig. 27-39). A common structural feature 
on the surface of these glycoproteins, the signal patch, 
is recognized by an enzyme that phosphorylates (in a 
two-step process) a mannose residue at the terminus 
of an oligosaccharide chain. The resulting mannose 
6-phosphate residue is then recognized by the cation- 
dependent mannose 6-phosphate receptor, a membrane- 
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associated lectin with its mannose phosphate binding 
site on the lumenal side of the Golgi complex. When a 
section of the Golgi complex containing this receptor 
buds off to form a transport vesicle, proteins containing 
mannose phosphate residues are dragged into the form- 
ing bud by interaction of their mannose phosphates with 
the receptor; the vesicle then moves to and fuses with a 
lysosome, depositing its cargo therein. Many, perhaps 
all, of the degradative enzymes (hydrolases) of the lyso- 
some are targeted and delivered by this mechanism. 
Some of the mannose 6-phosphate receptors can cap- 
ture enzymes containing the mannose 6-phosphate 
residue and direct them to the lysosome. This process is 
the basis for "enzyme replacement therapy" to correct 
lysosomal storage disorders in humans. ■ 

Other lectins act in other kinds of protein sorting. 
Any newly synthesized protein in the endoplasmic reticu- 
lum already has a complex oligosaccharide attached, 
which can be bound by either of two ER lectins that are 
also chaperones: calnexin (membrane-bound) or cal- 
reticulin (soluble). These lectins link the new protein 
with an enzyme that brings about rapid disulfide ex- 
change as the protein tries various ways to fold, leading 
eventually to the native conformation. At this point, en- 
zymes in the ER trim the oligosaccharide moiety to a form 
recognized by another lectin, ERGIC53, which draws the 
folded protein (glycoprotein) into the Golgi complex for 
further maturation. If a protein has not folded effectively, 
the oligosaccharide is trimmed to another form, this one 
recognized by a lectin, EDEM, that initiates movement of 
the defectively folded protein into the cytosol, where it 
will be degraded. Thus, protein glycosylation serves in 
the ER as a kind of quality control signal, allowing the cell 
to eliminate improperly folded proteins. (This process is 
described in greater detail in Chapter 27.) 



Lectin-Carbohydrate Interactions Are Highly Specific 
and Often Polyvalent 

In all the functions of lectins described above, and in many 
more known to involve lectin-oligosaccharide interactions, 
it is essential that the oligosaccharide have a unique struc- 
ture, so that recognition by the lectin is highly specific. 
The high density of information in oligosaccharides pro- 
vides a sugar code with an essentially unlimited number of 
unique "words" small enough to be read by a single pro- 
tein. In their carbohydrate-binding sites, lectins have a 
subtle molecular complementarity that allows interaction 
only with their correct carbohydrate cognates. The result 
is an extraordinarily high specificity in these interactions. 
The affinity between an oligosaccharide and each carbo- 
hydrate binding domain (CBD) of a lectin is sometimes 
modest (micromolar to millimolar K d values), but the ef- 
fective affinity is in many cases greatly increased by lectin 
multivalency, in which a single lectin molecule has multi- 
ple CBDs. In a cluster of oligosaccharides — as is com- 
monly found on a membrane surface, for example — each 
oligosaccharide can engage one of the lectin's CBDs, 
strengthening the interaction. When cells express multi- 
ple receptors, the avidity of the interaction can be very 
high, enabling highly cooperative events such as cell at- 
tachment and rolling (see Fig. 7-31). 

X-ray crystallographic studies of the structures of 
several lectin-carbohydrate complexes have provided 
rich details of the lectin-sugar interaction (Fig. 7-33). In 
humans, a family of 1 1 lectins that bind to oligosaccharide 
chains ending in sialic acid residues plays some important 
biological roles. All of these lectins bind sialic acids at /3 
sandwich domains like those found in immunoglobulins 
(Igs; see this motif in the CD8 protein in Fig. 4-21), and 
the proteins are therefore called siglecs 1 to 11 (sialic 




FIGURE 7-33 Details of a lectin-carbohydrate interaction. Structure 
of the bovine mannose 6-phosphate receptor complexed with man- 
nose 6-phosphate (PDB ID 1 M6P). The protein is represented as a sur- 
face contour image, showing the surface as predominantly negatively 
charged (red) or positively charged (blue). Mannose 6-phosphate is 
shown as a stick structure; a manganese ion is shown in violet, (b) An 
enlarged view of the binding site. Mannose 6-phosphate is hydrogen- 



bonded to Arg 111 and coordinated with the manganese ion (shown 
smaller than its van der Waals radius for clarity). Each hydroxyl group 
of mannose is hydrogen-bonded to the protein. The His 105 hydrogen- 
bonded to a phosphate oxygen of mannose 6-phosphate may be the 
residue that, when protonated at low pH, causes the receptor to re- 
lease mannose 6-phosphate into the lysosome. 
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acid-recognizing 7#-superfamily lectins), or sometimes 
sialoadhesins. The interaction of a siglec with sialic acid 
(NeuSAc) involves each of the ring substituents unique to 
NeuSAc: the acetyl group at C-5 undergoes both hydrogen- 
bond and van der Waals interactions with the protein; 
the carboxyl group makes a salt bridge with a conserved 
Arg residue; and the hydroxyls of the glycerol moiety 
hydrogen-bond with the protein. Siglecs regulate activi- 
ties in the immune and nervous systems and in blood 
cell development. Siglec-7, for example, by binding to a 
specific ganglioside (GD3) containing two sialic acid 
residues, suppresses the activity of NK (natural killer) 
cells in the immune system, sparing cells targeted for 
immune destruction from the NK killing activity. The 
elevated GD3 levels in tumors such as malignant 
melanoma and neuroblastoma may be a mechanism for 
evading the protective action of the immune system. 

The structure of the mannose 6-phosphate recep- 
tor/lectin reveals details of its interaction with mannose 
6-phosphate that explain the specificity of the binding 
and the role for a divalent cation in the lectin-sugar 
interaction (Fig. 7-33a). His 105 is hydrogen-bonded to 
one of the oxygen atoms of the phosphate (Fig. 7-33b). 
When the protein tagged with mannose 6-phosphate 
reaches the lysosome (which has a lower internal pH 
than the Golgi complex), the receptor loses its affinity 
for mannose 6-phosphate. Protonation of His 105 may be 
responsible for this change in binding. 

In addition to these very specific interactions, there 
are more general interactions that contribute to the 
binding of many carbohydrates to their lectins. For ex- 
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FIGURE 7-34 Hydrophobic interactions of sugar residues. Sugar units 
such as galactose have a more polar side (the top of the chair as shown 
here, with the ring oxygen and several hydroxyls) that is available to 
hydrogen-bond with the lectin, and a less polar side that can have hy- 
drophobic interactions with nonpolar side chains in the protein, such 
as the indole ring of Trp residues. 

ample, many sugars have a more polar and a less polar 
side (Fig. 7-34); the more polar side hydrogen-bonds 
with the lectin, while the less polar undergoes hydropho- 
bic interactions with nonpolar amino acid residues. The 
sum of all these interactions produces high-affinity bind- 
ing and high specificity of lectins for their carbohydrates. 
This represents a kind of information transfer that is 
clearly central in many processes within and between 
cells. Figure 7-35 summarizes some of the biological in- 
teractions mediated by the sugar code. 
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FIGURE 7-35 Roles of oligosaccharides in recognition 
and adhesion at the cell surface, (a) Oligosaccharides 
with unique structures (represented as strings of hexa- 
gons), components of a variety of glycoproteins or gly- 
col ipids on the outer surface of plasma membranes, 
interact with high specificity and affinity with lectins in 
the extracellular milieu, (b) Viruses that infect animal 
cells, such as the influenza virus, bind to cell surface 
glycoproteins as the first step in infection, (c) Bacterial 
toxins, such as the cholera and pertussis toxins, bind to 
a surface glycolipid before entering a cell, (d) Some bac- 
teria, such as H. pylori, adhere to and then colonize or 
infect animal cells, (e) Selectins (lectins) in the plasma 
membrane of certain cells mediate cell-cell interactions, 
such as those of neutrophils with the endothelial cells of 
the capillary wall at an infection site. (f)The mannose 6- 
phosphate receptor/lectin of the trans Golgi complex 
binds to the oligosaccharide of lysosomal enzymes, tar- 
geting them for transfer into the lysosome. 
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SUMMARY 7.4 Carbohydrates as 

Informational Molecules: 
The Sugar Code 

■ Monosaccharides can be assembled into an almost 
limitless variety of oligosaccharides, which differ in 
the stereochemistry and position of glycosidic 
bonds, the type and orientation of substituent 
groups, and the number and type of branches. 
Glycans are far more information-dense than 
nucleic acids or proteins. 

■ Lectins, proteins with highly specific carbohydrate- 
binding domains, are commonly found on the outer 
surface of cells, where they initiate interaction with 
other cells. In vertebrates, oligosaccharide tags 
"read" by lectins govern the rate of degradation of 
certain peptide hormones, circulating proteins, and 
blood cells. 

■ Bacterial and viral pathogens and some eukaryotic 
parasites adhere to their animal-cell targets 

by the binding of lectins in the pathogens to 
oligosaccharides on the target cell surface. 

■ Intracellular lectins mediate intracellular protein 
targeting to specific organelles or to the secretory 
pathway 

■ X-ray crystallography of lectin-sugar complexes 
shows the detailed complementarity between the 
two molecules, which accounts for the strength and 
specificity of lectin interactions with carbohydrates. 

7.5 Working with Carbohydrates 

The growing appreciation of the importance of oligosac- 
charide structure in biological recognition has been the 
driving force behind the development of methods for an- 
alyzing the structure and stereochemistry of complex 
oligosaccharides. Oligosaccharide analysis is compli- 
cated by the fact that, unlike nucleic acids and proteins, 
oligosaccharides can be branched and are joined by a va- 
riety of linkages. The high charge density of many 
oligosaccharides and polysaccharides, and the relative 
lability of the sulfate esters in glycosaminoglycans, pres- 
ent further difficulties. 

For simple, linear polymers such as amylose, the posi- 
tions of the glycosidic bonds are determined by the classical 
method of exhaustive methylation: treating the intact poly- 
saccharide with methyl iodide in a strongly basic medium to 
convert all free hydroxyls to acid-stable methyl ethers, then 
hydrolyzing the methylated polysaccharide in acid. The 
only free hydroxyls present in the monosaccharide deriva- 
tives so produced are those that were involved in glycosidic 
bonds. To determine the sequence of monosaccharide 
residues, including any branches that are present, exogly- 
cosidases of known specificity are used to remove residues 
one at a time from the nonreducing end(s). The known 
specificity of these exoglycosidases often allows deduction 
of the position and stereochemistry of the linkages. 



For analysis of the oligosaccharide moieties of 
glycoproteins and glycolipids, the oligosaccharides are 
released by purified enzymes — glycosidases that specif- 
ically cleave O- or AT-linked oligosaccharides or lipases 
that remove lipid head groups. Alternatively, O-linked 
glycans can be released from glycoproteins by treat- 
ment with hydrazine. 

The resulting mixtures of carbohydrates are re- 
solved into their individual components by a variety of 
methods (Fig. 7-36), including the same techniques 
used in protein and amino acid separation: fractional 
precipitation by solvents, and ion-exchange and size- 
exclusion chromatography (see Fig. 3-17). Highly puri- 
fied lectins, attached covalently to an insoluble support, 
are commonly used in affinity chromatography of carbo- 
hydrates (see Fig. 3-1 7c). 

Hydrolysis of oligosaccharides and polysaccharides 
in strong acid yields a mixture of monosaccharides, 
which may be identified and quantified by chromato- 
graphic techniques to yield the overall composition of 
the polymer. 

Oligosaccharide analysis relies increasingly on mass 
spectrometry and high-resolution NMR spectroscopy. 
Matrix-assisted laser desorption/ionization mass spec- 
trometry (MALDI MS) and tandem mass spectrometry 
(MS/MS), both described in Box 3-2, are readily appli- 
cable to polar compounds such as oligosaccharides. 
MALDI MS is a very sensitive method for determining 
the mass of a molecular ion (in this case, the entire 
oligosaccharide chain; Fig. 7-37). MS/MS reveals the 
mass of the molecular ion and many of its fragments, 
which are usually the result of breakage of the glycosidic 
bonds. NMR analysis alone (see Box 4-5), especially for 
oligosaccharides of moderate size, can yield much infor- 
mation about sequence, linkage position, and anomeric 
carbon configuration. For example, the structure of the 
heparin segment shown as a space-filling model in 
Figure 7-22 was obtained entirely by NMR spectroscopy. 
Automated procedures and commercial instruments are 
used for the routine determination of oligosaccharide 
structure, but the sequencing of branched oligosaccha- 
rides joined by more than one type of bond remains a far 
more formidable task than determining the linear se- 
quences of proteins and nucleic acids. 

Another important tool in working with carbohy- 
drates is chemical synthesis, which has proved to be a 
powerful approach to understanding the biological func- 
tions of glycosaminoglycans and oligosaccharides. The 
chemistry involved in such syntheses is difficult, but car- 
bohydrate chemists can now synthesize short segments 
of almost any glycosaminoglycan, with correct stereo- 
chemistry, chain length, and sulfation pattern, and 
oligosaccharides significantly more complex than those 
shown in Figure 7-29. Solid-phase oligosaccharide syn- 
thesis is based on the same principles (and has the same 
advantages) as peptide synthesis (see Fig. 3-29), but 
requires a set of tools unique to carbohydrate chemistry: 
blocking groups and activating groups that allow the syn- 
thesis of glycosidic linkages with the correct hydroxyl 
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FIGURE 7-36 Methods of carbohydrate analysis. A carbohydrate purified in the first stage of the analysis 
often requires all four analytical routes for its complete characterization, 
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FIGURE 7-37 Separation and quantification of the oligosaccharides in 
a group of glycoproteins. In this experiment the mixture of proteins ex- 
tracted from kidney tissue was treated to release oligosaccharides from 
glycoproteins, and the oligosaccharides were analyzed by matrix- 
assisted laser desorption/ionization mass spectrometry (MALDI MS). 



Each distinct oligosaccharide produces a peak at its molecular mass, and 
the area under the curve reflects the quantity of that oligosaccharide. The 
most prominent oligosaccharide here (mass 2837.4 u) is composed of 
13 sugar residues; in this sample, other oligosaccharides containing as 
few as 7 and as many as 19 residues are also resolved by this method. 
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group. Highly purified enzymes (glycosyltransf erases) 
should greatly aid in the preparation of pure synthetic 
compounds. Synthetic approaches like this represent a 
current area of great interest since it is difficult to purify 
in adequate quantity defined oligosaccharides from natu- 
ral sources. Glycan microarrays of defined oligosaccha- 
rides, analogous to the DNA arrays described in Chapter 9, 
can be probed with fluorescently tagged lectins to de- 
termine their binding specificity. 



SUMMARY 7.5 



Working with 
Carbohydrates 



Establishing the complete structure of 
oligosaccharides and polysaccharides requires 
determination of linear sequence, branching posi- 
tions, the configuration of each monosaccharide 
unit, and the positions of the glycosidic linkages — a 
more complex problem than protein and nucleic 
acid analysis. 

The structures of oligosaccharides and 
polysaccharides are usually determined by a 
combination of methods: specific enzymatic 
hydrolysis to determine stereochemistry at the 
glycosidic bond and produce smaller fragments for 
further analysis; methylation to locate glycosidic 
bonds; and stepwise degradation to determine 
sequence and configuration of anomeric carbons. 

Mass spectrometry and high-resolution NMR 
spectroscopy, applicable to small samples of 
carbohydrate, yield essential information about 
sequence, configuration at anomeric and other 
carbons, and positions of glycosidic bonds. 

Solid-phase synthetic methods yield defined 
oligosaccharides that are of great value in exploring 
lectin-oligosaccharide interactions and may prove 
clinically useful. 
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1. Sugar Alcohols In the monosaccharide derivatives known 
as sugar alcohols, the carbonyl oxygen is reduced to a hydroxyl 
group. For example, D-glyceraldehyde can be reduced to glycerol. 
However, this sugar alcohol is no longer designated d or l. Why? 

2. Recognizing Epimers Using Figure 7-3, identify the 
epimers of (a) D-allose, (b) D-gulose, and (c) D-ribose at C-2, 
C-3, and C-4. 



3. Melting Points of Monosaccharide Osazone Deriva- 
tives Many carbohydrates react with phenylhydrazine 
(C 6 H 5 NHNH 2 ) to form bright yellow crystalline derivatives 
known as osazones: 
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The melting temperatures of these derivatives are easily de- 
termined and are characteristic for each osazone. This infor- 
mation was used to help identify monosaccharides before the 
development of HPLC or gas-liquid chromatography. Listed 
below are the melting points (MPs) of some aldose-osazone 
derivatives: 



Monosaccharide 


MP of anhydrous 
monosaccharide (°C) 


MP of osazone 
derivative (°C) 


Glucose 


146 


205 


Mannose 


132 


205 


Galactose 


165-168 


201 


Talose 


128-130 


201 



As the table shows, certain pairs of derivatives have the same 
melting points, although the underivatized monosaccharides 
do not. Why do glucose and mannose, and similarly galactose 
and talose, form osazone derivatives with the same melting 
points? 

4. Interconversion of D-Glucose Forms A solution of one 
enantiomer of a given monosaccharide rotates plane-polarized 
light to the left (counterclockwise) and is called the levorota- 
tory isomer, designated (-); the other enantiomer rotates 
plane-polarized light to the same extent but to the right 
(clockwise) and is called the dextrorotatory isomer, desig- 
nated (+). An equimolar mixture of the (+) and (-) forms 
does not rotate plane-polarized light. 

The optical activity of a stereoisomer is expressed quanti- 
tatively by its optical rotation, the number of degrees by 
which plane-polarized light is rotated on passage through a 
given path length of a solution of the compound at a given con- 
centration. The specific rotation [a]{ of an optically active 
compound is denned thus: 



[«n = 



observed optical rotation (°) 



optical path length (dm) X concentration (g/mL) 



The temperature (t) and the wavelength of the light (A) em- 
ployed (usually, as here, the d line of sodium, 589 nm) must be 
specified. 

A freshly prepared solution of a-D-glucose shows a spe- 
cific rotation of +112°. Over time, the rotation of the solution 
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gradually decreases and reaches an equilibrium value corre- 
sponding to [a]l 5 ° c = +52.5°. In contrast, a freshly prepared 
solution of /3-D-glucose has a specific rotation of +19°. The ro- 
tation of this solution increases over time to the same equilib- 
rium value as that shown by the a anomer. 

(a) Draw the Haworth perspective formulas of the a and j3 
forms of D-glucose. What feature distinguishes the two forms? 

(b) Why does the specific rotation of a freshly prepared 
solution of the a form gradually decrease with time? Why do 
solutions of the a and p forms reach the same specific rotation 
at equilibrium? 

(c) Calculate the percentage of each of the two forms of d- 
glucose present at equilibrium. 

5. Configuration and Conformation Which bond(s) in a-D- 
glucose must be broken to change its configuration to /3-d- 
glucose? Which bond(s) to convert D-glucose to D-mannose? 
Which bond(s) to convert one "chair" form of D-glucose to 
the other? 

6. Deoxysugars Is D-2-deoxygalactose the same chemical as 
D-2-deoxyglucose? Explain. 

7. Sugar Structures Describe the common structural fea- 
tures and the differences for each pair: (a) cellulose and glyco- 
gen; (b) D-glucose and D-fructose; (c) maltose and sucrose. 

8. Reducing Sugars Draw the structural formula for a-D- 
glucosyl-(l-*6)-D-mannosamine and circle the part of this 
structure that makes the compound a reducing sugar. 

9. Hemiacetal and Glycosidic Linkages Explain the dif- 
ference between a hemiacetal and a glycoside. 

10. A Taste of Honey The fructose in honey is mainly in the 
/3-D-pyranose form. This is one of the sweetest carbohydrates 
known, about twice as sweet as glucose; the /3-D-furanose form 
of fructose is much less sweet. The sweetness of honey gradu- 
ally decreases at a high temperature. Also, high-fructose corn 
syrup (a commercial product in which much of the glucose in 
corn syrup is converted to fructose) is used for sweetening 
cold but not hot drinks. What chemical property of fructose 
could account for both these observations? 

11. Reducing Disaccharide A disaccharide, which you 
know to be either maltose or sucrose, is treated with Fehling's 
solution, and a red color is formed. Which sugar is it, and how 
do you know? 

I 12. Glucose Oxidase in Determination of Blood 
Glucose The enzyme glucose oxidase isolated from 
the mold Penicillium notatum catalyzes the oxidation of /3-d- 
glucose to D-glucono-6-lactone. This enzyme is highly specific 
for the p anomer of glucose and does not affect the a anomer. 
In spite of this specificity, the reaction catalyzed by glucose 
oxidase is commonly used in a clinical assay for total blood 
glucose— that is, for solutions consisting of a mixture of ft- and 
a-D-glucose. What are the circumstances required to make this 
possible? Aside from allowing the detection of smaller quanti- 
ties of glucose, what advantage does glucose oxidase offer over 
Fehling's reagent for measuring blood glucose? 



13. Invertase "Inverts" Sucrose The hydrolysis of su- 
crose (specific rotation +66 5°) yields an equimolar mixture 
of D-glucose (specific rotation +52.5°) and D-fructose (spe- 
cific rotation -92°). (See Problem 4 for details of specific 
rotation.) 

(a) Suggest a convenient way to determine the rate of hy- 
drolysis of sucrose by an enzyme preparation extracted from 
the lining of the small intestine. 

(b) Explain why, in the food industry, an equimolar mix- 
ture of D-glucose and D-fructose formed by hydrolysis of su- 
crose is called invert sugar. 

(c) The enzyme invertase (now commonly called sucrase) 
is allowed to act on a 10% (0.1 g/mL) solution of sucrose until 
hydrolysis is complete. What will be the observed optical rota- 
tion of the solution in a 10 cm cell? (Ignore a possible small 
contribution from the enzyme.) 

14. Manufacture of Liquid-Filled Chocolates The manu- 
facture of chocolates containing a liquid center is an interest- 
ing application of enzyme engineering. The flavored liquid 
center consists largely of an aqueous solution of sugars rich in 
fructose to provide sweetness. The technical dilemma is the 
following: the chocolate coating must be prepared by pouring 
hot melted chocolate over a solid (or almost solid) core, yet 
the final product must have a liquid, fructose-rich center. Sug- 
gest a way to solve this problem. (Hint: Sucrose is much less 
soluble than a mixture of glucose and fructose.) 

15. Anomers of Sucrose? Lactose exists in two anomeric 
forms, but no anomeric forms of sucrose have been reported. 
Why? 

16. Gentiobiose Gentiobiose (d-G1c(/31->6)d-G1c) is a di- 
saccharide found in some plant glycosides. Draw the structure 
of gentobiose based on its abbreviated name. Is it a reducing 
sugar? Does it undergo mutarotation? 

17. Identifying Reducing Sugars Is Af-acetyl-/3-D-glucos- 
amine (Fig. 7-9) a reducing sugar? What about D-gluconate? Is 
the disaccharide GlcN(al<->la:)Glc a reducing sugar? 

18. Cellulose Digestion Cellulose could provide a widely 
available and cheap form of glucose, but humans cannot digest 
it. Why not? If you were offered a procedure that allowed you 
to acquire this ability, would you accept? Why or why not? 

19. Physical Properties of Cellulose and Glycogen The 

almost pure cellulose obtained from the seed threads of 
Gossypium (cotton) is tough, fibrous, and completely insolu- 
ble in water. In contrast, glycogen obtained from muscle or 
liver disperses readily in hot water to make a turbid solution. 
Despite their markedly different physical properties, both sub- 
stances are (l-»4) -linked D-glucose polymers of comparable 
molecular weight. What structural features of these two poly- 
saccharides underlie their different physical properties? Ex- 
plain the biological advantages of their respective properties. 

20. Dimensions of a Polysaccharide Compare the dimen- 
sions of a molecule of cellulose and a molecule of amylose, 
each of M T 200,000. 



Problems 




21. Growth Rate of Bamboo The stems of bamboo, a trop- 
ical grass, can grow at the phenomenal rate of 0.3 m/day under 
optimal conditions. Given that the stems are composed almost 
entirely of cellulose fibers oriented in the direction of growth, 
calculate the number of sugar residues per second that must 
be added enzymatically to growing cellulose chains to account 
for the growth rate. Each D-glucose unit contributes -0.5 nm 
to the length of a cellulose molecule. 

22. Glycogen as Energy Storage: How Long Can a Game 
Bird Fly? Since ancient times it has been observed that cer- 
tain game birds, such as grouse, quail, and pheasants, are eas- 
ily fatigued. The Greek historian Xenophon wrote, "The 
bustards . . . can be caught if one is quick in starting them up, 
for they will fly only a short distance, like partridges, and soon 
tire; and their flesh is delicious." The flight muscles of game 
birds rely almost entirely on the use of glucose 1 -phosphate 
for energy, in the form of ATP (Chapter 14). The glucose 1- 
phosphate is formed by the breakdown of stored muscle glyco- 
gen, catalyzed by the enzyme glycogen phosphorylase. The 
rate of ATP production is limited by the rate at which glycogen 
can be broken down. During a "panic flight," the game bird's 
rate of glycogen breakdown is quite high, approximately 
120 jamol/min of glucose 1-phosphate produced per gram of 
fresh tissue. Given that the flight muscles usually contain 
about 0.35% glycogen by weight, calculate how long a game 
bird can fly. (Assume the average molecular weight of a 
glucose residue in glycogen is 162 g/mol.) 

23. Relative Stability of Two Conformers Explain why 
the two structures shown in Figure 7-19 are so different in en- 
ergy (stability). Hint: See Figure 1-21. 

24. Volume of Chondroitin Sulfate in Solution One crit- 
ical function of chondroitin sulfate is to act as a lubricant in 
skeletal joints by creating a gel-like medium that is resilient to 
friction and shock. This function seems to be related to a dis- 
tinctive property of chondroitin sulfate: the volume occupied 
by the molecule is much greater in solution than in the dehy- 
drated solid. Why is the volume so much larger in solution? 

■ 25. Heparin Interactions Heparin, a highly nega- 
tively charged glycosaminoglycan, is used clinically as 
an anticoagulant. It acts by binding several plasma proteins, 
including antithrombin III, an inhibitor of blood clotting. The 
1:1 binding of heparin to antithrombin III seems to cause a 
conformational change in the protein that greatly increases its 
ability to inhibit clotting. What amino acid residues of an- 
tithrombin III are likely to interact with heparin? 

26. Permutations of a Trisaccharide Think about how one 
might estimate the number of possible trisaccharides composed 
of A^-acetylglucosamine 4-sulfate (GlcNAc4S) and glucuronic 
acid (GlcA), and draw 10 of them. 

27. Effect of Sialic Acid on SDS Polyacrylamide Gel 
Electrophoresis Suppose you have four forms of a protein, 
all with identical amino acid sequence but containing zero, 
one, two, or three oligosaccharide chains, each ending in a sin- 
gle sialic acid residue. Draw the gel pattern you would expect 



when a mixture of these four glycoproteins is subjected to SDS 
polyacrylamide gel electrophoresis (see Fig. 3-18) and stained 
for protein. Identify any bands in your drawing. 

28. Information Content of Oligosaccharides The 

carbohydrate portion of some glycoproteins may serve as a 
cellular recognition site. To perform this function, the 
oligosaccharide moiety must have the potential to exist in a 
large variety of forms. Which can produce a greater variety of 
structures: oligopeptides composed of five different amino 
acid residues or oligosaccharides composed of five different 
monosaccharide residues? Explain. 

29. Determination of the Extent of Branching in Amy- 
lopectin The amount of branching (number of (al->6) gly- 
coside bonds) in amylopectin can be determined by the 
following procedure. A sample of amylopectin is exhaustively 
methylated — treated with a methylating agent (methyl iodide) 
that replaces the hydrogen of every sugar hydroxyl with a 
methyl group, converting —OH to — OCH 3 . All the glycosidic 
bonds in the treated sample are then hydrolyzed in aqueous 
acid, and the amount of 2,3-di-O-methylglucose so formed is 
determined. 

CH 2 OH 







H 




GCH 3 


H 



H OCH 3 
2,3-Di-O-methylghicose 

(a) Explain the basis of this procedure for determining 
the number of (a 1^6) branch points in amylopectin. What 
happens to the unbranched glucose residues in amylopectin 
during the methylation and hydrolysis procedure? 

(b) A 258 mg sample of amylopectin treated as described 
above yielded 12.4 mg of 2,3-di-O-methylglucose. Determine 
what percentage of the glucose residues in the amylopectin con- 
tained an (al->6) branch. (Assume that the average molecular 
weight of a glucose residue in amylopectin is 162 g/mol.) 

30. Structural Analysis of a Polysaccharide A polysac- 
charide of unknown structure was isolated, subjected to ex- 
haustive methylation, and hydrolyzed. Analysis of the 
products revealed three methylated sugars: 2,3,4-tri-O- 
methyl-D-glucose, 2,4-di-O-methyl-D-glucose, and 2,3,4,6- 
tetra-O-methyl-D-glucose, in the ratio 20:1:1. What is the 
structure of the polysaccharide? 

Data Analysis Problem 

31. Determining the Structure of ABO Blood Group 
Antigens The human ABO blood group system was first dis- 
covered in 1901, and in 1924 this trait was shown to be inherited 
at a single gene locus with three alleles. In 1960, W. T. J. Morgan 
published a paper summarizing what was known at that time 
about the structure of the ABO antigen molecules. When the pa- 
per was published, the complete structures of the A, B, and O 
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antigens were not yet known; this paper is an example of what 
scientific knowledge looks like "in the making." 

In any attempt to determine the structure of an unknown 
biological compound, researchers must deal with two funda- 
mental problems: (1) If you don't know what it is, how do you 
know if it is pure? (2) If you don't know what it is, how do you 
know that your extraction and purification conditions have not 
changed its structure? Morgan addressed problem 1 through 
several methods. One method is described in his paper as ob- 
serving "constant analytical values after fractional solubility 
tests" (p. 312). In this case, "analytical values" are measure- 
ments of chemical composition, melting point, and so forth. 

(a) Based on your understanding of chemical techniques, 
what could Morgan mean by "fractional solubility tests"? 

(b) Why would the analytical values obtained from frac- 
tional solubility tests of a pure substance be constant, and 
those of an impure substance not be constant? 

Morgan addressed problem 2 by using an assay to measure 
the immunological activity of the substance present in differ- 
ent samples. 

(c) Why was it important for Morgan's studies, and espe- 
cially for addressing problem 2, that this activity assay be 
quantitative (measuring a level of activity) rather than sim- 
ply qualitative (measuring only the presence or absence of 
a substance)? 

The structure of the blood group antigens is shown in Fig- 
ure 10-15. In his paper (p. 314), Morgan listed several proper- 
ties of the three antigens, A, B, and 0, that were known at 
that time: 

1. Type B antigen has a higher content of galactose than A 
or 0. 

2. Tvpe A antigen contains more total amino sugars than B 
or 0. 

3 The glucosamine/galactosamine ratio for the A antigen is 
roughly 1.2; for B, it is roughly 2.5. 

(d) Which of these findings is (are) consistent with the 
known structures of the blood group antigens? 

(e) How do you explain the discrepancies between Mor- 
gan's data and the known structures? 

In later work, Morgan and his colleagues used a clever 
technique to obtain structural information about the blood 
group antigens. Enzymes had been found that would specifi- 
cally degrade the antigens. However, these were available only 
as crude enzyme preparations, perhaps containing more than 



one enzyme of unknown specificity. Degradation of the blood 
type antigens by these crude enzymes could be inhibited by 
the addition of particular sugar molecules to the reaction. Only 
sugars found in the blood type antigens would cause this inhi- 
bition. One enzyme preparation, isolated from the protozoan 
Trichomonas foetus, would degrade all three antigens and 
was inhibited by the addition of particular sugars. The results 
of these studies are summarized in the table below, showing 
the percentage of substrate remaining unchanged when the T. 
foetus enzyme acted on the blood group antigens in the pres- 
ence of sugars. 

Unchanged substrate (%) 
Sugar added A antigen B antigen 0 antigen 



Control — no sugar 3 11 

L-Fucose 3 1 100 

D-Fucose 3 11 

L-Galactose 3 13 

D-Galactose 6 100 1 

A^Acetylglucosamine 3 11 

N- Acetylgalactosamine 100 6 1 



For the O antigen, a comparison of the control and 
L-fucose results shows that L-fucose inhibits the degradation 
of the antigen. This is an example of product inhibition, in 
which an excess of reaction product shifts the equilibrium of 
the reaction, preventing further breakdown of substrate. 

(f) Although the O antigen contains galactose, N- 
acetylglucosamine, and ^-acetylgalactosamine, none of these 
sugars inhibited the degradation of this antigen. Based on 
these data, is the enzyme preparation from T. foetus an endo- 
or exoglycosidase? (Endoglycosidases cut bonds between in- 
terior residues; exoglycosidase s remove one residue at a time 
from the end of a polymer.) Explain your reasoning, 

(g) Fucose is also present in the A and B antigens. Based 
on the structure of these antigens, why does fucose fail to pre- 
vent their degradation by the T. foetus enzyme? What struc- 
ture would be produced? 

(h) Which of the results in (f ) and (g) are consistent with 
the structures shown in Figure 10-15? Explain your reasoning. 

Reference 
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A structure this pretty just had to exist, 

—James Watson, The Double Helix, 1968 
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|L I ucleotides have a variety of roles in cellular metab- 
IVI olism. They are the energy currency in metabolic 
I w transactions, the essential chemical links in the re- 
sponse of cells to hormones and other extracellular 
stimuli, and the structural components of an array of en- 
zyme cofactors and metabolic intermediates. And, last 
but certainly not least, they are the constituents of nu- 
cleic acids: deoxyribonucleic acid (DNA) and ribonu- 
cleic acid (RNA), the molecular repositories of genetic 
information. The structure of every protein, and ulti- 
mately of every biomolecule and cellular component, is 
a product of information programmed into the nu- 
cleotide sequence of a cell's nucleic acids. The ability to 
store and transmit genetic information from one gener- 
ation to the next is a fundamental condition for life. 

This chapter provides an overview of the chemical 
nature of the nucleotides and nucleic acids found in 
most cells; a more detailed examination of the function 
of nucleic acids is the focus of Part III of this text. 

8.1 Some Basics 

0 Nucleotides, Building Blocks of Nucleic Acids The amino acid 
sequence of every protein in a cell, and the nucleotide 
sequence of every RNA, is specified by a nucleotide se- 
quence in the cell's DNA. A segment of a DNA molecule 
that contains the information required for the synthesis 
of a functional biological product, whether protein or 
RNA, is referred to as a gene. A cell typically has many 
thousands of genes, and DNA molecules, not surpris- 
ingly, tend to be very large. The storage and transmission 
of biological information are the only known functions 
of DNA. 



RNAs have a broader range of functions, and several 
classes are found in cells. Ribosomal RNAs (rRNAs) 
are components of ribosomes, the complexes that 
carry out the synthesis of proteins. Messenger RNAs 
(mRNAs) are intermediaries, carrying genetic informa- 
tion from one or a few genes to a ribosome, where the 
corresponding proteins can be synthesized. Transfer 
RNAs (tRNAs) are adapter molecules that faithfully 
translate the information in mRNA into a specific se- 
quence of amino acids. In addition to these major classes 
there is a wide variety of RNAs with special functions, 
described in depth in Part III. 

Nucleotides and Nucleic Acids Have Characteristic Bases 
and Pentoses 

Nucleotides have three characteristic components: 
(1) a nitrogenous (nitrogen-containing) base, (2) a 
pentose, and (3) a phosphate (Fig. 8-1). The molecule 
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FIGURE 8-1 Structure of nucleotides, (a) General structure showing 
the numbering convention for the pentose ring. This is a ribonu- 
cleotide. In deoxyribonucleotides the — OH group on the 2' carbon (in 
red) is replaced with — H. (b) The parent compounds of the pyrimidine 
and purine bases of nucleotides and nucleic acids, showing the num- 
bering conventions. 
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without the phosphate group is called a nucleoside. 
The nitrogenous bases are derivatives of two parent 
compounds, pyrimidine and purine. The bases and 
pentoses of the common nucleotides are heterocyclic 
compounds. 

KEY CONVENTION: The carbon and nitrogen atoms in the 
parent structures are conventionally numbered to facil- 
itate the naming and identification of the many deriva- 
tive compounds. The convention for the pentose ring 
follows rules outlined in Chapter 7, but in the pentoses 
of nucleotides and nucleosides the carbon numbers are 
given a prime (') designation to distinguish them from 
the numbered atoms of the nitrogenous bases. ■ 

The base of a nucleotide is joined covalently (at N-l 
of pyrimidines and N-9 of purines) in an 7V-/3-glycosyl 
bond to the 1' carbon of the pentose, and the phosphate 
is esterified to the 5' carbon. The AT-/3-glycosyl bond is 
formed by removal of the elements of water (a hydroxyl 
group from the pentose and hydrogen from the base) , as 
in O-glycosidic bond formation (see Fig. 7-29). 

Both DNA and RNA contain two major purine bases, 
adenine (A) and guanine (G), and two major pyrim- 
idines. In both DNA and RNA one of the pyrimidines is 
cytosine (C), but the second major pyrimidine is not 
the same in both: it is thymine (T) in DNA and uracil 
(U) in RNA. Only rarely does thymine occur in RNA or 
uracil in DNA. The structures of the five major bases are 
shown in Figure 8-2, and the nomenclature of their 
corresponding nucleotides and nucleosides is summa- 
rized in Table 8-1. 

Nucleic acids have two kinds of pentoses. The re- 
curring deoxyribonucleotide units of DNA contain 2'- 
deoxy-D-ribose, and the ribonucleotide units of RNA 
contain D-ribose. In nucleotides, both types of pentoses 
are in their /3-furanose (closed five-membered ring) 
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FIGURE 8-2 Major purine and pyrimidine bases of nucleic acids. 

Some of the common names of these bases reflect the circumstances of 
their discovery. Guanine, for example, was first isolated from guano 
{bird manure), and thymine was first isolated from thymus tissue. 



form. As Figure 8-3 shows, the pentose ring is not pla- 
nar but occurs in one of a variety of conformations gen- 
erally described as "puckered." 

KEY CONVENTION: Although DNA and RNA seem to have 
two distinctions — different pentoses and the presence 
of uracil in RNA and thymine in DNA — it is the pentoses 
that define the identity of a nucleic acid. If the nucleic 
acid contains 2'-deoxy-D-ribose, it is DNA by definition 
even though it may contain some uracil. Similarly, if the 
nucleic acid contains D-ribose, it is RNA regardless of its 
base composition. ■ 



TABLE 8-1 



Base 



Purines 

Adenine 

Guanine 

Pyrimidines 

Cytosine 

Thymine 
Uracil 



Nucleotide and Nucleic Acid Nomenclature 

Nucleoside Nucleotide 



Adenosine 
Deoxyadenosine 

Guanosine 
Deoxyguanosine 

Cytidine 
Deoxycytidine 

Thymidine or deoxythymidine 
Uridine 



Nucleic acid 



Adenylate 
Deoxyadenylate 

Guanylate 
Deoxyguanylate 

Cytidylate 
Deoxycytidylate 

Thymidylate or deoxythymidylate 
Uridylate 



RNA 
DNA 

RNA 
DNA 

RNA 
DNA 

DNA 

RNA 



Note: "Nucleoside" and "nucleotide" are generic terms that include both ribo- and deoxyribo- forms, Also, ribonucleosides and ribonucleotides are here designated simply as nucleo- 
sides and nucleotides (e.g., riboadenosine as adenosine), and deoxyribonucleosides and deoxyribonucleotides as deoxynucleosides and deoxynucleotides (e.g., deoxyriboadenosine 
as deoxyadenosine). Both forms of naming are acceptable, but the shortened names are more commonly used. Thymine is an exception; "ribothymidine" is used to describe its unusual 
occurrence in RNA. 
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(a) 
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FIGURE 8-3 Conformations of ribose. (a) In solution, the straight- 
chain (aldehyde) and ring (/3-furanose) forms of free ribose are in 
equilibrium RNA contains only the ring form, /3-D-ribofuranose. 
Deoxyribose undergoes a similar interconversion in solution, but in 
DNA exists solely as ;8-2'-deoxy-D-ribofuranose. (b) Ribofuranose 

Figure 8-4 gives the structures and names of 
the four major deoxyribonucleotides (deoxyribonu- 
cleoside 5 '-monophosphates), the structural units of 




rings in nucleotides can exist in four different puckered conformations. 
In all cases, four of the five atoms are in a single plane. The fifth atom 
(C-2' or C-3') is on either the same (endo) or the opposite (exo) side of 
the plane relative to the C-5' atom. 

DNAs, and the four major ribonucleotides (ribonu- 
cleoside 5 '-monophosphates), the structural units of 
RNAs. 
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Nucleotide: Adenylate (adenosine 
5 ' -monophosphate) 



Symbols: 
Nucleoside: 



A, AMP 

Adenosine 



Guanylate (guanosine 
5 ' -monophosphate) 

G, GMP 
Guanosine 
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Uridylate (uridine 
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Cytidylate (cytidine 
5 ' -monophosphate) 

C, CMP 
Cytidine 



(b) Ribonucleotides 



FIGURE 8-4 Deoxyribonucleotides and ribonucleotides of nucleic 
acids. All nucleotides are shown in their free form at pH 7.0. The nu- 
cleotide units of DNA (a) are usually symbolized as A, G, T, and C, 
sometimes as dA, dG, dT, and dC; those of RNA (b) as A, G, U, and C. 
In their free form the deoxyribonucleotides are commonly abbreviated 
dAMP, dGMP, dTMP, and dCMP; the ribonucleotides, AMP, GMP, 



UMP, and CMP. For each nucleotide, the more common name is fol- 
lowed by the complete name in parentheses. All abbreviations assume 
that the phosphate group is at the 5' position. The nucleoside portion 
of each molecule is shaded in pink. In this and the following illus- 
trations, the ring carbons are not shown. 
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(a) AT 2 -Methylguanosine 5-Hydroxymethylcytidine 
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FIGURE 8-5 Some minor purine and pyrimidine bases, shown as the 
nucleosides, (a) Minor bases of DNA. 5-Methylcytidine occurs in the 
DNA of animals and higher plants, /V 6 -methyladenosine in bacterial 
DNA, and 5-hydroxymethylcytidine in the DNA of bacteria infected 
with certain bacteriophages, (b) Some minor bases of tRNAs. Inosine 
contains the base hypoxanthine. Note that pseudouridine, like uridine, 
contains uracil; they are distinct in the point of attachment to the 
ribose— in uridine, uracil is attached through N-1, the usual attach- 
ment point for pyrimidines; in pseudouridine, through C-5. 

Although nucleotides bearing the major purines and 
pyrimidines are most common, both DNA and RNA also 
contain some minor bases (Fig. 8-5). In DNA the most 
common of these are methylated forms of the major 
bases; in some viral DNAs, certain bases may be hydrox- 
ymethylated or glucosylated. Altered or unusual bases 
in DNA molecules often have roles in regulating or pro- 
tecting the genetic information. Minor bases of many 
types are also found in RNAs, especially in tRNAs (see 
Fig. 8-25 and Fig. 26-23). 

KEY CONVENTION: The nomenclature for the minor bases 
can be confusing. Like the major bases, many have com- 
mon names — hypoxanthine, for example, shown as its 
nucleoside inosine in Figure 8-5. When an atom in the 
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FIGURE 8-6 Some adenosine monophosphates. Adenosine 2 -mono- 
phosphate, 3 '-monophosphate, and 2 ',3 '-cyclic monophosphate are formed 
by enzymatic and alkaline hydrolysis of RNA. 



purine or pyrimidine ring is substituted, the usual con- 
vention (used here) is simply to indicate the ring posi- 
tion of the substituent by its number — for example, 
5-methylcytosine, 7-methylguanine, and 5-hydroxy- 
methylcytosine (shown as the nucleosides in Fig. 8-5). 
The element to which the substituent is attached (N, C, 
0) is not identified. The convention changes when the 
substituted atom is exocyclic (not within the ring struc- 
ture), in which case the type of atom is identified and 
the ring position to which it is attached is denoted with 
a superscript. The amino nitrogen attached to C-6 of 
adenine is N & ; similarly, the carbonyl oxygen and amino 
nitrogen at C-6 and C-2 of guanine are O 6 and N 2 , re- 
spectively. Examples of this nomenclature are N 6 - 
methyladenosine and Af 2 -methylguanosine (Fig. 8-5). ■ 

Cells also contain nucleotides with phosphate groups 
in positions other than on the 5' carbon (Fig. 8-6). 
Ribonucleoside 2 ',3 '-cyclic monophosphates are 

isolatable intermediates, and ribonucleoside 3'- 
monophosphates are end products of the hydrolysis of 
RNA by certain ribonucleases. Other variations are 
adenosine 3 ' , 5 ' -cyclic monophosphate (c AMP) and 
guanosine 3',5'-qyclic monophosphate (cGMP), consid- 
ered at the end of this chapter. 

Phosphodiester Bonds Link Successive Nucleotides in 
Nucleic Acids 

The successive nucleotides of both DNA and RNA are co- 
valently linked through phosphate-group "bridges," in 
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which the 5 '-phosphate group of one nucleotide unit is 
joined to the 3'-hydroxyl group of the next nucleotide, 
creating a phosphodiester linkage (Fig. 8-7), Thus the 
covalent backbones of nucleic acids consist of alternating 
phosphate and pentose residues, and the nitrogenous 
bases may be regarded as side groups joined to the back- 
bone at regular intervals. The backbones of both DNA and 
RNA are hydrophilic. The hydroxyl groups of the sugar 
residues form hydrogen bonds with water. The phosphate 
groups, with a pK a near 0, are completely ionized and neg- 
atively charged at pH 7, and the negative charges are gen- 
erally neutralized by ionic interactions with positive 
charges on proteins, metal ions, and polyamines. 

KEY CONVENTION: All the phosphodiester linkages in DNA 
and RNA have the same orientation along the chain 
(Fig. 8-7), giving each linear nucleic acid strand a spe- 
cific polarity and distinct 5' and 3' ends. By definition, 
the 5' end lacks a nucleotide at the 5' position and the 
3' end lacks a nucleotide at the 3' position. Other 
groups (most often one or more phosphates) may be 
present on one or both ends. The 5' to 3' orientation of 
a strand of nucleic acid refers to the ends of the strand, 
not the orientation of the individual phosphodiester 
bonds linking its constituent nucleotides. ■ 

The covalent backbone of DNA and RNA is subject 
to slow, nonenzymatic hydrolysis of the phosphodiester 
bonds. In the test tube, RNA is hydrolyzed rapidly under 
alkaline conditions, but DNA is not; the 2'-hydroxyl 
groups in RNA (absent in DNA) are directly involved in 
the process. Cyclic 2',3'-monophosphate nucleotides 
are the first products of the action of alkali on RNA and 
are rapidly hydrolyzed further to yield a mixture of 2'- 
and 3'-nucleoside monophosphates (Fig. 8-8). 
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FIGURE 8-7 Phosphodiester linkages in the covalent backbone of 
DNA and RNA. The phosphodiester bonds (one of which is shaded in 
the DNA) link successive nucleotide units. The backbone of alternating 
pentose and phosphate groups in both types of nucleic acid is highly 
polar. The 5' end of the macromolecule lacks a nucleotide at the 5' 
position, and the 3' end lacks a nucleotide at the 3' position. 
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FIGURE 8-8 Hydrolysis of RNA under alkaline 
conditions. The 2' hydroxyl acts as a nucleophile in 
an intramolecular displacement. The 2',3'-cyclic 
monophosphate derivative is further hydrolyzed to a 
mixture of 2'- and 3 '-monophosphates. DNA, which 
lacks 2' hydroxyls, is stable under similar conditions. 
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The nucleotide sequences of nucleic acids can be 
represented schematically, as illustrated below by a seg- 
ment of DNA with five nucleotide units. The phosphate 
groups are symbolized by (©, and each deoxyribose is 
symbolized by a vertical line, from C-l ' at the top to C-5' 
at the bottom (but keep in mind that the sugar is always 
in its closed-ring /3-furanose form in nucleic acids). The 
connecting lines between nucleotides (which pass 
through ®) are drawn diagonally from the middle (C-3') 
of the deoxyribose of one nucleotide to the bottom 
(C-5') of the next. 

A C G T A 



5 r End 




Some simpler representations of this pentadeoxyri- 
bonucleotide are pA-C-G-T-A OH , pApCpGpTpA, and 
pACGTA. 

KEY CONVENTION: The sequence of a single strand of nu- 
cleic acid is always written with the 5' end at the left and 
the 3' end at the right— that is, in the 5' 3' direction. ■ 

A short nucleic acid is referred to as an oligonu- 
cleotide. The definition of "short" is somewhat arbi- 
trary, but polymers containing 50 or fewer nucleotides 
are generally called oligonucleotides. A longer nucleic 
acid is called a polynucleotide. 

The Properties of Nucleotide Bases Affect the Three- 
Dimensional Structure of Nucleic Adds 

Free pyrimidines and purines are weakly basic com- 
pounds and thus are called bases. The purines and 
pyrimidines common in DNA and RNA are aromatic 
molecules (Fig. 8-2), a property with important conse- 




H H 
Lactam Lactim Double lactim 

Uracil 



FIGURE 8-9 Tautomeric forms of uracil. The lactam form predomi- 
nates at pH 7.0; the other forms become more prominent as pH 
decreases. The other free pyrimidines and the free purines also have 
tautomeric forms, but they are more rarely encountered. 

quences for the structure, electron distribution, and 
light absorption of nucleic acids. Electron derealization 
among atoms in the ring gives most of the bonds partial 
double-bond character. One result is that pyrimidines 
are planar molecules and purines are very nearly planar, 
with a slight pucker. Free pyrimidine and purine bases 
may exist in two or more tautomeric forms depending 
on the pH. Uracil, for example, occurs in lactam, lactim, 
and double lactim forms (Fig. 8-9). The structures 
shown in Figure 8-2 are the tautomers that predominate 
at pH 7.0. All nucleotide bases absorb UV light, and nu- 
cleic acids are characterized by a strong absorption at 
wavelengths near 260 nm (Fig. 8-10). 

The purine and pyrimidine bases are hydropho- 
bic and relatively insoluble in water at the near-neu- 
tral pH of the cell. At acidic or alkaline pH the bases 
become charged and their solubility in water in- 
creases. Hydrophobic stacking interactions in which 
two or more bases are positioned with the planes of 
their rings parallel (like a stack of coins) are one of 
two important modes of interaction between bases in 
nucleic acids. The stacking also involves a combina- 
tion of van der Waals and dipole-dipole interactions 
between the bases. Base stacking helps to minimize 
contact of the bases with water, and base-stacking in- 
teractions are very important in stabilizing the three- 



FIGURE 8-1 0 Absorption spectra of the com- 
mon nucleotides. The spectra are shown as 
the variation in molar extinction coefficient 
with wavelength. The molar extinction coeffi- 
cients at 260 nm and pH 7.0 <e 260 ) are listed 
in the table. The spectra of corresponding 
ribonucleotides and deoxyribonucleotides, 
as well as the nucleosides, are essentially 
identical. For mixtures of nucleotides, a 
wavelength of 260 nm (dashed vertical line) 
is used for absorption measurements. 
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dimensional structure of nucleic acids, as described 
later. 

The functional groups of pyrimidines and purines 
are ring nitrogens, carbonyl groups, and exocyclic 
amino groups. Hydrogen bonds involving the amino 
and carbonyl groups are the most important mode of 
interaction between two (and occasionally three or 
four) complementary strands of nucleic acid. The most 
common hydrogen-bonding patterns are those defined 
by James D. Watson and Francis Crick in 1953, in 
which A bonds specifically to T (or U) and G bonds to 
C (Fig. 8-11). These two types of base pairs pre- 
dominate in double-stranded DNA and RNA, and the 
tautomers shown in Figure 8-2 are responsible for 
these patterns. It is this specific pairing of bases that 
permits the duplication of genetic information, as we 
shall discuss later in this chapter. 




James D. Watson Francis Crick, 

1916-2004 



FIGURE 8-11 Hydrogen-bonding patterns in the base pairs defined 
by Watson and Crick. Here as elsewhere, hydrogen bonds are repre- 
sented by three blue lines. 

SUMMARY 8.1 Some Basics 

■ A nucleotide consists of a nitrogenous base (purine 
or pyrimidine), a pentose sugar, and one or more 
phosphate groups. Nucleic acids are polymers of 
nucleotides, joined together by phosphodiester 
linkages between the 5'-hydroxyl group of one 
pentose and the 3'-hydroxyl group of the next. 

■ There are two types of nucleic acid: RNA and DNA. 
The nucleotides in RNA contain ribose, and the 
common pyrimidine bases are uracil and cytosine. 
In DNA, the nucleotides contain 2'-deoxyribose, 
and the common pyrimidine bases are thymine and 
cytosine. The primary purines are adenine and 
guanine in both RNA and DNA. 

8.2 Nucleic Acid Structure 

The discovery of the structure of DNA by Watson and 
Crick in 1953 was a momentous event in science, an 
event that gave rise to entirely new disciplines and influ- 
enced the course of many established ones. In this sec- 
tion we focus on DNA structure, some of the events that 
led to its discovery, and more recent refinements in our 
understanding of DNA. RNA structure is also introduced. 

As in the case of protein structure (Chapter 4), it is 
sometimes useful to describe nucleic acid structure in 
terms of hierarchical levels of complexity (primary, sec- 
ondary, tertiary). The primary structure of a nucleic 
acid is its covalent structure and nucleotide sequence. 
Any regular, stable structure taken up by some or all of 
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the nucleotides in a nucleic acid can be referred to as 
secondary structure. All structures considered in the 
remainder of this chapter fall under the heading of sec- 
ondary structure. The complex folding of large chromo- 
somes within eukaryotic chromatin and bacterial 
nucleoids is generally considered tertiary structure; 
this is discussed in Chapter 24. 

DNA Is a Double Helix That Stores Genetic Information 

DNA was first isolated and characterized by Friedrich 
Miescher in 1868. He called the phosphorus-containing 
substance "nuclein." Not until the 1940s, with the work 
of Oswald T. Avery, Colin MacLeod, and Maclyn McCarty, 
was there any compelling evidence that DNA was the ge- 
netic material. Avery and his colleagues found that DNA 
extracted from a virulent (disease-causing) strain of the 
bacterium Streptococcus pneumoniae and injected 
into a nonvirulent strain of the same bacterium trans- 
formed the nonvirulent strain into a virulent strain. They 
concluded that the DNA from the virulent strain carried 
the genetic information for virulence. Then in 1952, ex- 
periments by Alfred D. Hershey and Martha Chase, in 
which they studied the infection of bacterial cells by a 
virus (bacteriophage) with radioactively labeled DNA or 
protein, removed any remaining doubt that DNA, not 
protein, carried the genetic information. 

Another important clue to the structure of DNA 
came from the work of Erwin Chargaff and his col- 
leagues in the late 1940s. They found that the four nu- 
cleotide bases of DNA occur in different ratios in the 
DNAs of different organisms and that the amounts of 
certain bases are closely related. These data, collected 
from DNAs of a great many different species, led Char- 
gaff to the following conclusions: 

1 . The base composition of DNA generally varies from 
one species to another. 

2. DNA specimens isolated from different tissues of 
the same species have the same base composition. 

3. The base composition of DNA in a given species 
does not change with an organism's age, nutritional 
state, or changing environment. 

4. In all cellular DNAs, regardless of the species, the 
number of adenosine residues is equal to the 
number of thymidine residues (that is, A = T), and 
the number of guanosine residues is equal to the 
number of cytidine residues (G = C). From these 
relationships it follows that the sum of the purine 
residues equals the sum of the pyrimidine residues; 
that is, A + G = T + C. 

These quantitative relationships, sometimes called 
"Chargaff's rules," were confirmed by many subsequent 
researchers. They were a key to establishing the three- 
dimensional structure of DNA and yielded clues to how 
genetic information is encoded in DNA and passed from 
one generation to the next. 




FIGURE 8-12 X-ray diffraction pattern of DNA. The spots forming a 
cross in the center denote a helical structure. The heavy bands at the 
left and right arise from the recurring bases. 

To shed more light on the structure of DNA, Ros- 
alind Franklin and Maurice Wilkins used the powerful 
method of x-ray diffraction (see Box 4-5) to analyze 
DNA fibers. They showed in the early 1950s that DNA 
produces a characteristic x-ray diffraction pattern 
(Fig. 8-12). From this pattern it was deduced that 
DNA molecules are helical with two periodicities along 
their long axis, a primary one of 3.4 A and a secondary 
one of 34 A. The problem then was to formulate a 
three-dimensional model of the DNA molecule that 
could account not only for the x-ray diffraction data 
but also for the specific A = T and G = C base equiva- 
lences discovered by Chargaff and for the other chem- 
ical properties of DNA. 




Rosalind Franklin, Maurice Wilkins, 
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James Watson and Francis Crick relied on this accu- 
mulated information about DNA to set about deducing its 
structure. In 1953 they postulated a three-dimensional 
model of DNA structure that accounted for all the avail- 
able data. It consists of two helical DNA chains wound 
around the same axis to form a right-handed double helix 
(see Box 4-1 for an explanation of the right- or left- 
handed sense of a helical structure). The hydrophilic 
backbones of alternating deoxyribose and phosphate 
groups are on the outside of the double helix, facing the 
surrounding water. The furanose ring of each deoxyri- 
bose is in the C-2' endo conformation. The purine and 
pyrimidine bases of both strands are stacked inside the 
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FIGURE 8-13 Watson-Crick model for the structure of DNA. The 

original model proposed by Watson and Crick had 1 0 base pairs, or 34 
A (3.4 nm), per turn of the helix; subsequent measurements revealed 
10.5 base pairs, or 36 A (3.6 nm), per turn, (a) Schematic representa- 
tion, showing dimensions of the helix, (b) Stick representation showing 
the backbone and stacking of the bases, (c) Space-filling model. 

double helix, with their hydrophobic and nearly planar 
ring structures very close together and perpendicular to 
the long axis. The offset pairing of the two strands cre- 
ates a major groove and minor groove on the surface 
of the duplex (Fig. 8-13). Each nucleotide base of one 
strand is paired in the same plane with a base of the 
other strand. Watson and Crick found that the hydrogen- 
bonded base pairs illustrated in Figure 8-11, G with C 
and A with T, are those that fit best within the structure, 
providing a rationale for Chargaff 's rule that in any DNA, 
G = C and A = T. It is important to note that three hy- 
drogen bonds can form between G and C, symbolized 
G=C, but only two can form between A and T, symbol- 
ized A=T. This is one reason for the finding that separa- 
tion of paired DNA strands is more difficult the higher 
the ratio of G=C to A=T base pairs. Other pairings of 
bases tend (to varying degrees) to destabilize the dou- 
ble-helical structure. 

When Watson and Crick constructed their model, 
they had to decide at the outset whether the strands of 
DNA should be parallel or antiparallel — whether 
their 3',5'-phosphodiester bonds should run in the same 
or opposite directions. An antiparallel orientation pro- 
duced the most convincing model, and later work with 
DNA polymerases (Chapter 25) provided experimental 
evidence that the strands are indeed antiparallel, a find- 
ing ultimately confirmed by x-ray analysis. 

To account for the periodicities observed in the x-ray 
diffraction patterns of DNA fibers, Watson and Crick ma- 
nipulated molecular models to arrive at a structure in 
which the vertically stacked bases inside the double helix 



would be 3.4 A apart; the secondary repeat distance of 
about 34 A was accounted for by the presence of 10 base 
pairs in each complete turn of the double helix. In aque- 
ous solution the structure differs slightly from that in 
fibers, having 10.5 base pairs per helical turn (Fig. 8-13). 

As Figure 8-14 shows, the two antiparallel polynu- 
cleotide chains of double-helical DNA are not identical 
in either base sequence or composition. Instead they are 
complementary to each other. Wherever adenine oc- 
curs in one chain, thymine is found in the other; simi- 
larly, wherever guanine occurs in one chain, cytosine is 
found in the other. 

The DNA double helix, or duplex, is held together 
by two forces, as described earlier: hydrogen bonding 
between complementary base pairs (Fig. 8-11) and 
base-stacking interactions. The complementarity be- 
tween the DNA strands is attributable to the hydrogen 
bonding between base pairs. The base-stacking interac- 
tions, which are largely nonspecific with respect to the 
identity of the stacked bases, make the major contribu- 
tion to the stability of the double helix. 

The important features of the double-helical model of 
DNA structure are supported by much chemical and bio- 
logical evidence. Moreover, the model immediately sug- 
gested a mechanism for the transmission of genetic 
information. The essential feature of the model is the 
complementarity of the two DNA strands. As Watson and 
Crick were able to see, well before confirmatory data be- 
came available, this structure could logically be replicated 
by (1) separating the two strands and (2) synthesizing a 
complementary strand for each. Because nucleotides in 
each new strand are joined in a sequence specified by the 
base-pairing rules stated above, each preexisting strand 
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FIGURE 8-14 Complementarity of 
strands in the DNA double helix. 

The complementary antiparallel 
strands of DNA follow the pairing 
rules proposed by Watson and Crick. 
The base-paired antiparallel strands 
differ in base composition: the left 
strand has the composition A 3 T 2 G^ 
C 3 ; the right, A 2 T 3 G 3 Q. They also 
differ in sequence when each chain is 
read in the 5' -» 3' direction. Note 
the base equivalences: A = T and G 
= C in the duplex. 
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FIGURE 8-15 Replication of DNA as suggested by Watson and Crick. 

The preexisting or "parent" strands become separated, and each is the 
template for biosynthesis of a complementary "daughter" strand (in pink). 

functions as a template to guide the synthesis of one com- 
plementary strand (Fig. 8-15). These expectations 
were experimentally confirmed, inaugurating a revolution 
in our understanding of biological inheritance. 

WORKED EXAMPLE 8-1 Base Pairing in DNA 

In samples of DNA isolated from two unidentified 
species of bacteria, X and Y, adenine makes up 32% and 
17%, respectively, of the total bases. What relative pro- 
portions of adenine, guanine, thymine, and cytosine 
would you expect to find in the two DNA samples? What 
assumptions have you made? One of these species was 
isolated from a hot spring (64 °C). Which species is most 
likely the thermophilic bacterium, and why? 

Solution: For any double-helical DNA, A = T and G = C. 
The DNA from species X has 32% A and therefore must 
contain 32% T. This accounts for 64% of the bases and 
leaves 36% as G=C pairs: 18% G and 18% C. The sam- 
ple from species Y, with 17% A, must contain 17% T, ac- 
counting for 34% of the base pairs. The remaining 66% 
of the bases are thus equally distributed as 33% G and 
33% C. This calculation is based on the assumption that 
both DNA molecules are double-stranded. 

The higher the G + C content of a DNA molecule, 
the higher the melting temperature. Species Y, having 
the DNA with the higher G + C content (66%), most 
likely is the thermophilic bacterium; its DNA has a 
higher melting temperature and thus is more stable at 
the temperature of the hot spring. 



DNA Can Occur in Different Three-Dimensional 
Forms 

DNA is a remarkably flexible molecule. Considerable ro- 
tation is possible around several types of bonds in the 
sugar-phosphate (phosphodeoxyribose) backbone, and 
thermal fluctuation can produce bending, stretching, 
and unpairing (melting) of the strands. Many significant 
deviations from the Watson-Crick DNA structure are 
found in cellular DNA, some or all of which may be im- 
portant in DNA metabolism. These structural variations 
generally do not affect the key properties of DNA de- 
fined by Watson and Crick: strand complementarity, an- 
tiparallel strands, and the requirement for A=T and 
G=C base pairs. 

Structural variation in DNA reflects three things: 
the different possible conformations of the deoxyri- 
bose, rotation about the contiguous bonds that make 
up the phosphodeoxyribose backbone (Fig. 8-16a), 
and free rotation about the C-l'-JV-glycosyl bond (Fig. 
8-16b). Because of steric constraints, purines in 
purine nucleotides are restricted to two stable confor- 
mations with respect to deoxyribose, called syn and 
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FIGURE 8-16 Structural variation in DNA. (a) The conformation of a 
nucleotide in DNA is affected by rotation about seven different bonds. 
Six of the bonds rotate freely. The limited rotation about bond 4 gives 
rise to ring pucker, in which one of the atoms in the five-membered fu- 
ranose ring is out of the plane described by the other four. This confor- 
mation is endo or exo, depending on whether the atom is displaced to 
the same side of the plane as C-5' or to the opposite side (see Fig. 8-3b). 
(b) For purine bases in nucleotides, only two conformations with 
respect to the attached ribose units are sterically permitted, anti or syn. 
Pyrimidines generally occur in the anti conformation. 
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anti (Fig. 8-16b). Pyrimidines are generally restricted 
to the anti conformation because of steric interference 
between the sugar and the carbonyl oxygen at C-2 of 
the pyrimidine. 

The Watson-Crick structure is also referred to as 
B-form DNA, or B-DNA. The B form is the most stable 
structure for a random-sequence DNA molecule under 
physiological conditions and is therefore the standard 
point of reference in any study of the properties of 
DNA. Two structural variants that have been well char- 
acterized in crystal structures are the A and Z forms. 
These three DNA conformations are shown in Figure 
8-17, with a summary of their properties. The A form is 
favored in many solutions that are relatively devoid of 
water. The DNA is still arranged in a right-handed dou- 
ble helix, but the helix is wider and the number of base 
pairs per helical turn is 11, rather than 10.5 as in B- 
DNA. The plane of the base pairs in A-DNA is tilted 
about 20° with respect to the helix axis. These struc- 
tural changes deepen the major groove while making 
the minor groove shallower. The reagents used to pro- 
mote crystallization of DNA tend to dehydrate it, and 
thus most short DNA molecules tend to crystallize in 
the A form. 

Z-form DNA is a more radical departure from the 
B structure; the most obvious distinction is the left- 
handed helical rotation. There are 12 base pairs per 
helical turn, and the structure appears more slender 
and elongated. The DNA backbone takes on a zigzag 
appearance. Certain nucleotide sequences fold into 
left-handed Z helices much more readily than others. 
Prominent examples are sequences in which pyrim- 



idines alternate with purines, especially alternating C 
and G or 5-methyl-C and G residues. To form the left- 
handed helix in Z-DNA, the purine residues flip to the 
syn conformation, alternating with pyrimidines in the 
anti conformation. The major groove is barely appar- 
ent in Z-DNA, and the minor groove is narrow and 
deep. 

Whether A-DNA occurs in cells is uncertain, but 
there is evidence for some short stretches (tracts) of 
Z-DNA in both bacteria and eukaryotes. These Z- 
DNA tracts may play a role (as yet undefined) in reg- 
ulating the expression of some genes or in genetic 
recombination. 

Certain DNA Sequences Adopt Unusual 
Structures 

Other sequence-dependent structural variations found 
in larger chromosomes may affect the function and me- 
tabolism of the DNA segments in their immediate vicin- 
ity. For example, bends occur in the DNA helix 
wherever four or more adenosine residues appear se- 
quentially in one strand. Six adenosines in a row pro- 
duce a bend of about 18°. The bending observed with 
this and other sequences may be important in the bind- 
ing of some proteins to DNA. 

A rather common type of DNA sequence is a palin- 
drome. A palindrome is a word, phrase, or sentence that 
is spelled identically read either forward or backward; 
two examples are ROTATOR and NURSES RUN. The 
term is applied to regions of DNA with inverted repeats 
of base sequence having twofold symmetry over two 
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FIGURE 8-17 Comparison of A, B, and Z forms of DNA. Each struc- 
ture shown here has 36 base pairs. The bases are shown in gray, the 
phosphate atoms in yellow, and the riboses and phosphate oxygens in 
blue. Blue is the color used to represent DNA strands in later chapters. 
The table summarizes some properties of the three forms of DNA. 
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FIGURE 8-18 Palindromes and mirror repeats. Palindromes are se- 
quences of double-stranded nucleic acids with twofold symmetry. In 
order to superimpose one repeat (shaded sequence) on the other, it 
must be rotated 1 80° about the horizontal axis then 1 80° about the ver- 
tical axis, as shown by the colored arrows. A mirror repeat, on the 
other hand, has a symmetric sequence within each strand, Superim- 
posing one repeat on the other requires only a single 180° rotation 
about the vertical axis 

strands of DNA (Fig. 8-18). Such sequences are self- 
complementary within each strand and therefore have 
the potential to form hairpin or cruciform (cross- 
shaped) structures (Fig. 8-19), When the inverted re- 
peat occurs within each individual strand of the DNA, the 
sequence is called a mirror repeat. Mirror repeats do 
not have complementary sequences within the same 
strand and cannot form hairpin or cruciform structures. 
Sequences of these types are found in virtually every 
large DNA molecule and can encompass a few base pairs 
or thousands. The extent to which palindromes occur 
as cruciforms in cells is not known, although some cruci- 
form structures have been demonstrated in vivo in Es- 
cherichia coli. Self-complementary sequences cause 
isolated single strands of DNA (or RNA) in solution to 
fold into complex structures containing multiple hairpins. 

Several unusual DNA structures involve three or 
even four DNA strands. Nucleotides participating in a 
Watson-Crick base pair (Fig. 8-11) can form additional 
hydrogen bonds, particularly with functional groups ar- 
rayed in the major groove. For example, a cytidine 
residue (if protonated) can pair with the guanosine 
residue of a G=C nucleotide pair (Fig. 8-20); a thymi- 
dine can pair with the adenosine of an A=T pair. The 
N-7, O 6 , and TV 6 of purines, the atoms that participate in 
the hydrogen bonding of triplex DNA, are often referred 
to as Hoogsteen positions, and the non- Watson-Crick 
pairing is called Hoogsteen pairing, after Karst Hoog- 
steen, who in 1963 first recognized the potential for these 
unusual pairings. Hoogsteen pairing allows the formation 
of triplex DNAs. The triplexes shown in Figure 8-20 (a, 
b) are most stable at low pH because the C=G • C + 
triplet requires a protonated cytosine. In the triplex, the 
pK a of this cytosine is >7.5, altered from its normal value 
of 4.2. The triplexes also form most readily within long 
sequences containing only pyrimidines or only purines in 
a given strand. Some triplex DNAs contain two pyrimi- 
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FIGURE 8-19 Hairpins and cruciforms. Palindromic DNA (or RNA) 
sequences can form alternative structures with intrastrand base pair- 
ing, (a) When only a single DNA (or RNA) strand is involved, the struc- 
ture is called a hairpin, (b) When both strands of a duplex DNA are 
involved, it is called a cruciform. Blue shading highlights asymmetric 
sequences that can pair with the complementary sequence either in 
the same strand or in the complementary strand. 



dine strands and one purine strand; others contain two 
purine strands and one pyrimidine strand. 

Four DNA strands can also pair to form a tetraplex 
(quadruplex), but this occurs readily only for DNA se- 
quences with a very high proportion of guanosine 
residues (Fig. 8-20c, d). The guanosine tetraplex, or 
G tetraplex, is quite stable over a wide range of condi- 
tions. The orientation of strands in the tetraplex can 
vary as shown in Figure 8-20e. 

In the DNA of living cells, sites recognized by many 
sequence-specific DNA-binding proteins (Chapter 28) 
are arranged as palindromes, and polypyrimidine or poly- 
purine sequences that can form triple helices are found 
within regions involved in the regulation of expression 
of some eukaryotic genes. In principle, synthetic DNA 
strands designed to pair with these sequences to form 
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FIGURE 8-20 DNA structures containing three or four DNA strands. 

(a) Base-pairing patterns in one well-characterized form of triplex 
DNA. The Hoogsteen pair in each case is shown in red. (b) Triple- 
helical DNA containing two pyrimidine strands (poly(Q) and one 
purine strand (poly{G)) (derived from PDB ID 1 BCE). The dark blue and 
light blue strands are antiparallel and paired by normal Watson-Crick 
base-pairing patterns. The third (all-pyrimidine) strand (purple) is par- 
allel to the purine strand and paired through non-Watson-Crick hydro- 
gen bonds. The triplex is viewed end-on, with five triplets shown. Only 
the triplet closest to the viewer is colored, (c) Base-pairing pattern in 
the guanosine tetraplex structure, (d) Two successive tetraplets from a 
G tetraplex structure, viewed end-on with the one closest to the viewer 
in color, (e) Possible variants in the orientation of strands in a G 
tetraplex, 

triplex DNA could disrupt gene expression. This ap- 
proach to controlling cellular metabolism is of commer- 
cial interest for its potential application in medicine and 
agriculture. 

Messenger RNAs Code for Polypeptide Chains 

We now turn our attention to the expression of the ge- 
netic information that DNA contains. RNA, the second 
major form of nucleic acid in cells, has many functions. 
In gene expression, RNA acts as an intermediary by us- 
ing the information encoded in DNA to specify the 
amino acid sequence of a functional protein. 

Given that the DNA of eukaryotes is largely con- 
fined to the nucleus whereas protein synthesis occurs 
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on ribosomes in the cytoplasm, some molecule other 
than DNA must carry the genetic message from the 
nucleus to the cytoplasm. As early as the 1950s, RNA 
was considered the logical candidate: RNA is found in 
both the nucleus and the cytoplasm, and an increase in 
protein synthesis is accompanied by an increase in the 
amount of cytoplasmic RNA and an increase in its rate 
of turnover. These and other observations led several 
researchers to suggest that RNA carries genetic infor- 
mation from DNA to the protein biosynthetic machin- 
ery of the ribosome. In 1961 Frangois Jacob and 
Jacques Monod presented a unified (and essentially 
correct) picture of many aspects of this process. They 
proposed the name "messenger RNA" (mRNA) for that 
portion of the total cellular RNA carrying the genetic 



284 Nucleotides and Nucleic Acids 



Gene 

(a) Monocistronic 

Gene 1 Gene 2 Gene 3 

(b) Polycistronic 

FIGURE 8-21 Bacterial mRNA. Schematic diagrams show (a) mono- 
cistronic and (b) polycistronic mRNAs of bacteria. Red segments rep- 
resent RNA coding for a gene product; gray segments represent 
noncoding RNA. In the polycistronic transcript, noncoding RNA sepa- 
rates the three genes. 



information from DNA to the ribosomes, where the 
messengers provide the templates that specify amino 
acid sequences in polypeptide chains. Although mRNAs 
from different genes can vary greatly in length, the 
mRNAs from a particular gene generally have a denned 
size. The process of forming mRNA on a DNA template 
is known as transcription. 

In bacteria and archaea, a single mRNA molecule 
may code for one or several polypeptide chains. If it car- 
ries the code for only one polypeptide, the mRNA is 
monocistronic; if it codes for two or more different 
polypeptides, the mRNA is polycistronic. In eukary- 
otes, most mRNAs are monocistronic. (For the purposes 
of this discussion, "cistron" refers to a gene. The term it- 
self has historical roots in the science of genetics, and its 
formal genetic definition is beyond the scope of this 
text.) The minimum length of an mRNA is set by the 
length of the polypeptide chain for which it codes. For 
example, a polypeptide chain of 100 amino acid residues 
requires an RNA coding sequence of at least 300 
nucleotides, because each amino acid is coded by a 
nucleotide triplet (this and other details of protein 
synthesis are discussed in Chapter 27). However, 
mRNAs transcribed from DNA are always somewhat 
longer than the length needed simply to code for a 
polypeptide sequence (or sequences). The additional, 
noncoding RNA includes sequences that regulate pro- 
tein synthesis. Figure 8-21 summarizes the general 
structure of bacterial mRNAs. 

Many RNAs Have More Complex 
Three-Dimensional Structures 

Messenger RNA is only one of several classes of cellular 
RNA. Transfer RNAs are adapter molecules in protein 
synthesis; covalently linked to an amino acid at one end, 
they pair with the mRNA in such a way that amino acids 
are joined to a growing polypeptide in the correct 
sequence. Ribosomal RNAs are components of ribo- 
somes. There is also a wide variety of special-function 
RNAs, including some (called ribozymes) that have en- 
zymatic activity All the RNAs are considered in detail in 
Chapter 26. The diverse and often complex functions of 



these RNAs reflect a diversity of structure much richer 
than that observed in DNA molecules. 

The product of transcription of DNA is always 
single-stranded RNA. The single strand tends to assume 
a right-handed helical conformation dominated by base- 
stacking interactions (Fig. 8-22), which are stronger 
between two purines than between a purine and pyrim- 
idine or between two pyrimidines. The purine-purine in- 
teraction is so strong that a pyrimidine separating two 
purines is often displaced from the stacking pattern so 
that the purines can interact. Any self-complementary 
sequences in the molecule produce more complex 
structures. RNA can base-pair with complementary re- 
gions of either RNA or DNA. Base pairing matches the 
pattern for DNA: G pairs with C and A pairs with U (or 
with the occasional T residue in some RNAs). One dif- 
ference is that base pairing between G and U residues — 
unusual in DNA— is fairly common in RNA (see 
Fig. 8-24). The paired strands in RNA or RNA-DNA 
duplexes are antiparallel, as in DNA. 

RNA has no simple, regular secondary structure 
that serves as a reference point, as does the double he- 
lix for DNA. The three-dimensional structures of many 
RNAs, like those of proteins, are complex and unique. 
Weak interactions, especially base-stacking interac- 
tions, help stabilize RNA structures, just as they do in 
DNA. Where complementary sequences are present, 
the predominant double-stranded structure is an A- 
form right-handed double helix. Z-form helices have 
been made in the laboratory (under very high-salt or 




FIGURE 8-22 Typical right-handed stacking pattern of single-stranded 
RNA. The bases are shown in gray, the phosphate atoms in yellow, and 
the riboses and phosphate oxygens in green. Green is used to represent 
RNA strands in succeeding chapters, just as blue is used for DNA. 
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high-temperature conditions). The B form of RNA has 
not been observed. Breaks in the regular A-form helix 
caused by mismatched or unmatched bases in one or 
both strands are common and result in bulges or inter- 
nal loops (Fig. 8-23). Hairpin loops form between 
nearby self-complementary sequences. The potential 
for base-paired helical structures in many RNAs is ex- 
tensive (Fig. 8-24), and the resulting hairpins are the 
most common type of secondary structure in RNA. Spe- 
cific short base sequences (such as UUCG) are often 
found at the ends of RNA hairpins and are known to 
form particularly tight and stable loops. Such se- 
quences may act as starting points for the folding of an 
RNA molecule into its precise three-dimensional struc- 
ture. Other contributions are made by hydrogen bonds 
that are not part of standard Watson-Crick base pairs. 
For example, the 2'-hydroxyl group of ribose can hy- 
drogen-bond with other groups. Some of these proper- 
ties are evident in the structure of the phenylalanine 
transfer RNA of yeast — the tRNA responsible for insert- 
ing Phe residues into polypeptides — and in two RNA 
enzymes, or ribozymes, whose functions, like those of 



Single 
strands 





Hairpin double helix 
(b) 

FIGURE 8-23 Secondary structure of RNAs. (a) Bulge, internal loop, 
and hairpin loop, (b) The paired regions generally have an A-form 
right-handed helix, as shown for a hairpin. 



FIGURE 8-24 Base-paired helical structures in an RNA. Shown here 
is the possible secondary structure of the M1 RNA component of the 
enzyme RNase P of E. coli, with many hairpins. RNase P, which also 
contains a protein component (not shown), functions in the processing 
of transfer RNAs (see Fig. 26-27). The two brackets indicate addi- 
tional complementary sequences that may be paired in the three- 
dimensional structure. The blue dots indicate non-Watson-Crick G=U 
base pairs (boxed inset). Note that G=U base pairs are allowed only 
when presynthesized strands of RNA fold up or anneal with each other. 
There are no RNA polymerases (the enzymes that synthesize RNAs on 
a DNA template) that insert a U opposite a template G, or vice versa, 
during RNA synthesis. 
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FIGURE 8-25 Three-dimensional structure in RNA. (a) Three- 
dimensional structure of phenylalanine tRNA of yeast (PDB ID 1TRA). 
Some unusual base-pairing patterns found in this tRNA are shown. Note 
also the involvement of the oxygen of a ribose phosphodiester bond in 
one hydrogen-bonding arrangement, and a ribose 2'-hydroxyl group in 
another (both in red), (b) A hammerhead ribozyme (so named because 
the secondary structure at the active site looks like the head of a ham- 
mer), derived from certain plant viruses (derived from PDB ID 1MME). 



Ribozymes, or RNA enzymes, catalyze a variety of reactions, prima- 
rily in RNA metabolism and protein synthesis. The complex three- 
dimensional structures of these RNAs reflect the complexity inherent 
in catalysis, as described for protein enzymes in Chapter 6. (c) A seg- 
ment of mRNA known as an intron, from the ciliated protozoan 
Tetrahymena thermophila (derived from PDB ID 1GRZ). This intron (a 
ribozyme) catalyzes its own excision from between exons in an mRNA 
strand (discussed in Chapter 26). 



protein enzymes, depend on their three-dimensional 
structures (Fig. 8-25). 

The analysis of RNA structure and the relationship 
between structure and function is an emerging field of 
inquiry that has many of the same complexities as the 
analysis of protein structure. The importance of under- 
standing RNA structure grows as we become increas- 
ingly aware of the large number of functional roles for 
RNA molecules. 



SUMMARY 8.2 Nucleic Acid Structure 

■ Many lines of evidence show that DNA bears 
genetic information. In particular, the 
Avery-MacLeod-McCarty experiment showed that 
DNA isolated from one bacterial strain can enter 
and transform the cells of another strain, endowing 
it with some of the inheritable characteristics of the 
donor. The Hershey-Chase experiment showed that 



the DNA of a bacterial virus, but not its protein 
coat, carries the genetic message for replication of 
the virus in a host cell. 

■ Putting together the available data, Watson and 
Crick postulated that native DNA consists of two 
antiparallel chains in a right-handed double-helical 
arrangement. Complementary base pairs, A=T and 
G=C, are formed by hydrogen bonding within the 
helix. The base pairs are stacked perpendicular to 
the long axis of the double helix, 3.4 A apart, with 
10.5 base pairs per turn. 

■ DNA can exist in several structural forms. Two 
variations of the Watson-Crick form, or B-DNA, are 
A- and Z-DNA. Some sequence-dependent structural 
variations cause bends in the DNA molecule. DNA 
strands with appropriate sequences can form hairpin/ 
cruciform structures or triplex or tetraplex DNA. 

■ Messenger RNA transfers genetic information from 
DNA to ribosomes for protein synthesis. Transfer 
RNA and ribosomal RNA are also involved in 
protein synthesis. RNA can be structurally 
complex; single RNA strands can fold into hairpins, 
double-stranded regions, or complex loops. 

8.3 Nucleic Acid Chemistry 

The role of DNA as a repository of genetic information 
depends in part on its inherent stability. The chemical 
transformations that do occur are generally very slow in 
the absence of an enzyme catalyst. The long-term stor- 
age of information without alteration is so important to a 
cell, however, that even very slow reactions that alter 
DNA structure can be physiologically significant. 
Processes such as carcinogenesis and aging may be inti- 
mately linked to slowly accumulating, irreversible alter- 
ations of DNA. Other, nondestructive alterations also 
occur and are essential to function, such as the strand 
separation that must precede DNA replication or tran- 
scription. In addition to providing insights into physio- 
logical processes, our understanding of nucleic acid 
chemistry has given us a powerful array of technologies 
that have applications in molecular biology, medicine, 
and forensic science. We now examine the chemical 
properties of DNA and some of these technologies. 

Double-Helical DNA and RNA Can Be Denatured 

Solutions of carefully isolated, native DNA are highly 
viscous at pH 7.0 and room temperature (25 °C). When 
such a solution is subjected to extremes of pH or to tem- 
peratures above 80 °C, its viscosity decreases sharply, 
indicating that the DNA has undergone a physical 
change. Just as heat and extremes of pH denature glob- 
ular proteins, they also cause denaturation, or melting, 
of double-helical DNA. Disruption of the hydrogen 
bonds between paired bases and of base stacking causes 
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unwinding of the double helix to form two single 
strands, completely separate from each other along the 
entire length or part of the length (partial denaturation) 
of the molecule. No covalent bonds in the DNA are 
broken (Fig. 8-26). 

Renaturation of a DNA molecule is a rapid one-step 
process, as long as a double-helical segment of a dozen 
or more residues still unites the two strands. When the 
temperature or pH is returned to the range in which 
most organisms live, the unwound segments of the two 
strands spontaneously rewind, or anneal, to yield the 
intact duplex (Fig. 8-26). However, if the two strands 
are completely separated, renaturation occurs in two 
steps. In the first, relatively slow step, the two strands 
"find" each other by random collisions and form a short 
segment of complementary double helix. The second 
step is much faster: the remaining unpaired bases 
successively come into register as base pairs, and the 
two strands "zipper" themselves together to form the 
double helix. 

The close interaction between stacked bases in a 
nucleic acid has the effect of decreasing its absorption 
of UV light relative to that of a solution with the same 
concentration of free nucleotides, and the absorption 
is decreased further when two complementary nucleic 
acid strands are paired. This is called the hypochromic 
effect. Denaturation of a double-stranded nucleic acid 
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FIGURE 8-26 Reversible denaturation and annealing (renaturation) of 
DNA. 
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produces the opposite result: an increase in absorp- 
tion called the hyperchromic effect. The transition 
from double-stranded DNA to the single-stranded, de- 
natured form can thus be detected by monitoring UV 
absorption at 260 nm. 

Viral or bacterial DNA molecules in solution dena- 
ture when they are heated slowly (Fig. 8-27). Each 
species of DNA has a characteristic denaturation tem- 
perature, or melting point (t m \ formally, the temperature 
at which half the DNA is present as separated single 
strands): the higher its content of G=C base pairs, 
the higher the melting point of the DNA. This is because 
G=C base pairs, with three hydrogen bonds, require 
more heat energy to dissociate than A=T base pairs. 
Thus the melting point of a DNA molecule, determined 
under fixed conditions of pH and ionic strength, can 
yield an estimate of its base composition. If denaturation 
conditions are carefully controlled, regions that are rich 



(a) 




FIGURE 8-27 Heat denaturation of DNA. (a) The denaturation, or 
melting, curves of two DNA specimens. The temperature at the mid- 
point of the transition (t m ) is the melting point; it depends on pH and 
ionic strength and on the size and base composition of the DNA. (b) 
Relationship between t m and the G + C content of a DNA. 
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FIGURE 8-28 Partially denatured DNA. This DNA was partially dena- 
tured, then fixed to prevent renaturation during sample preparation. 
The shadowing method used to visualize the DNA in this electron mi- 
crograph increases its diameter approximately fivefold and obliterates 
most details of the helix. However, length measurements can be ob- 
tained, and single-stranded regions are readily distinguishable from 
double-stranded regions. The arrows point to some single-stranded 
bubbles where denaturation has occurred. The regions that denature 
are highly reproducible and are rich in A=T base pairs. 



in A=T base pairs will specifically denature while most 
of the DNA remains double-stranded. Such denatured 
regions (called bubbles) can be visualized with electron 
microscopy (Fig. 8-28). Note that in the strand sepa- 
ration of DNA that occurs in vivo during processes such 
as DNA replication and transcription, the sites where 
these processes are initiated are often rich in A=T base 
pairs, as we shall see. 

Duplexes of two RNA strands or one RNA strand 
and one DNA strand (RNA-DNA hybrids) can also be 
denatured. Notably, RNA duplexes are more stable than 
DNA duplexes. At neutral pH, denaturation of a double- 
helical RNA often requires temperatures 20 °C or more 
higher than those required for denaturation of a DNA 
molecule with a comparable sequence. The stability of 
an RNA-DNA hybrid is generally intermediate between 
that of RNA and that of DNA. The physical basis for 
these differences in thermal stability is not known. 

Nucleic Acids from Different Species Can Form Hybrids 

The ability of two complementary DNA strands to pair 
with one another can be used to detect similar DNA 
sequences in two different species or within the gen- 
ome of a single species. If duplex DNAs isolated from 
human cells and from mouse cells are completely de- 
natured by heating, then mixed and kept at about 25 °C 
below their t m for many hours, much of the DNA will 
anneal. The rate of DNA annealing is affected by tem- 
perature, the length and concentration of the DNA frag- 
ments being annealed, the concentration of salts in the 
reaction mixture, and properties of the sequence itself 



(e.g., complexity and G=C content). Temperature is 
especially important. If the temperature is too low, 
short sequences with coincidental similarity from dis- 
tant, heterologous parts of the DNA molecules will an- 
neal unproductively and interfere with the more 
general alignment of complementary DNA strands. 
Temperatures that are too high will favor denaturation. 
Most of the reannealing occurs between complemen- 
tary mouse DNA strands to form mouse duplex DNA; 
similarly, most human DNA strands anneal with com- 
plementary human DNA strands. However, some 
strands of the mouse DNA will associate with human 
DNA strands to yield hybrid duplexes, in which seg- 
ments of a mouse DNA strand form base-paired regions 
with segments of a human DNA strand (Fig. 8-29). 
This reflects a common evolutionary heritage; different 
organisms generally have many proteins and RNAs with 
similar functions and, often, similar structures. In many 
cases, the DNAs encoding these proteins and RNAs 
have similar sequences. The closer the evolutionary re- 
lationship between two species, the more extensively 
their DNAs will hybridize. For example, human DNA 
hybridizes much more extensively with mouse DNA 
than with DNA from yeast. 

The hybridization of DNA strands from different 
sources forms the basis for a powerful set of techniques 
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FIGURE 8-29 DNA hybridization. Two DNA samples to be compared 
are completely denatured by heating. When the two solutions are 
mixed and slowly cooled, DNA strands of each sample associate with 
their normal complementary partner and anneal to form duplexes. If 
the two DNAs have significant sequence similarity they also tend to 
form partial duplexes or hybrids with each other: the greater the 
sequence similarity between the two DNAs, the greater the number of 
hybrids formed. Hybrid formation can be measured in several ways. 
One of the DNAs is usually labeled with a radioactive isotope to sim- 
plify the measurements. 
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essential to the practice of modern molecular genetics. 
A specific DNA sequence or gene can be detected in the 
presence of many other sequences if one already has an 
appropriate complementary DNA strand (usually la- 
beled in some way) to hybridize with it (Chapter 9). The 
complementary DNA can be from a different species or 
from the same species, or it can be synthesized chemi- 
cally in the laboratory using techniques described later 
in this chapter. Hybridization techniques can be varied 
to detect a specific RNA rather than DNA. The isolation 
and identification of specific genes and RNAs rely on 
these hybridization techniques. Applications of this 
technology make possible the identification of an indi- 
vidual on the basis of a single hair left at the scene of a 
crime or the prediction of the onset of a disease decades 
before symptoms appear (see Box 9-1). 

Nucleotides and Nucleic Acids Undergo 
Nonenzymatic Transformations 

Purines and pyrimidines, along with the nu- 
cleotides of which they are a part, undergo 
spontaneous alterations in their covalent structure. 
The rate of these reactions is generally very slow, but 
they are physiologically significant because of the cell's 
very low tolerance for alterations in its genetic infor- 
mation. Alterations in DNA structure that produce per- 
manent changes in the genetic information encoded 
therein are called mutations, and much evidence sug- 
gests an intimate link between the accumulation of 
mutations in an individual organism and the processes 
of aging and carcinogenesis. 

Several nucleotide bases undergo spontaneous 
loss of their exocyclic amino groups (deamination) 
(Fig. 8-30a). For example, under typical cellular 
conditions, deamination of cytosine (in DNA) to uracil 
occurs in about one of every 10 7 cytidine residues in 
24 hours. This corresponds to about 100 spontaneous 
events per day, on average, in a mammalian cell. Deami- 
nation of adenine and guanine occurs at about l/100th 
this rate. 

The slow cytosine deamination reaction seems in- 
nocuous enough, but is almost certainly the reason why 
DNA contains thymine rather than uracil. The product 
of cytosine deamination (uracil) is readily recognized as 
foreign in DNA and is removed by a repair system 
(Chapter 25). If DNA normally contained uracil, recog- 
nition of uracils resulting from cytosine deamination 
would be more difficult, and unrepaired uracils would 
lead to permanent sequence changes as they were 
paired with adenines during replication. Cytosine deam- 
ination would gradually lead to a decrease in G=C base 
pairs and an increase in A=U base pairs in the DNA of 
all cells. Over the millennia, cytosine deamination could 
eliminate G=C base pairs and the genetic code that de- 
pends on them. Establishing thymine as one of the four 
bases in DNA may well have been one of the crucial 
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FIGURE 8-30 Some well-characterized nonenzymatic reactions 
of nucleotides, (a) Deamination reactions. Only the base is shown, 
(b) Depurination, in which a purine is lost by hydrolysis of the N-(S- 
glycosyl bond. Loss of pyrimidines via a similar reaction occurs, but 
much more slowly. The resulting lesion, in which the deoxyribose is 
present but the base is not, is called an abasic site or an AP site 
{apurinic site or, rarely, apyrimidinic site). The deoxyribose remaining 
after depurination is readily converted from the jS-furanose to the alde- 
hyde form (see Fig. 8-3) Further nonenzymatic reactions are illus- 
trated in Figures 8-31 and 8-32, 



turning points in evolution, making the long-term stor- 
age of genetic information possible. 

Another important reaction in deoxyribonucleotides 
is the hydrolysis of the TV-jS-glycosyl bond between the 
base and the pentose, to create a DNA lesion called an 
AP (apurinic, apyrimidinic) site or abasic site (Fig. 
8-30b). This occurs at a higher rate for purines than for 
pyrimidines. As many as one in 10 5 purines (10,000 per 
mammalian cell) are lost from DNA every 24 hours un- 
der typical cellular conditions. Depurination of ribonu- 
cleotides and RNA is much slower and generally is 
not considered physiologically significant. In the test 
tube, loss of purines can be accelerated by dilute acid. 
Incubation of DNA at pH 3 causes selective removal 
of the purine bases, resulting in a derivative called 
apurinic acid. 

Other reactions are promoted by radiation. UV light 
induces the condensation of two ethylene groups to 
form a cyclobutane ring. In the cell, the same reaction 
between adjacent pyrimidine bases in nucleic acids 
forms cyclobutane pyrimidine dimers. This happens 
most frequently between adjacent thymidine residues 



on the same DNA strand (Fig. 8-31). A second type of 
pyrimidine dimer, called a 6-4 photoproduct, is also 
formed during UV irradiation. Ionizing radiation (x rays 
and gamma rays) can cause ring opening and fragmen- 
tation of bases as well as breaks in the covalent back- 
bone of nucleic acids. 

Virtually all forms of life are exposed to energy-rich 
radiation capable of causing chemical changes in DNA. 
Near-UV radiation (with wavelengths of 200 to 400 nm), 
which makes up a significant portion of the solar spec- 
trum, is known to cause pyrimidine dimer formation and 
other chemical changes in the DNA of bacteria and of 
human skin cells. We are subject to a constant field of 
ionizing radiation in the form of cosmic rays, which can 
penetrate deep into the earth, as well as radiation emit- 
ted from radioactive elements, such as radium, pluto- 
nium, uranium, radon, 14 C, and 3 H. X rays used in 
medical and dental examinations and in radiation ther- 
apy of cancer and other diseases are another form of 
ionizing radiation. It is estimated that UV and ionizing 
radiations are responsible for about 10% of all DNA 
damage caused by environmental agents. 
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FIGURE 8-31 Formation of pyrimidine dimers induced by UV light. 

(a) One type of reaction (on the left) results in the formation of a cy- 
clobutyl ring involving C-5 and C-6 of adjacent pyrimidine residues. 
An alternative reaction (on the right) results in a 6-4 photoproduct, 



with a linkage between C-6 of one pyrimidine and C-4 of its neighbor, 
(b) Formation of a cyclobutane pyrimidine dimer introduces a bend or 
kink into the DNA. 



DNA also may be damaged by reactive chemicals in- 
troduced into the environment as products of industrial 
activity Such products may not be injurious per se but 
may be metabolized by cells into forms that are. There 
are two prominent classes of such agents (Fig. 8-32): 
(1) deaminating agents, particularly nitrous acid (HN0 2 ) 
or compounds that can be metabolized to nitrous acid or 
nitrites, and (2) alkylating agents. 

Nitrous acid, formed from organic precursors 
such as nitrosamines and from nitrite and nitrate 



salts, is a potent accelerator of the deamination of 
bases. Bisulfite has similar effects. Both agents are 
used as preservatives in processed foods to prevent 
the growth of toxic bacteria. They do not seem to in- 
crease cancer risks significantly when used in this 
way, perhaps because they are used in small amounts 
and make only a minor contribution to the overall 
levels of DNA damage. (The potential health risk 
from food spoilage if these preservatives were not 
used is much greater.) 



FIGURE 8-32 Chemical agents that cause 
DNA damage, (a) Precursors of nitrous acid, 
which promotes deamination reactions, 
(b) Alkylating agents. 
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Alkylating agents can alter certain bases of DNA. 
For example, the highly reactive chemical dimethylsul- 
fate (Fig. 8-32b) can methylate a guanine to yield O 6 - 
methylguanine, which cannot base-pair with cytosine. 
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Many similar reactions are brought about by alkylating 
agents normally present in cells, such as S-adenosyl- 
methionine. 

The most important source of mutagenic alterations 
in DNA is oxidative damage. Excited-oxygen species 
such as hydrogen peroxide, hydroxyl radicals, and su- 
peroxide radicals arise during irradiation or as a byprod- 
uct of aerobic metabolism. Of these species, the hydroxyl 
radicals are responsible for most oxidative DNA damage. 
Cells have an elaborate defense system to destroy reac- 
tive oxygen species, including enzymes such as catalase 
and superoxide dismutase that convert reactive oxygen 
species to harmless products. A fraction of these oxi- 
dants inevitably escape cellular defenses, however, and 
damage to DNA occurs through any of a large, complex 
group of reactions ranging from oxidation of deoxyribose 
and base moieties to strand breaks. Accurate estimates 
for the extent of this damage are not yet available, but 
every day the DNA of each human cell is subjected to 
thousands of damaging oxidative reactions. 

This is merely a sampling of the best-understood re- 
actions that damage DNA. Many carcinogenic compounds 
in food, water, or air exert their cancer-causing effects by 
modifying bases in DNA. Nevertheless, the integrity of 
DNA as a polymer is better maintained than that of either 
RNA or protein, because DNA is the only macromolecule 
that has the benefit of biochemical repair systems. These 
repair processes (described in Chapter 25) greatly lessen 
the impact of damage to DNA. ■ 

Some Bases of DNA Are Methylated 

Certain nucleotide bases in DNA molecules are 
enzymatically methylated. Adenine and cytosine are 
methylated more often than guanine and thymine. 
Methylation is generally confined to certain sequences or 
regions of a DNA molecule. In some cases the function of 
methylation is well understood; in others the function re- 
mains unclear. All known DNA methylases use S-adeno- 
sylmethionine as a methyl group donor (Fig. 8-32b). 



E. coli has two prominent methylation systems. One 
serves as part of a defense mechanism that helps the 
cell to distinguish its DNA from foreign DNA by mark- 
ing its own DNA with methyl groups and destroying 
(foreign) DNA without the methyl groups (this is 
known as a restriction-modification system; see p. 305). 
The other system methylates adenosine residues within 
the sequence (5')GATC(3') to A/^-methyladenosine 
(Fig. 8-5a). This is mediated by the Dam (DNA adenine 
methylation) methylase, a component of a system that 
repairs mismatched base pairs formed occasionally 
during DNA replication (see Fig. 25-22). 

In eukaryotic cells, about 5% of cytidine residues in 
DNA are methylated to 5-methylcytidine (Fig. 8-5a). 
Methylation is most common at CpG sequences, produc- 
ing methyl-CpG symmetrically on both strands of the 
DNA. The extent of methylation of CpG sequences varies 
by molecular region in large eukaryotic DNA molecules. 

The Sequences of Long DNA Strands Can Be Determined 

In its capacity as a repository of information, a DNA mol- 
ecule's most important property is its nucleotide se- 
quence. Until the late 1970s, determining the sequence 
of a nucleic acid containing even five or ten nucleotides 
was very laborious. The development of two new tech- 
niques in 1977, one by Alan Maxam and Walter Gilbert 
and the other by Frederick Sanger, made possible the 
sequencing of larger DNA molecules with an ease 
unimagined just a few years before. The techniques de- 
pend on an improved understanding of nucleotide 
chemistry and DNA metabolism, and on electrophoretic 
methods for separating DNA strands differing in size by 
only one nucleotide. Electrophoresis of DNA is similar 
to that of proteins (see Fig. 3-18). Polyacrylamide is 
often used as the gel matrix in work with short DNA 
molecules (up to a few hundred nucleotides); agarose is 
generally used for longer pieces of DNA. 

In both Sanger and Maxam-Gilbert sequencing, the 
general principle is to reduce the DNA to four sets of la- 
beled fragments. The reaction producing each set is 
base-specific, so the lengths of the fragments corre- 
spond to positions in the DNA sequence where a certain 
base occurs. For example, for an oligonucleotide with 
the sequence pAATCGACT, labeled at the 5' end (the 
left end), a reaction that breaks the DNA after each C 
residue will generate two labeled fragments: a four- 
nucleotide and a seven-nucleotide fragment; a reaction 
that breaks the DNA after each G will produce only one 
labeled, five-nucleotide fragment. Because the frag- 
ments are radioactively labeled at their 5' ends, only the 
fragment to the 5' side of the break is visualized. The 
fragment sizes correspond to the relative positions of C 
and G residues in the sequence. When the sets of frag- 
ments corresponding to each of the four bases are elec- 
trophoretically separated side by side, they produce a 
ladder of bands from which the sequence can be read 
directly (Fig. 8-33). We illustrate only the Sanger 
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FIGURE 8-33 DNA sequencing by the Sanger method. This method 
makes use of the mechanism of DNA synthesis by DNA polymerases 
(Chapter 25). (a) DNA polymerases require both a primer (a short 
oligonucleotide strand), to which nucleotides are added, and a tem- 
plate strand to guide selection of each new nucleotide. In cells, the 3'- 
hydroxyl group of the primer reacts with an incoming deoxynucleoside 
triphosphate (dNTP) to form a new phosphodiester bond, (b) The 
Sanger sequencing procedure uses dideoxynucleoside triphosphate 
(ddNTP) analogs to interrupt DNA synthesis. (The Sanger method is also 
known as the dideoxy method.) When a ddNTP is inserted in place of 
a dNTP, strand elongation is halted after the analog is added, because it 
lacks the 3'-hydroxyl group needed for the next step. (c)The DNA to be 
sequenced is used as the template strand, and a short primer, radioac- 
tively or fluorescently labeled, is annealed to it. By addition of small 
amounts of a single ddNTP, for example ddCTP, to an otherwise nor- 
mal reaction system, the synthesized strands will be prematurely ter- 
minated at some locations where dC normally occurs. Given the 
excess of dCTP over ddCTP, the chance that the analog will be incor- 
porated whenever a dC is to be added is small. However, ddCTP is 
present in sufficient amounts to ensure that each new strand has a high 
probability of acquiring at least one ddC at some point during synthe- 
sis. The result is a solution containing a mixture of labeled fragments, 
each ending with a C residue. Each C residue in the sequence gener- 
ates a set of fragments of a particular length, such that the different- 
sized fragments, separated by electrophoresis, reveal the location of C 
residues. This procedure is repeated separately for each of the four 
ddNTPs, and the sequence can be read directly from an autoradi- 
ogram of the gel. Because shorter DNA fragments migrate faster, the 
fragments near the bottom of the gel represent the nucleotide positions 
closest to the primer (the 5' end), and the sequence is read (in the 5' — > 
3' direction) from bottom to top. Note that the sequence obtained is 
that of the strand complementary to the strand being analyzed. 
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method, because it has proved to be technically easier 
and is in more widespread use. It requires the enzymatic 
synthesis of a DNA strand complementary to the strand 
under analysis, using a radioactively labeled "primer" 
and dideoxynucleotides. 

DNA sequencing is now automated by a variation of 
Sanger's sequencing method in which the dideoxynu- 
cleotides used for each reaction are labeled with a dif- 
ferently colored fluorescent tag (Fi^ f . 8-34). With this 
technology, researchers can sequence DNA molecules 
containing thousands of nucleotides in a few hours. The 
entire genomes of more than a thousand organisms have 
now been sequenced in this way (see Table 1-2), and 
many very large DNA-sequencing projects have been 
completed or are in progress. For example, in the Human 
Genome Project, researchers have sequenced all 3.2 bil- 
lion base pairs of the DNA in a human cell (Chapter 9). 
|| Dideoxy Sequencing of DNA 

The Chemical Synthesis of DNA Has Been 
Automated 

Also important in nucleic acid chemistry is the rapid 
and accurate synthesis of short oligonucleotides of 
known sequence. The methods were pioneered by H. 
Gobind Khorana and his colleagues in the 1970s. Re- 
finements by Robert Letsinger and Marvin Caruthers 
led to the chemistry now in widest use, called the phos- 
phoramidite method (Fig. 8-35). The synthesis is car- 
ried out with the growing strand attached to a solid 
support, using principles similar to those used by Mer- 
rifield for peptide synthesis (see Fig. 3-29), and is read- 
ily automated. The efficiency of each addition step is 
very high, allowing the routine synthesis of polymers 
containing 70 or 80 nucleotides and, in some laborato- 
ries, much longer strands. The availability of relatively 
inexpensive DNA polymers with predesigned sequences 
is having a powerful impact on all areas of biochemistry 
(Chapter 9). 



FIGURE 8-34 Strategy for automating DNA sequencing reactions. 

Each dideoxynucleotide used in the Sanger method can be linked to a 
fluorescent molecule that gives all the fragments terminating in that 
nucleotide a particular color. All four labeled ddNTPs are added to a 
single tube. The resulting colored DNA fragments are then separated 
by size in a single electrophoretic gel contained in a capillary tube (a 
refinement of gel electrophoresis that allows for faster separations). All 
fragments of a given length migrate through the capillary gel in a sin- 
gle peak, and the color associated with each peak is detected using a 
laser beam. The DNA sequence is read by determining the sequence of 
colors in the peaks as they pass the detector. This information is fed di- 
rectly to a computer, which determines the sequence. 
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FIGURE 8-35 Chemical synthesis of DNA by the phosphoramidite 
method. Automated DNA synthesis is conceptually similar to the syn- 
thesis of polypeptides on a solid support. The oligonucleotide is built up 
on the solid support (silica), one nucleotide at a time, in a repeated se- 
ries of chemical reactions with suitably protected nucleotide precursors. 
© The first nucleoside (which will be the 3' end) is attached to the 
silica support at the 3' hydroxyl (through a linking group, R) and is pro- 
tected at the 5' hydroxyl with an acid-labile dimethoxytrityl group 
(DMT), The reactive groups on all bases are also chemically protected. 
@ The protecting DMT group is removed by washing the column with 
acid (the DMT group is colored, so this reaction can be followed spec- 
trophotometrically). @ The next nucleotide has a reactive phospho- 
ramidite at its 3' position: a trivalent phosphite (as opposed to the more 
oxidized pentavalent phosphate normally present in nucleic acids) with 
one linked oxygen replaced by an amino group or substituted amine. In 
the common variant shown, one of the phosphoramidite oxygens is 
bonded to the deoxyribose, the other is protected by a cyanoethyl group, 




and the third position is occupied by a readily displaced diisopropyl- 
amino group. Reaction with the immobilized nucleotide forms a 5', 3' 
linkage, and the diisopropylamino group is eliminated. In step ©, the 
phosphite linkage is oxidized with iodine to produce a phosphotriester 
linkage. Reactions @ through @ are repeated until all nucleotides are 
added. At each step, excess nucleotide is removed before addition of the 
next nucleotide. In steps ® and © the remaining protecting groups on 
the bases and the phosphates are removed, and in ©the oligonu- 
cleotide is separated from the solid support and purified. The chemical 
synthesis of RNA is somewhat more complicated because of the need to 
protect the 2' hydroxyl of ribose without adversely affecting the reactiv- 
ity of the 3' hydroxyl. 
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SUMMARY 8.3 Nucleic Acid Chemistry 

Native DNA undergoes reversible unwinding and 
separation of strands (melting) on heating or at 
extremes of pH. DNAs rich in G=C pairs have 
higher melting points than DNAs rich in A=T pairs. 

■ Denatured single-stranded DNAs from two species 
can form a hybrid duplex, the degree of hybridization 
depending on the extent of sequence similarity. 
Hybridization is the basis for important techniques 
used to study and isolate specific genes and RNAs. 

m DNA is a relatively stable polymer. Spontaneous 
reactions such as deamination of certain bases, 
hydrolysis of base-sugar Af-glycosyl bonds, 
radiation-induced formation of pyrimidine dimers, 
and oxidative damage occur at very low rates, yet 
are important because of a cell's very low tolerance 
for changes in genetic material. 

■ DNA sequences can be determined and DNA 
polymers synthesized with simple, automated 
protocols involving chemical and enzymatic methods. 



8.4 Other Functions of Nucleotides 

In addition to their roles as the subunits of nucleic acids, 
nucleotides have a variety of other functions in every 
cell: as energy carriers, components of enzyme cofac- 
tors, and chemical messengers. 

Nucleotides Carry Chemical Energy in Ceils 

The phosphate group covalently linked at the 5' 
hydroxyl of a ribonucleotide may have one or two 
additional phosphates attached. The resulting mole- 
cules are referred to as nucleoside mono-, di-, and 
triphosphates ( Fig. 8-36). Starting from the ribose, the 
three phosphates are generally labeled a, /3, and y. 
Hydrolysis of nucleoside triphosphates provides the 
chemical energy to drive many cellular reactions. 
Adenosine 5 '-triphosphate, ATP, is by far the most 
widely used for this purpose, but UTP, GTP, and CTP are 
also used in some reactions. Nucleoside triphosphates 
also serve as the activated precursors of DNA and RNA 
synthesis, as described in Chapters 25 and 26. 

The energy released by hydrolysis of ATP and the 
other nucleoside triphosphates is accounted for by the 
structure of the triphosphate group. The bond between 
the ribose and the a phosphate is an ester linkage. 
The and jS,y linkages are phosphoanhydrides 
(Fig. 8-37). Hydrolysis of the ester linkage yields about 
14 kJ/mol under standard conditions, whereas hydroly- 
sis of each anhydride bond yields about 30 kJ/mol. ATP 
hydrolysis often plays an important thermodynamic role 
in biosynthesis. When coupled to a reaction with a posi- 
tive free-energy change, ATP hydrolysis shifts the equi- 
librium of the overall process to favor product formation 
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FIGURE 8-36 Nucleoside phosphates. General structure of the nucle- 
oside 5 '-mono-, di-, and triphosphates (NMPs, NDPs, and NTPs) and 
their standard abbreviations. In the deoxyribonucleoside phosphates 
(dNMPs, dNDPs, and dNTPs), the pentose is 2'-deoxy-D-ribose. 
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FIGURE 8-37 The phosphate ester and phosphoanhydride bonds of 
ATP. Hydrolysis of an anhydride bond yields more energy than hy- 
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ester are shown for comparison 
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(recall the relationship between equilibrium constant and 
free-energy change described by Eqn 6-3 on p. 188). 

Adenine Nucleotides Are Components of Many 
Enzyme Cofactors 

A variety of enzyme cofactors serving a wide range of 
chemical functions include adenosine as part of their 
structure (Fig. 8-38). They are unrelated structurally 
except for the presence of adenosine. In none of these 
cofactors does the adenosine portion participate di- 
rectly in the primary function, but removal of adenosine 
generally results in a drastic reduction of cofactor activ- 
ities. For example, removal of the adenine nucleotide 
(3'-phosphoadenosine diphosphate) from acetoacetyl- 
CoA, the coenzyme A derivative of acetoacetate, re- 
duces its reactivity as a substrate for /3-ketoacyl-CoA 



transferase (an enzyme of lipid metabolism) by a factor 
of 10 6 . Although this requirement for adenosine has not 
been investigated in detail, it must involve the binding 
energy between enzyme and substrate (or cofactor) 
that is used both in catalysis and in stabilizing the initial 
enzyme-substrate complex (Chapter 6). In the case of 
/3-ketoacyl-CoA transferase, the nucleotide moiety of 
coenzyme A seems to be a binding "handle" that helps to 
pull the substrate (acetoacetyl-CoA) into the active site. 
Similar roles may be found for the nucleoside portion of 
other nucleotide cofactors. 

Why is adenosine, rather than some other large mol- 
ecule, used in these structures? The answer here may 
involve a form of evolutionary economy. Adenosine is 
certainly not unique in the amount of potential binding 
energy it can contribute. The importance of adenosine 
probably lies not so much in some special chemical 
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FIGURE 8-38 Some coenzymes containing adenosine. The adenosine 
portion is shaded in light red. Coenzyme A (CoA) functions in acyl 
group transfer reactions; the acyl group (such as the acetyl or ace- 
toacetyl group) is attached to the CoA through a thioester linkage to the 
/3-mercaptoethylamine moiety. NAD + functions in hydride transfers, 
and FAD, the active form of vitamin B 2 (riboflavin), in electron transfers. 
Another coenzyme incorporating adenosine is 5'-deoxyadenosylcobal- 
amin, the active form of vitamin B 12 (see Box 1 7-2), which participates 
in intramolecular group transfers between adjacent carbons. 
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characteristic as in the evolutionary advantage of using 
one compound for multiple roles. Once ATP became the 
universal source of chemical energy, systems developed 
to synthesize ATP in greater abundance than the other 
nucleotides; because it is abundant, it becomes the logi- 
cal choice for incorporation into a wide variety of struc- 
tures. The economy extends to protein structure. A 
single protein domain that binds adenosine can be used 
in different enzymes. Such a domain, called a nu- 
cleotide-binding fold, is found in many enzymes that 
bind ATP and nucleotide cof actors. 

Some Nucleotides Are Regulatory Molecules 

Cells respond to their environment by taking cues from 
hormones or other external chemical signals. The inter- 
action of these extracellular chemical signals ("first mes- 
sengers") with receptors on the cell surface often leads to 
the production of second messengers inside the cell, 
which in turn leads to adaptive changes in the cell interior 
(Chapter 12). Often, the second messenger is a nu- 
cleotide (Fig. 8-39). One of the most common is 
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adenosine 3',5'-cyclic monophosphate (cyclic AMP, 
or cAMP), formed from ATP in a reaction catalyzed by 
adenylyl cyclase, an enzyme associated with the inner 
face of the plasma membrane. Cyclic AMP serves regula- 
tory functions in virtually every cell outside the plant 
kingdom. Guanosine 3 ',5 '-cyclic monophosphate (cGMP) 
occurs in many cells and also has regulatory functions. 

Another regulatory nucleotide, ppGpp (Fig. 8-39), is 
produced in bacteria in response to a slowdown in protein 
synthesis during amino acid starvation. This nucleotide 
inhibits the synthesis of the rRNA and tRNA molecules 
(see Fig. 28-24) needed for protein synthesis, preventing 
the unnecessary production of nucleic acids. 



SUMMARY 8.4 



Other Functions of 
Nucleotides 



■ ATP is the central carrier of chemical energy in 
cells. The presence of an adenosine moiety in a 
variety of enzyme cofactors may be related to 
binding-energy requirements. 

■ Cyclic AMP, formed from ATP in a reaction 
catalyzed by adenylyl cyclase, is a common second 
messenger produced in response to hormones and 
other chemical signals. 
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Problems 

1. Nucleotide Structure Which positions in the purine ring 
of a purine nucleotide in DNA have the potential to form hy- 
drogen bonds but are not involved in Watson-Crick base 
pairing? 



2. Base Sequence of Complementary DNA Strands One 

strand of a double-helical DNA has the sequence (5')GCG- 
CAATATTTCTCAAAATATTGCGC (3 ') ■ Write the base sequence 
of the complementary strand. What special type of sequence is 
contained in this DNA segment? Does the double-stranded DNA 
have the potential to form any alternative structures? 

3. DNA of the Human Body Calculate the weight in grams 
of a double-helical DNA molecule stretching from the Earth to 
the moon (-320,000 km). The DNA double helix weighs about 
1 x 10" 18 g per 1,000 nucleotide pairs; each base pair extends 
3.4 A. For an interesting comparison, your body contains 
about 0.5 g of DNA! 

4. DNA Bending Assume that a poly(A) tract five base pairs 
long produces a 20° bend in a DNA strand. Calculate the total 
(net) bend produced in a DNA if the center base pairs (the 
third of five) of two successive (dA) 5 tracts are located (a) 10 
base pairs apart; (b) 15 base pairs apart. Assume 10 base pairs 
per turn in the DNA double helix. 

5. Distinction between DNA Structure and RNA Struc- 
ture Hairpins may form at palindromic sequences in single 
strands of either RNA or DNA. How is the helical structure of 
a long and fully base-paired (except at the end) hairpin in RNA 
different from that of a similar hairpin in DNA? 

6. Nucleotide Chemistry The cells of many eukaryotic or- 
ganisms have highly specialized systems that specifically re- 
pair G-T mismatches in DNA. The mismatch is repaired to 
form a G^C (not A=T) base pair. This G-T mismatch repair 
mechanism occurs in addition to a more general system that 
repairs virtually all mismatches. Can you suggest why cells 
might require a specialized system to repair G-T mismatches? 

7. Spontaneous DNA Damage Hydrolysis of the A^-glycosyl 
bond between deoxyribose and a purine in DNA creates an AP 
site. An AP site generates a thermodynamic destabilization 
greater than that created by any DNA mismatched base pair. 
This effect is not completely understood. Examine the struc- 
ture of an AP site (see Fig. 8-33b) and describe some chemical 
consequences of base loss. 

8. Nucleic Acid Structure Explain why the absorption of 
UV light by double-stranded DNA increases (the hyper- 
chromic effect) when the DNA is denatured. 

9. Determination of Protein Concentration in a Solu- 
tion Containing Proteins and Nucleic Acids The concen- 
tration of protein or nucleic acid in a solution containing both 
can be estimated by using their different light absorption prop- 
erties: proteins absorb most strongly at 280 ran and nucleic 
acids at 260 ran. Estimates of their respective concentrations 
in a mixture can be made by measuring the absorbance (A) of 
the solution at 280 and 260 nm and using the table on page 300, 
which gives #280/260, the ratio of absorbances at 280 and 260 nm; 
the percentage of total mass that is nucleic acid; and a factor, F, 
that corrects the^so reading and gives a more accurate protein 
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estimate. The protein concentration (in mg/mL) = F X A 2 &o 
(assuming the cuvette is 1 cm wide). Calculate the protein 
concentration in a solution of^4 2 so = 0.69 and A 2 q 0 = 0.94. 





Proportion of 
niirlHr arid 


F 


1.75 


0.00 


1.116 


1.63 


0.25 


1 081 


1.52 


0.50 


1.054 


1.40 


0.75 


1.023 


1.36 


1.00 


0.994 


1 30 


1.25 


0.970 


1.25 


1 50 


0.944 


1.16 


2.00 


0.899 


1.09 


2 50 


0.852 


1.03 


3.00 


0 814 


0.979 


3 50 


0 776 


0.939 


4 00 


0.743 


0.874 


5 00 


0 682 


0.846 


5.50 


0.656 


0.822 


6.00 


0.632 


0.804 


6 50 


0.607 


0 784 


7.00 


0.585 


0.767 


7.50 


0.565 


0.753 


8.00 


0.545 


0 730 


9.00 


0.508 


0 705 


10.00 


0.478 


0.671 


12 00 


0.422 


0.644 


14 00 


0.377 


0.615 


17.00 


0.322 


0.595 


20.00 


0.278 



10. Solubility of the Components of DNA Draw the fol- 
lowing structures and rate their relative solubilities in water 
(most soluble to least soluble): deoxyribose, guanine, phos- 
phate. How are these solubilities consistent with the three- 
dimensional structure of double-stranded DNA? 

11. Sanger Sequencing Logic In the Sanger (dideoxy) 
method for DNA sequencing, a small amount of a dideoxynu- 
cleotide triphosphate— say, ddCTP— is added to the sequencing 
reaction along with a larger amount of the corresponding dCTP. 
What result would be observed if the dCTP were omitted? 

12. DNA Sequencing The following DNA fragment was se- 
quenced by the Sanger method. The red asterisk indicates a 
fluorescent label. 

*5' 3'-OH 

3' ATTACGCAAGGACATTAGAC— 5' 

A sample of the DNA was reacted with DNA polymerase and 
each of the nucleotide mixtures (in an appropriate buffer) 
listed below. Dideoxynucleotides (ddNTPs) were added in 
relatively small amounts 

1. dATP, dTTP, dCTP, dGTP, ddTTP 

2. dATP, dTTP, dCTP, dGTP, ddGTP 



3. dATP, dCTP, dGTP, ddTTP 

4. dATP, dTTP, dCTP, dGTP 

The resulting DNA was separated by electrophoresis on an 
agarose gel, and the fluorescent bands on the gel were located. 
The band pattern resulting from nucleotide mixture 1 is shown 
below. Assuming that all mixtures were run on the same gel, 
what did the remaining lanes of the gel look like? 




13. Snake Venom Phosphodiesterase An exonuclease is an 
enzyme that sequentially cleaves nucleotides from the end of a 
polynucleotide strand. Snake venom phosphodiesterase, 
which hydrolyzes nucleotides from the 3' end of any oligonu- 
cleotide with a free 3'-hydroxyl group, cleaves between the 3' 
hydroxyl of the ribose or deoxyribose and the phosphoryl 
group of the next nucleotide. It acts on single-stranded DNA 
or RNA and has no base specificity This enzyme was used in 
sequence determination experiments before the development 
of modern nucleic acid sequencing techniques. What are the 
products of partial digestion by snake venom phosphodi- 
esterase of an oligonucleotide with the following sequence? 

( 5 ' ) GC GC C AUUGC ( 3 ' ) — O H 

14. Preserving DNA in Bacterial Endospores Bacterial 
endospores form when the environment is no longer con- 
ducive to active cell metabolism. The soil bacterium Bacillus 
subtilis, for example, begins the process of sporulation when 
one or more nutrients are depleted. The end product is a 
small, metabolically dormant structure that can survive almost 
indefinitely with no detectable metabolism. Spores have mech- 
anisms to prevent accumulation of potentially lethal mutations 
in their DNA over periods of dormancy that can exceed 1,000 
years. B subtilis spores are much more resistant than are the 
organism's growing cells to heat, UV radiation, and oxidizing 
agents, all of which promote mutations. 

(a) One factor that prevents potential DNA damage in 
spores is their greatly decreased water content. How would 
this affect some types of mutations? 

(b) Endospores have a category of proteins called small 
acid-soluble proteins (SASPs) that bind to their DNA, pre- 
venting formation of cyclobutane-type dimers. What causes 
cyclobutane dimers, and why do bacterial endospores need 
mechanisms to prevent their formation? 
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15. Oligonucleotide Synthesis In the scheme of Figure 
8-35, each new base to be added to the growing oligonu- 
cleotide is modified so that its 3' hydroxyl is activated and 
the 5' hydroxyl has a dimethoxytrityl (DMT) group at- 
tached. What is the function of the DMT group on the in- 
coming base? 

Biochemistry on the Internet 

16. The Structure of DNA Elucidation of the three- 
dimensional structure of DNA helped researchers understand 
how this molecule conveys information that can be faithfully 
replicated from one generation to the next. To see the second- 
ary structure of double-stranded DNA, go to the Protein Data 
Bank website (www.rcsb.org). Use the PDB identifiers listed 
below to retrieve the structure summaries for the two forms 
of DNA. Open the structures using Jmol (linked under the 
Display Options), and use the controls in the Jmol menu 
(accessed with a control-click or by clicking on the Jmol logo 
in the lower right corner of the image screen) to complete the 
following exercises. Refer to the Jmol help links as needed. 

(a) Obtain the file for 141D, a highly conserved, repeated 
DNA sequence from the end of the HIV-1 (the virus that 
causes AIDS) genome Display the molecule as a ball-and-stick 
structure (in the control menu, choose Select > All, then Ren- 
der > Scheme > Ball and Stick) . Identify the sugar-phosphate 
backbone for each strand of the DNA duplex. Locate and 
identify individual bases. Identify the 5' end of each strand 
Locate the major and minor grooves. Is this a right- or left- 
handed helix? 

(b) Obtain the file for 145D, a DNA with the Z conforma- 
tion. Display the molecule as a ball-and-stick structure. 
Identify the sugar-phosphate backbone for each strand of the 
DNA duplex. Is this a right- or left-handed helix? 

(c) To fully appreciate the secondary structure of DNA, 
view the molecules in stereo. On the control menu, Select > 
All, then Render > Stereographic > Cross-eyed or Wall- 
eyed, You will see two images of the DNA molecule. Sit with 
your nose approximately 10 inches from the monitor and fo- 
cus on the tip of your nose (cross-eyed) or the opposite 
edges of the screen (wall-eyed). In the background you 
should see three images of the DNA helix. Shift your focus 
to the middle image, which should appear three-dimen- 
sional. (Note that only one of the two authors can make this 
work.) 

Data Analysis Problem 

17. Chargaff's Studies of DNA Structure The chapter 
section "DNA Is a Double Helix that Stores Genetic Informa- 
tion" includes a summary of the main findings of Erwin Char- 
gaff and his coworkers, listed as four conclusions ("Chargaff's 
rules"; p. 278). In this problem, you will examine the data 
Chargaff collected in support of these conclusions, 

In one paper, Chargaff (1950) described his analytical 
methods and some early results. Briefly, he treated DNA 



samples with acid to remove the bases, separated the bases by 
paper chromatography, and measured the amount of each base 
with UV spectroscopy. His results are shown in the three tables 
below The molar ratio is the ratio of the number of moles of 
each base in the sample to the number of moles of phosphate in 
the sample — this gives the fraction of the total number of bases 
represented by each particular base. The recovery is the sum 
of all four bases (the sum of the molar ratios); full recovery of 
all bases in the DNA would give a recovery of 1.0. 



Molar ratios in ox DNA 







Thymus 




Spleen 


Liver 


Base 


Prep. 1 Prep. 2 Prep. 3 


Prep. 1 Prep. 2 


Prep. 1 


Adenine 


0.26 


0.28 


0.30 


0.25 


0.26 


0.26 


Guanine 


0.21 


0.24 


0.22 


0.20 


0.21 


0.20 


Cytosine 


0.16 


0 18 


0.17 


0.15 


0.17 




Thymine 


0.25 


0.24 


0.25 


0.24 


0.24 




Recovery 


o. 88 


0.94 


0 94 


0.84 


0.88 





Molar ratios in human DNA 
Sperm Thymus Liver 
Base Prep. 1 Prep. 2 Prep. 1 Normal Carcinoma 



Adenine 


0.29 


0.27 


0.28 


0.27 


0.27 


Guanine 


0.18 


0.17 


0.19 


0.19 


0 18 


Cytosine 


0.18 


0.18 


0.16 




0.15 


Thymine 


0 31 


0.30 


0.28 




0.27 


Recovery 


0.96 


0.92 


0.91 




0 87 



Molar ratios in DNA of microorganisms 

Yeast Avian tubercle bacilli 



Base 


Prep. 1 


Prep. 2 


Prep. 1 


Adenine 


0.24 


0.30 


0.12 


Guanine 


0.14 


0 18 


0.28 


Cytosine 


0.13 


0.15 


0.26 


Thymine 


0.25 


0.29 


0.11 


Recovery 


0. 76 


0.92 


0.77 



(a) Based on these data, Chargaff concluded that "no dif- 
ferences in composition have so far been found in DNA from 
different tissues of the same species." This corresponds to 
conclusion 2 in this chapter. However, a skeptic looking at the 
data above might say, "They certainly look different to me!" If 
you were Chargaff, how would you use the data to convince 
the skeptic to change her mind? 

(b) The base composition of DNA from normal and can- 
cerous liver cells (hepatocarcinoma) was not distinguishably 
different. Would you expect Chargaff's technique to be capable 
of detecting a difference between the DNA of normal and can- 
cerous cells? Explain your reasoning. 

As you might expect, Chargaff's data were not completely 
convincing. He went on to improve his techniques, as described 
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in a later paper (Chargaff, 1951), in which he reported molar 
ratios of bases in DNA from a variety of organisms: 



Source 


A:G 


T:C 


A:T 


G:C 


Purine iDvriinidiiie 


Ox 


1.29 


1.43 


1.04 


1.00 


1.1 


Human 


1.56 


1.75 


1.00 


1.00 


1.0 


Hen 


1.45 


1.29 


1.06 


0.91 


0.99 


Salmon 


1.43 


1.43 


1.02 


1.02 


1.02 


Wheat 


1.22 


1.18 


1.00 


0.97 


0.99 


Yeast 


1 67 


1.92 


1.03 


1.20 


1.0 


Haemophilus 












lift flu P'Yi yctc 












typec 


1.74 


1.54 


1.07 


0.91 


10 


E. coli K-12 


1.05 


0.95 


1.09 


0.99 


1.0 


Avian tubercle 












bacillus 


0.4 


0.4 


1.09 


1.08 


1.1 


Serratia 












marcescens 


0.7 


0.7 


0.95 


0.86 


0.9 


Bacillus schatz 


0.7 


0.6 


1.12 


0.89 


1.0 



(c) According to Chargaff, as stated in conclusion 1 in this 
chapter, "The base composition of DNA generally varies from 
one species to another." Provide an argument, based on the 
data presented so far, that supports this conclusion. 

(d) According to conclusion 4, "In all cellular DNAs, re- 
gardless of the species ...A + G = T + C." Provide an argu- 
ment, based on the data presented so far, that supports this 
conclusion. 

Part of Chargaff s intent was to disprove the "tetranucleotide 
hypothesis"; this was the idea that DNA was a monotonous 



tetranucleotide polymer (AGCT) n and therefore not capable of 
containing sequence information. Although the data presented 
above show that DNA cannot be simply a tetranucleotide— if so, 
all samples would have molar ratios of 0.25 for each base — it was 
still possible that the DNA from different organisms was a slightly 
more complex, but still monotonous, repeating sequence. 

To address this issue, Chargaff took DNA from wheat 
germ and treated it with the enzyme deoxyribonuclease for 
different time intervals. At each time interval, some of the 
DNA was converted to small fragments; the remaining, larger 
fragments he called the "core." In the table below, the "19% 
core" corresponds to the larger fragments left behind when 
81% of the DNA was degraded; the "8% core" corresponds to 
the larger fragments left after 92% degradation. 



Base 


Intact DNA 


19% Core 


8% Core 


Adenine 


0.27 


0.33 


0.35 


Guanine 


0.22 


0.20 


0.20 


Cytosine 


0.22 


0.16 


0.14 


Thymine 


0.27 


0.26 


0.23 


Recovery 


0.98 


0.95 


0.92 



(e) How would you use these data to argue that wheat 
germ DNA is not a monotonous repeating sequence? 
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Of all the natural systems, living matter is the one which, in the face 
of great transformations, preserves inscribed in its organization the 
largest amount of its own past history. 

— Emile Zuckerkandi and Linus Pauling, 
article in Journal of Theoretical Biology, 7965 
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We now turn to a technology that is fundamental 
to the advance of modern biological sciences, 
defining present and future biochemical fron- 
tiers and illustrating many important principles of bio- 
chemistry. Elucidation of the laws governing enzymatic 
catalysis, macromolecular structure, cellular metabo- 
lism, and information pathways allows research to be di- 
rected at increasingly complex biochemical processes. 
Cell division, immunity, embryogenesis, vision, taste, 
oncogenesis, cognition — all are orchestrated in an elab- 
orate symphony of molecular and macromolecular 
interactions that we are now beginning to understand 
with increasing clarity. The real implications of the bio- 
chemical journey begun in the nineteenth century are 
found in the ever-increasing power to ana- 
lyze and alter living systems. 

To understand a complex biological 
process, a biochemist isolates and studies 
the individual components in vitro, then 
pieces together the parts to get a coherent 
picture of the overall process. A major 
source of molecular insights is the cell's 
own information archive, its DNA. The 
sheer size of chromosomes, however, pre- 
sents an enormous challenge: how does one 
find and study a particular gene among the Paul Berg 



tens of thousands of genes nested in the billions of base 
pairs of a mammalian genome? Solutions began to 
emerge in the 1970s. 

Decades of advances by thousands of scientists 
working in genetics, biochemistry, cell biology, and 
physical chemistry came together in the laboratories of 
Paul Berg, Herbert Boyer, and Stanley Cohen to yield 
techniques for locating, isolating, preparing, and 
studying small segments of DNA derived from much 
larger chromosomes. Techniques for DNA cloning 
paved the way to the modern fields of genomics and 1 
proteomics, the study of genes and proteins on the 
scale of whole cells and organisms. These new methods 
are transforming basic research, agriculture, medicine, 
ecology, forensics, and many other fields, while occa- 
sionally presenting society with difficult choices and 
ethical dilemmas. 

We begin this chapter with an outline of the funda- 
mental biochemical principles of the now-classic disci- 
pline of DNA cloning. Next, we illustrate the range of 
applications and the potential of a range of newer tech- 
nologies, with a broad emphasis on modern advances in 
genomics and proteomics. 




Herbert Boyer Stanley N. Cohen 
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9.1 DNA Cloning: The Basics 

A clone is an identical copy. This term originally applied 
to cells of a single type, isolated and allowed to repro- 
duce to create a population of identical cells. DNA 
cloning involves separating a specific gene or DNA seg- 
ment from a larger chromosome, attaching it to a small 
molecule of carrier DNA, and then replicating this mod- 
ified DNA thousands or millions of times through both 
an increase in host cell number and the creation of mul- 
tiple copies of the cloned DNA in each cell. The result is 
selective amplification of a particular gene or DNA seg- 
ment. Cloning of DNA from any organism entails five 
general procedures: 

1. Cutting DNA at precise locations. Sequence- 
specific endonucleases (restriction endonucleases) 
provide the necessary molecular scissors. 

2. Selecting a small molecule of DNA capable of 
self-replication These DNAs are called cloning 
vectors (a vector is a delivery agent). They are 
typically plasmids or viral DNAs. 

3. Joining two DNA fragments covalently, The 
enzyme DNA ligase links the cloning vector and 
DNA to be cloned. Composite DNA molecules 
comprising covalently linked segments from two or 
more sources are called recombinant DNAs. 

4. Moving recombinant DNA from the test tube to a 
host cell that will provide the enzymatic machinery 
for DNA replication. 

5. Selecting or identifying host cells that contain 
recombinant DNA. 

The methods used to accomplish these and related tasks 
are collectively referred to as recombinant DNA tech- 
nology or, more informally, genetic engineering. 

Much of our initial discussion will focus on DNA 
cloning in the bacterium Escherichia coli, the first or- 
ganism used for recombinant DNA work and still the 
most common host cell. E. coli has many advantages: its 
DNA metabolism (like many other of its biochemical 
processes) is well understood; many naturally occurring 
cloning vectors associated with E. coli, such as plasmids 
and bacteriophages (bacterial viruses; also called 
phages), are well characterized; and techniques are 
available for moving DNA expeditiously from one bacte- 
rial cell to another. The principles discussed here are 
broadly applicable to DNA cloning in other organisms, a 
topic discussed more fully later in the chapter. 

Restriction Endonucleases and DNA Ligase Yield 
Recombinant DNA 

Particularly important to recombinant DNA technology 
is a set of enzymes (Table 9-1) made available through 
decades of research on nucleic acid metabolism. Two 
classes of enzymes lie at the heart of the classic approach 
to generating and propagating a recombinant DNA mole- 
cule (Fig. 9-1). First, restriction endonucleases 
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chromosome with a 
restriction endonuclease. 
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FIGURE 9-1 Schematic illustration of DNA cloning. A cloning vector 
and eukaryotic chromosomes are separately cleaved with the same re- 
striction endonuclease. The fragments to be cloned are then ligated to 
the cloning vector The resulting recombinant DNA (only one recom- 
binant vector is shown here) is introduced into a host cell where it can 
be propagated (cloned). Note that this drawing is not to scale: the size 
of the E. coli chromosome relative to that of a typical cloning vector 
(such as a plasmid) is much greater than depicted here. 



(also called restriction enzymes) recognize and cleave 
DNA at specific sequences (recognition sequences or re- 
striction sites) to generate a set of smaller fragments. 
Second, the DNA fragment to be cloned is joined to a 
suitable cloning vector by using DNA ligases to link the 
DNA molecules together. The recombinant vector is then 
introduced into a host cell, which amplifies the fragment 
in the course of many generations of cell division. 

Restriction endonucleases are found in a wide range 
of bacterial species. Werner Arber discovered in the 
early 1960s that their biological function is to recognize 
and cleave foreign DNA (the DNA of an infecting virus, 
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Some Enzymes Used in Recombinant DNA Technology 

Enzyme ( s ) Function 



Type II restriction endonucleases 
DNA ligase 

DNA polymerase I (E. coli) 
Reverse transcriptase 
Polynucleotide kinase 
Terminal transferase 
Exonuclease III 
Bacteriophage \ exonuclease 
Alkaline phosphatase 



Cleave DNAs at specific base sequences 
Joins two DNA molecules or fragments 

Fills gaps in duplexes by stepwise addition of nucleotides to 3' ends 
Makes a DNA copy of an RNA molecule 

Adds a phosphate to the 5'-OH end of a polynucleotide to label it or permit ligation 

Adds homopolymer tails to the 3' -OH ends of a linear duplex 

Removes nucleotide residues from the 3' ends of a DNA strand 

Removes nucleotides from the 5' ends of a duplex to expose single-stranded 3' ends 

Removes terminal phosphates from either the 5' or 3' end (or both) 



for example) ; such DNA is said to be restricted. In the 
host cell's DNA, the sequence that would be recognized 
by its own restriction endonuclease is protected from di- 
gestion by methylation of the DNA, catalyzed by a spe- 
cific DNA methylase. The restriction endonuclease and 
the corresponding methylase are sometimes referred to 
as a restriction-modification system. 

There are three types of restriction endonucleases, 
designated I, II, and III. Types I and III are generally 
large, multisubunit complexes containing both the en- 
donuclease and methylase activities. Type I restriction 
endonucleases cleave DNA at random sites that can be 
more than 1,000 base pairs (bp) from the recognition 
sequence. Type III restriction endonucleases cleave the 
DNA about 25 bp from the recognition sequence. Both 



types move along the DNA in a reaction that requires the 
energy of ATP. Type II restriction endonucleases, 

first isolated by Hamilton Smith in 1970, are simpler, re- 
quire no ATP, and cleave the DNA within the recognition 
sequence itself. The extraordinary utility of this group of 
restriction endonucleases was demonstrated by Daniel 
Nathans, who first used them to develop novel methods 
for mapping and analyzing genes and genomes. 

Thousands of restriction endonucleases have been 
discovered in different bacterial species, and more than 
100 different DNA sequences are recognized by one or 
more of these enzymes. The recognition sequences are 
usually 4 to 6 bp long and palindromic (see Fig. 8-18). 
Table 9-2 lists sequences recognized by a few type II 
restriction endonucleases. 



TABLE 9-2 



BamHl 



Recognition Sequences for Some Type II Restriction Endonucleases 



Clal 



EcoRl 



EcoRV 



Haelll 



(5') GGATCC (3') 
CCTAGG 

(5') A TC GAT (3') 
TAGCTA 

(5') GAATTC (30 
CTTAAG 

i 

(50 GAT AT C (30 
CT AT AG 
t 

(50 G GC C (30 
CCGG 



Hindlll 



Notl 



Pstl 



Pvull 



mini 



I 

(50 AAGCTT(30 
TTCGAA 
T 

i 

(50GCGGCCGC(30 
CGCCGGCG 
t 

(50 CTGC AG (30 
GACGTC 

i 

(5') C AGCTG (3') 
GTCGAC 
t 

I 

(5') GACNNNGTC (3') 
CTGNNNC AG 
t 



Arrows indicate the phosphodiester bonds cleaved by each restriction endonuclease. Asterisks indicate bases that are methylated by the 
corresponding methylase (where known). N denotes any base. Note that the name of each enzyme consists of a three-letter abbreviation 
(in italics) of the bacterial species from which it is derived, sometimes followed by a strain designation and Roman numerals to distinguish 
different restriction endonucleases isolated from the same bacterial species. Thus BamHl is the first (I) restriction endonuclease characterized 
from Bacillus amyloliquefaciens, strain H. 
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Some restriction endonucleases make staggered 
cuts on the two DNA strands, leaving two to four nu- 
cleotides of one strand unpaired at each resulting end. 
These unpaired strands are referred to as sticky ends 
(Fig. 9-2a), because they can base-pair with each 
other or with complementary sticky ends of other DNA 
fragments. Other restriction endonucleases cleave both 
strands of DNA at the opposing phosphodiester bonds, 
leaving no unpaired bases on the ends, often called 
blunt ends (Fig. 9-2b). 

The average size of the DNA fragments produced by 
cleaving genomic DNA with a restriction endonuclease 
depends on the frequency with which a particular re- 
striction site occurs in the DNA molecule; this in turn 
depends largely on the size of the recognition sequence. 



In a DNA molecule with a random sequence in which all 
four nucleotides were equally abundant, a 6 bp se- 
quence recognized by a restriction endonuclease such 
asBamHI would occur on average once every 4 6 (4,096) 
bp, assuming the DNA had a 50% G=C content. En- 
zymes that recognize a 4 bp sequence would produce 
smaller DNA fragments from a random-sequence DNA 
molecule; a recognition sequence of this size would be 
expected to occur about once every 4 4 (256) bp. In nat- 
ural DNA molecules, particular recognition sequences 
tend to occur less frequently than this because nu- 
cleotide sequences in DNA are not random and the four 
nucleotides are not equally abundant. In laboratory ex- 
periments, the average size of the fragments produced 
by restriction endonuclease cleavage of a large DNA 
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FIGURE 9-2 Cleavage of DNA molecules by restriction endonucleases. 

Restriction endonucleases recognize and cleave only specific se- 
quences, leaving either (a) sticky ends (with protruding single strands) or 
(b) blunt ends. Fragments can be ligated to other DNAs, such as the 
cleaved cloning vector {a plasmid) shown here. This reaction is facili- 
tated by the annealing of complementary sticky ends. Ligation is less ef- 
ficient for DNA fragments with blunt ends than for those with 
complementary sticky ends, and DNA fragments with different (non- 
complementary) sticky ends generally are not ligated. (c) A synthetic 
DNA fragment with recognition sequences for several restriction en- 
donucleases can be inserted into a plasmid that has been cleaved by a 
restriction endonuclease. The insert is called a linker; an insert with 
multiple restriction sites is called a polylinker. j) Restriction En- 
donucleases 
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can be increased by simply terminating the reaction be- 
fore completion; the result is called a partial digest. 
Fragment size can also be increased by using a special 
class of endonucleases called homing endonucleases 
(see Fig. 26-38). These recognize and cleave much 
longer DNA sequences (14 to 20 bp). 

Once a DNA molecule has been cleaved into frag- 
ments, a particular fragment of known size can be en- 
riched by agarose or acrylamide gel electrophoresis or 
by HPLC (pp. 89, 88). For a typical mammalian genome, 
however, cleavage by a restriction endonuclease usually 
yields too many different DNA fragments to permit con- 
venient isolation of a particular fragment. A common in- 
termediate step in the cloning of a specific gene or DNA 
segment is the construction of a DNA library (described 
in Section 9.2). 

After the target DNA fragment is isolated, DNA ligase 
can be used to join it to a similarly digested cloning vec- 
tor — that is, a vector digested by the same restriction en- 
donuclease; a fragment generated by EcoRI, for example, 
generally will not link to a fragment generated by BamHl. 
As described in more detail in Chapter 25 (see Fig. 
25-17), DNA ligase catalyzes the formation of new phos- 
phodiester bonds in a reaction that uses ATP or a similar 
cofactor. The base pairing of complementary sticky ends 
greatly facilitates the ligation reaction (Fig. 9-2a). Blunt 
ends can also be ligated, albeit less efficiently Researchers 
can create new DNA sequences by inserting synthetic 
DNA fragments (called linkers) between the ends that 
are being ligated. Inserted DNA fragments with multiple 
recognition sequences for restriction endonucleases (of- 
ten useful later as points for inserting additional DNA by 
cleavage and ligation) are called polylinkers (Fig. 9-2c). 

The effectiveness of sticky ends in selectively join- 
ing two DNA fragments was apparent in the earliest 
recombinant DNA experiments. Before restriction en- 
donucleases were widely available, some workers found 
they could generate sticky ends by the combined action 
of the bacteriophage X exonuclease and terminal trans- 
ferase (Table 9-1). The fragments to be joined were 
given complementary homopolymeric tails. Peter Lobban 
and Dale Kaiser used this method in 1971 in the first ex- 
periments to join naturally occurring DNA fragments. 
Similar methods were used soon after in the laboratory 
of Paul Berg to join DNA segments from simian virus 40 
(SV40) to DNA derived from bacteriophage X, thereby 
creating the first recombinant DNA molecule with DNA 
segments from different species. 

Cloning Vectors Allow Amplification 
of Inserted DNA Segments 

The principles that govern the delivery of recombinant 
DNA in clonable form to a host cell, and its subsequent 
amplification in the host, are well illustrated by consid- 
ering three popular cloning vectors commonly used in 
experiments with E, coli — plasmids, bacteriophages, 
and bacterial artificial chromosomes — and a vector used 
to clone large DNA segments in yeast. 



Plasmids Plasmids are circular DNA molecules that 
replicate separately from the host chromosome. Natu- 
rally occurring bacterial plasmids range in size from 
5,000 to 400,000 bp. They can be introduced into bacte- 
rial cells by a process called transformation. The cells 
(generally E. coli) and plasmid DNA are incubated to- 
gether at 0 °C in a calcium chloride solution, then sub- 
jected to a shock by rapidly shifting the temperature to 
37 to 43 °C. For reasons not well understood, some of 
the cells treated in this way take up the plasmid DNA. 
Some species of bacteria, such asAcinetobacter baylyi, 
are naturally competent for DNA uptake and do not re- 
quire the calcium chloride treatment. In an alternative 
method, cells incubated with the plasmid DNA are sub- 
jected to a high-voltage pulse. This approach, called 
electroporation, transiently renders the bacterial 
membrane permeable to large molecules. 

Regardless of the approach, few cells actually take 
up the plasmid DNA, so a method is needed to select 
those that do. The usual strategy is to use a plasmid that 
includes a gene that the host cell requires for growth 
under specific conditions, such as a gene that confers 
resistance to an antibiotic. Only cells transformed by the 
recombinant plasmid can grow in the presence of that 
antibiotic, making any cell that contains the plasmid 
"selectable" under those growth conditions. Such a gene 
is called a selectable marker. 

Investigators have developed many different plas- 
mid vectors suitable for cloning by modifying naturally 
occurring plasmids. The now classic E. coli plasmid 
pBR322 offers a good example of the features useful in a 
cloning vector (Fig. 9-3). 



EcoRI 




FIGURE 9-3 The constructed E. coli plasmid pBR322. Note the 
location of some important restriction sites — for P$t\, EcoR\, tfamHI, 
Sail, and PvuW; ampicillin- and tetracycline-resistance genes; and the 
replication origin (ori). Constructed in 1977, this was one of the early 
plasmids designed expressly for cloning in E. coli. 
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Cells that grow on tetracycline but not on tetracycline + 
ampicillin contain recombinant plasmids with disrupted 
ampicillin resistance, hence the foreign DNA. Cells with pBR322 
without foreign DNA retain ampicillin resistance and grow on 
both plates. 

FIGURE 9-4 Use of pBR322 to clone foreign DNA in E. coli and 
identify cells containing it T 0 Plasmid Cloning 



Important pBR322 features include: 

1. An origin of replication, ori, a sequence where 
replication is initiated by cellular enzymes 
(Chapter 25). This sequence is required to 
propagate the plasmid and maintain it at a level 
of 10 to 20 copies per cell. 

2. Two genes that confer resistance to different 
antibiotics (tet R , amp R ), allowing the identification 
of cells that contain the intact plasmid or a 
recombinant version of the plasmid (Fig. 9-4). 

3. Several unique recognition sequences (Pstl, 
EcoRl, BamWl, Sail, Pvull) that are targets for 
different restriction endonucleases, providing 
sites where the plasmid can later be cut to insert 
foreign DNA. 

4. Small size (4,361 bp), which facilitates entry of the 
plasmid into cells and the biochemical manipulation 
of the DNA. 

Transformation of typical bacterial cells with purified 
DNA (never a very efficient process) becomes less suc- 
cessful as plasmid size increases, and it is difficult to 
clone DNA segments longer than about 15,000 bp when 
plasmids are used as the vector. 

Bacteriophages Bacteriophage X has a very efficient 
mechanism for delivering its 48,502 bp of DNA into a 
bacterium, and it can be used as a vector to clone some- 
what larger DNA segments (Fig. 9-5). Two key fea- 
tures contribute to its utility: 

1 . About one-third of the X genome is nonessential 
and can be replaced with foreign DNA. 

2. DNA is packaged into infectious phage particles 
only if it is between 40,000 and 53,000 bp long, a 
constraint that can be used to ensure packaging of 
recombinant DNA only. 

Researchers have developed bacteriophage X vec- 
tors that can be readily cleaved into three pieces, two 
of which contain essential genes but which together 
are only about 30,000 bp long. The third piece, "filler" 
DNA, is discarded when the vector is to be used for 
cloning, and additional DNA is inserted between the 
two essential segments to generate ligated DNA mole- 
cules long enough to produce viable phage particles. In 
effect, the packaging mechanism selects for recombi- 
nant viral DNAs. 

Bacteriophage X vectors permit the cloning of DNA 
fragments of up to 23,000 bp. Once the bacteriophage X 
fragments are ligated to foreign DNA fragments of suit- 
able size, the resulting recombinant DNAs can be pack- 
aged into phage particles by adding them to crude 
bacterial cell extracts that contain all the proteins 
needed to assemble a complete phage. This is called in 
vitro packaging (Fig. 9-5). All viable phage particles 
will contain a foreign DNA fragment. The subsequent 
transmission of the recombinant DNA into E. coli cells is 
highly efficient. 
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FIGURE 9-5 Bacteriophage cloning vectors. Recombinant DNA meth- 
ods are used to modify the bacteriophage \ genome, removing the 
genes not needed for phage production and replacing them with 
"filler" DNA to make the phage DNA large enough for packaging into 
phage particles. As shown here, the filler is replaced with foreign DNA 
in cloning experiments. Recombinants are packaged into viable phage 
particles in vitro only if they include an appropriately sized foreign 
DNA fragment as well as both of the essential X DNA end fragments. 

FIGURE 9-6 Bacterial artificial chromosomes (BACs) as cloning 
vectors. The vector is a relatively simple plasmid, with a replication ori- 
gin (ori) that directs replication. The par genes, derived from a type of 
plasmid called an F plasmid, assist in the even distribution of plasmids 
to daughter cells at cell division. This increases the likelihood of each 
daughter cell carrying one copy of the plasmid, even when few copies 
are present. The low number of copies is useful in cloning large seg- 
ments of DNA because it limits the opportunities for unwanted recom- 
bination reactions that can unpredictably alter large cloned DNAs over 
time. The BAC includes selectable markers. A lacZ gene (required for 
the production of the enzyme /3-galactosidase) is situated in the cloning 
region such that it is inactivated by cloned DNA inserts. Introduction of 
recombinant BACs into cells by electroporation is promoted by the use 
of cells with an altered (more porous) cell wall. Recombinant DNAs are 
screened for resistance to the antibiotic chloramphenicol (Cm R ). Plates 
also contain a substrate for jS-galactosidase that yields a colored prod- 
uct. Colonies with active /3-galactosidase and hence no DNA insert in 
the BAC vector turn blue; colonies without /3-galactosidase activity — 
and thus with the desired DNA inserts — are white. 



Bacterial Artificial Chromosomes (BACs) Bacte- 
rial artificial chromosomes are simply plasmids designed 
for the cloning of very long segments (typically 100,000 
to 300,000 bp) of DNA (Fig. 9-6). They generally in- 
clude selectable markers such as resistance to the an- 
tibiotic chloramphenicol (Cm R ), as well as a very stable 
origin of replication (ori) that maintains the plasmid at 
one or two copies per cell. DNA fragments of several 
hundred thousand base pairs are cloned into the BAC 
vector. The large circular DNAs are then introduced into 
host bacteria by electroporation. These procedures use 
host bacteria with mutations that compromise the struc- 
ture of their cell wall, permitting the uptake of the large 
DNA molecules. 
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Yeast Artificial Chromosomes (YACs) E. coli cells 
are by no means the only hosts for genetic engineering. 
Yeasts are particularly convenient eukaryotic organisms 
for this work. As with E. coli, yeast genetics is a well- 
developed discipline. The genome of the most com- 
monly used yeast, Saccharomyces cerevisiae, contains 
only 14 X 10 6 bp (a simple genome by eukaryotic 
standards, less than four times the size of the E. coli 
chromosome), and its entire sequence is known. Yeast is 
also very easy to maintain and grow on a large scale in 
the laboratory. Plasmid vectors have been constructed 
for yeast, employing the same principles that govern the 
use of E. coli vectors described above. Convenient 
methods are now available for moving DNA into and out 
of yeast cells, facilitating the study of many aspects of 
eukaryotic cell biochemistry. Some recombinant plas- 
mids incorporate multiple replication origins and other 
elements that allow them to be used in more than one 
species (for example, yeast or E. coli). Plasmids that 
can be propagated in cells of two or more different 
species are called shuttle vectors. 

Research with large genomes and the associated 
need for high-capacity cloning vectors led to the devel- 
opment of yeast artificial chromosomes (YACS; 
Fig. 9-7) , YAC vectors contain all the elements needed 
to maintain a eukaryotic chromosome in the yeast nu- 
cleus: a yeast origin of replication, two selectable mark- 
ers, and specialized sequences (derived from the 
telomeres and centromere, regions of the chromosome 
discussed in Chapter 24) needed for stability and proper 
segregation of the chromosomes at cell division. Before 
being used in cloning, the vector is propagated as a cir- 
cular bacterial plasmid. Cleavage with a restriction en- 
donuclease (BamUl in Fig. 9-7) removes a length of 
DNA between two telomere sequences (TEL), leaving 
the telomeres at the ends of the linearized DNA. Cleav- 
age at another internal site (EcoRl in Fig. 9-7) divides 
the vector into two DNA segments, referred to as vector 
arms, each with a different selectable marker. 

The genomic DNA is prepared by partial digestion 
with restriction endonucleases (EcoRl in Fig. 9-7) to ob- 
tain a suitable fragment size. Genomic fragments are then 
separated by pulsed field gel electrophoresis, a varia- 
tion of gel electrophoresis (see Fig. 3-18) that allows the 
separation of very large DNA segments. The DNA 
fragments of appropriate size (up to about 2 X 10 6 bp) 
are mixed with the prepared vector arms and ligated. The 
ligation mixture is then used to transform treated yeast 
cells with very large DNA molecules. Culture on a 
medium that requires the presence of both selectable 
marker genes ensures the growth of only those yeast cells 
that contain an artificial chromosome with a large insert 
sandwiched between the two vector arms (Fig. 9-7). The 
stability of YAC clones increases with size (up to a point). 
Those with inserts of more than 150,000 bp are nearly as 
stable as normal cellular chromosomes, whereas those 
with inserts of less than 100,000 bp are gradually lost dur- 
ing mitosis (so generally there are no yeast cell clones 



CEN 



Selectable 
marker X / / 




Selectable 
marker Y 




BamBl digestion creates 
linear chromosome with 
telomeric ends 



TEL 



on CEN 



EcoRl 



TEL 



l EcoRl digestion 
i! creates two arms 



TEL 



X ori CEN 



TEL 



Left arm has 
selectable marker X 



Right arm has 
selectable marker Y 



**• s J v Fragments of genomic 



Ligate 



DNA generated by light 
digestion with EcoRl 



YAC 
Transform 




Yeast 
spheroplast 



Yeast with 
YAC clone 



FIGURE 9-7 Construction of a yeast artificial chromosome (YAC). A 

YAC vector includes an origin of replication (or/), a centromere {CEN), 
two telomeres (TEL), and selectable markers (X and Y). Digestion with 
BamHl and fcoRI generates two separate DNA arms, each with a 
telomeric end and one selectable marker. A large segment of DNA 
(e g , up to 2 x 10 6 bp from the human genome) is ligated to the two 
arms to create a yeast artificial chromosome. The YAC transforms yeast 
cells (prepared by removal of the cell wall to form spheroplasts), and 
the cells are selected for X and Y; the surviving cells propagate the 
DNA insert. 



carrying only the two vector ends ligated together or with 
only short inserts). YACs that lack a telomere at either 
end are rapidly degraded. 

Specific DNA Sequences Are Detectable by Hybridization 

DNA hybridization, a process outlined in Chapter 8 (see 
Fig. 8-29), is the most common sequence-based process 
for detecting a particular gene or segment of nucleic 
acid. There are many variations of the basic method, 
most making use of a labeled (such as radioactive) DNA 
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FIGURE 9-8 Use of hybridization to identify a clone with a particular 
DNA segment. The radioactive DNA probe hybridizes to complemen- 
tary DNA and is revealed by autoradiography. Once the labeled 
colonies have been identified, the corresponding colonies on the orig- 
inal agar plate can be used as a source of cloned DNA for further study. 



or RNA fragment, known as a probe, complementary to 
the DNA being sought. In one classic approach to detect 
a particular DNA sequence within a DNA library (a col- 
lection of DNA clones), nitrocellulose paper is pressed 
onto an agar plate containing many individual bacterial 
colonies from the library, each colony with a different 
recombinant DNA. Some cells from each colony adhere 
to the paper, forming a replica of the plate. The paper is 
treated with alkali to disrupt the cells and denature the 
DNA within, which remains bound to the region of the 
paper around the colony from which it came. Added ra- 
dioactive DNA probe anneals only to its complementary 
DNA. After any unannealed probe DNA is washed away, 
the hybridized DNA can be detected by autoradiography 
(Fig. 9-8). 

A common limiting step in detecting and cloning a 
gene is the generation of a complementary strand of 
nucleic acid to use as a probe. The origin of a probe de- 
pends on what is known about the gene under investiga- 
tion. Sometimes a homologous gene cloned from 
another species makes a suitable probe. Or, if the pro- 
tein product of a gene has been purified, probes can be 
designed and synthesized by working backward from 
the amino acid sequence, deducing the DNA sequence 
that would code for it (Fig. 9-9). Now, researchers typ- 
ically obtain the necessary DNA sequence information 
from sequence databases that detail the structure of 
millions of genes from a wide range of organisms. 



FIGURE 9-9 Probe to detect the gene for a protein of known amino 
acid sequence. Because more than one DNA sequence can code for 
any given amino acid sequence, the genetic code is said to be "degen- 
erate." (As described in Chapter 27, an amino acid is coded for by a set 
of three nucleotides called a codon. Most amino acids have two or 
more codons; see Fig. 27-7.) Thus the correct DNA sequence for a 
known amino acid sequence cannot be known in advance. The probe 
is designed to be complementary to a region of the gene with minimal 
degeneracy, that is, a region with the fewest possible codons for the 
amino acids — two codons at most in the example shown here. 
Oligonucleotides are synthesized with selectively randomized 
sequences, so that they contain either of the two possible nucleotides 
at each position of potential degeneracy (shaded in pink). The oligonu- 
cleotide shown here represents a mixture of eight different sequences: 
one of the eight will complement the gene perfectly, and all eight will 
match at least 1 7 of the 20 positions. 
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Expression of Cloned Genes Produces 
Large Quantities of Protein 

Frequently it is the product of the cloned gene, rather 
than the gene itself, that is of primary interest- 
particularly when the protein has commercial, therapeu- 
tic, or research value. With an increased understanding 
of the fundamentals of DNA, RNA, and protein metabo- 
lism and their regulation inE. coli, investigators can now 
manipulate cells to express cloned genes in order to 
study their protein products. 

Most eukaryotic genes lack the DNA sequence 
elements — such as promoters, sequences that instruct 
RNA polymerase where to bind— required for their 
expression in E. coli cells, so bacterial regulatory se- 
quences for transcription and translation must be in- 
serted at appropriate positions relative to the eukaryotic 
gene in the vector DNA. (Promoters, regulatory se- 
quences, and other aspects of the regulation of gene ex- 
pression are discussed in Chapter 28.) In some cases 
cloned genes are so efficiently expressed that their 
protein product represents 10% or more of the cellular 
protein; they are said to be overexpressed. At these con- 
centrations some foreign proteins can kill an£. coli cell, 
so gene expression must be limited to the few hours be- 
fore the planned harvest of the cells. 

Cloning vectors with the transcription and transla- 
tion signals needed for the regulated expression of a 
cloned gene are often called expression vectors. The 
rate of expression of the cloned gene is controlled by re- 
placing the gene's own promoter and regulatory se- 
quences with more efficient and convenient versions 
supplied by the vector. Generally, a well-characterized 
promoter and its regulatory elements are positioned 
near several unique restriction sites for cloning, so that 
genes inserted at the restriction sites will be expressed 
from the regulated promoter element (Fig. 9-10). 
Some of these vectors incorporate other features, such 
as a bacterial ribosome binding site to enhance transla- 
tion of the mRNA derived from the gene, or a transcrip- 
tion termination sequence. 

Genes can similarly be cloned and expressed in eu- 
karyotic cells, with various species of yeast as the usual 
hosts. A eukaryotic host can sometimes promote post- 
translational modifications (changes in protein structure 
made after synthesis on the ribosomes) that might be re- 
quired for the function of a cloned eukaryotic protein. 

Alterations in Cloned Genes Produce Modified Proteins 

Cloning techniques can be used not only to overproduce 
proteins but to produce protein products subtly altered 
from their native forms. Specific amino acids may be re- 
placed individually by site-directed mutagenesis. 
This powerful approach to studying protein structure 
and function changes the amino acid sequence of a pro- 
tein by altering the DNA sequence of the cloned gene. 
If appropriate restriction sites flank the sequence to be 
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FIGURE 9-10 DNA sequences in a typical E. coli expression vector. 

The gene to be expressed is inserted into one of the restriction sites in 
the polylinker, near the promoter (P), with the end encoding the amino 
terminus proximal to the promoter. The promoter allows efficient 
transcription of the inserted gene, and the transcription termination se- 
quence sometimes improves the amount and stability of the mRNA 
produced. The operator (O) permits regulation by means of a repressor 
that binds to it (Chapter 28). The ribosome binding site provides se- 
quence signals needed for efficient translation of the mRNA derived 
from the gene. The selectable marker allows the selection of cells con- 
taining the recombinant DNA, 

altered, researchers can simply remove a DNA segment 
and replace it with a synthetic one that is identical to the 
original except for the desired change (Fig. 9-1 la). 
When suitably located restriction sites are not present, 
an approach called oligonucleotide-directed muta- 
genesis (Fig. 9-1 lb) can create a specific DNA se- 
quence change. A short synthetic DNA strand with a 
specific base change is annealed to a single-stranded 
copy of the cloned gene within a suitable vector. The 
mismatch of a single base pair in 15 to 20 bp does not 
prevent annealing if it is done at an appropriate temper- 
ature. The annealed strand serves as a primer for the 
synthesis of a strand complementary to the plasmid vec- 
tor. This slightly mismatched duplex recombinant plas- 
mid is then used to transform bacteria, where the 
mismatch is repaired by cellular DNA repair enzymes 
(Chapter 25). About half of the repair events will re- 
move and replace the altered base and restore the gene 
to its original sequence; the other half will remove and 
replace the normal base, retaining the desired muta- 
tion. Transformants are screened (often by sequencing 
their plasmid DNA) until a bacterial colony containing a 
plasmid with the altered sequence is found. 

Changes can also be introduced that involve more 
than one base pair. Large parts of a gene can be deleted 
by cutting out a segment with restriction endonucleases 
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FIGURE 9-1 1 Two approaches to site-directed mutagenesis, (a) A syn- 
thetic DNA segment replaces a DNA fragment that has been removed 
by cleavage with a restriction endonuclease. (b) A synthetic oligonu- 
cleotide with a desired sequence change at one position is hybridized 
to a single-stranded copy of the gene to be altered. This acts as primer 
for synthesis of a duplex DNA (with one mismatch), which is then used 
to transform cells. Cellular DNA repair systems will convert about 50% 
of the mismatches to reflect the desired sequence change. 



and ligating the remaining portions to form a smaller 
gene. Parts of two different genes can be ligated to 
create new combinations. The product of such a fused 
gene is called a fusion protein. 

Researchers now have ingenious methods to bring 
about virtually any genetic alteration in vitro. Reintro- 



duction of the altered DNA into the cell permits investi- 
gation of the consequences of the alteration. Site- 
directed mutagenesis has greatly facilitated research on 
proteins by allowing investigators to make specific 
changes in the primary structure of a protein and to ex- 
amine the effects of these changes on the folding, three- 
dimensional structure, and activity of the protein. 

Terminal Tags Provide Binding Sites 
for Affinity Purification 

Mfinity chromatography is one of the most efficient meth- 
ods available for protein purification (see Fig. 3-1 7c). 
Unfortunately, there are many proteins for which there 
is no known ligand that can be conveniently immobilized 
on a chromatographic medium. The use of fusion pro- 
teins has made it possible to purify almost any protein 
by affinity chromatography. 

First, the gene encoding the target protein is fused 
to a gene encoding a peptide or protein that binds to a 
known ligand with high affinity and specificity. The pep- 
tide or protein used for this purpose, which may be at- 
tached at either the amino or carboxyl terminus, is 
called a terminal tag or (more often) simply a tag. Some 
proteins and peptides commonly used as tags are listed 
in Table 9-3 along with their ligands. 

The general procedure is illustrated by the attach- 
ment of a tag consisting of glutathiones-transferase 
(GST). GST is a small enzyme (M T 26,000) that binds 
tightly and specifically to the molecule glutathione 
(Fig 9-12). If the GST gene sequence is fused to a tar- 
get gene, the fusion protein acquires the capacity to 
bind glutathione. The fusion protein is expressed in a 
bacterial or other host organism, and a crude extract is 
prepared. A column is filled with a porous matrix con- 
sisting of the ligand (in this case, glutathione) immobi- 
lized to microscopic beads of a stable polymer such as 
cross-linked agarose. As the crude extract percolates 
through this column matrix, the fusion protein becomes 
immobilized by binding to the glutathione. The other 
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Gene for target protein 



Glutathione 
(GSH) 



Glutathione-S-transferase (GST) 

FIGURE 9-12 The use of tagged proteins in protein purification. The 

use of a GST tag is illustrated, (a) GIutathione-5-transferase (GST) is a 
small enzyme (depicted here by the purple icon) that binds glutathione 
(a glutamate residue to which a Cys-Gly dipeptide is attached at the 
carboxyl carbon of the Glu side chain, hence the abbreviation GSH). 
(b) The GST tag is fused to the carboxyl terminus of the target protein 
by genetic engineering. The tagged protein is expressed in host cells, 
and is present in the crude extract when the cells are lysed. The extract 
is subjected to chromatography on a column containing a medium 
with immobilized glutathione. The GST-tagged protein binds to the 
glutathione, retarding its migration through the column, while the 
other proteins wash through rapidly. The tagged protein is subse- 
quently eluted from the column with a solution containing elevated 
salt concentration or free glutathione, 



proteins in the extract are washed through the column 
and discarded. The interaction between GST and glu- 
tathione is strong but noncovalent, allowing the fusion 
protein to be gently eluted from the column using a solu- 
tion containing either a higher concentration of salts or 
free glutathione to compete with the immobilized ligand 
for GST binding. Fusion proteins can often be obtained 
with good yield and high purity in this way. In some com- 
mercially available systems, the tag can be partially or 
completely removed from the purified fusion protein us- 
ing a protease that cleaves a sequence near the junction 
between the target protein and its fused tag. 

A short tag with widespread application consists of a 
simple sequence of six or more histidine residues. These 
histidine tags or His tags, as they are more commonly 
known, bind tightly and specifically to nickel ions. Chro- 
matography media with immobilized Ni 2+ can then be 
used to efficiently separate His-tagged proteins from oth- 
ers in an extract. Larger tags, such as maltose-binding 
protein, can enhance solubility and compensate for lack 
of stability in target proteins, allowing the purification of 
proteins that cannot be purified by other methods. 

Tag technology is powerful and convenient, and has 
been used successfully in thousands of published stud- 
ies. However, one must be wary when using tagged pro- 
teins in experiments. Terminal tags are not inert. Even 
very small tags can affect the properties of the proteins 
to which they are attached and thus affect experimental 
results. Activity may be affected even when tags are re- 
moved by proteases, if one or a few extra amino acid 
residues remain associated with the target protein. Re- 
sults obtained from tagged proteins should always be 
evaluated with the aid of well-designed controls to as- 
sess the effect of the tag on protein function. 
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SUMMARY 9.1 DNA CloningrThe Basics 

■ DNA cloning and genetic engineering involve the 
cleavage of DNA and assembly of DNA segments in 
new combinations — recombinant DNA. 

■ Cloning entails cutting DNA into fragments with 
enzymes; selecting and possibly modifying a 
fragment of interest; inserting the DNA fragment 
into a suitable cloning vector; transferring the 
vector with the DNA insert into a host cell for 
replication; and identifying and selecting cells that 
contain the DNA fragment. 

■ Key enzymes in gene cloning include restriction 
endonucleases (especially the type II enzymes) and 
DNA ligase. 
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■ Cloning vectors include plasmids, bacteriophages, 
and, for the longest DNA inserts, bacterial artificial 
chromosomes (BACs) and yeast artificial 
chromosomes (YACs). 

■ Cells containing particular DNA sequences can be 
identified by DNA hybridization methods. 

■ Genetic engineering techniques manipulate cells to 
express and/or alter cloned genes. 

■ Proteins or peptides can be attached to a protein 
of interest by altering its cloned gene, creating a 
fusion protein. The additional peptide segments 
can be used to detect the protein, or to purify it 
using convenient affinity chromatography methods. 

9.2 From Genes to Genomes 

The modern science of genomics now permits the 
study of DNA on a cellular scale, from individual genes 
to the entire genetic complement of an organism — its 
genome. Genomic databases are growing rapidly, as one 
sequencing milestone is superseded by the next. 
Biology in the twenty-first century will move forward 
with the aid of informational resources undreamed of 
only a few years ago. We now turn to a consideration of 
some of the technologies fueling these advances. 

DNA Libraries Provide Specialized Catalogs of 
Genetic Information 

A DNA library is a collection of DNA clones, gathered to- 
gether as a source of DNA for sequencing, gene discov- 
ery, or gene function studies. The library can take a 
variety of forms, depending on the source of the DNA. 
Among the largest types of DNA library is a genomic 
library, produced when the complete genome of a par- 
ticular organism is cleaved into thousands of fragments, 
and all the fragments are cloned by insertion into a 
cloning vector. 

The first step in preparing a genomic library is par- 
tial digestion of the DNA by restriction endonucleases, 
such that any given sequence will appear in fragments of 
a range of sizes — a range that is compatible with the 
cloning vector and ensures that virtually all sequences 
are represented among the clones in the library. Frag- 
ments that are too large or too small for cloning are re- 
moved by centrifugation or electrophoresis. The cloning 
vector, such as a BAC or YAC plasmid, is cleaved with 
the same restriction endonuclease and ligated to the 
genomic DNA fragments. The ligated DNA mixture is 
then used to transform bacterial or yeast cells to pro- 
duce a library of cell types, each type harboring a differ- 
ent recombinant DNA molecule. Ideally, all the DNA in 
the genome under study will be represented in the 
library. Each transformed bacterium or yeast cell grows 
into a colony, or "clone," of identical cells, each cell 
bearing the same recombinant plasmid, one of many 
represented in the overall library. 



Using hybridization methods, researchers can order 
individual clones in a library by identifying clones with 
overlapping sequences. A set of overlapping clones rep- 
resents a catalog for a long contiguous segment of a 
genome, often referred to as a contig (Fig. 9-13). Pre- 
viously studied sequences or entire genes can be located 
within the library using hybridization methods to deter- 
mine which library clones harbor the known sequence. 
If the sequence has already been mapped on a chromo- 
some, investigators can determine the location (in the 
genome) of the cloned DNA and any contig of which it is 
a part. A well-characterized library may contain thou- 
sands of long contigs, all assigned to and ordered on par- 
ticular chromosomes to form a detailed physical map. 
The known sequences within the library (each called a 
sequence-tagged site, or STS) can provide landmarks 
for genomic sequencing projects. 

As more and more genome sequences become avail- 
able, the utility of genomic libraries is diminishing and 
investigators are constructing more specialized libraries 
designed to study gene function. An example is a library 
that includes only those genes that are expressed — that 
is, are transcribed into RNA — in a given organism or 
even in certain cells or tissues. Such a library focuses 
on those portions of a genome relevant to the function 
of a tissue or cell type. The researcher first extracts 
mRNA from an organism or from specific cells of an 
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FIGURE 9-13 Ordering of the clones in a DNA library. Shown here is 
a segment of a chromosome from a hypothetical organism X, with 
markers A through Q representing sequence-tagged sites (STSs — DNA 
segments of known sequence, including known genes). Below the 
chromosome is an array of ordered BAC clones, numbered 1 to 9. Or- 
dering the clones on the genetic map is a many-stage process. The 
presence or absence of an STS on an individual clone can be deter- 
mined by hybridization— for example, by probing each clone with 
DNA complementary to the STS. Once the STSs on each BAC clone 
are identified, the clones (and the STSs themselves, if their location is 
not yet known) can be ordered on the map. For example, compare 
clones 3, 4, and 5. Marker £ (blue) is found on all three clones; F (red) 
on clones 4 and 5, but not on 3; and C (green) only on clone 5. This 
indicates that the order of the sites is E, F, C. The clones partially 
overlap and their order must be 3, 4, 5. The resulting ordered series of 
clones is called a contig. 
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organism and then prepares complementary DNAs 

(cDNAs) from the RNA in a multistep reaction cat- 
alyzed by the enzyme reverse transcriptase (Fig. 9-14). 
The resulting double-stranded DNA fragments are then 
inserted into a suitable vector and cloned, creating a 
population of clones called a cDNA library. The search 
for a particular gene is made easier by focusing on a 
cDNA library generated from the mRNAs of a cell known 
to express that gene. For example, if we wished to clone 
globin genes, we could first generate a cDNA library 
from erythrocyte precursor cells, in which about half the 
mRNAs code for globins. To aid in the mapping of large 
genomes, cDNAs in a library can be partially sequenced 
at random to produce a useful type of STS called an ex- 
pressed sequence tag (EST). ESTs, ranging in size 
from a few dozen to several hundred base pairs, can be 



positioned within the larger genome map, providing 
markers for expressed genes. Hundreds of thousands of 
ESTs were included in the detailed physical maps used 
as a guide to sequencing the human genome. 

A cDNA library can be made even more specialized 
by cloning cDNAs or cDNA fragments into a vector that 
fuses each cDNA sequence with the sequence for a 
marker, or reporter gene; the fused genes form a 
"reporter construct." Two useful markers are the genes 
for green fluorescent protein and epitope tags. A target 
gene fused with a gene for green fluorescent pro- 
tein (GFP) generates a fusion protein that is highly 
fluorescent — it literally lights up (Fig. 9-15a). GFP, 
derived from the jellyfish Aequorea victoria, has a 
^-barrel structure, with a fluorophore in the center of 
the barrel (see Box 12-3, p. 434). The fluorophore is 
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FIGURE 9-14 Construction of a cDNA library from mRNA. A cell's 
mRNA includes transcripts from thousands of genes, and the cDNAs 
generated are correspondingly heterogeneous. The duplex DNA 
produced by this method is inserted into an appropriate cloning vector. 
Reverse transcriptase can synthesize DNA on an RNA or a DNA 
template (see Fig. 26-33). 
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FIGURE 9-15 Specialized DNA libraries, (a) Cloning of cDNA next to 
a gene for green fluorescent protein (GFP) creates a reporter construct, 
RNA transcription proceeds through the gene of interest (insert DNA) 
and the reporter gene, and the mRNA transcript is then expressed as a 
fusion protein. The GFP part of the protein is visible in the fluorescence 
microscope. The photograph shows a nematode worm containing a 
GFP fusion protein expressed only in the four "touch" neurons that run 
the length of its body. 0 Reporter Constructs (b) If the cDNA is 
cloned next to a gene for an epitope tag, the resulting fusion protein 
can be precipitated by antibodies to the epitope. Any other proteins 
that interact with the tagged protein also precipitate, helping to eluci- 
date protein-protein interactions. 



derived from a rearrangement and oxidation of several 
amino acid residues in an autocatalytic reaction that 
requires only molecular oxygen (see Box 12-3, Fig. 3). 
Thus the protein is readily cloned in an active form in 
almost any cell. Just a few molecules of this protein can 
be observed microscopically, allowing the study of its 
location and movements in a cell. Careful protein engi- 
neering has generated mutant forms of GFP with a 
range of different colors and other properties (bright- 
ness , stability), and related proteins have recently 
been isolated from other species. 

An epitope tag is a short protein sequence that is 
bound tightly by a well-characterized monoclonal anti- 
body (p. 173). The tagged protein can be specifically 
precipitated from a crude protein extract by interac- 
tion with the antibody (Fig. 9-1 5b). If any other pro- 
teins bind to the tagged protein, those will precipitate 
as well, providing information about protein-protein 
interactions in a cell. The diversity and utility of special- 
ized DNA libraries (and tagged proteins) are growing 
every year. 

The Polymerase Chain Reaction Amplifies 
Specific DNA Sequences 

The Human Genome Project, along with the many asso- 
ciated efforts to sequence the genomes of organisms of 
every type, is providing unprecedented access to gene 
sequence information. This in turn is simplifying the 
process of cloning individual genes for more detailed 
biochemical analysis. If we know the sequence of at least 
the flanking parts of a DNA segment to be cloned, we 
can hugely amplify the number of copies of that DNA 
segment, using the polymerase chain reaction 
(PCR), a process conceived by Kary Mullis in 1983. The 
amplified DNA can be cloned directly or used in a 
variety of analytical procedures. 

The PCR procedure has an elegant simplicity. Two 
synthetic oligonucleotides are prepared, complemen- 
tary to sequences on opposite strands of the target DNA 
at positions defining the ends of the segment to be am- 
plified. The oligonucleotides serve as replication 
primers that can be extended by DNA polymerase. The 
3' ends of the hybridized probes are oriented toward 
each other and positioned to prime DNA synthesis 
across the desired DNA segment (Fig. 9-16). (DNA 
polymerases synthesize DNA strands from deoxyribonu- 
cleotides, using a DNA template, as described in 
Chapter 25.) Isolated DNA containing the segment to be 
amplified is heated briefly to denature it, and then 
cooled in the presence of a large excess of the synthetic 
oligonucleotide primers. The four deoxynucleoside 
triphosphates are then added, and the primed DNA seg- 
ment is replicated selectively. The cycle of heating, cool- 
ing, and replication is repeated 25 or 30 times over a few 
hours in an automated process, amplifying the DNA 
segment between the primers until it can be readily 
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analyzed or cloned. PCR uses a heat-stable DNA poly- 
merase, such as the Taq polymerase (derived from a 
bacterium that lives at 90 °C), which remains active af- 
ter every heating step and does not have to be replen- 
ished. Careful design of the primers used for PCR, such 
as including restriction endonuclease cleavage sites, can 
facilitate the subsequent cloning of the amplified DNA 
(Fig. 9-16b). 

This technology is highly sensitive: PCR can detect 
and amplify as little as one DNA molecule in almost any 
type of sample. Although DNA degrades over time 
(p. 289), PCR has allowed successful cloning of DNA 
from samples more than 40,000 years old. Investigators 
have used the technique to clone DNA fragments from 
the mummified remains of humans and extinct animals 
such as the woolly mammoth, creating the new fields of 
molecular archaeology and molecular paleontology. DNA 
from burial sites has been amplified by PCR and used to 
trace ancient human migrations. Epidemiologists can 
use PCR-enhanced DNA samples from human remains 
to trace the evolution of human pathogenic viruses. In 
addition to its usefulness for cloning DNA, PCR is a 
potent tool in forensic medicine (Box 9-1). It is also be- 
ing used for detection of viral infections before they 
cause symptoms and for prenatal diagnosis of a wide 
array of genetic diseases. 

The PCR method is also important in advancing the 
goal of whole genome sequencing. For example, the 
mapping of expressed sequence tags to particular chro- 
mosomes often involves amplification of the EST by 
PCR, followed by hybridization of the amplified DNA to 
clones in an ordered library. Investigators found many 
other applications of PCR in the Human Genome Pro- 
ject, to which we now turn. 

Genome Sequences Provide the Ultimate 
Genetic Libraries 

The genome is the ultimate source of information about 
an organism, and there is no genome we are more inter- 
ested in than our own. Less than 10 years after the de- 
velopment of practical DNA sequencing methods, 
serious discussions began about the prospects for se- 
quencing the entire 3 billion base pairs of the human 
genome. The international Human Genome Project got 
underway with substantial funding in the late 1980s. 
The effort eventually included significant contributions 
from 20 sequencing centers distributed among six na- 
tions: the United States, Great Britain, Japan, France, 
China, and Germany. General coordination was provided 
by the Office of Genome Research at the National Insti- 
tutes of Health, led first by James Watson and after 1992 
by Francis Collins. At the outset, the task of sequencing 
a 3 X 10 9 bp genome seemed to be a titanic job, but it 
gradually yielded to advances in technology. The com- 
pleted sequence of the human genome was published in 
April 2003, several years ahead of schedule. 



[~31 s] DNA-Based Information Technologies 



3 
5 



3' 
5'c 



Region of target DNA 
to be amplified 



3'c 



3'c 



(D Heat to separate 
strands. 

(2) Add synthetic oligo- 
nucleotide primers; cool. 



® Add thermostable DNA 
polymerase to catalyze 
5' -» 3' DNA synthesis. 



■ 7 r 












. r ' 


! i 1 ] 



Repeat steps (T) and (2). 



FIGURE 9-16 Amplification of a DNA segment by the polymerase 
chain reaction, (a) The PCR procedure has three steps. DNA strands 
are 0 separated by heating, then @ annealed to an excess of short 
synthetic DNA primers (blue) that flank the region to be amplified; 
(5) new DNA is synthesized by polymerization. The three steps are re- 
peated for 25 or 30 cycles. The thermostable DNA polymerase Taql 
(from Thermus aquaticus, a bacterial species that grows in hot springs) 
is not denatured by the heating steps, (b) DNA amplified by PCR can 
be cloned. The primers can include noncomplementary ends that have 
a site for cleavage by a restriction endonuclease. Although these parts 
of the primers do not anneal to the target DNA, the PCR process in- 
corporates them into the DNA that is amplified. Cleavage of the ampli- 
fied fragments at these sites creates sticky ends, used in ligation of the 
amplified DNA to a cloning vector. 0 Polymerase Chain Reaction 
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BOX 9-1 A Potent Weapon in 



Forensic Median 



e 



Traditionally, one of the most accurate methods for plac- 
ing an individual at the scene of a crime has been a 
fingerprint. With the advent of recombinant DNA tech- 
nology, a more powerful tool is now available: DNA 
fingerprinting (also called DNA typing or DNA profil- 
ing). The method was first described by English geneti- 
cist Alec Jeffreys in 1985. 

DNA fingerprinting is based on sequence poly- 
morphisms, slight sequence differences between indi- 
viduals, 1 bp in every 1,000 bp, on average. Each 
difference from the prototype human genome sequence 
(the first one obtained) occurs in some fraction of the 
human population; every individual has some differ- 
ences. Some of the sequence changes affect recognition 
sites for restriction enzymes, resulting in variation in the 
size of DNA fragments produced by digestion with a 
particular restriction enzyme. These variations are re- 
striction fragment length polymorphisms (RFLPs). 
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Another type of sequence variation, and the one now 
used most commonly in DNA typing, involves short 
tandem repeats (STRs). 

The detection of RFLPs relies on a specialized hy- 
bridization procedure called Southern blotting (Fig. 
1). DNA fragments from digestion of genomic DNA by 
restriction endonucleases are separated by size elec- 
trophoretically, denatured by soaking the agarose gel in 
alkali, and then blotted onto a nylon membrane to re- 
produce the distribution of fragments in the gel. The 
membrane is immersed in a solution containing a ra- 
dioactively labeled DNA probe. A probe for a sequence 
that is repeated several times in the human genome 
generally identifies a few of the thousands of DNA 
fragments generated when the human genome is di- 
gested with a restriction endonuclease. Autoradiogra- 
phy reveals the fragments to which the probe 
hybridizes, as in Figure 1. The method is very accurate 

{continued on next page) 
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FIGURE 1 The Southern blot procedure, as applied to RFLP DNA fin- 
gerprinting. Southern blotting (used for many purposes in molecular 
biology) was named after Jeremy Southern, who developed the tech- 
nique. In this example of a forensic application, the DNA from a se- 
men sample obtained from a rape and murder victim was compared 
with DNA samples from the victim and two suspects. Each sample 
was cleaved into fragments and separated by gel electrophoresis. 



Radioactive DNA probes were used to identify a small subset of frag- 
ments that contained sequences complementary to the probe. The 
sizes of the identified fragments varied from one individual to the 
next, as seen here in the different patterns for the three individuals 
(victim and two suspects) tested. One suspect's DNA exhibits a 
banding pattern identical to that of the semen sample taken from 
the victim. 
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lox 9-1 A Potent Weapon in Forensic Medicine [continued from previous page) 



and was first used in court cases in the late 1980s. How- 
ever, it requires a large sample of undegraded DNA 
(>25 ng). That amount of DNA is often not available at 
a crime scene or disaster site. 

The requirement for more-sensitive DNA typing 
methods led to a focus on the polymerase chain reaction 
(PCR; see Fig. 9-16), and on STRs. An STR locus is a 
short DNA sequence, repeated many times in tandem at 
a particular location in a chromosome; most commonly, 
the repeated sequences are 4 bp long. The STR loci that 
are most useful for DNA typing are quite short, from 4 to 
50 repeats long (16 to 200 total base pairs for tetranu- 
cleotide repeats), and have multiple length variants in 
the human population. More than 20,000 tetranu- 
cleotide STR loci have been characterized in the human 
genome. More than a million STRs of all types may be 
present in the human genome, accounting for about 3% 
of all human DNA. 

The polymerase chain reaction is readily applied to 
STR analysis, and the focus of forensic scientists 
changed from RFLPs to STRs as the promise of in- 
creased sensitivity became apparent in the early 1990s. 
The DNA sequences flanking STRs are unique to each 
type of STR and identical (except for very rare muta- 
tions) in all humans. PCR primers are targeted to this 
flanking DNA, and designed to amplify the DNA across 



the STR (Fig. 2a). The length of the PCR product then 
reflects the length of the STR in that sample. Since 
each human inherits one chromosome from each 
parent, the STR lengths on the two chromosomes are 
often different, generating two signals from one indi- 
vidual. If multiple STR loci are analyzed, a profile can 
be generated that is essentially unique to a particular 
individual. PCR amplification allows investigators to 
obtain DNA fingerprints from less than 1 ng of partially 
degraded DNA, an amount that can be obtained from a 
single hair follicle, a drop of blood, a small semen sam- 
ple on a bed sheet, or samples that might be months or 
even many years old. 

Successful forensic use of STR analysis required 
standardization. The first forensic STR standard was 
established in the United Kingdom in 1995. The U.S. 
standard, called the COmbined DNA Index System 
(CODIS), was established in 1998. The CODIS system 
is based on 13 well-studied STR loci (Table 1), which 
must be present in any DNA typing experiment carried 
out in the United States. The amelogenin gene is also 
used as a marker. This gene, present on the human 
sex chromosomes, has slightly different flanking DNA 
on the X and Y chromosomes. PCR amplification across 
the amelogenin gene thus generates different-size 
products that can reveal the sex of the DNA donor. 



[ TABLE 1 


Properties of 1 


:he Loci Used for the CODIS Database 




Locus 


Chromosome 


Repeat motif 


Repeat length 
(range)* 


Number of 
alleles seen* 


CSF1PO 


5 


TAGA 


5-16 


20 


FGA 


4 


CTTT 


12.2-51.2 


80 


TH01 


11 


TCAT 


3-14 


20 


TPOX 


2 


GAAT 


4-16 


15 


VWA 


12 


[TCTG][TCTA] 


10-25 


28 


D3S1358 


3 


[TCTG][TCTA] 


8-21 


24 


D5S818 


5 


AGAT 


7-18 


15 


D7S820 


7 


GATA 


5-16 


30 


D8S1179 


8 


[TCTA][TCTG] 


7-20 


17 


D13S317 


13 


TATC 


5-16 


17 


D16S539 


16 


GATA 


5-16 


19 


D18S51 


18 


AGAA 


7-39.2 


51 


D21S11 


21 


[TCTA][TCTG] 


12-41.2 


82 


Amelogenin 


X,Y 


Not applicable 







Source: Adapted from Butler, J.M. (2005) Forensic DNA Typing, 2nd edn, Academic Press, San Diego, p. 96. 

♦Repeat lengths observed in the human population. Partial or imperfect repeats can be included in some alleles. 

dumber of different alleles observed to date in the human population. Careful analysis of a locus in many individuals is a prerequisite to its 
use in forensic DNA typing. 
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The CODIS database contained 2.8 million samples 
prior to 2006, and is linked to all 50 United States. As 
of mid-2005, it had assisted more than 25,000 forensic 
investigations. 

Convenient kits have been developed commercially 
that allow the amplification of 16 or more STR loci in 
one test tube. These "multiplex" STR kits (Fig. 2b) have 
PCR primers unique to each locus. Each primer is care- 
fully designed to avoid hybridization to any other primer 
in the kit and to generate PCR products of different sizes 
so as to spread out the signals from the different loci dur- 
ing electrophoresis. The primers are linked to colored 
dyes to help distinguish the different PCR products. The 
most widely used kits now include the 13 CODIS loci, 
amelogenin, and two additional loci used by law enforce- 
ment agencies elsewhere in the world (16 total). The kits 
are very precise in establishing human identity When 
good DNA profiles are obtained, the chance of an acci- 
dental match between two individuals in the human pop- 
ulation is less than 1 in 10 18 (quintillion) . 

DNA typing has been used to both convict and ac- 
quit suspects and, in other cases, to establish paternity 
with an extraordinary degree of certainty. The impact of 
these procedures on court cases will continue to grow as 
standards are improved and as international DNA typing 
databases grow. Even very old mysteries can be solved: 
in 1996, DNA fingerprinting helped to confirm the iden- 
tification of the bones of the last Russian czar and his 
family, who were assassinated in 1918. 
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FIGURE 2 PCR analysis of an STR locus, (a) PCR primers are designed 
to amplify a segment of DNA that includes the repeat region. One of 
the two primers is linked to a fluorescent dye (green circle). The two 
chromosomes in an individual may have different alleles, with differ- 
ent numbers of repeats at the same locus. The PCR thus can generate 
two products of slightly different size. These are subjected to elec- 
trophoresis on a very thin polyacrylamide gel in a capillary tube. The 
resulting fluorescent bands are scanned by a laser to generate a series 
of peaks that, when combined with appropriate size marker controls, 
accurately reveals the size of each PCR fragment and thus the length of 



the STR in the corresponding allele, (b) Combining the PCR products 
from multiple loci generates a multiplex pattern such as the one shown 
(in this case, a pattern with 16 loci, called a 16-plex, generated using 
a commercial STR analysis kit). This requires multiple sets of PCR 
primers, with one set targeted to each locus. To help distinguish the 
loci, the dyes linked to the PCR primers are of several different colors. 
In addition, the PCR primers for a given locus are designed to produce 
products in a size range as distinct as possible from that generated by 
the primers targeted to other loci. 
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This advance was the product of a carefully planned 
international effort spanning 14 years. Research teams 
first generated a detailed physical map of the human 
genome, with clones derived from each chromosome 
organized into a series of long contigs (Fig. 9-17). Each 
contig contained landmarks in the form of STSs at a dis- 
tance of every 100,000 bp or less. The genome thus 
mapped could be divided up between the international 
sequencing centers, each center sequencing the mapped 
BAC or YAC clones corresponding to its particular seg- 
ments of the genome. Because many of the clones were 
more than 100,000 bp long, and modern sequencing tech- 
niques can resolve only 600 to 750 bp of sequence at a 
time, each clone had to be sequenced in pieces. The se- 
quencing strategy used a shotgun approach, in which re- 
searchers used powerful new automated sequencers to 
sequence random segments of a given clone, then assem- 
bled the sequence of the entire clone by computerized 
identification of overlaps. The number of clone pieces se- 
quenced was determined statistically so that the entire 
length of the clone was sequenced four to six times on av- 
erage. The sequenced DNA was then made available in a 
database covering the entire genome. Construction of the 
physical map was a time-consuming task, and its progress 
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FIGURE 9-17 The Human Genome Project strategy. Clones isolated 
from a genomic library were ordered into a detailed physical map, 
then individual clones were sequenced by shotgun sequencing proto- 
cols. The strategy used by the commercial sequencing effort eliminated 
the step of creating the physical map and sequenced the entire genome 
by shotgun cloning. 



was followed in annual reports in major journals through- 
out the 1990s — by the end of which the map was largely 
in place. Completion of the entire sequencing project was 
initially projected for the year 2005, but circumstances 
and technology intervened to accelerate the process. 

A competing commercial effort to sequence the hu- 
man genome was initiated by the newly established Cel- 
era Corporation in 1997. Led by J. Craig Venter, the 
Celera group made use of a different strategy called 
"whole genome shotgun sequencing," which eliminates 
the step of assembling a physical map of the genome. In- 
stead, teams sequenced DNA segments from through- 
out the genome at random. The sequenced segments 
were ordered by the computerized identification of se- 
quence overlaps (with some reference to the public pro- 
ject's detailed physical map). At the outset of the 
Human Genome Project, shotgun sequencing on this 
scale had been deemed impractical. However, advances 
in computer software and sequencing automation had 
made the approach feasible by 1997. The ensuing race 
between the private and public sequencing efforts sub- 
stantially advanced the timeline for completion of the 
project. Publication of the draft human genome se- 
quence in 2001 was followed by two years of follow-up 
work to eliminate nearly a thousand discontinuities and 
to provide high-quality sequence data that are contigu- 
ous throughout the genome. 




Francis S. Collins 



Craig Venter 



The Human Genome Project marks the culmination 
of twentieth-century biology and promises a vastly 
changed scientific landscape for the new century. The 
human genome is only part of the story, as the genomes 
of many other species are also being (or have been) se- 
quenced, including the yeasts Saccharomyces cere- 
visiae (completed in 1996) and Schizosaccharomyces 
pombe (2002), the nematode Caenorhabditis elegans 
(1998), the fruit fly Drosophila melanogaster (2000), 
the plant Arabidopsis thaliana (2000), the mouse Mus 
musculus (2002), and hundreds of bacterial and ar- 
chaeal species (Fig. 9-18). Most of the early efforts 
were focused on species commonly used in laboratories. 
However, the technology continues to improve, and the 
complete genomic sequences of over 1,200 organisms of 
all types will be available by the time this book is pub- 
lished. Broad efforts to map genes, attempts to identify 
new proteins and disease genes, and many other initia- 
tives are underway. 
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FIGURE 9-18 Genomic sequencing timeline. Discussions in the mid- 
1980s led to initiation of the Human Genome Project in 1989. 
Preparatory work, including extensive mapping to provide genome 
landmarks, occupied much of the 1990s. Separate projects were 
launched to sequence the genomes of other organisms important to re- 
search. The sequencing efforts completed to date include many bacte- 
rial species (such as Haemophilus influenzae), yeast (5 cerevisiae), 
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nematode worms (e.g., C elegans), insects (D, melanogaster and Apis 
mellifera), plants (A thaliana and Oryza sativa L), rodents (Mus muscu- 
lus and Rattus norvegicus), primates {Homo sapiens and Pan 
troglodytes), and some nasty human pathogens (e.g., Trichomonas 
vaginalis). Each genome project has a website that serves as a central 
repository for the latest data. 



The result is a database with the potential not only 
to fuel rapid advances in biology but to change the way 
that humans think about themselves. Early insights pro- 
vided by the human genome sequence range from the 
intriguing to the profound. We are not as complicated as 
we thought. Decades-old estimates that humans pos- 
sessed about 100,000 genes within the approximately 
3.2 X 10 9 bp in the human genome have been sup- 
planted by the discovery that we have only 25,000 to 
30,000 genes. This is perhaps three times more genes 
than a fruit fly (with 20,000) and twice as many as a 
nematode worm (23,000). Although humans evolved 
relatively recently, the human genome is very old. Of 
1,278 protein families identified in one early screen, only 
94 were unique to vertebrates. However, while we 
share many protein domain types with plants, worms, 
and flies, we use these domains in more complex 
arrangements. Alternative modes of gene expression 
(Chapter 26) allow the production of more than one 
protein from a single gene— a process that humans and 
other vertebrates engage in more than do bacteria, 
worms, or any other forms of life. This allows for 
greater complexity in the proteins generated from our 
gene complement. 

We now know that only 1.1% to 1.4% of our DNA ac- 
tually encodes proteins (Fig. 9-19). More than 50% of 
our genome consists of short, repeated sequences, the 
vast majority of which — about 45% of our genome in 
all — come from transposons, short movable DNA se- 
quences that are molecular parasites (Chapter 25). 
Many of the transposons have been there a long time, 
now altered so that they can no longer move to new ge- 
nomic locations. Others are still actively moving at low 
frequencies, helping to make the genome an ever- 
dynamic and evolving entity. At least a few transposons 
have been co-opted by their host and appear to serve 
useful cellular functions. 

What does all this information tell us about how 
much one human differs from another? Within the hu- 
man population are millions of single-base differences, 



called single nucleotide polymorphisms, or SNPs 

(pronounced "snips"). Each human differs from the 
next by about 1 bp in every 1,000 bp. From these small 
genetic differences arises the human variety we are all 
aware of — differences in hair color, eyesight, allergies to 
medication, foot size, and even (to some unknown de- 
gree) behavior. Some of the SNPs are linked to particu- 
lar human populations and can provide important 
information about human migrations that occurred 
thousands of years ago and about our more distant evo- 
lutionary past. 

What does this information tell us about what 
makes us human? The sequence of our closest biologi- 
cal relative, the chimpanzee, might offer some clues. 
The human and chimpanzee genomes differ by only 
1.2% at the level of base pairs, and the differences are 
less than that in genes encoding proteins. This value 
sounds small, but in these large genomes it translates 
into 35 million base pair changes, another 5 million 
short insertions or deletions, and a substantial number 
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FIGURE 9-19 Snapshot of the human genome. The chart shows the 
proportions of our genome made up of various types of sequences. 
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of larger genomic rearrangements. Sorting out which of 
these genomic distinctions are relevant to features that 
are uniquely human is a daunting task. The analysis of 
primate genomes with an eye to understanding human 
biochemistry and evolution has much potential, but the 
task remains in its infancy. 

As spectacular as these advances are, genome se- 
quencing itself is easy compared with what comes next — 
the effort to understand all the information in each 
genome. The genome sequences being added monthly to 
international databases are roadmaps, parts of which are 
written in a language we do not yet understand. However, 
they have great utility in catalyzing the discovery of new 
proteins and processes affecting every aspect of bio- 
chemistry, as will become apparent in chapters to come. 

SUMMARY 9.2 From Genes to Genomes 

■ The science of genomics broadly encompasses the 
study of genomes and their gene content. 

■ Genomic DNA segments can be organized in 
libraries — such as genomic libraries and cDNA 
libraries — with a wide range of designs and 
purposes. 

■ The polymerase chain reaction (PCR) can be used 
to amplify selected DNA segments from a DNA 
library or an entire genome. 

■ In international cooperative research efforts, the 
genomes of many organisms, including that of 
humans, have been sequenced in their entirety and 
are now available in public databases. 

93 From Genomes to Proteomes 

A gene is not simply a DNA sequence; it is information 
that is converted to a useful product — a protein or func- 
tional RNA molecule — when and if needed by the cell. 
The first and most obvious step in exploring a large se- 
quenced genome is to catalog the products of the genes 
within that genome. Genes that encode RNA as their fi- 
nal product are somewhat harder to identify than are 
protein-encoding genes, and even the latter can be very 
difficult to spot in a vertebrate genome. The explosion of 
DNA sequence information has also revealed a sobering 
truth. Despite many years of biochemical advances, 
there are still thousands of proteins in every eukaryotic 
cell (and quite a few in bacteria and archaea) that we 
know nothing about. These proteins may have functions 
in processes not yet discovered, or may contribute in 
unexpected ways to processes we think we understand, 
In addition, the genomic sequences tell us nothing about 
the three-dimensional structure of proteins or how pro- 
teins are modified after they are synthesized. The pro- 
teins, with their myriad critical functions in every cell, 
are now becoming the focus of new strategies for whole 
cell biochemistry. 



The complement of proteins expressed by a genome 
is called its proteome, a term that first appeared in 
the research literature in 1995. This concept rapidly 
evolved into a separate field of investigation, called pro- 
teomics. The problem addressed by proteomics research 
is straightforward, although the solution is not. Each 
genome presents us with thousands of genes encoding 
proteins, and ideally we want to know the structure and 
function of all those proteins. Given that many proteins 
offer surprises even after years of study, the investigation 
of an entire proteome is a daunting enterprise. Simply 
discovering the function of new proteins requires inten- 
sive work. Biochemists can now apply shortcuts in the 
form of a broad array of new and updated technologies. 

Protein function can be described on three levels. 
Phenotypic function describes the effects of a protein on 
the entire organism. For example, the loss of the protein 
may lead to slower growth of the organism, an altered de- 
velopment pattern, or even death. Cellular function is a 
description of the network of interactions engaged in by a 
protein at the cellular level. Interactions with other pro- 
teins in the cell can help define the kinds of metabolic 
processes in which the protein participates. Finally, mo- 
lecular function refers to the precise biochemical activity 
of a protein, including details such as the reactions an en- 
zyme catalyzes or the ligands a receptor binds. 

For several genomes, such as those of the yeast 
Saccharomyces cerevisiae and the plant Arabidopsis 
thaliana, a massive effort is underway to inactivate 
each gene by genetic engineering and to investigate the 
effect on the organism. If the growth patterns or other 
properties of the organism change (or if it does not grow 
at all), this provides information on the phenotypic 
function of the protein product of the gene. 

There are three other main paths to investigating 
protein function: (1) sequence and structural compar- 
isons with genes and proteins of known function, (2) de- 
termination of when and where a gene is expressed, and 
(3) investigation of the interactions of the protein with 
other proteins. We discuss each of these approaches 
in turn. 

Many of the same technologies that help elucidate 
the function of one protein can be applied to many pro- 
teins at once. The developing field of systems biology 
studies the multitude of biochemical changes in a cell, 
including the changes in the cellular protein population, 
as a function of environmental or genetic stresses. We 
identify approaches applicable to systems biology as 
they are discussed. 

Sequence or Structural Relationships Provide 
Information on Protein Function 

The rapid accumulation of genome sequence information 
has greatly advanced our understanding of evolution (see 
Section 3.4, p. 102) . Another important reason to sequence 
many genomes is to provide a database that can be used to 
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FIGURE 9-20 Synteny in the mouse and human genomes. Large seg- 
ments of the mouse and human genomes have closely related genes 
aligned in the same order on chromosomes, a relationship called syn- 
teny. This diagram shows segments of human chromosome 9 and 
mouse chromosome 2. The genes in these segments exhibit a very high 
degree of homology as well as the same gene order. The different let- 
tering schemes for the gene names reflect different naming conven- 
tions in the two organisms. 

assign gene functions by genome comparisons, an enter- 
prise referred to as comparative genomics. Some- 
times a newly discovered gene is related by sequence 
homologies to a gene previously studied in another or 
the same species, and its function can be entirely or 
partly defined by that relationship. Genes that occur in 
different species but possess a clear sequence and func- 
tional relationship to each other are called orthologs. 
Genes similarly related to each other within a single 
species are called paralogs (see p. 34). If the function 
of a gene has been characterized for one species, this in- 
formation can be used to at least tentatively assign gene 
function to the ortholog found in the second species. 
The correlation is easiest to make when comparing 
genomes from relatively closely related species, such as 
mouse and human, although many clearly orthologous 
genes have been identified in species as distant as bac- 
teria and humans. Sometimes even the order of genes 
on a chromosome is conserved over large segments of 
the genomes of closely related species (Fig. 9-20). 
Conserved gene order, called synteny, provides addi- 
tional evidence for an orthologous relationship between 
genes at identical locations within the related segments. 

Alternatively, certain sequences associated with 
particular structural motifs (Chapter 4) may be identi- 
fied within a protein. The presence of a structural motif 
may suggest that it, say, catalyzes ATP hydrolysis, binds 
to DNA, or forms a complex with zinc ions, helping to 
define molecular function. These relationships are de- 
termined with the aid of increasingly sophisticated com- 
puter programs, limited only by the current information 
on gene and protein structure and our capacity to asso- 
ciate sequences with particular structural motifs. 

To further the assignment of function based on struc- 
tural relationships, a large-scale structural proteomics 



project has been initiated. The goal is to crystallize and 
determine the structure of as many proteins and protein 
domains as possible, in many cases with little or no exist- 
ing information about protein function. The project has 
been assisted by the automation of some of the tedious 
steps of protein crystallization (see Box 4-5). As these 
structures are revealed, they will be made available in the 
structural databases described in Chapter 4. The effort 
should help define the extent of variation in structural 
motifs. When a newly discovered protein is found to have 
structural folds that are clearly related to motifs with 
known functions in the databases, this information can 
suggest a molecular function for the protein. 

Cellular Expression Patterns Can Reveal 
the Cellular Function of a Gene 

In every newly sequenced genome, researchers find 
genes that encode proteins with no evident structural 
relationships to known genes or proteins. In these cases, 
other approaches must be used to generate information 
about gene function. Determining which tissues a gene 
is expressed in, or what circumstances trigger the ap- 
pearance of the gene product, can provide valuable 
clues. Many different approaches have been developed 
to study these patterns. 

TWo-Dimensional Gel Electrophoresis As shown 
in Figure 3-21, two-dimensional gel electrophoresis 
allows the separation and display of up to 1,000 different 
proteins on a single gel. Mass spectrometry (see 
Box 3-2) can then be used to partially sequence individ- 
ual protein spots and assign each to a gene. The appear- 
ance and nonappearance (or disappearance) of particular 
protein spots in samples from different tissues, from 
similar tissues at different stages of development, or 
from tissues treated in ways that simulate a variety of 
biological conditions can help define cellular function. 

Because many proteins are displayed at once in 
these gels, the technique is also applicable to systems 
biology. For example, a bacterial pathogen may evolve 
so as to become resistant to one or more antibiotics. The 
pattern of protein expression in that bacterial strain is 
likely to change, and multiple proteins may be affected. 

DNA Microarrays Major refinements of the technol- 
ogy underlying DNA libraries, PCR, and hybridization 
have come together in the development of DNA mi- 
croarrays (sometimes called DNA chips), which allow 
the rapid and simultaneous screening of many thou- 
sands of genes. DNA segments from known genes, a few 
dozen to hundreds of nucleotides long, are amplified by 
PCR and placed on a solid surface, using robotic devices 
that accurately deposit nanoliter quantities of DNA so- 
lution. Up to a million such spots are deposited in a pre- 
designed array on a surface area of just a few square 
centimeters. An alternative strategy is to synthesize 
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DNA directly on the solid surface, using photolithogra- 
phy (Fig. 9-21). Once the chip is constructed, it can be 
probed with mRNAs or cDNAs from a particular cell 
type or cell culture to identify the genes being ex- 
pressed in those cells. 

A microarray can answer such questions as which 
genes are expressed at a given stage in the development 
of an organism. The total complement of mRNA is iso- 
lated from cells at two different stages of development 
and converted to cDNA, using reverse transcriptase and 
fluorescently labeled deoxynucleotides. The fluorescent 
cDNAs are then mixed and used as probes, each 
hybridizing to complementary sequences on the mi- 
croarray. In Figure 9-22, for example, the labeled nu- 
cleotides used to make the cDNA for each sample 
fluoresce in two different colors. The cDNA from the 
two samples is mixed and used to probe the microarray. 
Spots that fluoresce green represent mRNAs more 
abundant at the single-cell stage; those that fluoresce 
red represent sequences more abundant later in devel- 
opment. The mRNAs that are equally abundant at both 
stages of development fluoresce yellow. By using a mix- 
ture of two samples to measure relative rather than ab- 
solute abundance of sequences, the method corrects for 
variations in the amounts of DNA originally deposited in 
each spot on the grid and other possible inconsistencies 
among spots in the microarray The spots that fluoresce 
provide a snapshot of all the genes being expressed in 
the cells at the moment they were harvested — gene ex- 
pression examined on a genome-wide scale. For a gene 



FIGURE 9-21 Photolithography. This technique for preparing a DNA 
microarray makes use of nucleotide precursors that are activated by 
light, joining one nucleotide to the next in a photoreaction (as opposed 
to the chemical process illustrated in Fig 8-35). A computer is pro- 
grammed with the oligonucleotide sequences to be synthesized at 
each point on a solid surface. The reactive groups on that surface are 
initially rendered inactive because of attached photoactive blocking 
groups (•), A screen covering the surface is opened over the areas pro- 
grammed to receive a particular nucleotide. A flash of light eliminates 
the blocking group in the uncovered spots, and then a solution with a 
particular nucleotide (e.g., * •), activated to react at its 3 '-hydroxy I 
group (*), is washed over the surface A blocking group on the 5'- 
hydroxyl group of the nucleotide prevents unwanted reactions, and the 
nucleotide becomes linked to the surface in the illuminated spots via 
its 3'-hydroxy! group. The screen is then replaced with another screen 
that selectively illuminates only spots programmed to receive a C; light 
removes the 5' blocking groups of previously bound nucleotides, and 
*G» is then added to link G to those spots. The surface is washed suc- 
cessively with solutions containing each remaining type of activated 
nucleotide (*C«, *T«), using selective screening and light to ensure that 
the correct nucleotides are added at each spot in the correct sequence. 
This continues until the required sequences are built up on each spot 
on the surface. Many polymers with the same sequence are generated 
on each spot, not just the single polymer shown. The surface itself has 
thousands of spots, each with different sequences (see Fig. 9-22); this 
array shows just four spots, to illustrate the strategy. 
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FIGURE 9-22 DNA microarray. A microarray can be prepared from 
any known DNA sequence, from any source, generated by chemical 
synthesis or by PCR, The DNA is positioned on a solid surface (usually 
specially treated glass slides) with the aid of a robotic device capable 
of depositing very small (nanoliter) drops in precise patterns. UV light 
cross-links the DNA to the glass slides Once the DNA is attached to 
the surface, the microarray can be probed with other fluorescently la- 
beled nucleic acids, Here, mRNA samples are collected from cells at 
two different stages in the development of a frog, The cDNA probes are 
made with nucleotides that fluoresce in different colors for each sam- 
ple; a mixture of the cDNAs is used to probe the microarray. Green spots 
represent mRNAs more abundant at the single-cell stage; red spots, 
sequences more abundant later in development. The yellow spots 
indicate approximately equal abundance at both stages, f Synthe- 
sizing an Oligonucleotide Array 




of unknown function, the time and circumstances of its 
expression can provide important clues about its role in 
the cell. 

An example of this technique is illustrated in 
Figure 9-23, showing the dramatic results this tech- 
nique can produce. Segments from each of the more 
than 6,000 genes in the completely sequenced yeast 
genome were separately amplified by PCR, and each 
segment was deposited in a defined pattern to create 
the illustrated microarray In a sense, this array provides 
a snapshot of the entire yeast genome. 

Microarrays are an invaluable tool of systems biol- 
ogy, allowing researchers to examine changes in gene 
expression on a cellular scale. The research focus may 
be one gene, or the entire genome. The same technol- 
ogy is also used to examine changes in DNA, such as ge- 
netic changes induced by natural selection or simple 
variation in a population. If a bacterial population has a 
new phenotype that signals the presence of one or 
more mutations, the mutations can often be identified 
quickly using microarrays. A microarray containing the 
wild-type sequence information is probed with DNA 
from the mutant cells. Hybridization will be reduced 




FIGURE 9-23 Enlarged image of a DNA microarray. Each glowing spot 
in this microarray contains DNA from one of the 6,200 genes of the 
yeast (5 cerevisiae) genome, with every gene represented in the array. 
The microarray has been probed with fluorescently labeled nucleic 
acid derived from the mRNAs obtained (1) when the cells were grow- 
ing normally in culture and (2) five hours after the cells began to form 
spores, The green spots represent genes expressed at higher levels 
during normal growth; the red spots, genes expressed at higher levels 
during sporulation.The yellow spots represent genes that do not change 
their levels of expression during sporulation. This image is enlarged; the 
microarray actually measures only 1.8 X 1.8 cm. ■ Screening 
Oligonucleotide Array for Patterns of Gene Expression 
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wherever the mutant cell DNA and the wild-type DNA 
do not match, producing a signal at the relevant spots. 
More detailed sequencing can then reveal the exact 
changes at each genomic region affected. This ap- 
proach is providing new tools with which to track issues 
of medical interest, such as the development of new vi- 
ral strains and the evolution of antibiotic resistance in 
bacterial pathogens. 

The medical applications of microarrays already 
have an important role in cancer treatment. 
There are many different types of human cancer, and tu- 
mors — even within a particular tissue — can vary greatly 
in their growth rate, tendency to metastasize, and re- 
sponse to various therapies. It is often impossible to dif- 
ferentiate tumor types based solely on appearance. 
However, the cells making up a tumor exhibit character- 
istic patterns of gene expression, called a transcriptional 
profile, which often differ greatly from one tumor to the 
next. These can provide a kind of tumor fingerprint. Re- 
cent progress in the diagnosis and treatment of breast 
cancer provides a vivid example. Broad clinical studies 
over the past decade have used microarrays to develop 
transcriptional profiles of many thousands of breast can- 
cers. Treatment protocols have been tracked, and suc- 
cesses and failures carefully documented. Particular 
genes and groups of genes are gradually being identified 
that, when expressed at higher levels, and in certain 
combinations, serve as prognostic predictors. The result 
is a growing database of correlations that allows the use 
of transcriptional profiles to develop prognoses and se- 
lect the most beneficial therapies. These tools are be- 
coming widespread in oncology clinics, and their value 
to oncologists and patients will only increase with time 
and experience. ■ 

Protein Chips Proteins, too, can be immobilized on a 
solid surface and used to help define the presence or ab- 
sence of other proteins in a sample. For example, re- 
searchers prepare an array of antibodies to particular 
proteins by immobilizing them as individual spots on a 
solid surface. A sample of proteins is added, and if the 
protein that binds any of the antibodies is present in the 
sample, it can be detected by a solid-state form of the 
ELISA assay (see Fig. 5-2 6b). Many other types and ap- 
plications of protein chips are being developed. This is 
another technology that can be applied to one protein, 
or all of the proteins within a biological system. 

Detection of Protein-Protein Interactions Helps to 
Define Cellular and Molecular Function 

A key to defining the function of a particular protein is to 
determine what it binds to. In the case of protein- 
protein interactions, the association of a protein of un- 
known function with one whose function is known can 
provide a useful and compelling "guilt by association." 
The techniques used in this effort are quite varied. 



Species 

Protein 1 2 3 4 

PI + + 
P2 + 

P3 + + + 

P4 + 

P5 + 

P6 + + + 

P7 + + + 



FIGURE 9-24 Use of comparative genomics to identify functionally 
related genes. One use of comparative genomics is to prepare phylo- 
genetic profiles in order to identify genes that always appear together 
in a genome. This example shows a comparison of genes from four or- 
ganisms, but in practice, computer searches can look at dozens of 
species. The designations P1 , P2, and so forth refer to proteins encoded 
by each species. This technique does not require homologous proteins. 
In this example, because proteins P3 and P6 always appear together in 
a genome they may be functionally related. Further testing would be 
needed to confirm this inference. 

Comparisons of Genome Composition Although 
not evidence of direct association, the mere presence of 
combinations of genes in particular genomes can hint at 
protein function. We can simply search the genomic 
databases for particular genes, then determine what 
other genes are present in the same genomes (Fig. 
9-24). When two genes always appear together in a 
genome, it suggests that the proteins they encode may 
be functionally related. Such correlations are most useful 
if the function of at least one of the proteins is known. 

Purification of Protein Complexes With the construc- 
tion of cDNA libraries in which each gene is contiguous 
with (fused to) an epitope tag, workers can immunoprecip- 
itate the protein product of a gene by using the antibody 
that binds to the epitope (Fig. 9-1 5b). If the tagged protein 
is expressed in cells, other proteins that bind to it may also 
be precipitated with it. Identification of the associated pro- 
teins reveals some of the protein-protein interactions of 
the tagged protein. There are many variations of this 
process. For example, a crude extract of cells that express 
a similarly tagged protein is added to a column containing 
immobilized antibody. The tagged protein binds to the an- 
tibody, and proteins that interact with the tagged protein 
are sometimes also retained on the column. The connec- 
tion between the protein and the tag is cleaved with a spe- 
cific protease, and the protein complexes are eluted from 
the column and analyzed. Researchers can use these meth- 
ods to define complex networks of interactions within a 
cell. Many of the terminal tags listed in Table 9-3 can be 
used in similar chromatographic protocols, employing the 
affinity of a tag for a particular ligand to identify proteins 
that bind to a particular tagged protein. 
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FIGURE 9-25 The yeast two-hybrid system, (a) In this system for de- 
tecting protein-protein interactions, the aim is to bring together the 
DNA-binding domain and the activation domain of the yeast Cal4 pro- 
tein through the interaction of two proteins, X andY, to which each do- 
main is fused. This interaction is accompanied by the expression of a 
reporter gene, (b) The two fusions are created in separate yeast strains, 
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which are then mated. The mated mixture is plated on a medium on 
which the yeast cannot survive unless the reporter gene is expressed. 
Thus, all surviving colonies have interacting protein fusion pairs. Se- 
quencing of the fusion proteins in the survivors reveals which proteins 
are interacting. Yeast Two-Hybrid Systems 



Yeast T\vo-Hybrid Analysis A sophisticated genetic 
approach to defining protein-protein interactions is 
based on the properties of the Gal4 protein (Gal4p), 
which activates transcription of certain genes in yeast 
(see Fig. 28-31). Gal4p has two domains, one that binds 
to a specific DNA sequence and another that activates 
the RNA polymerase that synthesizes mRNA from an ad- 
jacent reporter gene. The domains are stable when sepa- 
rated, but activation of the RNA polymerase requires 
interaction with the activation domain, which in turn re- 
quires positioning by the DNA-binding domain. Hence, 
the domains must be brought together to function 
correctly (Fig. 9-25a). 

In this method, the protein-coding regions of genes 
to be analyzed are fused to the coding sequences of ei- 
ther the DNA-binding domain or the activation domain of 
Gal4p, and the resulting genes express a series of fusion 
proteins. If a protein fused to the DNA-binding domain 
interacts with a protein fused to the activation domain, 
transcription is activated. The reporter gene transcribed 
by this activation is generally one that yields a protein 
required for growth, or is an enzyme that catalyzes a 
reaction with a colored product. Thus, when grown on 
the proper medium, cells that contain such a pair of in- 
teracting proteins are easily distinguished from those 
that do not. Typically, many genes are fused to the Gal4p 
DNA-binding domain gene in one yeast strain, and many 
other genes are fused to the Gal4p activation domain 
gene in another yeast strain, then the yeast strains are 



mated and individual diploid cells grown into colonies 
(Fig. 9-25b). This allows for large-scale screening for 
proteins that interact in the cell. 

All these techniques provide important clues to pro- 
tein function. However, they do not replace classical bio- 
chemistry. They simply provide researchers with an 
expedited entree into important new biological prob- 
lems. In the end, a detailed functional understanding 
of any new protein requires traditional biochemical 
analyses — such as were used for the many well-studied 
proteins described in this text. When paired with the 
simultaneously evolving tools of biochemistry and 
molecular biology, genomics and proteomics are speed- 
ing the discovery not only of new proteins but of new 
biological processes and mechanisms. 

SUMMARY 9.3 From Genomes to Proteomes 

■ A proteome is the complement of proteins produced 
by a cell's genome. The new field of proteomics 
encompasses an effort to catalog and determine the 
functions of all the proteins in a cell. Integrated 
research to investigate multiple proteins or other 
macromolecules in a cell is sometimes called 
systems biology. 

■ One of the most effective ways to determine the 
function of a new gene is by comparative genomics, 
the search of databases for genes with similar 
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sequences. Paralogs and orthologs are proteins 
(and their genes) with clear functional and 
sequence relationships in the same or in different 
species. In some cases, the presence of a gene in 
combination with certain other genes, observed as 
a pattern in several genomes, can point toward a 
possible function. 

■ Cellular proteomes can be displayed by two- 
dimensional gel electrophoresis and explored with 
the aid of mass spectrometry. 

■ The cellular function of a protein can sometimes 
be inferred by determining when and where its 
gene is expressed. Researchers use DNA 
microarrays (chips) and protein chips to explore 
gene expression at the cellular level. 

■ Several new techniques, including comparative 
genomics, immunoprecipitation, and yeast two- 
hybrid analysis, can identify protein-protein 
interactions. These interactions provide important 
clues to protein function. 

9.4 Genome Alterations and New 
Products of Biotechnology 

We don't need to look far to find practical applications 
for the new biotechnologies or to find new opportuni- 
ties for breakthroughs in basic research. Herein lie both 
the promise and the challenge of genomics. We not only 
can understand genomes, we can change them. This is 
perhaps the ultimate manifestation of the new tech- 
nologies. We will enhance our capacity to engineer or- 
ganisms and produce new pharmaceutical agents and, 
as a consequence, will improve human nutrition and 
health. This promise can be realized only if practical 
safeguards are in place to ensure responsible applica- 
tion of these techniques. 

A Bacterial Plant Parasite Aids Cloning in Plants 

The introduction of recombinant DNA into plants has 
enormous implications for agriculture, making possible 
the alteration of the nutritional profile or yield of crops 
or their resistance to environmental stresses, such as in- 
sect pests, diseases, cold, salinity, and drought. Fertile 
plants of some species may be generated from a single 
transformed cell, so that an introduced gene passes to 
progeny through the seeds. 

As yet, researchers have not found any naturally oc- 
curring plant cell plasmids to facilitate cloning in plants, 
so the biggest technical challenge is getting DNA into 
plant cells. An important and adaptable ally in this effort 
is the soil bacterium Agrobacterium tumefaciens. This 
bacterium can invade plants at the site of a wound, 
transform nearby cells, and induce them to form a tumor 
called a crown gall. Agrobacterium contains the large 
(200,000 bp) Ti plasmid (Fig. 9~26a). When the 
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FIGURE 9-26 Transfer of DNA to plant cells by a bacterial parasite. 

(a) The Ti (tumor-inducing) plasmid of Agrobacterium tumefaciens. 

(b) Wounded plant cells produce and release the phenolic compound 
acetosyringone, When Agrobacterium detects this compound, the viru- 
lence (vir) genes on theTi plasmid are expressed. The vir genes encode en- 
zymes needed to introduce theT DNA segment of theTi plasmid into the 
genome of nearby plant cells. A single-stranded copy of theT DNA is syn- 
thesized and transferred to the plant cell, where it is converted to duplex 
DNA and integrated into a plant cell chromosome. TheT DNA encodes 
enzymes that synthesize both plant growth hormones and opines (see Fig. 
9-27); the latter compounds are metabolized (as a nutrient source) only by 
Agrobacterium. Expression of theT DNA genes by transformed plant cells 
thus leads to both aberrant plant cell growth (tumor formation) and the di- 
version of plant cell nutrients to the invading bacteria. 

bacterium is in contact with a damaged plant cell, a 
23,000 bp segment of the Ti plasmid called T DNA is 
transferred from the plasmid and integrated at a random 
position in one of the plant cell chromosomes (Fig. 
9-26b). The transfer of T DNA from Agrobacterium to 
the plant cell chromosome depends on two 25 bp re- 
peats that flank the T DNA and on the products of the 
virulence (vir) genes on the Ti plasmid (Fig. 9-26a). 

The T DNA encodes enzymes that convert plant 
metabolites to two classes of compounds that benefit the 
bacterium (Fig. 9-27). The first group consists of plant 
growth hormones (auxins and cytokinins) that stimulate 
growth of the transformed plant cells to form the crown 
gall tumor. The second constitutes a series of unusual 
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FIGURE 9-27 Metabolites produced in Agrobacterium-lniected plant 
cells. Auxins and cytokinins are plant growth hormones. The most 
common auxin, indoleacetate, is derived from tryptophan. Cytokinins 
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are adenine derivatives. Opines generally are derived from amino acid 
precursors; at least 14 different opines are produced by enzymes en- 
coded by theTi plasmids of different Agrobacterium species, 



amino acids called opines, which serve as a food source for 
the bacterium. The opines are produced in high concentra- 
tions in the tumor cells and secreted to the surroundings, 
where they can be metabolized only by Agrobacterium, 
using enzymes encoded elsewhere on the Ti plasmid. The 
bacterium thereby diverts plant resources by converting 
them to a form that benefits only itself. 

This rare example of DNA transfer from a bacterium 
to a eukaryotic cell is a natural genetic engineering 
process — one that researchers can harness to transfer 
recombinant DNA (instead of T DNA) to the plant 
genome. A common cloning strategy employs an 
Agrobacterium with two different recombinant plas- 
mids. The first is a Ti plasmid from which the T DNA seg- 
ment has been removed in the laboratory (Fig. 9-28a). 



FIGURE 9-28 A two-plasmid strategy to create a recombinant plant. 

(a) One plasmid is a modified Ti plasmid that contains the vir genes but 
lacks T DNA. (b) The other plasmid contains a segment of DNA that 
bears both a foreign gene (the gene of interest, e.g., the gene for the in- 
secticidal protein described in Fig. 9-30) and an antibiotic-resistance 
element (here, kanamycin resistance), flanked by the two 25 bp repeats 
ofT DNA that are required for transfer of the plasmid genes to the plant 
chromosome. This plasmid also contains the replication origin needed 
for propagation in Agrobacterium. 

When bacteria invade at the site of a wound (the edge of the cut 
leaf), the vir genes on the first plasmid mediate transfer into the plant 
genome of the segment of the second plasmid that is flanked by the 25 
bp repeats. Leaf segments are placed on an agar dish that contains both 
kanamycin and appropriate levels of plant growth hormones, and new 
plants are generated from segments with the transformed cells. Non- 
transformed cells are killed by the kanamycin. The foreign gene and the 
antibiotic-resistance element are normally transferred together, so plant 
cells that grow in this medium generally contain the foreign gene. 
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The second is an Agrobacterium-E. coli shuttle vector 
in which the 25 bp repeats of the T DNA flank a foreign 
gene that the researcher wants to introduce into the 
plant cell, along with a selectable marker such as resist- 
ance to the antibiotic kanamycin (Fig. 9-28b). The engi- 
neered Agrobacterium is used to infect a leaf, but crown 
galls are not formed because the T DNA genes for the 
auxin, cytokinin, and opine biosynthetic enzymes are ab- 
sent from both plasmids. Instead, the vir gene products 
from the altered Ti plasmid direct the transformation of 
the plant cells by the foreign gene — the gene flanked by 
the T DNA 25 bp repeats in the second plasmid. The 
transformed plant cells can be selected by growth on 
agar plates that contain kanamycin, and addition of 
growth hormones induces the formation of new plants 
that contain the foreign gene in every cell. 

The successful transfer of recombinant DNA into 
plants was vividly illustrated by an experiment in which the 
luciferase gene from fireflies was introduced into the cells 
of a tobacco plant (Fig. 9-29) — a favorite plant for trans- 
formation experiments because its cells are particularly 
easy to transform with Agrobacterium. The potential of 
this technology is not limited to the production of glow-in- 
the-dark plants, of course. The same approach has been 
used to produce crop plants that are resistant to herbi- 
cides, plant viruses, and insect pests (Fig. 9-30). Poten- 
tial benefits include increased yields and less need for 
environmentally harmful agricultural chemicals. 

Biotechnology can introduce new traits into a plant 
much faster than traditional methods of plant breeding. 
A prominent example is the development of soybeans 
that are resistant to the general herbicide glyphosate 




FIGURE 9-29 A tobacco plant expressing the gene for firefly lu- 
ciferase. Light was produced after the plant was watered with a solu- 
tion containing luciferin, the substrate for the light-producing 
luciferase enzyme (see Box 13-1). Don't expect glow-in-the-dark or- 
namental plants at your local plant nursery anytime soon. The light is 
actually quite weak; this photograph required a 24-hour exposure. The 
real point — that this technology allows the introduction of new traits 
into plants — is nevertheless elegantly made. 




FIGURE 9-30 Tomato plants engineered to be resistant to insect 
larvae. Two tomato plants were exposed to equal numbers of moth 
larvae. The plant on the left has not been genetically altered. The plant 
on the right expresses a gene for a protein toxin derived from the bac- 
terium Bacillus thuringiensis. This protein, introduced by a protocol 
similar to that depicted in Figure 9-28, is toxic to the larvae of some 
moth species while being harmless to humans and other organisms. 
Insect resistance has also been genetically engineered in cotton and 
other plants. 

(the active ingredient in the product RoundUp). 
Glyphosate breaks down in the environment more rap- 
idly than many other herbicides (glyphosate-sensitive 
plants can be planted in a treated area after as little as 
48 hours), and its use on fields does not generally lead to 
contamination of groundwater or carryover from one 
year to the next. A field of glyphosate-resistant soy- 
beans can be treated once with glyphosate during a 
summer growing season to eliminate essentially all 
weeds in the field, while leaving the soybeans unaffected 
(Fig. 9-31). Potential pitfalls of the technology such 
as the evolution of glyphosate-resistant weeds or the 
escape of difficult-to-control recombinant plants, re- 
main a concern of researchers and the public. 

Manipulation of Animal Ceil Genomes Provides 
Information on Chromosome Structure and 
Gene Expression 

The transformation of animal cells by foreign genetic ma- 
terial offers an important mechanism for expanding our 
knowledge of the structure and function of animal 
genomes, as well as for the generation of animals with 
new traits. This potential has stimulated intensive re- 
search into more-sophisticated means of cloning animals. 

Most work of this kind requires a source of cells into 
which DNA can be introduced. Although intact tissues 
are often difficult to maintain and manipulate in vitro, 
many types of animal cells can be isolated and grown in 
the laboratory if their growth requirements are carefully 
met. Cells derived from a particular animal tissue and 
grown under appropriate tissue culture conditions can 
maintain their differentiated properties (for example, a 
hepatocyte (liver cell) remains a hepatocyte) for weeks 
or even months. 
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FIGURE 9-31 Glyphosate-resistant soybean plants. The photographs 
show two areas of a soybean field in Wisconsin, (a) Without glyphosate 
treatment this part of the field is overrun with weeds, (b) Glyphosate- 
resistant soybean plants thrive in the glyphosate-treated section of the 
field Glyphosate breaks down rapidly in the environment. Agricultural 
use of engineered plants such as these proceeds only after consider- 
able deliberation, balancing the extraordinary promise of the technol- 
ogy with the need to select new traits with care. Both science and 
society as a whole have a stake in ensuring that the use of the resultant 
plants has no adverse impact on the environment or on human health. 



No suitable plasmidlike vector is available for intro- 
ducing DNA into an animal cell, so transformation usually 
requires the integration of the DNA into a host-cell chro- 
mosome. The efficient delivery of DNA to a cell nucleus 
and integration of this DNA into a chromosome without 
disrupting any critical genes remain the major technical 
problems in the genetic engineering of animal cells. 

Available methods for carrying DNA into an animal 
cell vary in efficiency and convenience. Some success 
has been achieved with spontaneous uptake of DNA or 



electroporation, techniques roughly comparable to the 
common methods used to transform bacteria. They are 
inefficient in animal cells, however, transforming only 1 
in 100 to 10,000 cells. Microinjection — the injection of 
DNA directly into a nucleus, using a very fine needle — 
has a high success rate for skilled practitioners, but the 
total number of cells that can be treated is small, be- 
cause each must be injected individually. 

The most efficient and widely used methods for 
transforming animal cells rely on liposomes or viral 
vectors. Liposomes are small vesicles consisting of a lipid 
bilayer that encloses an aqueous compartment (see Fig. 
11-4). Liposomes that enclose a recombinant DNA mol- 
ecule can be fused with the membranes of target cells to 
deliver DNA into the cell. The DNA sometimes reaches 
the nucleus, where it can integrate into a chromosome 
(mostly at random locations). Viral vectors are even 
more efficient at delivering DNA. Animal viruses have ef- 
fective mechanisms for introducing their nucleic acids 
into cells, and several types also have mechanisms to in- 
tegrate their DNA into a host-cell chromosome. Some of 
these, such as retroviruses (see Fig. 26-33) and aden- 
oviruses, have been modified to serve as viral vectors to 
introduce foreign DNA into mammalian cells. 

The work on retroviral vectors illustrates some of the 
strategies being used (Fig. 9-32). When an engineered 
retrovirus enters a cell, its RNA genome is transcribed to 
DNA by reverse transcriptase and then integrated into the 
host genome by the enzyme viral integrase. Special re- 
gions of DNA are required for this procedure: long termi- 
nal repeat (LTR) sequences to integrate retroviral DNA 
into the host chromosome and the ip (psi) sequence to 
package the viral RNA in viral particles (see Fig. 26-34). 

The gag, pol, and env genes of the retroviral genome, 
required for retroviral replication and assembly of viral 
particles, can be replaced with foreign DNA. To assemble 
viruses that contain the recombinant genetic information, 
researchers must introduce the DNA into cultured cells 
that are simultaneously infected with a "helper virus" that 
has the genes to produce viral particles but lacks the i/j se- 
quence required for packaging. Thus the recombinant 
DNA can be transcribed and its RNA packaged into viral 
particles. These particles can act as vectors to introduce 
the recombinant RNA into target cells. Viral reverse tran- 
scriptase and integrase enzymes (produced by the helper 
virus) are also packaged in the viral particle and intro- 
duced into the target cells. Once the engineered viral 
genome is inside a cell, these enzymes create a DNA copy 
of the recombinant viral RNA genome and integrate it into 
a host chromosome. The integrated recombinant DNA 
then becomes a permanent part of the target cell's chro- 
mosome and is replicated with the chromosome at every 
cell division. The cell itself is not endangered by the inte- 
grated viral DNA, because the recombinant virus lacks the 
genes needed to produce RNA copies of its genome and 
package them into new viral particles. The use of recom- 
binant retroviruses is often the best method for introduc- 
ing DNA into large numbers of mammalian cells. 
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FIGURE 9-32 Use of retroviral vectors in mammalian cell cloning. A 

typical retroviral genome (somewhat simplified here), engineered to 
carry a foreign gene (pink), is added to a host-cell tissue culture. The 
helper virus (not shown) lacks the packaging sequence, ip f so its RNA 
transcripts cannot be packaged into viral particles, but it provides the 
gag, pol, and env gene products needed to package the engineered 
retrovirus into functional viral particles. This enables the foreign gene 
in the recombinant retroviral genome to be introduced efficiently into 
the target cells. 



Each type of virus has different attributes, so sev- 
eral classes of animal viruses are being engineered as 
vectors to transform mammalian cells. Adenoviruses, for 
example, lack a mechanism for integrating DNA into a 



chromosome. Recombinant DNA introduced via an ade- 
noviral vector is therefore expressed for only a short 
time and then destroyed. This can be useful if the objec- 
tive is transient expression of a gene. 

Transformation of animal cells by any of the above 
techniques has its problems. Introduced DNA is gener- 
ally integrated into chromosomes at random locations. 
Even when the foreign DNA contains a sequence similar 
to a sequence in a host chromosome, allowing targeting 
to that position, nonhomologous integrants still outnum- 
ber the targeted ones by several orders of magnitude. If 
these integration events disrupt essential genes, they 
can sometimes alter cellular functions (most animal 
cells are diploid or polyploid, however, so an integration 
usually leaves at least one unaffected copy of any given 
gene). A particularly poor outcome would involve an 
integration event that inadvertently activated a gene 
that stimulated cell division, potentially creating a 
cancer cell. Although such an event was once thought to 
be rare, recent trials suggest it is a significant hazard 
(Box 9-2). Finally, the site of an integration can deter- 
mine the level of expression of the integrated gene, 
because integrants are not transcribed equally well 
everywhere in the genome. 

Despite these challenges, the transformation of an- 
imal cells has been used extensively to study chromo- 
some structure and the function, regulation, and 
expression of genes. The successful introduction of re- 
combinant DNA into an animal can be illustrated by an 
experiment that permanently altered an easily observ- 
able inheritable physical trait. Microinjection of DNA 
into the nuclei of fertilized mouse eggs can produce 
efficient transformation (chromosomal integration). 
When the injected eggs are introduced into a female 
mouse and allowed to develop, the new gene is often 
expressed in some of the newborn mice. Those in which 
the germ line has been altered can be identified by test- 
ing their offspring. By careful breeding of these mice, 
researchers can establish a transgenic mouse line in 
which all the mice are homozygous for the new gene or 
genes. Transgenic mice have now been produced with a 
wide range of genetic variations, including many rele- 
vant to human diseases and their control, pointing the 
way to human gene therapy (Box 9-2). A very similar 
approach is used to generate mice in which a particular 
gene has been inactivated ("knockout mice"), a way of 
establishing the function of the inactivated gene. This 
technology was used to introduce into mice defects 
in genes that help to control obesity (see Fig. 23-34). 
9 Creating a Transgenic Mouse 

The capacity to transform the genomes of animals is 
not limited to mice. The zebrafish, a common tropical 
aquarium fish, is also an important model organism for 
researchers. New strains of zebrafish, with their 
genomes altered so that they express one of the variants 
of the green fluorescent protein that fluoresce in the dif- 
ferent parts of the light spectrum (see Fig. 9-15), may 
be found in a pet store near you (Fig. 9-33). 
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FIGURE 9-33 Cloning in vertebrates. Genes for several variants of green 
fluorescent protein have been introduced into different strains of 
zebrafish, making each of them literally glow in the dark. Each variant 
GFP fluoresces in a different part of the light spectrum/ making the fish 
expressing it glow in a particular color (red, green, or yellow). 



New Technologies Promise to Expedite the 
Discovery of New Pharmaceuticals 

It is difficult to summarize all the ways in which 
genomics and proteomics might affect the devel- 
opment of pharmaceutical agents, but a few examples il- 
lustrate the potential. Hypertension, congestive heart 
failure, hypercholesterolemia, and obesity are treated 
by pharmaceutical drugs that alter human physiology. 
Therapies are arrived at by identifying an enzyme or re- 
ceptor involved in the process and discovering an in- 
hibitor that interferes with its action. Proteomics will 
play an increasing role in identifying such potential drug 
targets. For example, the most potent vasoconstrictor 
known is the peptide hormone urotensin II. First discov- 
ered in fish spinal fluid, urotensin II is a small cyclic pep- 
tide, with 11 amino acid residues in humans and 12 or 13 



The Human Genome and Human Gene Therapy 



As biotechnology gained momentum in the 1980s, a rational 
approach to the treatment of genetic diseases became in- 
creasingly attractive. In principle, DNA can be introduced 
into human cells to correct inherited genetic deficiencies. 
Genetic correction may even be targeted to a specific tissue 
by inoculating an individual with a genetically engineered, 
tissue-specific virus carrying a payload of DNA to be incor- 
porated into deficient cells. The goal is entrancing, but the 
research path is strewn with impediments. 

Altering chromosomal DNA entails substantial 
risk — a risk that cannot be quantified in the early stages 
of discovery. Consequently, early efforts at human gene 
therapy were directed at only a small subset of genetic 
diseases. Panels of scientists and ethicists developed a 
list of several conditions that should be satisfied to jus- 
tify the risk involved, including the following. (1) The 
genetic defect must be a well-characterized, single-gene 
disorder. (2) Both the mutant and the normal gene must 
be cloned and sequenced. (3) In the absence of a tech- 
nique for eliminating the existing mutant gene, the func- 
tional gene must function well in the presence of the 
mutant gene. (4) Finally, and most important, the risks 
inherent in a new technology must be outweighed by the 
seriousness of the disease. Protocols for human clinical 
trials were submitted by scientists in several nations and 
reviewed for scientific rigor and ethical compliance by 
carefully selected advisory panels in each country; then 
human trials commenced. 

Early targets of gene therapy included cancer and 
genetic diseases affecting the immune system. Immu- 
nity is mediated by leukocytes (white blood cells) of 
several different types, all arising from undifferentiated 
stem cells in the bone marrow. These cells divide 
quickly and have special metabolic requirements. Dif- 
ferentiation can become blocked in several ways, re- 
sulting in a condition called severe combined immune 



deficiency (SCID). One form of SCID results from ge- 
netically inherited defects in the gene encoding adeno- 
sine deaminase (ADA) , an enzyme involved in 
nucleotide biosynthesis (discussed in Chapter 22). An- 
other form of SCID arises from a defect in a cell-surface 
receptor protein that binds chemical signals called cy- 
tokines, which trigger differentiation. In both cases, the 
progenitor stem cells cannot differentiate into the ma- 
ture immune system cells, such as T and B lymphocytes 
(see p. 170). Children with these rare human diseases 
are highly susceptible to bacterial and viral infections, 
and often suffer from a range of related physiological 
and neurological problems. In the absence of an effec- 
tive therapy, the children must be confined in a sterile 
environment. About 20% of these children have a hu- 
man leukocyte antigen (HLA)-identical sibling who can 
serve as a bone marrow transplant donor, a procedure 
that can cure the disease. The remaining children need 
a different approach. 

The first human gene therapy trial was carried out at 
the National Institutes of Health in Bethesda, Maryland, 
in 1990. The patient was a four-year-old girl crippled by 
ADA deficiency. Bone marrow cells from the child were 
transformed with an engineered retrovirus containing a 
functional ADA gene; when the alteration of cells is 
done in this way — in the laboratory rather than in the 
living patient — the procedure is said to be done ex vivo. 
The treated cells were reintroduced into the patient's 
marrow. Four years later, the child was leading a normal 
life, going to school, and even testifying about her expe- 
riences before Congress. However, her recovery cannot 
be uniquely attributed to gene therapy. Before the gene 
therapy clinical trials began, researchers had developed 
a new treatment for ADA deficiency, in which synthetic 
ADA was administered in a complex with polyethylene 

(continued on next page) 
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glycol (PEG). For many ADA-SCID patients, injection of 
the ADA-PEG complex allowed some immune system 
development, with weight gain and reduced infection, 
although not full immune reconstitution. The new gene 
therapy was risky, and withdrawing the inoculation 
treatment from patients in the gene therapy trial was 
judged unethical. So trial participants received both 
treatments at once, making it unclear which treatment 
was primarily responsible for the positive clinical out- 
come. Nevertheless, the clinical trial provided important 
information: it was feasible to transfer genes ex vivo to 
large numbers of leukocytes, and cells bearing the trans- 
ferred gene were still detectable years after treatment, 
suggesting that long-term correction was possible. In 
addition, the risk associated with use of the retroviral 
vectors appeared to be low. 

Through the 1990s, hundreds of human gene ther- 
apy clinical trials were carried out, targeting a variety of 
genetic diseases, but the results in most cases were 
discouraging. One major impediment proved to be the 
inefficiency of introducing new genes into cells. Trans- 
formation failed in many cells, and the number of trans- 
formed cells often proved insufficient to reverse the 
disorder. In the ADA trials, achieving a sufficient popu- 
lation of transformed cells was particularly difficult, be- 
cause of the ongoing ADA-PEG therapy Normally, stem 
cells with the correct ADA gene would have a growth 
advantage over the untreated cells, expanding their 
population and gradually predominating in the bone 
marrow. However, the injections of ADA-PEG in the 
same patients allowed the untransformed (ADA-defi- 
cient) cells to live and develop, and the transformed 
cells did not have the needed growth advantage to ex- 
pand their population at the expense of the others. 

A gene therapy trial initiated in 1999 was successful 
in correcting a form of SCID caused by defective cy- 
tokine receptors (in particular a subunit called yc), as 
reported in 2000 by physician researchers in France, 
Italy, and Britain. These researchers introduced the cor- 
rected gene for the yc cytokine-receptor subunit into 
CD34 + cells. (The stem cells that give rise to immune 
system cells have a protein called CD34 on their surface; 
these cells can be separated from other bone marrow 
cells by antibodies to CD34.) The transformed cells 
were placed back into the patients' bone marrow. In this 
trial, introduction of the corrected gene clearly con- 
ferred a growth advantage over the untreated cells. A 
functioning immune system was detected in nine of 
eleven patients within 6 to 12 weeks, and levels of ma- 
ture immune system T lymphocytes reached the levels 
found in age-matched control subjects (who did not 
have SCID) within 6 to 8 months. Immune system func- 
tion was restored, and nearly 4 years later (mid-2003) 
most of the children were leading normal lives. This pro- 



vided dramatic confirmation that human gene therapy 
could cure a serious genetic disease. 

In 2003 came a setback. Two of the patients who had 
received cells with the correct cytokine receptor gene 
developed a severe form of leukemia. During the gene 
therapy treatment in both cases, one of the introduced 
retroviruses had by chance inserted itself into a chromo- 
some of one CD34 + cell, resulting in abnormally high ex- 
pression of a gene called LMO-2. The affected cell 
differentiated into an immune system T cell, and the ele- 
vated expression of LMO-2 led to uncontrolled growth of 
the cell, giving rise to the leukemia. Both patients were 
successfully treated with chemotherapy, and no other 
patients developed problems. However, the incident 
shows that early worries about the risk associated with 
retroviral vectors were well founded. After a review of 
the gene therapy trial protocols, including consultations 
with ethicists and parents of children affected by these 
diseases, further gene therapy trials are still planned for 
children who are not candidates for bone marrow trans- 
plants. The reason is simple enough. The potential bene- 
fit to the children with these debilitating conditions has 
been judged to outweigh the demonstrated risk. Devel- 
opment of new viral vectors appears to be the most im- 
portant prerequisite to more effective therapies. 

Human gene therapy is not limited to genetic dis- 
eases. Cancer cells are being targeted by delivering genes 
for proteins that might destroy the cell or restore the nor- 
mal control of cell division. Immune system cells associ- 
ated with tumors, called tumor-infiltrating lymphocytes, 
can be genetically modified to produce tumor necrosis fac- 
tor (TNF; see Fig. 12-51). When these lymphocytes are 
taken from a cancer patient, modified, and reintroduced, 
the engineered cells target the tumor, and the TNF they 
produce causes tumor shrinkage. AIDS may also be treat- 
able with gene therapy; DNA that encodes an RNA mole- 
cule complementary to a vital HIV mRNA could be 
introduced into immune system cells (the targets of HIV). 
The RNA transcribed from the introduced DNA would 
pair with the HIV mRNA, preventing its translation and in- 
terfering with the virus's life cycle. Alternatively, a gene 
could be introduced that encodes an inactive form of one 
subunit of a multisubunit HIV enzyme; with one nonfunc- 
tional subunit, the entire enzyme might be inactivated. 

Our growing understanding of the human genome 
and the genetic basis for some diseases brings the prom- 
ise of early diagnosis and constructive intervention. As 
the early results demonstrate, however, the road to ef- 
fective therapies will be a long one, with many detours. 
We need to learn more about cellular metabolism, more 
about how genes interact, and more about how to man- 
age the dangers. The prospect of vanquishing life- 
destroying genetic defects and other debilitating 
diseases provides the motivation to press on. 
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in some other organisms. The vasoconstriction it in- 
duces can cause or exacerbate hypertension, congestive 
heart failure, and coronary artery disease. Some of the 
methods described in Section 9.3 for elucidating 
protein-protein interactions have been used to demon- 
strate that urotensin II is bound by a G protein-coupled 
receptor called GPR14. As we shall see in Chapter 12, 
G proteins play an important role in many signaling 
pathways. However, GPR14 was an "orphan" receptor, in 
that human genome sequencing had identified it as a 
G protein-coupled receptor, but with no known func- 
tion. The association of urotensin II with GPR14 now 
makes the latter protein a key target for drug therapies 
aimed at interfering with the action of urotensin II. 

Glu-Thr-Pro-Asp-Cys-S-S-Cys-Val 
/ \ 
Phe Tyr 

^Trp-Lys' 
Urotensin II 

Another objective of medical research is to identify 
new agents that can treat the diseases caused by human 
pathogens. This now means identifying enzymatic tar- 
gets in microbial pathogens that can be inactivated with 
a new drug. The ideal microbial target enzyme should be 
(1) essential to the pathogen cell's survival, (2) well- 
conserved among a wide range of pathogens, and (3) ab- 
sent or significantly different in humans. The task of 
identifying metabolic processes that are critical to mi- 
croorganisms but absent in humans is made much easier 



by comparative genomics, augmented by the functional 
information available from genomics and proteomics. ■ 

Recombinant DNA Technology Yields New 
Products and Challenges 

The products of recombinant DNA technology range from 
proteins to engineered organisms. The technology can 
produce large amounts of commercially useful proteins, 
can design microorganisms for special tasks, and can engi- 
neer plants or animals with traits that are useful in agri- 
culture or medicine. Some products of this technology 
have been approved for consumer or professional use, and 
many more are in development. Genetic engineering has 
been transformed over a few years from a promising new 
technology to a multibillion-dollar industry, with much of 
the growth occurring in the pharmaceutical industry. 
Some major classes of products are listed in Table 9-4. 

Erythropoietin is typical of recent breakthrough 
products. This protein hormone (M T 51,000) 
stimulates erythrocyte production. People with diseases 
that compromise kidney function often have a defi- 
ciency of this protein, resulting in anemia. Erythropoi- 
etin produced by recombinant DNA technology can be 
used to treat these individuals, reducing the need for re- 
peated blood transfusions. ■ 

Other applications of this technology continue to 
emerge. Enzymes produced by recombinant DNA tech- 
nology are already used in the production of detergents, 
sugars, and cheese. Engineered proteins are being used as 



TABLE 9-4 



Some 



Product category 



Recombinant DMA Products in Medicine 
Examples/uses 



Anticoagulants 
Blood factors 

Colony-stimulating factors 

Erythropoietin 
Growth factors 
Human growth hormone 
Human insulin 
Interferons 
Interleukins 

Monoclonal antibodies 

Superoxide dismutase 
Vaccines 



Tissue plasminogen activator (TPA); activates plasmin, an enzyme involved in dissolving 

clots; effective in treating heart attack patients. 
Factor VIII; promotes clotting; it is deficient in hemophiliacs; treatment with factor VIII 

produced by recombinant DNA technology eliminates infection risks associated with 

blood transfusions. 

Immune system growth factors that stimulate leukocyte production; treatment of immune 

deficiencies and infections. 
Stimulates erythrocyte production; treatment of anemia in patients with kidney disease. 
Stimulate differentiation and growth of various cell types; promote wound healing. 
Treatment of dwarfism. 
Treatment of diabetes. 

Interfere with viral reproduction; used to treat some cancers. 

Activate and stimulate different classes of leukocytes; possible uses in treatment of 

wounds, HIV infection, cancer, and immune deficiencies. 
Extraordinary binding specificity is used in: diagnostic tests; targeted transport of drugs, 

toxins, or radioactive compounds to tumors as a cancer therapy; many other 

applications. 

Prevents tissue damage from reactive oxygen species when tissues briefly deprived of 0 2 

during surgery suddenly have blood flow restored. 
Proteins derived from viral coats are as effective in "priming" an immune system as is the 

killed virus more traditionally used for vaccines, and are safer; first developed was the 

vaccine for hepatitis B. 
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food additives to supplement nutrition, flavor, and fra- 
grance. Microorganisms are being engineered with altered 
or entirely novel metabolic pathways to extract oil and 
minerals from ground deposits, to digest oil spills, to 
detoxify hazardous waste dumps and sewage, or to pro- 
duce ethanol as fuel. Engineered plants with improved 
resistance to drought, frost, pests, and disease are in- 
creasing crop yields and reducing the need for agricultural 
chemicals. Complete animals can be cloned by moving an 
entire nucleus and all of its genetic material to a prepared 
egg from which the nucleus has been removed. 

The extraordinary promise of modern biotechnology 
does not come without controversy. The cloning of 
mammals challenges societal mores and may be accom- 
panied by serious deficiencies in the health and longevity 
of the cloned animal. If useful pharmaceutical agents can 
be produced, so can toxins suitable for biological war- 
fare. The potential for hazards posed by the release of 
engineered plants and other organisms into the bios- 
phere continues to be monitored carefully. The full range 
of the long-term consequences of this technology for our 
species and for the global environment is impossible to 
foresee, but will certainly demand our increasing under- 
standing of both cellular metabolism and ecology. 

SUMMARY 9.4 Genome Alterations and 
New Products of 
Biotechnology 

■ Advances in whole genome sequencing and genetic 
engineering methods are enhancing our ability to 
modify genomes in all species. 

■ Cloning in plants, which makes use of the Ti plasmid 
vector from Agrobacterium, allows the introduction 
of new plant traits. 

■ In animal cloning, researchers introduce foreign 
DNA primarily with the use of viral vectors or 
microinjection. These techniques can produce 
transgenic animals and provide new methods for 
human gene therapy. 

■ The use of genomics and proteomics in basic and 
pharmaceutical research is greatly advancing the 
discovery of new drugs. Biotechnology is also 
generating an ever-expanding range of other 
products and technologies. 
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Problems — 

1. Cloning When joining two or more DNA fragments, a 
researcher can adjust the sequence at the junction in a variety 
of subtle ways, as seen in the following exercises. 

(a) Draw the structure of each end of a linear DNA frag- 
ment produced by an EcoRl restriction digest (include those 
sequences remaining from the EcoRl recognition sequence). 

(b) Draw the structure resulting from the reaction of this 
end sequence with DNA polymerase I and the four deoxynu- 
cleoside triphosphates (see Fig. 8-33). 

(c) Draw the sequence produced at the junction that 
arises if two ends with the structure derived in (b) are ligated 
(see Fig. 25-17). 

(d) Draw the structure produced if the structure derived 
in (a) is treated with a nuclease that degrades only single- 
stranded DNA. 

(e) Draw the sequence of the junction produced if an end 
with structure (b) is ligated to an end with structure (d). 

(f) Draw the structure of the end of a linear DNA fragment 
that was produced by uPvull restriction digest (include those 
sequences remaining from the Pvull recognition sequence). 

(g) Draw the sequence of the junction produced if an end 
with structure (b) is ligated to an end with structure (f). 

(h) Suppose you can synthesize a short duplex DNA frag- 
ment with any sequence you desire. With this synthetic frag- 
ment and the procedures described in (a) through (g), design 
a protocol that would remove an EcoRl restriction site from a 
DNA molecule and incorporate a new BamHl restriction site at 
approximately the same location. (See Fig. 9-2.) 

(i) Design four different short synthetic double-stranded 
DNA fragments that would permit ligation of structure 
(a) with a DNA fragment produced by a Pstl restriction digest. 
In one of these fragments, design the sequence so that the fi- 
nal junction contains the recognition sequences for both 
EcoRl and Pstl In the second and third fragments, design the 
sequence so that the junction contains only the EcoRl and only 
the Pstl recognition sequence, respectively. Design the se- 
quence of the fourth fragment so that neither the EcoRl nor 
the Pstl sequence appears in the junction. 

2. Selecting for Recombinant Plasmids When cloning a for- 
eign DNA fragment into a plasmid, it is often useful to insert the 
fragment at a site that interrupts a selectable marker (such as 
the tetracycline-resistance gene of pBR322). The loss of function 
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of the interrupted gene can be used to identify clones containing 
recombinant plasmids with foreign DNA. With a bacteriophage \ 
vector it is not necessary to do this, yet one can easily distinguish 
vectors that incorporate large foreign DNA fragments from those 
that do not. How are these recombinant vectors identified? 

3. DNA Cloning The plasmid cloning vector pBR322 (see 
Fig. 9-3) is cleaved with the restriction endonuclease Pstl. An 
isolated DNA fragment from a eukaryotic genome (also pro- 
duced by Pstl cleavage) is added to the prepared vector and 
ligated. The mixture of ligated DNAs is then used to transform 
bacteria, and plasmid-containing bacteria are selected by 
growth in the presence of tetracycline. 

(a) In addition to the desired recombinant plasmid, what 
other types of plasmids might be found among the trans- 
formed bacteria that are tetracycline resistant? How can the 
types be distinguished? 

(b) The cloned DNA fragment is 1,000 bp long and has an 
EcoRl site 250 bp from one end. Three different recombinant 
plasmids are cleaved with EcoRl and analyzed by gel elec- 
trophoresis, giving the patterns shown. What does each pat- 
tern say about the cloned DNA? Note that in pBR322, the Pstl 
and EcoRl restriction sites are about 750 bp apart. The entire 
plasmid with no cloned insert is 4,361 bp. Size markers in lane 
4 have the number of nucleotides noted. 



■f. 

! 



Nucleotide 
length 

5,000 



3,000 



1,500 

1,000 
750 
500 
250 



4. Identifying the Gene for a Protein with a Known 
Amino Acid Sequence Using Figure 27-7 to translate the 
genetic code, design a DNA probe that would allow you to 
identify the gene for a protein with the following amino- 
terminal amino acid sequence. The probe should be 18 to 20 
nucleotides long, a size that provides adequate specificity if 
there is sufficient homology between the probe and the gene. 

H 3 N + -Ala-Pro-Met-Thr-Trp-TVr-Cys-Met- 

Asp-Trp-Ile-Ala-Gly-Gly-Pro~Trp-Phe-Arg- 

Lys-Asn-Thr-Lys- 

5. Designing a Diagnostic Test for a Genetic Disease 

Huntington's disease (HD) is an inherited neurodegenerative dis- 
order, characterized by the gradual, irreversible impairment of 
psychological, motor, and cognitive functions. Symptoms typi- 
cally appear in middle age, but onset can occur at almost any age. 



The course of the disease can last 15 to 20 years. The molecular ba- 
sis of the disease is becoming better understood. The genetic 
mutation underlying HD has been traced to a gene encoding a 
protein (M T 350,000) of unknown function. In individuals who will 
not develop HD, a region of the gene that encodes the amino ter- 
minus of the protein has a sequence of CAG codons (for gluta- 
mine) that is repeated 6 to 39 times in succession. In individuals 
with adult-onset HD, this codon is typically repeated 40 to 55 
times. In individuals with childhood-onset HD, this codon is re- 
peated more than 70 times. The length of this simple trinu- 
cleotide repeat indicates whether an individual will develop HD, 
and at approximately what age the first symptoms will occur. 

A small portion of the amino -terminal coding sequence of 
the 3,143-codon HD gene is given below. The nucleotide se- 
quence of the DNA is shown in black, the amino acid sequence 
corresponding to the gene is shown in blue, and the CAG re- 
peat is shaded. Using Figure 27-7 to translate the genetic 
code, outline a PCR-based test for HD that could be carried 
out using a blood sample. Assume the PCR primer must be 25 
nucleotides long. By convention, unless otherwise specified a 
DNA sequence encoding a protein is displayed with the coding 
strand (the sequence identical to the mRNA transcribed from 
the gene) on top such that it is read 5' to 3', left to right. 

307 ATGGCGACCCTGGAAAAGCTGATGAAGGCCTTCGAGTCCCTCAAGTCCTTC 

1 \l \ i 1 K K 1. M K A F K SI K S i- 

358 CAGCAGTTCCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAG 

18 Q t> 1 QQQQQQQQQQQQQQ 

409 C AGC AGCAGCAGCAGC AGCAGCAAO AGCCGCC ACCGC CGCCGCC GC CGC CG 
35 Q Q Q Q Q Q Q Q Q I* i> \> i> |> p p p 

460 CCGCCTCCTCAGCTTCCTCAGCCGCCGCCG 

52 p p p q i p q p p P 

Source: The Huntington's Disease Collaborative Research Group. (1993) A novel gene 
containing a trinucleotide repeat that is expanded and unstable on Huntington's 
disease chromosomes. Cell 72, 971-983. 

6. Using PCR to Detect Circular DNA Molecules In a 

species of ciliated protist, a segment of genomic DNA is some- 
times deleted. The deletion is a genetically programmed reac- 
tion associated with cellular mating. A researcher proposes 
that the DNA is deleted in a type of recombination called site- 
specific recombination, with the DNA on either end of the seg- 
ment joined together and the deleted DNA ending up as a 
circular DNA reaction product. 




reaction 




Suggest how the researcher might use the polymerase chain 
reaction (PCR) to detect the presence of the circular form of 
the deleted DNA in an extract of the protist. 

7. Glowing Plants When grown in ordinary garden soil and 
watered normally, a plant engineered to express green fluores- 
cent protein (see Fig. 9-1 5a) will glow in the dark, whereas a 
plant engineered to express firefly luciferase (see Fig. 9-29) 
will not. Explain these observations. 
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8. RFLP Analysis for Paternity Testing DNA fingerprinting 
and RFLP analysis are often used to test for paternity. A child in- 
herits chromosomes from both mother and father, so DNA from 
a child displays restriction fragments derived from each parent. 
In the gel shown here, which child, if any, can be excluded as be- 
ing the biological offspring of the putative father? Explain your 
reasoning. Lane M is the sample from the mother, F from the pu- 
tative father, and CI, C2, and C3 from the children. 



]c2f|c3jp 



— 



9. Mapping a Chromosome Segment A group of overlap- 
ping clones, designated A through F, is isolated from one 
region of a chromosome. Each of the clones is separately 
cleaved by a restriction enzyme and the pieces resolved by 
agarose gel electrophoresis, with the results shown in the fig- 
ure below. There are nine different restriction fragments in 
this chromosomal region, with a subset appearing in each 
clone. Using this information, deduce the order of the restric- 
tion fragments in the chromosome. 



Overlapping clones 



i 
i 

m 



l 

2 

-3 

4 
5 

-6 



- 7 

-a 

-9 



Nine 

restriction 
fragments 



10. Cloning in Plants The strategy outlined in Figure 9-28 
employs Agrobacterium cells that contain two separate plas- 
mids. Suggest why the sequences on the two plasmids are not 
combined on one plasmid. 

11. DNA Fingerprinting and RFLP Analysis DNA is ex- 
tracted from the blood cells of two different humans, individuals 
1 and 2. In separate experiments, the DNA from each individual 



is cleaved by restriction endonucleases A, B, and C, and the 
fragments separated by electrophoresis. A hypothetical map of 
a 10,000 bp segment of a human chromosome is shown (1 kbp 
= 1,000 bp). Individual 2 has point mutations that eliminate re- 
striction recognition sites B* and C*. You probe the gel with a ra- 
dioactive oligonucleotide complementary to the indicated 
sequence and expose a piece of x-ray film to the gel. Indicate 
where you would expect to see bands on the film. The lanes of 
the gel are marked in the accompanying diagram. 
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12. Use of Photolithography to Make a DNA Microarray 

Figure 9-21 shows the first steps in the process of making a 
DNA microarray, or DNA chip, using photolithography. Describe 
the remaining steps needed to obtain the desired sequences (a 
different four-nucleotide sequence on each of the four spots) 
shown in the first panel of the figure. After each step, give the 
resulting nucleotide sequence attached at each spot. 

13. Cloning in Mammals The retroviral vectors described 
in Figure 9-32 make possible the efficient integration of for- 
eign DNA into a mammalian genome. Explain how these vec- 
tors, which lack genes for replication and viral packaging (gag, 
pol, env\ are assembled into infectious viral particles. Sug- 
gest why it is important that these vectors lack the replication 
and packaging genes. 



Data Analysis Problem 



14. ifincll: The First Restriction Endonuclease Discov- 
ery of the first restriction endonuclease to be of practical use 
was reported in two papers published in 1970. In the first pa- 
per, Smith and Wilcox described the isolation of an enzyme 
that cleaved double-stranded DNA. They initially demon- 
strated the enzyme's nuclease activity by measuring the 
decrease in viscosity of DNA samples treated with the emyme. 
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(a) Why does treatment with a nuclease decrease the vis- 
cosity of a solution of DNA? 

The authors determined whether the enzyme was an 
endo- or an exonuclease by treating 32 P-labeled DNA with the 
enzyme, then adding trichloroacetic acid (TCA). Under the 
conditions used in their experiment, single nucleotides would 
be TCA-soluble and oligonucleotides would precipitate. 

(b) No TCA-soluble 32 P-labeled material formed on treat- 
ment of 32 P-labeled DNA with the nuclease. Based on this find- 
ing, is the enzyme an endo- or exonuclease? Explain your 
reasoning. 

When a polynucleotide is cleaved, the phosphate usually 
is not removed but remains attached to the 5 ; or 3' end of the 
resulting DNA fragment. Smith and Wilcox determined the 
location of the phosphate on the fragment formed by the 
nuclease in the following steps: 

1. Treat unlabeled DNA with the nuclease. 

2. Treat a sample (A) of the product with y- 32 P-labeled ATP 
and polynucleotide kinase (which can attach the 
y-phosphate of ATP to a 5' OH but not to a 5' phosphate 
or to a 3' OH or 3' phosphate). Measure the amount of 
32 P incorporated into the DNA. 

3. Treat another sample (B) of the product of step 1 with 
alkaline phosphatase (which removes phosphate groups 
from free 5' and 3' ends), followed by polynucleotide 
kinase and y- 32 P-labeled ATP Measure the amount of 32 P 
incorporated into the DNA. 

(c) Smith and Wilcox found that sample A had 136 
counts/min of 32 P; sample B had 3,740 counts/min. Did the nu- 
clease cleavage leave the phosphate on the 5' or the 3' end of 
the DNA fragments? Explain your reasoning. 

(d) Treatment of bacteriophage T7 DNA with the nuclease 
gave approximately 40 specific fragments of various lengths. 
How is this result consistent with the enzyme's recognizing a 
specific sequence in the DNA as opposed to making random 
double-strand breaks? 



At this point, there were two possibilities for the site- 
specific cleavage: the cleavage occurred either (1) at the site 
of recognition or (2) near the site of recognition but not within 
the sequence recognized. To address this issue, Kelly and 
Smith determined the sequence of the 5' ends of the DNA frag- 
ments generated by the nuclease, in the following steps: 

1 . Treat phage T7 DNA with the enzyme. 

2. Treat the resulting fragments with alkaline phosphatase 
to remove the 5' phosphates. 

3. Treat the dephosphorylated fragments with polynu- 
cleotide kinase and y- 32 P-labeled ATP to label the 5' ends. 

4. Treat the labeled molecules with DNases to break them 
into a mixture of mono-, di-, and trinucleotides. 

5. Determine the sequence of the labeled mono-, di-, and 
trinucleotides by comparing them with oligonucleotides 
of known sequence on thin-layer chromatography. 

The labeled products were identified as follows: mononu- 
cleotides: A and G; dinucleotides: 5'-pApA-3' and 5'-pGpA-3'; 
trinucleotides: 5'-pApApC-3' and 5'-pGpApC-3'. 

(e) Which model of cleavage is consistent with these re- 
sults? Explain your reasoning- 
Kelly and Smith went on to determine the sequence of the 

3' ends of the fragments. They found a mixture of 5'-pTpC-3' 
and 5'-pTpT-3\ They did not determine the sequence of any 
trinucleotides at the 3' end. 

(f ) Based on these data, what is the recognition sequence 
for the nuclease and where in the sequence is the DNA back- 
bone cleaved? Use Table 9-2 as a model for your answer. 
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The fatty substance, separated from the salifiable bases, was dissolved 
in boiling alcohol. On cooling, it was obtained crystallized and very 
pure, and in this state It was examined. As it has not been hitherto 
described . . J purpose to call it margarine, from the Creek word sig- 
nifying pearl, because one of its characters is to have the appearance 
of mother of pearl, which it communicates to several of the combina- 
tions of which it forms with the salifiable bases, 

—Michel-Eugene ChevreuL article m Philosophical Magazine, 1814 
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Biological lipids are a chemically diverse group of 
compounds, the common and defining feature of 
which is their insolubility in water. The biological 
functions of the lipids are as diverse as their chemistry. 
Fats and oils are the principal stored forms of energy in 
many organisms. Phospholipids and sterols are major 
structural elements of biological membranes. Other 
lipids, although present in relatively small quantities, 
play crucial roles as enzyme cofactors, electron carriers, 
light-absorbing pigments, hydrophobic anchors for pro- 
teins, "chaperones" to help membrane proteins fold, 
emulsifying agents in the digestive tract, hormones, and 
intracellular messengers. This chapter introduces repre- 
sentative lipids of each type, organized according to 
their functional roles, with emphasis on their chemical 
structure and physical properties. Although we follow a 
functional organization for our discussion, the literally 
thousands of different lipids can also be organized into 
eight general categories of chemical structure (see 
Table 10-3). We discuss the energy-yielding oxidation of 
lipids in Chapter 17 and their synthesis in Chapter 21. 



10.1 Storage Lipids 

The fats and oils used almost universally as stored forms 
of energy in living organisms are derivatives of fatty 
acids. The fatty acids are hydrocarbon derivatives, at 
about the same low oxidation state (that is, as highly 
reduced) as the hydrocarbons in fossil fuels. The cellu- 
lar oxidation of fatty acids (to C0 2 and H 2 0), like the 
controlled, rapid burning of fossil fuels in internal com- 
bustion engines, is highly exergonic. 



We introduce here the structures and nomenclature 
of the fatty acids most commonly found in living organ- 
isms. Two types of fatty acid-containing compounds, tri- 
acylglycerols and waxes, are described to illustrate the 
diversity of structure and physical properties in this 
family of compounds. 

Fatty Acids Are Hydrocarbon Derivatives 

Fatty acids are carboxylic acids with hydrocarbon 
chains ranging from 4 to 36 carbons long (C 4 to C 36 ). In 
some fatty acids, this chain is unbranched and fully sat- 
urated (contains no double bonds); in others the chain 
contains one or more double bonds (Table 10-1). A few 
contain three-carbon rings, hydroxyl groups, or methyl- 
group branches. 

KEY CONVENTION: A simplified nomenclature for un- 
branched fatty acids specifies the chain length and num- 
ber of double bonds, separated by a colon (Fig. 10-la); 




(a) 18:1(A 9 ) ds-9-Octadecenoic acid 




(b) 20:5(A 5 ' 8 ' 11 ' 14 ' 17 ) Eicosapentaenoic acid (EPA), 
an omega-3 fatty acid 

FIGURE 10-1 Two conventions for naming fatty acids, (a) Standard 
nomenclature assigns the number 1 to the carboxyl carbon (C-1), and 
a to the carbon next to it. The position of any double bond(s) is indi- 
cated by A followed by a superscript number indicating the lower- 
numbered carbon in the double bond, (b) For polyunsaturated fatty 
acids (PUFAs), an alternative convention numbers the carbons in the 
opposite direction, assigning the number 1 to the methyl carbon at the 
other end of the chain; this carbon is also designated w (omega; the last 
letter in the Greek alphabet). The positions of the double bonds are 
indicated relative to the o) carbon. 
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TABLE 10-1 



Carbon 
skeleton 



Occurring Fatty Acids: Structure, Properties, and Nomencfatur 



Structure* 



12:0 

14:0 

16:0 

18:0 
20:0 
24:0 

16:1(A 9 ) 
18:1 (A 9 ) 
18:2(A 9 ' 12 ) 

18:3(A 9 ' 12 ' 15 ) 

20:4(A 5 ' 8 ' 1U4 ) 



Systematic name 1 



CH 3 (CH 2 ) 10 COOH 
CH 3 (CH 2 ) 12 COOH 
CH 3 (CH 2 ) 14 COOH 
CH 3 (CH 2 ) 16 COOH 
CH 3 (CH 2 ) 18 COOH 
CH 3 (CH 2 ) 22 COOH 



CH 3 (CH 2 ) 5 CH= 
CH(CH 2 ) 7 COOH 

CH 3 (CH 2 ) 7 CH= 
CH(CH 2 ) 7 COOH 

CH 3 (CH 2 ) 4 CH= 
CHCH 2 CH= 
CH(CH 2 ) 7 COOH 

CH 3 CH 2 CH=CHCH 2 CH= 
CHCH 2 CH= 
CH(CH 2 ) 7 COOH 

CH 3 (CH 2 ) 4 CH= 
CHCH 2 CH= 
CHCH 2 CH=CHCH 2 CH= 
CH(CH 2 ) 3 COOH 



Common name 
(derivation) 



Melting 



Solubility at 30 °C 
(mg/g solvent) 



point (°C) Water Benzene 



?z-Dodecanoic acid 



%-Tetradecanoic acid 



n-Hexadecanoic acid 



n-Octadecanoic acid 



?z-Eicosanoic acid 



n-Tetracosanoic 
acid 

czs-9-Hexadecenoic 
acid 

czs-9-Octadecenoic 
acid 

cis-,cis-9, 12- 
Octadecadienoic 
acid 

cis~, cis~, cisS, 12,15- 
Octadecatrienoic 
acid 

cis-,cis-,cis-,cis-5, 8,11, 
14- Icosatetraenoic 
acid 



Laurie acid 
(Latin laurus, 
"laurel plant") 

Myristic acid 
(Latin Myristica, 
nutmeg genus) 

Palmitic acid 
(Latin palma, 
"palm tree") 

Stearic acid 
(Greek stear, 
"hard fat") 

Arachidic acid 
(Latin Arachis, 
legume genus) 

Lignoceric acid 

(Latin lignum, 

"wood" 4- cera, "wax") 
Palmitoleic acid 

Oleic acid (Latin 
oleum, "oil") 

Linoleic acid 
(Greek linon, 
"flax") 

o'-Linolenic acid 



Arachidonic acid 



44.2 0.063 2,600 



53.9 0.024 



63.1 0.0083 



69.6 0.0034 



76.5 



874 



348 



124 



1 to -0.5 
13.4 
1-5 

-11 

-49.5 



•All acids are shown in their nonionized form. At pH 7, all free fatty acids have an ionized carboxylate. Note that numbering of carbon atoms begins at the carboxyl carbon. 
fnr h l P - e n iX H n H -I™™ 1 " unbranched structure - For instance - "dodecanoic" simply indicates 12 carbon atoms, which could be arranged in a variety of branched 

con^ 



for example, the 16-carbon saturated palmitic acid is ab- 
breviated 16:0, and the 18-carbon oleic acid, with one 
double bond, is 18:1. The positions of any double bonds 
are specified relative to the carboxyl carbon, numbered 1, 
by superscript numbers following A (delta); a 20-carbon 
fatty acid with one double bond between C-9 and C-10 
(C-l being the carboxyl carbon) and another between 
C-12 and C-13 is designated 20:2(A 9 ' 12 ). ■ 

The most commonly occurring fatty acids have even 
numbers of carbon atoms in an unbranched chain of 12 to 
24 carbons (Table 10-1). As we shall see in Chapter 21, 
the even number of carbons results from the mode of 
synthesis of these compounds, which involves succes- 
sive condensations of two-carbon (acetate) units. 



There is also a common pattern in the location of 
double bonds; in most monounsaturated fatty acids the 
double bond is between C-9 and C-10 (A 9 ), and the 
other double bonds of polyunsaturated fatty acids are 
generally A 12 and A 15 . (Arachidonic acid is an excep- 
tion to this generalization.) The double bonds of 
polyunsaturated fatty acids are almost never conju- 
gated (alternating single and double bonds, as in 
— CH=CH— CH=CH— ), but are separated by a meth- 
ylene group: — CH=CH— CH 2 — CH=CH— (Fig. 10-lb). 
In nearly all naturally occurring unsaturated fatty 
acids, the double bonds are in the cis configuration. 
Trans fatty acids are produced by fermentation in the 
rumen of dairy animals and are obtained from dairy 
products and meat. 



KEY CONVENTION: The family of polyunsaturated fatty 
acids (PUFAs) with a double bond between the third 
and fourth carbon from the methyl end of the chain are 
of special importance in human nutrition. Because the 
physiological role of PUFAs is related more to the posi- 
tion of the first double bond near the methyl end of the 
chain than to the carboxyl end, an alternative nomen- 
clature is sometimes used for these fatty acids. The car- 
bon of the methyl group — that is, the carbon most 
distant from the carboxyl group — is called the o) 
(omega) carbon and is given the number 1 (Fig. 10-lb). 
In this convention, PUFAs with a double bond between 
C-3 and C-4 are called omega-3 (co-3) fatty acids, and 
those with a double bond between C-6 and C-7 are 
omega-6 (w-6) fatty acids. ■ 

B5 Humans require but do not have the enzymatic 
mM capacity to synthesize the omega-3 PUFA a- 
Linolenic acid (ALA; 18:3(A 9,12 ' 15 ), in the standard con- 
vention), and must therefore obtain it in the diet. From 
ALA, humans can synthesize two other omega-3 PUFAs 
important in cellular function: eicosapentaenoic acid 
(EPA; 20:5(A 5;8 ' 1U4 ' 17 ), shown in Fig. 10-lb) and do- 
cosahexaenoic acid (DHA; 22:6(A 4 ' 7 ' 10 ' 13 ' 16 ' 19 )). An im- 
balance of omega-6 and omega-3 PUFAs in the diet is 
associated with an increased risk of cardiovascular dis- 
ease. The optimal dietary ratio of omega-6 to omega-3 
PUFAs is between 1:1 and 4:1, but the ratio in the diets 
of most North Americans is closer to 10:1 to 30:1, lead- 
ing to an increased frequency of heart disease and stroke. 
The "Mediterranean diet," which has been associated 
with lowered cardiovascular risk, is richer in omega-3 
PUFAs, obtained in leafy vegetables (salads) and fish 
oils. The latter oils are especially rich in EPA and DHA, 
and fish oil supplements are often prescribed for indi- 
viduals with a history of cardiovascular disease. ■ 

The physical properties of the fatty acids, and of 
compounds that contain them, are largely determined 
by the length and degree of unsaturation of the hydro- 
carbon chain. The nonpolar hydrocarbon chain accounts 
for the poor solubility of fatty acids in water. Laurie acid 
(12:0, M r 200), for example, has a solubility in water of 
0.063 mg/g— much less than that of glucose (M T 180), 
which is 1,100 mg/g. The longer the fatty acyl chain and 
the fewer the double bonds, the lower is the solubility in 
water. The carboxylic acid group is polar (and ionized at 
neutral pH) and accounts for the slight solubility of 
short-chain fatty acids in water. 

Melting points are also strongly influenced by the 
length and degree of unsaturation of the hydrocarbon 
chain. At room temperature (25 °C), the saturated fatty 
acids from 12:0 to 24:0 have a waxy consistency, 
whereas unsaturated fatty acids of these lengths are oily 
liquids. This difference in melting points is due to 
different degrees of packing of the fatty acid molecules 
(Fig. 10-2). In the fully saturated compounds, free ro- 
tation around each carbon-carbon bond gives the hy- 
drocarbon chain great flexibility; the most stable 
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Saturated Mixture of saturated and 

fatty acids unsaturated fatty acids 



FIGURE 10-2 The packing of fatty acids into stable aggregates. The 

extent of packing depends on the degree of saturation, (a) Two represen- 
tations of the fully saturated acid stearic acid, 1 8:0 (stearate at pH 7), in 
its usual extended conformation. Each line segment of the zigzag repre- 
sents a single bond between adjacent carbons, (b) The cis double bond 
(shaded) in oleic acid, 1 8:1 (A 9 ) (oleate), restricts rotation and introduces 
a rigid bend in the hydrocarbon tail. All other bonds in the chain are free 
to rotate, (c) Fully saturated fatty acids in the extended form pack into 
nearly crystalline arrays, stabilized by many hydrophobic interactions, 
(d) The presence of one or more fatty acids with cis double bonds inter- 
feres with this tight packing and results in less stable aggregates. 

conformation is the fully extended form, in which the 
steric hindrance of neighboring atoms is minimized. 
These molecules can pack together tightly in nearly 
crystalline arrays, with atoms all along their lengths in 
van der Waals contact with the atoms of neighboring 
molecules. In unsaturated fatty acids, a cis double bond 
forces a kink in the hydrocarbon chain. Fatty acids with 
one or several such kinks cannot pack together as 
tightly as fully saturated fatty acids, and their interac- 
tions with each other are therefore weaker. Because less 
thermal energy is needed to disorder these poorly or- 
dered arrays of unsaturated fatty acids, they have 
markedly lower melting points than saturated fatty 
acids of the same chain length (Table 10-1). 

In vertebrates, free fatty acids (unesterified fatty 
acids, with a free carboxylate group) circulate in the 
blood bound noncovalently to a protein carrier, serum 
albumin. However, fatty acids are present in blood 
plasma mostly as carboxylic acid derivatives such as 
esters or amides. Lacking the charged carboxylate 
group, these fatty acid derivatives are generally even 
less soluble in water than are the free fatty acids. 
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Triacylglycerols Are Fatty Acid Esters 
of Giycero! 

The simplest lipids constructed from fatty acids are the 
triacylglycerols, also referred to as triglycerides, fats, 
or neutral fats. Triacylglycerols are composed of three 
fatty acids each in ester linkage with a single glycerol 
(Fig. 10-3). Those containing the same kind of fatty 
acid in all three positions are called simple triacylglyc- 
erols and are named after the fatty acid they contain. 
Simple triacylglycerols of 16:0, 18:0, and 18:1, for exam- 
ple, are tripalmitin, tristearin, and triolein, respectively. 
Most naturally occurring triacylglycerols are mixed; 
they contain two or three different fatty acids. To name 
these compounds unambiguously, the name and posi- 
tion of each fatty acid must be specified. 

Because the polar hydroxyls of glycerol and the po- 
lar carboxylates of the fatty acids are bound in ester 
linkages, triacylglycerols are nonpolar, hydrophobic mol- 
ecules, essentially insoluble in water. Lipids have lower 
specific gravities than water, which explains why mix- 
tures of oil and water (oil-and-vinegar salad dressing, 




1-Stearoyl, 2-linoleoyl, 3-palmitoyl glycerol, 
a mixed triacylglycerol 



FIGURE 10-3 Glycerol and a triacylglycerol. The mixed triacylglyc- 
erol shown here has three different fatty acids attached to the glycerol 
backbone, When glycerol has different fatty acids at C-1 and C-3, C-2 
is a chiral center (p. 16). 



for example) have two phases: oil, with the lower specific 
gravity, floats on the aqueous phase. 

Triacylglycerols Provide Stored Energy 
and insulation 

In most eukaryotic cells, triacylglycerols form a sepa- 
rate phase of microscopic, oily droplets in the aqueous 
cytosol, serving as depots of metabolic fuel. In verte- 
brates, specialized cells called adipocytes, or fat cells, 
store large amounts of triacylglycerols as fat droplets 
that nearly fill the cell (Fig. 10-4a). Triacylglycerols 
are also stored as oils in the seeds of many types of 
plants, providing energy and biosynthetic precursors 
during seed germination (Fig. 10-4b). Adipocytes and 
germinating seeds contain lipases, enzymes that cat- 
alyze the hydrolysis of stored triacylglycerols, releas- 
ing fatty acids for export to sites where they are 
required as fuel. 




FIGURE 10-4 Fat stores in cells, (a) Cross section of four guinea pig 
adipocytes, showing huge fat droplets that virtually fill the cells. Also 
visible are several capillaries in cross section, (b) Cross section of a 
cotyledon cell from a seed of the plant Arabidopsis. The large dark 
structures are protein bodies, which are surrounded by stored oils in 
the light-colored oil bodies. 
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There are two significant advantages to using tria- 
cylglycerols as stored fuels, rather than polysaccha- 
rides such as glycogen and starch. First, the carbon 
atoms of fatty acids are more reduced than those of 
sugars, and oxidation of triacylglycerols yields more 
than twice as much energy, gram for gram, as the oxi- 
dation of carbohydrates. Second, because triacylglyc- 
erols are hydrophobic and therefore unhydrated, the 
organism that carries fat as fuel does not have to carry 
the extra weight of water of hydration that is associ- 
ated with stored polysaccharides (2 g per gram of 
polysaccharide). Humans have fat tissue (composed 
primarily of adipocytes) under the skin, in the abdom- 
inal cavity, and in the mammary glands. Moderately 
obese people with 15 to 20 kg of triacylglycerols de- 
posited in their adipocytes could meet their energy 
needs for months by drawing on their fat stores. In 
contrast, the human body can store less than a day's 
energy supply in the form of glycogen. Carbohydrates 
such as glucose and glycogen do offer certain advan- 
tages as quick sources of metabolic energy, one of 
which is their ready solubility in water. 



In some animals, triacylglycerols stored under the 
skin serve not only as energy stores but as insulation 
against low temperatures. Seals, walruses, penguins, 
and other warm-blooded polar animals are amply 
padded with triacylglycerols. In hibernating animals 
(bears, for example), the huge fat reserves accumu- 
lated before hibernation serve the dual purposes of in- 
sulation and energy storage (see Box 17-1). The low 
density of triacylglycerols is the basis for another re- 
markable function of these compounds. In sperm 
whales, a store of triacylglycerols and waxes allows 
the animals to match the buoyancy of their bodies to 
that of their surroundings during deep dives in cold 
water (Box 10-1). 

Partial Hydrogenation of Cooking Oils Produces 
Trans Fatty Acids 

01 Most natural fats, such as those in vegetable 
I oils, dairy products, and animal fat, are complex 
mixtures of simple and mixed triacylglycerols. These 
contain a variety of fatty acids differing in chain length 



BOX 10-1 Sperm Whales: Fatheads of the Deep 



Studies of sperm whales have uncovered another way 
in which triacylglycerols are biologically useful. The 
sperm whale's head is very large, accounting for more 
than one-third of its total body weight. About 90% of 
the weight of the head is made up of the spermaceti 
organ, a blubbery mass that contains up to 3,600 kg 
(about 4 tons) of spermaceti oil, a mixture of triacyl- 
glycerols and waxes containing an abundance of un- 
saturated fatty acids. This mixture is liquid at the 
normal resting body temperature of the whale, about 
37 °C, but it begins to crystallize at about 31 °C and be- 
comes solid when the temperature drops several more 
degrees. 

The probable biological function of spermaceti oil 
has been deduced from research on the anatomy and 
feeding behavior of the sperm whale. These mammals 
feed almost exclusively on squid in very deep water. In 
their feeding dives they descend 1,000 m or more; the 
deepest recorded dive is 3,000 m (almost 2 miles). At 
these depths, there are no competitors for the very 
plentiful squid; the sperm whale rests quietly, waiting 
for schools of squid to pass. 

For a marine animal to remain at a given depth with- 
out a constant swimming effort, it must have the same 
density as the surrounding water. The sperm whale un- 
dergoes changes in buoyancy to match the density of its 
surroundings — from the tropical ocean surface to great 
depths where the water is much colder and thus denser. 
The key is the freezing point of spermaceti oil. When the 



temperature of the oil is lowered several degrees during 
a deep dive, it congeals or crystallizes and becomes 
denser. Thus the buoyancy of the whale changes to 
match the density of seawater. Various physiological 
mechanisms promote rapid cooling of the oil during a 
dive. As the whale returns to the surface, the congealed 
spermaceti oil warms and melts, decreasing its density 
to match that of the surface water. Thus we see in 
the sperm whale a remarkable anatomical and biochem- 
ical adaptation. The triacylglycerols and waxes synthe- 
sized by the whale contain fatty acids of the necessary 
chain length and degree of unsaturation to give the 
spermaceti oil the proper melting point for the animal's 
diving habits. 



Spermaceti 
organ 




Unfortunately for the sperm whale population, sper- 
maceti oil was at one time considered the finest lamp oil 
and continues to be commercially valuable as a lubri- 
cant. Several centuries of intensive hunting of these 
mammals have driven sperm whales onto the endan- 
gered species list. 
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and degree of saturation (Fig. 10-5). Vegetable oils 
such as corn (maize) and olive oil are composed largely 
of triacylglycerols with unsaturated fatty acids and thus 
are liquids at room temperature. Triacylglycerols con- 
taining only saturated fatty acids, such as tristearin, the 
major component of beef fat, are white, greasy solids at 
room temperature. 

When lipid-rich foods are exposed too long to the 
oxygen in air, they may spoil and become rancid. The un- 
pleasant taste and smell associated with rancidity result 
from the oxidative cleavage of double bonds in unsatu- 
rated fatty acids, which produces aldehydes and car- 
boxylic acids of shorter chain length and therefore 
higher volatility. To improve the shelf life of vegetable 
oils used in cooking, and to increase their stability at the 
high temperatures used in deep-frying, commercial veg- 
etable oils are subjected to partial hydrogenation. This 
process converts many of the cis double bonds in the 
fatty acids to single bonds and increases the melting 
temperature of the oils so that they are more nearly solid 
at room temperature (margarine is produced from veg- 
etable oil in this way). Partial hydrogenation has another, 
undesirable, effect: some cis double bonds are converted 
to trans double bonds. There is now strong evidence that 



C 16 and C 18 C 16 and C 18 C 4 to C 14 
saturated unsaturated saturated 
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Olive oil 
liquid 

Natural fats at 25 °C 
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soft solid 
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FIGURE 10-5 Fatty acid composition of three food fats. Olive oil, 
butter, and beef fat consist of mixtures of triacylglycerols, differing in 
their fatty acid composition. The melting points of these fats— and 
hence their physical state at room temperature (25 °Q— are a direct 
function of their fatty acid composition. Olive oil has a high proportion 
of long-chain (C 16 and C 18 ) unsaturated fatty acids, which accounts for 
its liquid state at 25 °C. The higher proportion of long-chain (C 16 and 
C 18 ) saturated fatty acids in butter increases its melting point, so butter 
is a soft solid at room temperature. Beef fat, with an even higher pro- 
portion of long-chain saturated fatty acids, is a hard solid. 



dietary intake of trans fatty acids (often referred to sim- 
ply as "trans fats") leads to a higher incidence of cardio- 
vascular disease, and that avoiding these fats in the diet 
substantially reduces the risk of coronary heart disease. 
Dietary trans fatty acids raise the level of triacylglycerols 
and of LDL ("bad") cholesterol in the blood, and lower 
the level of HDL ("good") cholesterol, and these changes 
alone are enough to increase the risk of coronary heart 
disease. But trans fatty acids may have further adverse 
effects. They seem, for example, to increase the body's 
inflammatory response, which is another risk factor for 
heart disease. (See Chapter 21 for a description of LDL 
and HDL — low-density and high-density lipoprotein — 
cholesterol and their health effects.) 

Many fast foods are deep-fried in partially hydro- 
genated vegetable oils and therefore contain high levels 
of trans fatty acids (Table 10-2). In view of the detri- 
mental effects of these fats, some countries (Denmark, 
for example) and some cities (New York City and 
Philadelphia) severely restrict the use of partially hydro- 
genated oils in restaurants. French fries prepared in a 
chain fast-food restaurant in Denmark now contain al- 
most no detectable trans fatty acids, whereas the same 
product prepared in the United States contains 5 to 10 g 
of trans fatty acids per serving (Table 10-2). The delete- 
rious effects of trans fats occur at intakes of 2 to 7 g/day 
(20 to 60 kcal in a daily caloric intake of 2000 kcal; note 
that a nutritional Calorie is the equivalent of the kilcalo- 
rie used by chemists and biochemists, so a 2000 Calorie 
diet is the equivalent of a 2000 kcal diet). A single serv- 
ing of french fries in a U.S. restaurant may contain this 
amount of trans fatty acid! Many other prepared foods, 
baked goods, and snacks on the shelves of supermarkets 
have comparably high levels of trans fats. ■ 



TABLE 10-2 



Trans Fatty Acids in Some Typical 
Fast Foods and Snacks 



■ 





Trans fatty acid content 




In a typical 


As%of 




serving (g) 


total fatty acids 


French fries 


4.7-6.1 


28-36 


Breaded fish burger 


5.6 


28 


Breaded chicken 






nuggets 


5.0 


25 


Pizza 


1.1 


9 


Corn tortilla chips 


1.6 


22 


Doughnut 


2.7 


25 


Muffin 


0.7 


14 


Chocolate bar 


02 


2 



Source: Adapted from Table 1 in Mozaffarian, D„ Katan, M.B., Ascherio, RH., Stampfer, 
M.J., & Willet, W.C. (2006) Trans fatty acids and cardiovascular disease. N. Engl J 
Med. 354, 1604-1605. 

Note: All data for foods prepared with partially hydrogenated vegetable oil in the United 
States in 2002. 
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Waxes Serve as Energy Stores and 
Water Repellents 

Biological waxes are esters of long-chain (C 14 to C 36 ) 
saturated and unsaturated fatty acids with long-chain 
(C 16 to C 30 ) alcohols (Fig. 10-6). Their melting 
points (60 to 100 °C) are generally higher than those 
of triacylglycerols. In plankton, the free-floating mi- 
croorganisms at the bottom of the food chain for ma- 
rine animals, waxes are the chief storage form of 
metabolic fuel. 

Waxes also serve a diversity of other functions re- 
lated to their water-repellent properties and their firm 
consistency. Certain skin glands of vertebrates secrete 
waxes to protect hair and skin and keep it pliable, lu- 
bricated, and waterproof. Birds, particularly water- 
fowl, secrete waxes from their preen glands to keep 
their feathers water-repellent. The shiny leaves of 
holly, rhododendrons, poison ivy, and many tropical 
plants are coated with a thick layer of waxes, which 
prevents excessive evaporation of water and protects 
against parasites. 

Biological waxes find a variety of applications in 
the pharmaceutical, cosmetic, and other industries. 
Lanolin (from lamb's wool), beeswax (Fig. 10-6), car- 
nauba wax (from a Brazilian palm tree), and wax ex- 
tracted from spermaceti oil (from whales; see Box 10-1) 
are widely used in the manufacture of lotions, ointments, 
and polishes. 



CH 3 (CH 2 ) 14 -C-0— CH 2 -(CH 2 ) 28 -CH 3 

^ y " * -J 

Palmitic acid 1-Triacontanol 

(a) 




(b) 



FIGURE 10-6 Biological wax. (a) Triacontanoylpalmitate, the major 
component of beeswax, is an ester of palmitic acid with the alcohol tri- 
acontanol. (b) A honeycomb, constructed of beeswax, is firm at 25 °C 
and completely impervious to water. The term "wax" originates in the 
Old English weax, meaning "the material of the honeycomb." 



SUMMARY 10.1 Storage Lipids 

■ Lipids are water-insoluble cellular components, 
of diverse structure, that can be extracted by 
nonpolar solvents. 

■ Almost all fatty acids, the hydrocarbon components 
of many lipids, have an even number of carbon 
atoms (usually 12 to 24); they are either saturated 
or unsaturated, with double bonds almost always 
in the cis configuration. 

■ Triacylglycerols contain three fatty acid molecules 
esterified to the three hydroxyl groups of glycerol. 
Simple triacylglycerols contain only one type of 
fatty acid; mixed triacylglycerols, two or three 
types. Triacylglycerols are primarily storage fats; 
they are present in many foods. 

■ Partial hydrogenation of vegetable oils in the food 
industry converts some cis double bonds to the 
trans configuration. Trans fatty acids in the diet are 
an important risk factor for coronary heart disease. 

1 0.2 Structural Lipids in Membranes 

The central architectural feature of biological mem- 
branes is a double layer of lipids, which acts as a barrier 
to the passage of polar molecules and ions. Membrane 
lipids are amphipathic: one end of the molecule is 
hydrophobic, the other hydrophilic. Their hydrophobic 
interactions with each other and their hydrophilic inter- 
actions with water direct their packing into sheets called 
membrane bilayers. In this section we describe five gen- 
eral types of membrane lipids: glycerophospholipids, in 
which the hydrophobic regions are composed of two 
fatty acids joined to glycerol; galactolipids and sul- 
folipids, which also contain two fatty acids esterified to 
glycerol, but lack the characteristic phosphate of phos- 
pholipids; archaeal tetraether lipids, in which two very 
long alkyl chains are ether-linked to glycerol at both 
ends; sphingolipids, in which a single fatty acid is joined 
to a fatty amine, sphingosine; and sterols, compounds 
characterized by a rigid system of four fused hydrocar- 
bon rings. 

The hydrophilic moieties in these amphipathic com- 
pounds may be as simple as a single — OH group at one 
end of the sterol ring system, or they may be much more 
complex. In glycerophospholipids and some sphin- 
golipids, a polar head group is joined to the hydrophobic 
moiety by a phosphodiester linkage; these are the phos- 
pholipids. Other sphingolipids lack phosphate but have 
a simple sugar or complex oligosaccharide at their polar 
ends; these are the glycolipids (Fig. 10-7). Within 
these groups of membrane lipids, enormous diversity re- 
sults from various combinations of fatty acid "tails" and 
polar "heads." The arrangement of these lipids in mem- 
branes, and their structural and functional roles therein, 
are considered in the next chapter. 
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FIGURE 10-8 (.-Glycerol 3-phosphate, the backbone of phospholipids. 

Glycerol itself is nol chiral. as it has a plane of symmetry through C-2, 
However, glycerol is prochiral— it can be converted to a chiral com- 
pound by adding a substituent such as phosphate to either of the 
— CH 2 OH groups. One unambiguous nomenclature for glycerol phos- 
phate is the dl system (described on p. 74), in which the isomers are 
named according to their stereochemical relationships to glyceralde- 
hyde isomers. By this system, the stereoisomer of glycerol phosphate 
found in most lipids is correctly named either L-glycerol 3-phosphate or 
D-glycerol 1 -phosphate. Another way to specify stereoisomers is the sn 
(stereospecific numbering) system, in which C-1 is, by definition, the 
group of the prochiral compound that occupies the pro-S position. The 
common form of glycerol phosphate in phospholipids is, by this system, 
sn-glycerol 3-phosphate. 
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FIGURE 10-7 Some common types of storage and membrane lipids. 

All the lipid types shown here have either glycerol or sphingosine as 
the backbone (pink screen), to which are attached one or more long- 
chain alkyl groups (yellow) and a polar head group (blue). In triacyl- 
glycerols, glycerophospholipids, galactolipids, and sulfolipids, the 
alkyl groups are fatty acids in ester linkage. Sphingol ipids contain a 

Glycerophospholipids Are Derivatives of 
PhosphatidicAcid 

Glycerophospholipids, also called phosphoglycerides, 
are membrane lipids in which two fatty acids are attached 
in ester linkage to the first and second carbons of glyc- 
erol, and a highly polar or charged group is attached 
through a phosphodiester linkage to the third carbon. 
Glycerol is prochiral; it has no asymmetric carbons, but 
attachment of phosphate at one end converts it into a 
chiral compound, which can be correctly named either 
L-glycerol 3-phosphate, D-glycerol 1 -phosphate, or sn- 
glycerol 3-phosphate (Fig. 10-8). Glycerophospholipids 
are named as derivatives of the parent compound, phos- 
phatide acid (Fig. 10-9), according to the polar 
alcohol in the head group. Phosphatidylcholine and phos- 
phatidylethanolamine have choline and ethanolamine 
as their polar head groups, for example. In all these 



single fatty acid, in amide linkage to the sphingosine backbone. The 
membrane lipids of archaea are variable; that shown here has two very 
long, branched alkyl chains, each end in ether linkage with a glycerol 
moiety. In phospholipids the polar head group is joined through a 
phosphodiester, whereas glycolipids have a direct glycosidic linkage 
between the head-group sugar and the backbone glycerol. 

compounds, the head group is joined to glycerol through a 
phosphodiester bond, in which the phosphate group bears 
a negative charge at neutral pH. The polar alcohol may be 
negatively charged (as in phosphatidylinositol 4,5-bisphos- 
phate), neutral (phosphatidylserine), or positively charged 
(phosphatidylcholine, phosphatidylethanolamine). As we 
shall see in Chapter 1 1 , these charges contribute greatly to 
the surface properties of membranes. 

The fatty acids in glycerophospholipids can be any 
of a wide variety, so a given phospholipid (phosphatidyl- 
choline, for example) may consist of several molecular 
species, each with its unique complement of fatty acids. 
The distribution of molecular species is specific for 
different organisms, different tissues of the same organ- 
ism, and different glycerophospholipids in the same cell 
or tissue. In general, glycerophospholipids contain a C 16 
or C 18 saturated fatty acid at C-1 and a Ci 8 or C 20 unsat- 
urated fatty acid at C-2. With few exceptions, the 
biological significance of the variation in fatty acids and 
head groups is not yet understood. 

Some Glycerophospholipids Have Ether-Linked Fatty Acids 

Some animal tissues and some unicellular organisms are 
rich in ether lipids, in which one of the two acyl chains 
is attached to glycerol in ether, rather than ester, link- 
age. The ether-linked chain may be saturated, as in the 
alkyl ether lipids, or may contain a double bond between 
C-1 and C-2, as in plasmalogens (Fig. 10-10). Verte- 
brate heart tissue is uniquely enriched in ether lipids; 
about half of the heart phospholipids are plasmalogens. 
The membranes of halophilic bacteria, ciliated protists, 
and certain invertebrates also contain high proportions 
of ether lipids. The functional significance of ether lipids 
in these membranes is unknown; perhaps their resist- 
ance to the phospholipases that cleave ester-linked fatty 
acids from membrane lipids is important in some roles. 
At least one ether lipid, platelet-activating fac- 
tor, is a potent molecular signal. It is released from 
leukocytes called basophils and stimulates platelet aggre- 
gation and the release of serotonin (a vasoconstrictor) 
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FIGURE 10-9 Glycerophospholipids. The common glycerophospho- 
lipids are diacylglycerols linked to head-group alcohols through a 
phosphodiester bond. Phosphatidic acid, a phosphomonoester, is 
the parent compound. Each derivative is named for the head-group 
alcohol (X), with the prefix "phosphatidyl-." In cardiolipin, two phos- 
phatidic acids share a single glycerol (R 1 and R 2 are fatty acyl groups). 
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FIGURE 10-10 Ether lipids. Plasmalogens have an ether-linked 
alkenyl chain where most glycerophospholipids have an ester-linked 
fatty acid (compare Fig. 10-9). Platelet-activating factor has a long 
ether-linked alkyl chain at C-1 of glycerol, but C-2 is ester-linked to 
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alcohol is ethanolamine in plasmalogens and choline in platelet- 
activating factor 
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from platelets. It also exerts a variety of effects on liver, 
smooth muscle, heart, uterine, and lung tissues and 
plays an important role in inflammation and the allergic 
response. ■ 

Chloroplasts Contain Galactolipids and Sulfolipids 

The second group of membrane lipids are those that pre- 
dominate in plant cells: the galactolipids, in which one 
or two galactose residues are connected by a glycosidic 
linkage to C-3 of a 1,2-diacylglycerol (Fig. 10-11; see 
also Fig. 10-7). Galactolipids are localized in the thy- 
lakoid membranes (internal membranes) of chloroplasts; 
they make up 70% to 80% of the total membrane lipids of 
a vascular plant. They are probably the most abundant 
membrane lipids in the biosphere. Phosphate is often the 
limiting plant nutrient in soil, and perhaps the evolution- 
ary pressure to conserve phosphate for more critical 
roles favored plants that made phosphate-free lipids. 
Plant membranes also contain sulfolipids, in which a sul- 
fonated glucose residue is joined to a diacylglycerol in 
glycosidic linkage. In sulfolipids, the sulfonate on the 
head group bears a negative charge like that of the phos- 
phate group in phospholipids (Fig. 10-11). 

Archaea Contain Unique Membrane Lipids 

The archaea, most of which live in ecological niches 
with extreme conditions— high temperatures (boiling 
water), low pH, high ionic strength, for example— have 



membrane lipids containing long-chain (32 carbons) 
branched hydrocarbons linked at each end to glycerol 
(Fig. 10-12). These linkages are through ether bonds, 
which are much more stable to hydrolysis at low pH and 
high temperature than are the ester bonds found in the 
lipids of bacteria and eukaryotes. In their fully extended 
form, these archaeal lipids are twice the length of phos- 
pholipids and sphingolipids, and can span the full width of 
the surface membrane. At each end of the extended mol- 
ecule is a polar head consisting of glycerol linked to either 
phosphate or sugar residues. The general name for these 
compounds, glycerol dialkyl glycerol tetraethers 
(GDGTs), reflects their unique structure. The glycerol 
moiety of the archaeal lipids is not the same stereoisomer 
as that in the lipids of Bacteria and Eukarya; the central 
carbon is in the R configuration in Archaea, in the S con- 
figuration in the other domains (Fig. 10-8). 

Sphingolipids Are Derivatives of Sphingosine 

Sphingolipids, the fourth large class of membrane 
lipids, also have a polar head group and two nonpolar 
tails, but unlike glycerophospholipids and galactolipids 
they contain no glycerol. Sphingolipids are composed of 
one molecule of the long-chain amino alcohol sphingo- 
sine (also called 4-sphingenine) or one of its derivatives, 
one molecule of a long-chain fatty acid, and a polar head 
group that is joined by a glycosidic linkage in some cases 
and a phosphodiester in others (Fig. 10-13). 
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FIGURE 10-11 Three glycolipids of 
chloroplast thylakoid membranes. In 

monogalactosyldiacylglycerols (MGDGs) 
and digalactosyldiacylglycerols (DGDGs), 
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from linoleic acid, 18:2(A 9 ' 12 ), and the 
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nosyldiacylglycerol, the sulfonate carries 
a fixed negative charge. 
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(on the condensation of isoprene units, see Fig. 21-36; also, compare 
the diphytanyl groups with the 20-carbon phytol side chain of chloro- 
phylls in Fig. 19-4 7a). In this extended form, the diphytanyl groups are 
about twice the length of a 16-carbon fatty acid typically found in the 



membrane lipids of bacteria and eukaryotes. The glycerol moieties in 
the archaeal lipids are in the R configuration, in contrast to those of 
bacteria and eukaryotes, which have the S configuration. Archaeal 
lipids differ in the substituents on the glycerols. In the molecule shown 
here, one glycerol is linked to the disaccharide a-glucopyranosyl- 
(1->2)-/3-galactofuranose; the other glycerol is linked to a glycerol 
phosphate head group. 
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FIGURE 10-13 Sphingolipids. The first three carbons at the polar end of 
sphingosine are analogous to the three carbons of glycerol in glyc- 
erophospholipids. The amino group at C-2 bears a fatty acid in amide 
linkage. The fatty acid is usually saturated or monounsaturated, with 1 6, 
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18, 22, or 24 carbon atoms. Ceramide is the parent compound for this 
group. Other sphingolipids differ in the polar head group (X) attached at 
C-1 . Gangliosides have very complex oligosaccharide head groups. Stan- 
dard symbols for sugars are used in this figure, as shown in Table 7-1 . 




Carbons C-l, C-2, and C-3 of the sphingosine mole- 
cule are structurally analogous to the three carbons of 
glycerol in glycerophospholipids. When a fatty acid is at- 
tached in amide linkage to the — NH 2 on C-2, the result- 
ing compound is a ceramide, which is structurally 
similar to a diacylglycerol. Ceramide is the structural 
parent of all sphingolipids. 

There are three subclasses of sphingolipids, all de- 
rivatives of ceramide but differing in their head groups: 
sphingomyelins, neutral (uncharged) glycolipids, and 
gangliosides. Sphingomyelins contain phosphocholine 
or phosphoethanolamine as their polar head group and 
are therefore classified along with glycerophospholipids 
as phospholipids (Fig. 10-7). Indeed, sphingomyelins 
resemble phosphatidylcholines in their general proper- 
ties and three-dimensional structure, and in having 
no net charge on their head groups (Fig. 10-14). 
Sphingomyelins are present in the plasma membranes 
of animal cells and are especially prominent in myelin, a 
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FIGURE 10-14 The similarities in shape and molecular structure of 
phosphatidylcholine (a glycerophospholipid) and sphingomyelin (a 
sphingolipid) are clear when their space-filling and structural formulas 
are drawn as here. 



membranous sheath that surrounds and insulates the ax- 
ons of some neurons — thus the name "sphingomyelins." 

Glycosphingolipids, which occur largely in the 
outer face of plasma membranes, have head groups with 
one or more sugars connected directly to the —OH at 
C-l of the ceramide moiety; they do not contain phos- 
phate. Cerebrosides have a single sugar linked to 
ceramide; those with galactose are characteristically 
found in the plasma membranes of cells in neural tissue, 
and those with glucose in the plasma membranes of cells 
in nonneural tissues. Globosides are glycosphingolipids 
with two or more sugars, usually D-glucose, D-galactose, 
or AT-acetyl-D-galactosamine. Cerebrosides and globo- 
sides are sometimes called neutral glycolipids, as they 
have no charge at pH 7. 

Gangliosides, the most complex sphingolipids, have 
oligosaccharides as their polar head groups and one or 
more residues of AT-acetylneuraminic acid (NeuSAc), a 
sialic acid (often simply called "sialic acid"), at the ter- 
mini. Sialic acid gives gangliosides the negative charge at 
pH 7 that distinguishes them from globosides. Ganglio- 
sides with one sialic acid residue are in the GM (M for 
mono-) series, those with two are in the GD (D for di-) se- 
ries, and so on (GT, three sialic acid residues; GQ, four). 
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Sphingolipids at Cell Surfaces Are Sites of 
Biological Recognition 

When sphingolipids were discovered more than a cen- 
tury ago by the physician-chemist Johann Thudichum, 
their biological role seemed as enigmatic as the Sphinx, 
for which he therefore named them. In humans, at least 
60 different sphingolipids have been identified in cellu- 
lar membranes. Many of these are especially prominent 
in the plasma membranes of neurons, and some are 
clearly recognition sites on the cell surface, but a spe- 
cific function for only a few sphingolipids has been dis- 
covered thus far. The carbohydrate moieties of certain 
sphingolipids define the human blood groups and there- 
fore determine the type of blood that individuals can 
safely receive in blood transfusions (Fig. 10-15). 

Gangliosides are concentrated in the outer surface 
of cells, where they present points of recognition for ex- 
tracellular molecules or surfaces of neighboring cells. 
The kinds and amounts of gangliosides in the plasma 
membrane change dramatically during embryonic de- 
velopment. Tumor formation induces the synthesis of a 
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FIGURE 10-15 Glycosphingolipids as determinants of blood 
groups. The human blood groups (O, A, B) are determined in 
part by the oligosaccharide head groups of these glycosphingolipids. 
The same three oligosaccharides are also found attached to certain 
blood proteins of individuals of blood types O, A, and B, respectively. 
Standard symbols for sugars are used here (see Table 7-1). 

new complement of gangliosides, and very low concen- 
trations of a specific ganglioside have been found to 
induce differentiation of cultured neuronal tumor 
cells. Investigation of the biological roles of diverse 
gangliosides remains fertile ground for future research. 

Phospholipids and Sphingoids Are 
Degraded in Lysosomes 

Most cells continually degrade and replace their mem- 
brane lipids. For each hydrolyzable bond in a 
glycerophospholipid, there is a specific hydrolytic en- 
zyme in the lysosome (Fig. 10-16). Phospholipases of 
the A type remove one of the two fatty acids, producing 
a lysophospholipid. (These esterases do not attack the 
ether link of plasmalogens.) Lysophospholipases re- 
move the remaining fatty acid. 

Gangliosides are degraded by a set of lysosomal enzymes 
that catalyze the stepwise removal of sugar units, finally 
yielding a ceramide. A genetic defect in any of these hy- 
drolytic enzymes leads to the accumulation of gangliosides in 
the cell, with severe medical consequences (Box 10-2). 

Sterols Have Four Fused Carbon Rings 

Sterols are structural lipids present in the membranes of 
most eukaryotic cells. The characteristic structure of this 
fifth group of membrane lipids is the steroid nucleus, con- 
sisting of four fused rings, three with six carbons and one 
with five (Fig. 10-17). The steroid nucleus is almost pla- 
nar and is relatively rigid; the fused rings do not allow ro- 
tation about C — C bonds. Cholesterol, the major sterol 
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FIGURE 10-16 The specificities of phospholipases. Phospholipases A T 
and A 2 hydrolyze the ester bonds of intact glycerophospholipids at C-l 
and C-2 of glycerol, respectively. When one of the fatty acids has been 
removed by a type A phospholipase, the second fatty acid is removed 
by a lysophospholipase (not shown). Phospholipases C and D each 
split one of the phosphodiester bonds in the head group. Some 
phospholipases act on only one type of glycerophospholipid, such as 
phosphatidylinositol 4,5-bisphosphate (shown here) or phosphatidyl- 
choline; others are less specific. 
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FIGURE 10-17 Cholesterol. The stick structure of cholesterol is visible 
through a transparent surface contour model of the molecule. In the 
chemical structure, the rings are labeled A through D to simplify refer- 
ence to derivatives of the steroid nucleus; the carbon atoms are num- 
bered in blue. The C-3 hydroxyl group (pink in both representations) is 
the polar head group. For storage and transport of the sterol, this hy- 
droxyl group condenses with a fatty acid to form a sterol ester. 
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BOX 10-2 



M E D I C I N 



Abnormal Accumulations of Membrane Lipids: Some 
Inherited Human Diseases 




The polar lipids of membranes undergo constant meta- 
bolic turnover, the rate of their synthesis normally coun- 
terbalanced by the rate of breakdown. The breakdown 
of lipids is promoted by hydrolytic enzymes in lyso- 
somes, each enzyme capable of hydrolyzing a specific 
bond. When sphingolipid degradation is impaired by a 
defect in one of these enzymes (Fig. 1), partial break- 
down products accumulate in the tissues, causing 
serious disease. 

For example, Niemann-Pick disease is caused by a 
rare genetic defect in the enzyme sphingomyelinase, 
which cleaves phosphocholine from sphingomyelin. 
Sphingomyelin accumulates in the brain, spleen, and 
liver. The disease becomes evident in infants and causes 
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mental retardation and early death. More common is 
Tay-Sachs disease, in which ganglioside GM2 accumu- 
lates in the brain and spleen (Fig. 2) owing to lack of the 
enzyme hexosaminidase A. The symptoms of Tay-Sachs 
disease are progressive developmental retardation, 
paralysis, blindness, and death by the age of 3 or 4 years. 

Genetic counseling can predict and avert many in- 
heritable diseases. Tests on prospective parents can de- 
tect abnormal enzymes, then DNA testing can 
determine the exact nature of the defect and the risk it 
poses for offspring. Once a pregnancy occurs, fetal cells 
obtained by sampling a part of the placenta (chorionic 
villus sampling) or the fluid surrounding the fetus 
(amniocentesis) can be tested in the same way. 

FIGURE 1 Pathways for the breakdown of GM1, globoside, and sphin- 
gomyelin to ceramide. A defect in the enzyme hydrolyzing a particular 
step is indicated by ®; the disease that results from accumulation of 
the partial breakdown product is noted. 
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FIGURE 2 Electron micrograph 
of a portion of a brain cell from 
an infant with Tay-Sachs dis- 
ease, obtained post mortem, 
showing abnormal ganglioside 
deposits in the lysosomes. 



in animal tissues, is ampmpathic, with a polar head group 
(the hydroxyl group at C-3) and a nonpolar hydrocarbon 
body (the steroid nucleus and the hydrocarbon side chain 
at C-17), about as long as a 16-carbon fatty acid in its ex- 
tended form. Similar sterols are found in other eukary- 



otes: stigmasterol in plants and ergosterol in fungi, for ex- 
ample. Bacteria cannot synthesize sterols; a few bacterial 
species, however, can incorporate exogenous sterols into 
their membranes. The sterols of all eukaryotes are syn- 
thesized from simple five-carbon isoprene subunits, as 
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are the fat-soluble vitamins, quinones, and dolichols de- 
scribed in Section 10.3. 

In addition to their roles as membrane constituents, 
the sterols serve as precursors for a variety of products 
with specific biological activities. Steroid hormones, for 
example, are potent biological signals that regulate gene 
expression. Bile acids are polar derivatives of choles- 
terol that act as detergents in the intestine, emulsifying 
dietary fats to make them more readily accessible to 
digestive lipases. 




Taurocholic acid 
(a bile acid) 

We return to cholesterol and other sterols in later 
chapters, to consider the structural role of cholesterol in 
biological membranes (Chapter 11), signaling by steroid 
hormones (Chapter 12), and the remarkable biosynthe- 
tic pathway to cholesterol and transport of cholesterol 
by lipoprotein carriers (Chapter 21). 

SUMMARY 10.2 Structural Lipids in 
Membranes 

■ The polar lipids, with polar heads and nonpolar 
tails, are major components of membranes. The 
most abundant are the glycerophospholipids, which 
contain fatty acids esterified to two of the hydroxyl 
groups of glycerol, and a second alcohol, the head 
group, esterified to the third hydroxyl of glycerol 
via a phosphodiester bond. Other polar lipids are 
the sterols. 

■ Glycerophospholipids differ in the structure of 
their head group; common glycerophospholipids 
are phosphatidylethanolamine and 
phosphatidylcholine. The polar heads of the 
glycerophospholipids are charged at pH near 7. 

■ Chloroplast membranes are rich in galactolipids, 
composed of a diacylglycerol with one or two linked 
galactose residues, and sulfolipids, diacylglycerols 
with a linked sulfonated sugar residue and thus a 
negatively charged head group. 

■ Archaea have unique membrane lipids, with 
long-chain alkyl groups ether-linked to glycerol at 
both ends and with sugar residues and/or phosphate 
joined to the glycerol to provide a polar or charged 
head group. These lipids are stable under the harsh 
conditions in which archaea live. 

■ The sphingolipids contain sphingosine, a long-chain 
aliphatic amino alcohol, but no glycerol. 
Sphingomyelin has, in addition to phosphoric acid 



and choline, two long hydrocarbon chains, one 
contributed by a fatty acid and the other by 
sphingosine. Three other classes of sphingolipids 
are cerebrosides, globosides, and gangliosides, 
which contain sugar components. 

■ Sterols have four fused rings and a hydroxyl group. 
Cholesterol, the major sterol in animals, is both a 
structural component of membranes and precursor 
to a wide variety of steroids. 

10.3 Lipids as Signals, Cofactors, and Pigments 

The two functional classes of lipids considered thus far 
(storage lipids and structural lipids) are major cellular 
components; membrane lipids make up 5% to 10% of the 
dry mass of most cells, and storage lipids more than 80% 
of the mass of an adipocyte. With some important excep- 
tions, these lipids play a passive role in the cell; lipid fu- 
els are stored until oxidized by enzymes, and membrane 
lipids form impermeable barriers around cells and cellu- 
lar compartments. Another group of lipids, present in 
much smaller amounts, have active roles in the meta- 
bolic traffic as metabolites and messengers. Some serve 
as potent signals — as hormones, carried in the blood 
from one tissue to another, or as intracellular messengers 
generated in response to an extracellular signal (hor- 
mone or growth factor). Others function as enzyme co- 
factors in electron-transfer reactions in chloroplasts and 
mitochondria, or in the transfer of sugar moieties in a va- 
riety of glycosylation reactions. A third group consists of 
lipids with a system of conjugated double bonds: pigment 
molecules that absorb visible light. Some of these act as 
light-capturing pigments in vision and photosynthesis; 
others produce natural colorations, such as the orange of 
pumpkins and carrots and the yellow of canary feathers. 
Finally, a very large group of volatile lipids produced in 
plants serve as signals that pass through the air, allowing 
plants to communicate with each other, and to invite an- 
imal friends and deter foes. We describe in this section a 
few representatives of these biologically active lipids. In 
later chapters, their synthesis and biological roles are 
considered in more detail. 

Phosphatidylinositois and Sphingosine Derivatives Act as 
Intracellular Signals 

Phosphatidylinositol and its phosphorylated derivatives 
act at several levels to regulate cell structure and 
metabolism. Phosphatidylinositol 4,5-bisphosphate 
(Fig. 10-9) in the cytoplasmic (inner) face of plasma 
membranes serves as a reservoir of messenger mole- 
cules that are released inside the cell in response to ex- 
tracellular signals interacting with specific surface 
receptors. Extracellular signals such as the hormone 
vasopressin activate a specific phospholipase C in the 
membrane, which hydrolyzes phosphatidylinositol 4,5- 
bisphosphate to release two products that act as intra- 
cellular messengers: inositol 1,4,5-trisphosphate (IP3), 
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which is water-soluble, and diacylglycerol, which re- 
mains associated with the plasma membrane. IP 3 trig- 
gers release of Ca 2+ from the endoplasmic reticulum, 
and the combination of diacylglycerol and elevated 
cytosolic Ca 2+ activates the enzyme protein kinase C. 
By phosphorylating specific proteins, this enzyme 
brings about the cell's response to the extracellular sig- 
nal. This signaling mechanism is described more fully 
in Chapter 12 (see Fig. 12-10). 

Inositol phospholipids also serve as points of nucle- 
ation for supramolecular complexes involved in signaling 
or in exocytosis. Certain signaling proteins bind specifi- 
cally to phosphatidylinositol 3,4,5-trisphosphate in the 
plasma membrane, initiating the formation of multien- 
zyme complexes at the membrane's cytosolic surface. 
Formation of phosphatidylinositol 3,4,5-trisphosphate in 
response to extracellular signals therefore brings the 
proteins together in signaling complexes at the surface 
of the plasma membrane (see Fig. 12-16). 

Membrane sphingolipids also can serve as sources 
of intracellular messengers. Both ceramide and sphin- 
gomyelin (Fig. 10-13) are potent regulators of protein 
kinases, and ceramide or its derivatives are involved in 
the regulation of cell division, differentiation, migration, 
and programmed cell death (also called apoptosis; see 
Chapter 12). 

Eicosanoids Carry Messages to Nearby Cells 

Eicosanoids are paracrine hormones, substances 
that act only on cells near the point of hormone 
synthesis instead of being transported in the blood to act 
on cells in other tissues or organs. These fatty acid deriv- 
atives have a variety of dramatic effects on vertebrate 



tissues. They are involved in reproductive function; in the 
inflammation, fever, and pain associated with injury or 
disease; in the formation of blood clots and the regulation 
of blood pressure; in gastric acid secretion; and in various 
other processes important in human health or disease. 

All eicosanoids are derived from arachidonic acid 
(20:4(A 5 ' 8 ' n ' 14 )) (Fig. 10-18), the 20-carbon polyun- 
saturated fatty acid from which they take their general 
name (Greek eikosi, "twenty"). There are three classes 
of eicosanoids: prostaglandins, thromboxanes, and 
leukotrienes. 

Prostaglandins (PG) contain a five-carbon ring 
originating from the chain of arachidonic acid. Their 
name derives from the prostate gland, the tissue from 
which they were first isolated by Bengt Samuelsson and 
Sune Bergstrom. Two groups of prostaglandins were 
originally defined: PGE (ether-soluble) and PGF (fosfat 
(Swedish for phosphate) buffer-soluble). Each group 
contains numerous subtypes, named PGE l5 PGE 2 , PGF 1? 
and so forth. Prostaglandins have an array of functions. 
Some stimulate contraction of the smooth muscle of the 
uterus during menstruation and labor. Others affect 
blood flow to specific organs, the wake-sleep cycle, and 
the responsiveness of certain tissues to hormones such 
as epinephrine and glucagon. Prostaglandins in a third 
group elevate body temperature (producing fever) and 
cause inflammation and pain. 

The thromboxanes have a six-membered ring con- 
taining an ether. They are produced by platelets (also 
called thrombocytes) and act in the formation of blood 
clots and the reduction of blood flow to the site of a clot. 
As shown by John Vane, the nonsteroidal antiinflamma- 
tory drugs (NSAIDs) — aspirin, ibuprofen, and meclofe- 
namate, for example — inhibit the enzyme prostaglandin 
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FIGURE 10-18 Arachidonic acid and some eicosanoid deriva- 
tives. Arachidonic acid (arachidonate at pH 7) is the precursor 
of eicosanoids, including the prostaglandins, thromboxanes, and 
leukotrienes. In prostaglandin E-,, C-8 and C-12 of arachidonate are 
joined to form the characteristic five-membered ring. In thromboxane 
A 2 , the C-8 and C-12 are joined and an oxygen atom is added to form 
the six-membered ring. Leukotriene A 4 has a series of three conjugated 
double bonds. Nonsteroidal antiinflammatory drugs (NSAIDs) such as 
aspirin and ibuprofen block the formation of prostaglandins and 
thromboxanes from arachidonate by inhibiting the enzyme cyclooxy- 
genase (prostaglandin H 2 synthase). 
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H 2 synthase (also called cyclooxygenase, or COX), 
which catalyzes an early step in the pathway from 
arachidonate to prostaglandins and thromboxanes 
(Fig. 10-18; see also Fig. 21-15). 

Leukotrienes, first found in leukocytes, contain 
three conjugated double bonds. They are powerful bio- 
logical signals. For example, leukotriene D 4 , derived 
from leukotriene A 4 , induces contraction of the smooth 
muscle lining the airways to the lung. Overproduction of 
leukotrienes causes asthmatic attacks, and leukotriene 
synthesis is one target of antiasthmatic drugs such as 
prednisone. The strong contraction of the smooth mus- 
cle of the lungs that occurs during anaphylactic shock is 
part of the potentially fatal allergic reaction in individu- 
als hypersensitive to bee stings, penicillin, or other 
agents. ■ 

Steroid Hormones Carry Messages between Tissues 

Steroids are oxidized derivatives of sterols; they 
have the sterol nucleus but lack the alkyl chain 
attached to ring D of cholesterol, and they are more po- 
lar than cholesterol. Steroid hormones move through 
the bloodstream (on protein carriers) from their site of 
production to target tissues, where they enter cells, 
bind to highly specific receptor proteins in the nucleus, 
and trigger changes in gene expression and thus metab- 
olism. Because hormones have very high affinity for 
their receptors, very low concentrations of hormones 
(nanomolar or less) are sufficient to produce responses 
in target tissues. The major groups of steroid hormones 
are the male and female sex hormones and the hor- 
mones produced by the adrenal cortex, Cortisol and al- 
dosterone (Fig. 10-19). Prednisone and prednisolone 
are steroid drugs with potent antiinflammatory activi- 
ties, mediated in part by the inhibition of arachidonate 
release by phospholipase A 2 (Fig. 10-18) and conse- 
quent inhibition of the synthesis of leukotrienes, 
prostaglandins, and thromboxanes. These drugs have a 
variety of medical applications, including the treatment 
of asthma and rheumatoid arthritis. ■ 

Vascular plants contain the steroidlike brassinolide 
(Fig. 10-19), a potent growth regulator that increases 
the rate of stem elongation and affects the orientation of 
cellulose microfibrils in the cell wall during growth. 

Vascular Plants Produce Thousands of Volatile Signals 

Plants produce literally thousands of different lipophilic 
compounds, volatile substances that are used to attract 
pollinators, to repel herbivores, to attract organisms 
that defend the plant against herbivores, and to commu- 
nicate with other plants. Jasmonate, for example (see 
Fig. 12-32), derived from the fatty acid 18:3(A 9 ' 12 ' 15 ) in 
membrane lipids, triggers the plant's defenses in re- 
sponse to insect-inflicted damage. The methyl ester of 
jasmonate gives the characteristic fragrance of jasmine 
oil, which is widely used in the perfume industry. Many 
of the plant volatiles are derived from fatty acids, or 
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FIGURE 10-19 Steroids derived from cholesterol. Testos- 
terone, the male sex hormone, is produced in the testes. Estra- 
diol, one of the female sex hormones, is produced in the ovaries and 
placenta. Cortisol and aldosterone are hormones synthesized in the 
cortex of the adrenal gland; they regulate glucose metabolism and salt 
excretion, respectively. Prednisolone and prednisone are synthetic 
steroids used as antiinflammatory agents. Brassinolide is a growth reg- 
ulator found in vascular plants. 

from compounds made by the condensation of five- 
carbon isoprene units; these include geraniol (the char- 
acteristic scent of geraniums), /3-pinene (pine trees), 
limonene (limes), menthol, and carvone (see Fig. l-23a), 
to name but a few. 
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Vitamins A and D Are Hormone Precursors 

During the first third of the twentieth century, a 
major focus of research in physiological chemistry 
was the identification of vitamins, compounds that are 
essential to the health of humans and other vertebrates 
but cannot be synthesized by these animals and must 
therefore be obtained in the diet. Early nutritional studies 
identified two general classes of such compounds: those 
soluble in nonpolar organic solvents (fat-soluble vitamins) 
and those that could be extracted from foods with aque- 
ous solvents (water-soluble vitamins). Eventually the fat- 
soluble group was resolved into the four vitamin groups A, 
D, E, and K, all of which are isoprenoid compounds syn- 
thesized by the condensation of multiple isoprene units. 
Two of these (D and A) serve as hormone precursors. 

Vitamin D 3 , also called cholecalciferol, is nor- 
mally formed in the skin from 7-dehydrocholesterol in a 
photochemical reaction driven by the UV component of 
sunlight (Fig. 1 0-20a). Vitamin D 3 is not itself biolog- 
ically active, but it is converted by enzymes in the liver 
and kidney to 1,25-dihydroxycholecalciferol, a hor- 
mone that regulates calcium uptake in the intestine and 
calcium levels in kidney and bone. Deficiency of vitamin 
D leads to defective bone formation and the disease 
rickets, for which administration of vitamin D produces 
a dramatic cure (Fig. 10-20b). Vitamin D 2 (ergocalcif- 
erol) is a commercial product formed by UV irradiation 
of the ergosterol of yeast. Vitamin D 2 is structurally 



similar to D 3 , with slight modification to the side chain 
attached to the sterol D ring. Both have the same bio- 
logical effects, and D 2 is commonly added to milk and 
butter as a dietary supplement. Like steroid hormones, 
the product of vitamin D metabolism, 1,25-dihydroxyc- 
holecalciferol, regulates gene expression by interacting 
with specific nuclear receptor proteins (see p. 1143). 

Vitamin A (retinol), in its various forms, functions 
as a hormone and as the visual pigment of the vertebrate 
eye (Fig. 10-21). Acting through receptor proteins in 
the cell nucleus, the vitamin A derivative retinoic acid 
regulates gene expression in the development of epithe- 
lial tissue, including skin. Retinoic acid is the active in- 
gredient in the drug tretinoin (Retin-A), used in the 
treatment of severe acne and wrinkled skin. Retinal, an- 
other vitamin A derivative, is the pigment that initiates 
the response of rod and cone cells of the retina to light, 
producing a neuronal signal to the brain. This role of 
retinal is described in detail in Chapter 12. 

Vitamin A was first isolated from fish liver oils; liver, 
eggs, whole milk, and butter are also good dietary 
sources. In vertebrates, /3-carotene, the pigment that 
gives carrots, sweet potatoes, and other yellow vegeta- 
bles their characteristic color, can be enzymatically con- 
verted to vitamin A. Deficiency of vitamin A leads to a 
variety of symptoms in humans, including dryness of the 
skin, eyes, and mucous membranes; retarded develop- 
ment and growth; and night blindness, an early symptom 
commonly used in diagnosing vitamin A deficiency. ■ 
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FIGURE 10-20 Vitamin D 3 production and metabolism. 

(a) Cholecalciferol (vitamin D 3 ) is produced in the skin by UV 
irradiation of 7-dehydrocholesterol, which breaks the bond shaded pink. 
In the liver, a hydroxyl group is added at C-25; in the kidney, a second 
hydroxylation at C-1 produces the active hormone, 1 ,25- 
dihydroxycholecalciferol, This hormone regulates the metabolism of 
Ca 2+ in kidney, intestine, and bone, (b) Dietary vitamin D prevents rick- 
ets, a disease once common in cold climates where heavy clothing 
blocks the UV component of sunlight necessary for the production of vi- 
tamin D 3 in skin, In this detail from a large mural by John Steuart Curry, 
The Social Benefits of Biochemical Research (1 943), the people and an- 
imals on the left show the effects of poor nutrition, including the bowed 
legs of a boy with classical rickets. On the right are the people and ani- 
mals made healthier with the "social benefits of research," including the 
use of vitamin D to prevent and treat rickets. This mural is in the Depart- 
ment of Biochemistry at the University of Wisconsin-Madison. 
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FIGURE 10-21 Vitamin A t and its precursor and derivatives, (a) (3- 

Carotene is the precursor of vitamin At . Isoprene structural units are set 
off by dashed red lines (see p. 359). Cleavage of /3-carotene yields two 
molecules of vitamin At (retinol) (b). Oxidation at C-1 5 converts retinol 
to the aldehyde, retinal (c), and further oxidation produces retinoic acid 
(d), a hormone that regulates gene expression. Retinal combines with 
the protein opsin to form rhodopsin (not shown)/ a visual pigment 




Retinoic acid 
(d) 



visible jjf 
light 




all-Jrans-Retmal 
(e) 



Hormonal 
signal to 
epithelial 
cells 



Neuronal 
signal 
to brain 



widespread in nature. In the dark, retinal of rhodopsin is in the 11-c/s 
form (c). When a rhodopsin molecule is excited by visible lights the 
1 1 -c/s-retinal undergoes a series of photochemical reactions that con- 
vert it to all-r/a/is-retinal (e), forcing a change in the shape of the entire 
rhodopsin molecule. This transformation in the rod cell of the verte- 
brate retina sends an electrical signal to the brain that is the basis of 
visual transduction, a topic we address in more detail in Chapter 12. 



Vitamins E and K and the Lipid Quinones Are 
Oxidation-Reduction Cofactors 

|7| Vitamin E is the collective name for a group of 
El closely related lipids called tocopherols, all of 

which contain a substituted aromatic ring and a long iso- 
prenoid side chain (Fig. 10-22a). Because they are hy- 
drophobic, tocopherols associate with cell membranes, 
lipid deposits, and lipoproteins in the blood. Tocopherols 
are biological antioxidants. The aromatic ring reacts with 
and destroys the most reactive forms of oxygen radicals 
and other free radicals, protecting unsaturated fatty acids 
from oxidation and preventing oxidative damage to mem- 
brane lipids, which can cause cell fragility. Tocopherols are 
found in eggs and vegetable oils and are especially abun- 
dant in wheat germ. Laboratory animals fed diets depleted 
of vitamin E develop scaly skin, muscular weakness and 
wasting, and sterility. Vitamin E deficiency in humans is 
very rare; the principal symptom is fragile erythrocytes. 

The aromatic ring of vitamin K (Fig. 10-22b) un- 
dergoes a cycle of oxidation and reduction during the 
formation of active prothrombin, a blood plasma protein 
essential in blood clotting. Prothrombin is a proteolytic 
enzyme that splits peptide bonds in the blood protein 
fibrinogen to convert it to fibrin, the insoluble fibrous 



protein that holds blood clots together. Henrik Dam and 
Edward A. Doisy independently discovered that vitamin K 
deficiency slows blood clotting, which can be fatal. Vita- 
min K deficiency is very uncommon in humans, aside 
from a small percentage of infants who suffer from hem- 
orrhagic disease of the newborn, a potentially fatal dis- 
order. In the United States, newborns are routinely 
given a 1 mg injection of vitamin K. Vitamin K x (phyllo- 
quinone) is found in green plant leaves; a related form, 
vitamin K 2 (menaquinone) , is formed by bacteria living 
in the vertebrate intestine. 




Edward A, Doisy, Henrik Dam, 

1893-1986 1895-1976 



362] Lipids 



(a) 

Vitamin E: an antioxidant 
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electron carrier (n = 4 to 8) 
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Dolichol: a sugar carrier 
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FIGURE 10-22 Some other biologically active isoprenoid compounds has 10 isoprene units. Dolichols of animals have 17 to 21 isoprene 
or derivatives. Units derived from isoprene are set off by dashed red units (85 to 105 carbon atoms), bacterial dolichols have 11, and those 
lines. In most mammalian tissues, ubiquinone (also called coenzyme Q) of plants and fungi have 1 4 to 24. 



Warfarin (Fig. 10-22c) is a synthetic compound that 
inhibits the formation of active prothrombin. It is partic- 
ularly poisonous to rats, causing death by internal 
bleeding. Ironically, this potent rodenticide is also an in- 
valuable anticoagulant drug for treating humans at risk 
for excessive blood clotting, such as surgical patients 
and those with coronary thrombosis. ■ 

Ubiquinone (also called coenzyme Q) and plasto- 
quinone (Fig. 10-22d, e) are isoprenoids that function 
as lipophilic electron carriers in the oxidation-reduction 
reactions that drive ATP synthesis in mitochondria and 
chloroplasts, respectively Both ubiquinone and plasto- 
quinone can accept either one or two electrons and 
either one or two protons (see Fig. 19-2). 

Dolichols Activate Sugar Precursors for Biosynthesis 

During assembly of the complex carbohydrates of bacte- 
rial cell walls, and during the addition of polysaccharide 



units to certain proteins (glycoproteins) and lipids (gly- 
colipids) in eukaryotes, the sugar units to be added are 
chemically activated by attachment to isoprenoid alco- 
hols called dolichols (Fig. 10-22f). These compounds 
have strong hydrophobic interactions with membrane 
lipids, anchoring the attached sugars to the membrane, 
where they participate in sugar-transfer reactions. 

Many Natural Pigments Are Lipidic Conjugated Dienes 

Conjugated dienes have carbon chains with alternating 
single and double bonds. Because this structural arrange- 
ment allows the derealization of electrons, the com- 
pounds can be excited by low-energy electromagnetic 
radiation (visible light), giving them colors visible to hu- 
mans and other animals. Carotene (Fig. 10-21) is yellow- 
orange; similar compounds give bird feathers their 
striking reds, oranges, and yellows (Fig. 10-23). Like 
sterols, steroids, dolichols, vitamins A, E, D, and K, 
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FIGURE 10-23 Lipids as pigments in plants and bird feathers. Com- 
pounds with long conjugated systems absorb light in the visible region of 
the spectrum. Subtle differences in the chemistry of these compounds 
produce pigments of strikingly different colors. Birds acquire the pig- 



ments that color their feathers red or yellow by eating plant materials that 
contain carotenoid pigments, such as canthaxanthin and zeaxanthin. The 
differences in pigmentation between male and female birds are the result 
of differences in intestinal uptake and processing of carotenoids. 



ubiquinone, and plastoquinone, these pigments are syn- 
thesized from five-carbon isoprene derivatives; the 
biosynthetic pathway is described in detail in Chapter 21. 



SUMMARY 10.3 



Lipids as Signals, 
Cofactors, and Pigments 



Some types of lipids, although present in relatively 
small quantities, play critical roles as cofactors or 
signals. 

Phosphatidylinositol bisphosphate is hydrolyzed to 
yield two intracellular messengers, diacylglycerol and 
inositol 1,4,5-trisphosphate. Phosphatidylinositol 
3,4,5-trisphosphate is a nucleation point for 
supramolecular protein complexes involved in 
biological signaling. 

Prostaglandins, thromboxanes, and leukotrienes 
(the eicosanoids) , derived from arachidonate, are 
extremely potent hormones. 

Steroid hormones, derived from sterols, serve as 
powerful biological signals, such as the sex 
hormones. 

Vitamins D, A, E, and K are fat-soluble compounds 
made up of isoprene units. All play essential roles 
in the metabolism or physiology of animals. Vitamin 
D is precursor to a hormone that regulates calcium 
metabolism. Vitamin A furnishes the visual pigment 
of the vertebrate eye and is a regulator of gene 
expression during epithelial cell growth. Vitamin E 
functions in the protection of membrane lipids from 
oxidative damage, and vitamin K is essential in the 
blood-clotting process. 

Ubiquinones and plastoquinones, also isoprenoid 
derivatives, are electron carriers in mitochondria 
and chloroplasts, respectively. 



■ Dolichols activate and anchor sugars to cellular 
membranes; the sugar groups are then used in the 
synthesis of complex carbohydrates, glycolipids, 
and glycoproteins. 

■ Lipidic conjugated dienes serve as pigments in 
flowers and fruits and give bird feathers their 
striking colors. 

10.4 Working with Lipids 

Because lipids are insoluble in water, their extraction 
and subsequent fractionation require the use of organic 
solvents and some techniques not commonly used in 
the purification of water-soluble molecules such as 
proteins and carbohydrates. In general, complex mix- 
tures of lipids are separated by differences in polarity 
or solubility in nonpolar solvents. Lipids that contain 
ester- or amide-linked fatty acids can be hydrolyzed by 
treatment with acid or alkali or with specific hydrolytic 
enzymes (phospholipases, glycosidases) to yield their 
components for analysis. Some methods commonly 
used in lipid analysis are shown in Figure 10-24 and 
discussed below. 

Lipid Extraction Requires Organic Solvents 

Neutral lipids (triacylglycerols, waxes, pigments, and so 
forth) are readily extracted from tissues with ethyl 
ether, chloroform, or benzene, solvents that do not per- 
mit lipid clustering driven by hydrophobic interactions. 
Membrane lipids are more effectively extracted by more 
polar organic solvents, such as ethanol or methanol, 
which reduce the hydrophobic interactions among lipid 
molecules while also weakening the hydrogen bonds 
and electrostatic interactions that bind membrane lipids 
to membrane proteins. A commonly used extractant is a 
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FIGURE 10-24 Common procedures in the extraction, separation, 
and identification of cellular lipids, (a) Tissue is homogenized in a 
chloroform/methanol/water mixture, which on addition of water and 
removal of unextractable sediment by centrifugation yields two 
phases. Different types of extracted lipids in the chloroform phase may 
be separated by (b) adsorption chromatography on a column of silica 
gel, through which solvents of increasing polarity are passed, or 
(c) thin-layer chromatography (TLC), in which lipids are carried up a 
silica gel-coated plate by a rising solvent front, less polar lipids travel- 
ing farther than more polar or charged lipids. TLC with appropriate 
solvents can also be used to separate closely related lipid species; for 
example, the charged lipids phosphatidylserine, phosphatidylglycerol, 
and phosphatidylinositol are easily separated by TLC. 

For the determination of fatty acid composition, a lipid fraction 
containing ester-linked fatty acids is transesterified in a warm aqueous 
solution of NaOH and methanol (d), producing a mixture of fatty acyl 
methyl esters. These methyl esters are then separated on the basis of 
chain length and degree of saturation by (e) gas-liquid chromatogra- 
phy (GLC) or (f) high-performance liquid chromatography (HPLC). Pre- 
cise determination of molecular mass by mass spectrometry allows 
unambiguous identification of individual lipids, 



mixture of chloroform, methanol, and water, initially in 
volume proportions (1:2:0.8) that are miscible, produc- 
ing a single phase. After tissue is homogenized in this 
solvent to extract all lipids, more water is added to the 
resulting extract and the mixture separates into two 
phases, methanol/water (top phase) and chloroform 
(bottom phase). The lipids remain in the chloroform 
layer, and the more polar molecules such as proteins 
and sugars partition into the methanol/water layer. 
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Adsorption Chromatography Separates Lipids of 
Different Polarity 

Complex mixtures of tissue lipids can be fractionated by 
chromatographic procedures based on the different po- 
larities of each class of lipid. In adsorption chromatogra- 
phy (Fig. 10-24b), an insoluble, polar material such as 
silica gel (a form of silicic acid, Si(OH) 4 ) is packed into 
a glass column, and the lipid mixture (in chloroform 
solution) is applied to the top of the column. (In high- 
performance liquid chromatography, the column is of 
smaller diameter and solvents are forced through the 
column under high pressure.) The polar lipids bind 
tightly to the polar silicic acid, but the neutral lipids pass 
directly through the column and emerge in the first 
chloroform wash. The polar lipids are then eluted, in or- 
der of increasing polarity, by washing the column with 
solvents of progressively higher polarity. Uncharged but 
polar lipids (cerebrosides, for example) are eluted with 
acetone, and very polar or charged lipids (such as 
glycerophospholipids) are eluted with methanol. 

Thin-layer chromatography on silicic acid employs 
the same principle (Fig. 10-24c). A thin layer of silica gel 
is spread onto a glass plate, to which it adheres. A small 
sample of lipids dissolved in chloroform is applied near 
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one edge of the plate, which is dipped in a shallow con- 
tainer of an organic solvent or solvent mixture; the entire 
setup is enclosed in a chamber saturated with the solvent 
vapor. As the solvent rises on the plate by capillary ac- 
tion, it carries lipids with it. The less polar lipids move 
farthest, as they have less tendency to bind to the silicic 
acid. The separated lipids can be detected by spraying 
the plate with a dye (rhodamine) that fluoresces when 
associated with lipids, or by exposing the plate to iodine 
fumes. Iodine reacts reversibly with the double bonds in 
fatty acids, such that lipids containing unsaturated fatty 
acids develop a yellow or brown color. Several other 
spray reagents are also useful in detecting specific lipids. 
For subsequent analysis, regions containing separated 
lipids can be scraped from the plate and the lipids 
recovered by extraction with an organic solvent. 

Gas-Liquid Chromatography Resolves Mixtures of 
Volatile Lipid Derivatives 

Gas-liquid chromatography separates volatile compo- 
nents of a mixture according to their relative tendencies 
to dissolve in the inert material packed in the chro- 
matography column or to volatilize and move through 
the column, carried by a current of an inert gas such as 
helium. Some lipids are naturally volatile, but most 
must first be derivatized to increase their volatility 
(that is, lower their boiling point). For an analysis of 
the fatty acids in a sample of phospholipids, the lipids 
are first transesterified: heated in a methanol/HCl or 
methanol/NaOH mixture to convert fatty acids esteri- 
fied to glycerol into their methyl esters (Fig. 10-24d). 
These fatty acyl methyl esters are then loaded onto the 
gas-liquid chromatography column, and the column is 
heated to volatilize the compounds. Those fatty acyl es- 
ters most soluble in the column material partition into 
(dissolve in) that material; the less soluble lipids are car- 
ried by the stream of inert gas and emerge first from the 
column. The order of elution depends on the nature of 
the solid adsorbant in the column and on the boiling 
point of the components of the lipid mixture. Using 
these techniques, mixtures of fatty acids of various 
chain lengths and various degrees of unsaturation can 
be completely resolved (Fig. 10-24e). 

Specific Hydrolysis Aids in Determination of Lipid Structure 

Certain classes of lipids are susceptible to degradation un- 
der specific conditions. For example, all ester-linked fatty 
acids in triacylglycerols, phospholipids, and sterol esters 
are released by mild acid or alkaline treatment, and some- 
what harsher hydrolysis conditions release amide-bound 
fatty acids from sphingolipids. Enzymes that specifically 
hydrolyze certain lipids are also useful in the determina- 
tion of lipid structure. Phospholipases A, C, and D 
(Fig. 10-16) each split particular bonds in phospholipids 
and yield products with characteristic solubilities and 
chromatographic behaviors. Phospholipase C, for example, 



releases a water-soluble phosphoryl alcohol (such as phos- 
phocholine from phosphatidylcholine) and a chloroform- 
soluble diacylglycerol, each of which can be characterized 
separately to determine the structure of the intact 
phospholipid. The combination of specific hydrolysis with 
characterization of the products by thin-layer, gas-liquid, 
or high-performance liquid chromatography often allows 
determination of a lipid structure. 

Mass Spectrometry Reveals Complete Lipid Structure 

To establish unambiguously the length of a hydrocarbon 
chain or the position of double bonds, mass spectromet- 
ric analysis of lipids or their volatile derivatives is in- 
valuable. The chemical properties of similar lipids (for 
example, two fatty acids of similar length unsaturated at 
different positions, or two isoprenoids with different 
numbers of isoprene units) are very much alike, and 
their order of elution from the various chromatographic 
procedures often does not distinguish between them. 
When the eluate from a chromatography column is 
sampled by mass spectrometry, however, the compo- 
nents of a lipid mixture can be simultaneously separated 
and identified by their unique pattern of fragmentation 
(Fig. 10-25). 

Lipidomics Seeks to Catalog All Lipids and Their Functions 

In exploring the biological role of lipids in cells and tis- 
sues, it is important to know which lipids are present 
and in what proportions, and to know how this lipid 
composition changes with embryonic development, dis- 
ease, or drug treatment. As lipid biochemists have be- 
come aware of the thousands of different naturally 
occurring lipids, they have proposed a new nomencla- 
ture system, with the aim of making it easier to compile 
and search databases of lipid composition. The system 
places each lipid in one of eight chemical groups (Table 
10-3) designated by two letters. Within these groups, 
finer distinctions are indicated by numbered classes and 
subclasses. For example, all glycerophosphocholines are 
GP01; the subgroup of glycerophosphocholines with two 
fatty acids in ester linkage is designated GP0101; with 
one fatty acid ether-linked at position 1 and one in ester 
linkage at position 2, this becomes GP0102. Specific 
fatty acids are designated by numbers that give every 
lipid its own unique identifier, so that each individual 
lipid, including lipid types not yet discovered, can be un- 
ambiguously described in terms of a 12-character iden- 
tifier. One factor used in this classification is the nature 
of the biosynthetic precursor. For example, prenol lipids 
(dolichols and vitamins E and K, for example) are 
formed from isoprenyl precursors. Polyketides, which 
we have not discussed in this chapter, include some nat- 
ural products, many toxic, with biosynthetic pathways 
related to those for fatty acids. The eight chemical cate- 
gories in Table 10-3 do not coincide perfectly with the 
divisions according to biological function that we have 
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FIGURE 10-25 Determination of fatty acid structure by mass spec- 
trometry. The fatty acid is first converted to a derivative that minimizes 
migration of the double bonds when the molecule is fragmented by 
electron bombardment. The derivative shown here is a picolinyl ester 
of linoleic acid— 1 8:2(A 912 ) (M r 371)— in which the alcohol is picoli- 
nol (red). When bombarded with a stream of electrons, this molecule 
is volatilized and converted to a parent ion {M + ; M r 371), in which the 
N atom bears the positive charge, and a series of smaller fragments 
produced by breakage of C— C bonds in the fatty acid. The mass spec- 
trometer separates these charged fragments according to their 
mass/charge ratio (m/z). (To review the principles of mass spectrome- 
try see Box 3-2.) 
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The prominent ions at m/z = 92, 108, 151, and 164 contain the 
pyridine ring of the picolinol and various fragments of the carboxyl group, 
showing that the compound is indeed a picolinyl ester. The molecular ion, 
M + (m/z = 371), confirms the presence of a C 18 fatty acid with two dou- 
ble bonds. The uniform series of ions 14 atomic mass units (u) apart repre- 
sents loss of each successive methyl and methylene group from the methyl 
end of the acyl chain (beginning at C-1 8; the right end of the molecule as 
shown here), until the ion at m/z = 300 is reached. This is followed by a 
gap of 26 u for the carbons of the terminal double bond, at m/z = 274; a 
further gap of 14 u for the C-1 1 methylene group, at m/z = 260; and so 
forth. By this means the entire structure is determined, although these data 
alone do not reveal the configuration (cis or trans) of the double bonds. 



used in this chapter. For example, the structural lipids 
of membranes include both glycerophospholipids and 
sphingolipids, separate categories in Table 10-3. Each 
method of categorization has its advantages. 

The application of mass spectrometric techniques 
with high throughput and high resolution can provide 
quantitative catalogs of all the lipids present in a spe- 
cific cell type — the lipidome — under particular condi- 
tions, and of the ways in which the lipidome changes 
with differentiation, disease such as cancer, or drug 
treatment. An animal cell contains about a thousand 
different lipid species, each presumably having a spe- 



cific function. These functions are known for a growing 
number of lipids, but the still largely unexplored 
lipidome offers a rich source of new problems for the 
next generation of biochemists and cell biologists to 
solve. 

SUMMARY 10.4 Working with Lipids 

■ In the determination of lipid composition, the lipids 
are first extracted from tissues with organic 
solvents and separated by thin-layer, gas-liquid, or 
high-performance liquid chromatography 



■ WHTig- Eight Major Categories of Biological Lipids 


Category 


Category code 


Examples 


Fatty acids 


FA 


Oleate, stearoyl-CoA, palmitoylcarnitine 


Glycerolipids 


GL 


Di- and triacylglycerols 


Glycerophospholipids 


GP 


Phosphatidylcholine, phosphatidyls erine, phosphatidylethanolamine 


Sphingolipids 


SP 


Sphingomyelin, ganglioside GM2 


Sterol lipids 


ST 


Cholesterol, progesterone, bile acids 


Prenol lipids 


PR 


Farnesol, geraniol, retinol, ubiquinone 


Saccharolipids 


SL 


Lipopolysaccharide 


Polyketides 


PK 


Tetracycline, anatoxin B x 
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Phospholipases specific for one of the bonds in a 
phospholipid can be used to generate simpler 
compounds for subsequent analysis. 

Individual lipids are identified by their 
chromatographic behavior, their susceptibility to 
hydrolysis by specific enzymes, or mass 
spectrometry. 

Lipidomics combines powerful analytical techniques 
to determine the full complement of lipids in a 
cell or tissue (the lipidome) and to assemble 
annotated databases that allow comparisons 
between lipids of different cell types and under 
different conditions. 
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Problems 



1. Operational Definition of Lipids How is the definition 
of "lipid" different from the types of definitions used for other 
biomolecules that we have considered, such as amino acids, 
nucleic acids, and proteins? 

2. Melting Points of Lipids The melting points of a series 
of 18-carbon fatty acids are: stearic acid, 69.6 °C; oleic acid, 
13.4 °C; linoleic acid, -5 °C; and linolenic acid, -11 °C 



(a) What structural aspect of these 18-carbon fatty acids 
can be correlated with the melting point? 

(b) Draw all the possible triacylglycerols that can be con- 
structed from glycerol, palmitic acid, and oleic acid. Rank 
them in order of increasing melting point. 

(c) Branched-chain fatty acids are found in some bacterial 
membrane lipids. Would their presence increase or decrease 
the fluidity of the membranes (that is, give them a lower or 
higher melting point)? Why? 

3. Preparation of Bearnaise Sauce During the prepara- 
tion of bearnaise sauce, egg yolks are incorporated into melted 
butter to stabilize the sauce and avoid separation. The stabiliz- 
ing agent in the egg yolks is lecithin (phosphatidylcholine) . 
Suggest why this works. 

4. Isoprene Units in Isoprenoids Geraniol, farnesol, and 
squalene are called isoprenoids, because they are synthesized 
from five-carbon isoprene units. In each compound, circle the 
five-carbon units representing isoprene units (see Fig. 10-22). 




Farnesol 




Squalene 

5. Naming Lipid Stereoisomers The two compounds be- 
low are stereoisomers of carvone with quite different proper- 
ties; the one on the left smells like spearmint, and that on the 
right, like caraway. Name the compounds using the RS system. 
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6. RS Designations for Alanine and Lactate Draw (using 
wedge-bond notation) and label the (it?) and (S) isomers of 
2-aminopropanoic acid (alanine) and 2-hydroxypropanoic 
acid (lactic acid) . 



HoN' 



I 



COOH 



CH, 



OH' 



^COOH 
CH 3 



2-Aminopropanoic acid 
(alanine) 



2-Hydroxypropanoic acid 
(lactic acid) 



7. Hydrophobic and Hydrophilic Components of Mem- 
brane Lipids A common structural feature of membrane 
lipids is their amphipathic nature. For example, in phos- 
phatidylcholine, the two fatty acid chains are hydrophobic and 
the phosphocholine head group is hydrophilic. For each of the 
following membrane lipids, name the components that serve 
as the hydrophobic and hydrophilic units: (a) phos- 
phatidylethanolamine; (b) sphingomyelin; (c) galactosylcere- 
broside; (d) ganglioside; (e) cholesterol. 

8. Structure of Omega-6 Fatty Acid Draw the structure of 
the omega-6 fatty acid 16:1. 

9. Catalytic Hydrogenation of Vegetable Oils Catalytic hy- 
drogenation, used in the food industry, converts double bonds in 
the fatty acids of the oil triacylglycerols to — CH 2 — CH 2 — . How 
does this affect the physical properties of the oils? 

10. Alkali Lability of Triacylglycerols A common proce- 
dure for cleaning the grease trap in a sink is to add a product 
that contains sodium hydroxide. Explain why this works. 

11. Deducing Lipid Structure from Composition 

Compositional analysis of a certain lipid shows that it has 
exactly one mole of fatty acid per mole of inorganic phos- 
phate Could this be a glycerophospholipid? A ganglioside? A 
sphingomyelin? 

12. Deducing Lipid Structure from Molar Ratio of Com- 
ponents Complete hydrolysis of a glycerophospholipid yields 
glycerol, two fatty acids (16:1 (A 9 ) and 16:0), phosphoric acid, 
and serine in the molar ratio 1:1:1:1:1. Name this lipid and 
draw its structure. 

13. Impermeability of Waxes What property of the waxy 
cuticles that cover plant leaves makes the cuticles imperme- 
able to water? 

B9 14. The Action of Phospholipases The venom 
Kjfll of the Eastern diamondback rattler and the Indian 
cobra contains phospholipase A 2 , which catalyzes the hy- 
drolysis of fatty acids at the C-2 position of glycerophos- 
pholipids. The phospholipid breakdown product of this 
reaction is lysolecithin (lecithin is phosphatidylcholine). 
At high concentrations, this and other lysophospholipids 
act as detergents, dissolving the membranes of erythro- 
cytes and lysing the cells. Extensive hemolysis may be life- 
threatening. 



(a) All detergents are amphipathic. What are the hy- 
drophilic and hydrophobic portions of lysolecithin? 

(b) The pain and inflammation caused by a snake bite can 
be treated with certain steroids. What is the basis of this 
treatment? 

(c) Though the high levels of phospholipase A 2 in venom 
can be deadly, this enzyme is necessary for a variety of normal 
metabolic processes. What are these processes? 

15. Lipids in Blood Group Determination We note in Fig- 
ure 10-15 that the structure of glycosphingolipids determines 
the blood groups A, B, and 0 in humans. It is also true that gly- 
coproteins determine blood groups. How can both statements 
be true? 

16. Intracellular Messengers from Phosphatidylinositols 

When the hormone vasopressin stimulates cleavage of 
phosphatidylinositol 4,5-bisphosphate by hormone-sensitive 
phospholipase C, two products are formed. What are they? Com- 
pare their properties and their solubilities in water, and predict 
whether either would diffuse readily through the cytosol. 

17. Storage of Fat-Soluble Vitamins In contrast to water 
soluble vitamins, which must be part of our daily diet, fat-soluble 
vitamins can be stored in the body in amounts sufficient for many 
months. Suggest an explanation for this difference. 

18. Hydrolysis of Lipids Name the products of mild hydroly- 
sis with dilute NaOH of (a) l-stearoyl-2,3-dipalmitoylglycerol; 
(b) l-palmitoyl-2-oleoylphosphatidylcholine. 

19. Effect of Polarity on Solubility Rank the following in 
order of increasing solubility in water: a triacylglycerol, a dia- 
cylglycerol, and a monoacylglycerol, all containing only 
palmitic acid, 

20. Chromatographic Separation of Lipids A mixture of 
lipids is applied to a silica gel column, and the column is 
then washed with increasingly polar solvents. The mixture 
consists of phosphatidylserine, phosphatidylethanolamine, 
phosphatidylcholine, cholesteryl palmitate (a sterol ester), 
sphingomyelin, palmitate, n-tetradecanol, triacylglycerol, and 
cholesterol. In what order will the lipids elute from the column? 
Explain your reasoning. 

21. Identification of Unknown Lipids Johann Thudichum, 
who practiced medicine in London about 100 years ago, also 
dabbled in lipid chemistry in his spare time. He isolated a vari- 
ety of lipids from neural tissue, and characterized and named 
many of them. His carefully sealed and labeled vials of isolated 
lipids were rediscovered many years later. 

(a) How would you confirm, using techniques not avail- 
able to Thudichum, that the vials labeled "sphingomyelin" and 
"cerebroside" actually contain these compounds? 

(b) How would you distinguish sphingomyelin from phos- 
phatidylcholine by chemical, physical, or enzymatic tests? 

22. Ninhydrin to Detect Lipids on TLC Plates Ninhydrin 
reacts specifically with primary amines to form a purplish-blue 
product. A thin-layer chromatogram of rat liver phospholipids 
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is sprayed with ninhydrin, and the color is allowed to develop. 
Which phospholipids can be detected in this way? 

Data Analysis Problem 

23. Determining the Structure of the Abnormal Lipid in 
Tay-Sachs Disease Box 10-2, Figure 1, shows the pathway 
of breakdown of gangliosides in healthy (normal) individuals 
and individuals with certain genetic diseases. Some of the 
data on which the figure is based were presented in a paper 
by Lars Svennerholm (1962). Note that the sugar NeuBAc, 
A^acetylneuraminic acid, represented in the Box 10-2 figure 
as ♦, is a sialic acid. 

Svennerholm reported that "about 90% of the monosialio- 
gangliosides isolated from normal human brain" consisted of a 
compound with ceramide, hexose, ^acetylgalactosamine, and 
Af-acetylneuraminic acid in the molar ratio 1:3:1:1. 

(a) Which of the gangliosides (GM1 through GM3 and glo- 
boside) in Box 10-2, Figure 1, fits this description? Explain 
your reasoning. 

(b) Svennerholm reported that 90% of the gangliosides 
from a patient with Tay-Sachs had a molar ratio (of the same 
four components given above) of 1:2:1:1. Is this consistent 
with the Box 10-2 figure? Explain your reasoning 

To determine the structure in more detail, Svennerholm 
treated the gangliosides with neuraminidase to remove the N- 
acetylneuraminic acid. This resulted in an asialoganglioside 
that was much easier to analyze, He hydrolyzed it with acid, 
collected the ceramide-containing products, and determined 
the molar ratio of the sugars in each product. He did this for 
both the normal and the Tay-Sachs gangliosides. His results 
are shown below. 
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(c) Based on these data, what can you conclude about the 
structure of the normal ganglioside? Is this consistent with the 
structure in Box 10-2? Explain your reasoning. 

(d) What can you conclude about the structure of the Tay- 
Sachs ganglioside? Is this consistent with the structure in 
Box 10-2? Explain your reasoning. 

Svennerholm also reported the work of other researchers 
who "permethylated" the normal asialoganglioside . Permethylation 
is the same as exhaustive methylation: a methyl group is added to 
every free hydroxyl group on a sugar. They found the following 
permethylated sugars: 2,3,6-trimethylglycopyranose; 2,3,4,6- 
tetramethylgalactopyranose; 2,4,6-trimethylgalactopyranose; and 
4,6-dimethyl-2-deoxy-2-aminogalactopyranose. 

(e) To which sugar of GM1 does each of the permethylated 
sugars correspond? Explain your reasoning. 

(f) Based on all the data presented so far, what pieces of 
information about normal ganglioside structure are missing? 

Reference 

Svennerholm, L. (1962) The chemical structure of normal human 

brain and Tay-Sachs gangliosides. Biochem Biophys Res Comm. 9, 
436-441 
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The first cell probably came into being when a mem- 
brane formed, enclosing a small volume of aqueous 
solution and separating it from the rest of the uni- 
verse. Membranes define the external boundaries of 
cells and regulate the molecular traffic across that 
boundary (Fig. 11-1); in eukaryotic cells, they divide 
the internal space into discrete compartments to segre- 
gate processes and components. They organize complex 
reaction sequences and are central to both biological en- 
ergy conservation and cell-to-cell communication. The 
biological activities of membranes flow from their re- 
markable physical properties. Membranes are flexible, 
self-sealing, and selectively permeable to polar solutes. 



Membrane 
bilayer 




FIGURE 11-1 Biological membranes. Viewed in cross section, all ceil 
membranes share a characteristic trilaminar appearance. This erythro- 
cyte was stained with osmium tetroxide and viewed with an electron 
microscope. The plasma membrane appears as a three-layer struc- 
ture, 5 to 8 nm (50 to 80 A) thick. The trilaminar image consists of 
two electron-dense layers (the osmium, bound to the inner and outer 
surfaces of the membrane) separated by a less dense central region. 



Their flexibility permits the shape changes that accom- 
pany cell growth and movement (such as amoeboid 
movement). With their ability to break and reseal, two 
membranes can fuse ; as in exocytosis, or a single mem- 
brane-enclosed compartment can undergo fission to 
yield two sealed compartments, as in endocytosis or cell 
division, without creating gross leaks through cellular 
surfaces. Because membranes are selectively perme- 
able, they retain certain compounds and ions within 
cells and within specific cellular compartments, while 
excluding others. 

Membranes are not merely passive barriers. They 
include an array of proteins specialized for promoting or 
catalyzing various cellular processes. At the cell surface, 
transporters move specific organic solutes and inorganic 
ions across the membrane; receptors sense extracellular 
signals and trigger molecular changes in the cell; adhe- 
sion molecules hold neighboring cells together. Within 
the cell, membranes organize cellular processes such as 
the synthesis of lipids and certain proteins, and the en- 
ergy transductions in mitochondria and chloroplasts. 
Because membranes consist of just two layers of mole- 
cules, they are very thin— essentially two-dimensional. 
Intermolecular collisions are far more probable in this 
two-dimensional space than in three-dimensional space, 
so the efficiency of enzyme-catalyzed processes organ- 
ized within membranes is vastly increased. 

In this chapter we first describe the composition of 
cellular membranes and their chemical architecture— 
the molecular structures that underlie their biological 
functions. Next, we consider the remarkable dynamic 
features of membranes, in which lipids and proteins 
move relative to each other. Cell adhesion, endocytosis, 
and the membrane fusion accompanying neurotransmit- 
ter secretion illustrate the dynamic roles of membrane 
proteins. We then turn to the protein-mediated passage 
of solutes across membranes via transporters and ion 
channels. In later chapters we discuss the roles of mem- 
branes in signal transduction (Chapters 12 and 23), 
energy transduction (Chapter 19), lipid synthesis 
(Chapter 21), and protein synthesis (Chapter 27). 
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1 1 .1 The Composition and Architecture 
of Membranes 

One approach to understanding membrane function 
is to study membrane composition— to determine, for 
example, which components are common to all mem- 
branes and which are unique to membranes with specific 
functions. So before describing membrane structure and 
function we consider the molecular components of mem- 
branes: proteins and polar lipids, which account for almost 
all the mass of biological membranes, and carbohydrates, 
present as part of glycoproteins and glycolipids. 

Each Type of Membrane Has Characteristic 
Lipids and Proteins 

The relative proportions of protein and lipid vary 
with the type of membrane (Table 11-1), reflecting the 
diversity of biological roles. For example, certain neurons 
have a myelin sheath, an extended plasma membrane 
that wraps around the cell many times and acts as a pas- 
sive electrical insulator. The myelin sheath consists pri- 
marily of lipids, whereas the plasma membranes of 
bacteria and the membranes of mitochondria and chloro- 
plasts, the sites of many enzyme-catalyzed processes, 
contain more protein than lipid (in mass per total mass). 

For studies of membrane composition, the first task 
is to isolate a selected membrane. When eukaryotic cells 
are subjected to mechanical shear, their plasma mem- 
branes are torn and fragmented, releasing cytoplasmic 
components and membrane-bounded organelles such as 
mitochondria, chloroplasts, lysosomes, and nuclei. 
Plasma membrane fragments and intact organelles can 
be isolated by techniques described in Chapter 1 (see 
Fig. 1-8) and in Worked Example 2-1, p. 53 

Cells clearly have mechanisms to control the kinds 
and amounts of membrane lipid they synthesize and to 
target specific lipids to particular organelles. Each 
kingdom, each species, each tissue or cell type, and the 
organelles of each cell type have a characteristic set of 
membrane lipids. Plasma membranes, for example, are 
enriched in cholesterol and contain no detectable 



cardiolipin (Fig. 11-2); this distribution is reversed in 
the inner mitochondrial membrane, which has very low 
cholesterol and high cardiolipin. In all but a few cases, 
the functional significance of these combinations is not 
yet known. 
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FIGURE 11-2 Lipid composition of the plasma membrane and or- 
ganelle membranes of a rat hepatocyte. The functional specialization 
of each membrane type is reflected in its unique lipid composition. 
Cholesterol is prominent in plasma membranes but barely detectable 
in mitochondrial membranes. Cardiolipin is a major component of the 
inner mitochondrial membrane but not of the plasma membrane. 
Phosphatidylserine, phosphatidylinositol, and phosphatidylglycerol 
are relatively minor components (yellow) of most membranes but serve 
critical functions; phosphatidylinositol and its derivatives, for example, 
are important in signal transductions triggered by hormones. Sphin- 
golipids, phosphatidylcholine, and phosphatidylethanolamine are 
present in most membranes, but in varying proportions. Glycolipids, 
which are major components of the chloroplast membranes of plants, 
are virtually absent from animal cells. 
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jor Components of Plasma Membranes in Various Organ 

Components (% by weight) 
Protein Phospholipid Sterol Sterol type Other lipids 
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Note: Values do not add up to 100% in every case, because there are components other than protein, phospholipids, and sterol; plants for 
example, have high levels of glycolipids. 
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The protein composition of membranes from differ- 
ent sources varies even more widely than their lipid 
composition, reflecting functional specialization. In ad- 
dition, some membrane proteins are covalently linked to 
oligosaccharides. For example, in glycophorin, a glyco- 
protein of the erythrocyte plasma membrane, 60% of 
the mass consists of complex oligosaccharides cova- 
lently attached to specific amino acid residues. Ser, Thr, 
and Asn residues are the most common points of attach- 
ment (see Fig. 7-29). The sugar moieties of surface gly- 
coproteins influence the folding of the proteins, as well 
as their stability and intracellular destination, and they 
play a significant role in the specific binding of ligands to 
glycoprotein surface receptors (see Fig. 7-35). 

Some membrane proteins are covalently attached to 
one or more lipids, which serve as hydrophobic anchors 
that hold the proteins to the membrane, as we shall see 

All Biological Membranes Share 
Some Fundamental Properties 

Membranes are impermeable to most polar or charged 
solutes, but permeable to nonpolar compounds; they 
are 5 to 8 nm (50 to 80 A) thick and appear trilaminar 
when viewed in cross section with the electron micro- 
scope (Fig. 11-1). The combined evidence from elec- 
tron microscopy and studies of chemical composition, 
as well as physical studies of permeability and the mo- 



tion of individual protein and lipid molecules within 
membranes, led to the development of the fluid mo- 
saic model for the structure of biological membranes 
(Fig. 11-3). Phospholipids form a bilayer in which the 
nonpolar regions of the lipid molecules in each layer 
face the core of the bilayer and their polar head groups 
face outward, interacting with the aqueous phase on 
either side. Proteins are embedded in this bilayer 
sheet, held by hydrophobic interactions between the 
membrane lipids and hydrophobic domains in the pro- 
teins. Some proteins protrude from only one side of the 
membrane; others have domains exposed on both 
sides. The orientation of proteins in the bilayer is 
asymmetric, giving the membrane "sidedness": the 
protein domains exposed on one side of the bilayer are 
different from those exposed on the other side, reflect- 
ing functional asymmetry The individual lipid and pro- 
tein units in a membrane form a fluid mosaic with a 
pattern that, unlike a mosaic of ceramic tile and mor- 
tar, is free to change constantly. The membrane mosaic 
is fluid because most of the interactions among its 
components are noncovalent, leaving individual lipid 
and protein molecules free to move laterally in the 
plane of the membrane. 

We now look at some of these features of the fluid 
mosaic model in more detail and consider the experi- 
mental evidence that supports the basic model but has 
necessitated its refinement in several ways. 
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FIGURE 11-3 Fluid mosaic model for membrane structure. The fatty 
acyl chains in the interior of the membrane form a fluid, hydrophobic 
region. Integral proteins float in this sea of lipid/ held by hydrophobic 
interactions with their nonpolar amino acid side chains. Both proteins 
and lipids are free to move laterally in the plane of the bilayer, but 
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movement of either from one leaflet of the bilayer to the other is re- 
stricted. The carbohydrate moieties attached to some proteins and 
lipids of the plasma membrane are exposed on the extracellular sur- 
face of the membrane. 
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A Lipid Bilayer is the Basic Structural 
Element of Membranes 

Glycerophospholipids, sphingolipids, and sterols are vir- 
tually insoluble in water. When mixed with water, they 
spontaneously form microscopic lipid aggregates, clus- 
tering together, with their hydrophobic moieties in con- 
tact with each other and their hydrophilic groups 
interacting with the surrounding water. This clustering 
reduces the amount of hydrophobic surface exposed to 
water and thus minimizes the number of molecules in 
the shell of ordered water at the lipid-water interface 
(see Fig. 2-7), resulting in an increase in entropy. Hy- 
drophobic interactions among lipid molecules provide 
the thermodynamic driving force for the formation and 
maintenance of these clusters. 

Depending on the precise conditions and the nature 
of the lipids, three types of lipid aggregate can form 
when amphipathic lipids are mixed with water (Fig. 
11-4). Micelles are spherical structures that contain 
anywhere from a few dozen to a few thousand amphi- 
pathic molecules. These molecules are arranged with 
their hydrophobic regions aggregated in the interior, 
where water is excluded, and their hydrophilic head 
groups at the surface, in contact with water. Micelle for- 
mation is favored when the cross-sectional area of the 
head group is greater than that of the acyl side chain(s), 
as in free fatty acids, lysophospholipids (phospholipids 
lacking one fatty acid), and detergents such as sodium 
dodecyl sulfate (SDS; p. 89). 

A second type of lipid aggregate in water is the bi- 
layer, in which two lipid monolayers (leaflets) form a 
two-dimensional sheet. Bilayer formation is favored if the 
cross-sectional areas of the head group and acyl side 
chain(s) are similar, as in glycerophospholipids and 
sphingolipids. The hydrophobic portions in each mono- 
layer, excluded from water, interact with each other. The 



hydrophilic head groups interact with water at each sur- 
face of the bilayer. Because the hydrophobic regions at its 
edges (Fig. ll-4b) are in contact with water, the bilayer 
sheet is relatively unstable and spontaneously folds back 
on itself to form a hollow sphere, a vesicle (Fig. ll-4c). 
The continuous surface of vesicles eliminates exposed 
hydrophobic regions, allowing bilayers to achieve maxi- 
mal stability in their aqueous environment. Vesicle forma- 
tion also creates a separate aqueous compartment. It is 
likely that the precursors to the first living cells resem- 
bled lipid vesicles, their aqueous contents segregated 
from their surroundings by a hydrophobic shell. 

The lipid bilayer is 3 nm (30 A) thick. The hydrocar- 
bon core, made up of the — CH 2 — and — CH 3 of the 
fatty acyl groups, is about as nonpolar as decane, and 
vesicles formed in the laboratory from pure lipids (lipo- 
somes) are essentially impermeable to polar solutes, as 
is the lipid bilayer of biological membranes (although 
the latter, as we shall see, are permeable to solutes for 
which they have specific transporters). 

Plasma membrane lipids are asymmetrically distrib- 
uted between the two monolayers of the bilayer, although 
the asymmetry, unlike that of membrane proteins, is not 
absolute. In the plasma membrane of the erythrocyte, 
for example, choline-containing lipids (phosphatidyl- 
choline and sphingomyelin) are typically found in the 
outer (extracellular, or exoplasmic) leaflet (Fig. 11-5), 
whereas phosphatidylserine, phosphatidyl thanolamine, 
and the phosphatidylinositols are much more common 
in the inner (cytoplasmic) leaflet. Changes in the distri- 
bution of lipids between plasma membrane leaflets have 
biological consequences. For example, only when the 
phosphatidylserine in the plasma membrane moves into 
the outer leaflet is a platelet able to play its role in for- 
mation of a blood clot. For many other cell types, phos- 
phatidylserine exposure on the outer surface marks a 
cell for destruction by programmed cell death. 
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FIGURE 11-4 Amphipathic lipid aggregates that form in water, (a) In 

micelles, the hydrophobic chains of the fatty acids are sequestered at 
the core of the sphere. There is virtually no water in the hydrophobic 
interior, (b) In an open bilayer, all acyl side chains except those at the 



(c) Vesicle 

edges of the sheet are protected from interaction with water, (c) When 
a two-dimensional bilayer folds on itself, it forms a closed bilayer, a 
three-dimensional hollow vesicle (liposome) enclosing an aqueous 
cavity. 
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FIGURE 11-5 Asymmetric distribution of phospholipids between 
the inner and outer monolayers of the erythrocyte plasma mem- 
brane. The distribution of a specific phospholipid is determined by 
treating the intact cell with phospholipase C, which cannot reach 
lipids in the inner monolayer (leaflet) but removes the head groups of 
lipids in the outer monolayer. The proportion of each head group re- 
leased provides an estimate of the fraction of each lipid in the outer 
monolayer 



Three Types of Membrane Proteins Differ 
in Their Association with the Membrane 

Integral membrane proteins are very firmly associ- 
ated with the lipid bilayer, and are removable only 
by agents that interfere with hydrophobic interactions, 
such as detergents, organic solvents, or denaturants 
(Fig. 11-6). Peripheral membrane proteins associ- 
ate with the membrane through electrostatic interac- 
tions and hydrogen bonding with the hydrophilic 
domains of integral proteins and with the polar head 
groups of membrane lipids. They can be released by 
relatively mild treatments that interfere with electro- 
static interactions or break hydrogen bonds; a commonly 
used agent is carbonate at high pH. Amphitropic 
proteins are found both in the cytosol and in associ- 
ation with membranes. Their affinity for membranes 
results in some cases from the protein's noncovalent 
interaction with a membrane protein or lipid, and in 
other cases from the presence of one or more lipids 
covalently attached to the amphitropic protein (see 
Fig. 11-14). Generally, the reversible association of 
amphitropic proteins with the membrane is regu- 
lated; for example, phosphorylation or ligand binding 
can force a conformational change in the protein, ex- 
posing a membrane-binding site that was previously 
inaccessible. 
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FIGURE 11-6 Peripheral, integral, and amphitropic proteins. Mem- 
brane proteins can be operationally distinguished by the conditions re- 
quired to release them from the membrane. Most peripheral proteins 
are released by changes in pH or ionic strength, removal of Ca 2+ by a 
chelating agent or addition of urea or carbonate. Integral proteins are 
extractable with detergents, which disrupt the hydrophobic interac- 
tions with the lipid bilayer and form micelle-like clusters around indi- 
vidual protein molecules. Integral proteins covalently attached to a 
membrane lipid, such as a glycosyl phosphatidylinositol (GPI; see 
Fig. 1 1-14), can be released by treatment with phospholipase C. Am- 
phitropic proteins are sometimes associated with membranes and 
sometimes not, depending on some type of regulatory process such as 
reversible palmitoylation. 

Many Membrane Proteins Span the Lipid Bilayer 

Membrane protein topology (the localization of protein 
domains relative to the lipid bilayer) can be deter- 
mined with reagents that react with protein side chains 
but cannot cross membranes— polar chemical reagents 
that react with primary amines of Lys residues, for ex- 
ample, or enzymes such as trypsin that cleave proteins 
but cannot cross the membrane. The human erythro- 
cyte is convenient for such studies because it has no 
membrane-bounded organelles; the plasma membrane 
is the only membrane present. If a membrane protein 
in an intact erythrocyte reacts with a membrane-im- 
permeant reagent, that protein must have at least one 
domain exposed on the outer (extracellular) face of 
the membrane. Trypsin cleaves extracellular domains 
but does not affect domains buried within the bilayer 
or exposed on the inner surface only, unless the plasma 
membrane is broken to make these domains accessible 
to the enzyme. 
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Experiments with such topology-specific reagents 
show that the erythrocyte glycoprotein glycophorin 
spans the plasma membrane. Its amino-terminal domain 
(bearing the carbohydrate chains) is on the outer surface 
and is cleaved by trypsin. The carboxyl terminus pro- 
trudes on the inside of the cell, where it cannot react with 
impermeant reagents. Both the amino-terminal and car- 
boxyl-terminal domains contain many polar or charged 
amino acid residues and are therefore hydrophilic. How- 
ever, a segment in the center of the protein (residues 75 
to 93) contains mainly hydrophobic amino acid residues, 
suggesting that glycophorin has a transmembrane seg- 
ment arranged as shown in Figure 1 1-7. 

These noncrystallographic experiments also revealed 
that the orientation of glycophorin in the membrane is 
asymmetric: its amino-terminal segment is always on the 
outside. Similar studies of other membrane proteins show 




131 

FIGURE 11-7 Transbilayer disposition of glycophorin in an erythro- 
cyte. One hydrophilic domain, containing all the sugar residues, is on 
the outer surface, and another hydrophilic domain protrudes from the 
inner face of the membrane. Each red hexagon represents a tetrasac- 
charide (containing two NeuSAc (sialic acid), Gal, and GalNAc) O- 
linked to a Ser or Thr residue; the blue hexagon represents an 
oligosaccharide N-linked to an Asn residue. The relative size of the 
oligosaccharide units is larger than shown here. A segment of 19 hy- 
drophobic residues (residues 75 to 93) forms an a helix that traverses 
the membrane bilayer (see Fig. 11-1 1a). The segment from residues 
64 to 74 has some hydrophobic residues and probably penetrates the 
outer face of the lipid bilayer, as shown. 



that each has a specific orientation in the bilayer, giving 
the membrane a distinct sidedness. For glycophorin, 
and for all other glycoproteins of the plasma membrane, 
the glycosylated domains are invariably found on the 
extracellular face of the bilayer. As we shall see, the 
asymmetric arrangement of membrane proteins results 
in functional asymmetry. All the molecules of a given ion 
pump, for example, have the same orientation in the 
membrane and pump ions in the same direction. 

Integral Proteins Are Held in the Membrane by 
Hydrophobic Interactions with Lipids 

The firm attachment of integral proteins to membranes is 
the result of hydrophobic interactions between mem- 
brane lipids and hydrophobic domains of the protein. 
Some proteins have a single hydrophobic sequence in the 
middle (as in glycophorin) or at the amino or carboxyl 
terminus. Others have multiple hydrophobic sequences, 
each of which, when in the a-helical conformation, is 
long enough to span the lipid bilayer (Fig. 11-8). 

One of the best-studied membrane-spanning pro- 
teins, bacteriorhodopsin, has seven very hydrophobic in- 
ternal sequences and crosses the lipid bilayer seven 
times. Bacteriorhodopsin is a light-driven proton pump 
densely packed in regular arrays in the purple membrane 
of the bacterium Halobacterium salinarum. X-ray crys- 
tallography reveals a structure with seven a-helical seg- 
ments, each traversing the lipid bilayer, connected by 
nonhelical loops at the inner and outer face of the mem- 
brane (Fig. 11-9). In the amino acid sequence of bacte- 
riorhodopsin, seven segments of about 20 hydrophobic 
residues can be identified, each forming an a helix that 
spans the bilayer. The seven helices are clustered to- 
gether and oriented not quite perpendicular to the bilayer 
plane, a pattern that (as we shall see in Chapter 12) is a 
common motif in membrane proteins involved in signal 
reception. Hydrophobic interactions between the nonpo- 
lar amino acids and the fatty acyl groups of the membrane 
lipids firmly anchor the protein in the membrane. 

Crystallized membrane proteins solved (i.e., their 
molecular structure deduced) by crystallography often 
include molecules of phospholipids, which are pre- 
sumed to be positioned in the crystals as they are in the 
native membranes. Many of these phospholipid mole- 
cules lie on the protein surface, their head groups inter- 
acting with polar amino acid residues at the inner and 
outer membrane-water interfaces and their side chains 
associated with nonpolar residues. These annular 
lipids form a bilayer shell (annulus) around the protein, 
oriented roughly as expected for phospholipids in a bi- 
layer (Fig. 11-10). Other phospholipids are found at 
the interfaces between monomers of multisubunit mem- 
brane proteins, where they form a "grease seal." Yet oth- 
ers are embedded deep within a membrane protein, 
often with their head groups well below the plane of the 
bilayer. For example, succinate dehydrogenase (Com- 
plex II, found in mitochondria; see Fig. 19-10) has sev- 
eral deeply embedded phospholipid molecules. 
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FIGURE 11-8 Integral membrane proteins. For known proteins of 
the plasma membrane, the spatial relationships of protein domains to 
the lipid bilayer fall into six categories, Types I and II have a single 
transmembrane helix; the amino-terminal domain is outside the cell in 
type I proteins and inside in type II. Type 111 proteins have multiple 
transmembrane helices in a single polypeptide. In type IV proteins, 
transmembrane domains of several different polypeptides assemble to 
form a channel through the membrane. TypeV proteins are held to the 
bilayer primarily by covalently linked lipids (see Fig. 1 1-14), and type 
VI proteins have both transmembrane helices and lipid (GPI) anchors, 
In this figure, and in figures throughout the book, we represent 
transmembrane protein segments in their most likely conformations: as 
a helices of six to seven turns. Sometimes these helices are shown sim- 
ply as cylinders. As relatively few membrane protein structures have 
been deduced by x-ray crystallography, our representation of the ex- 
tramembrane domains is arbitrary and not necessarily to scale. 



FIGURE 11-10 Lipid annuli associated with two integral membrane 
proteins, (a) The crystal structure of sheep aquaporin (PDB ID 2B60), 
a transmembrane water channel, includes a shell of phospholipids po- 
sitioned with their head groups (blue) at the expected positions on the 
inner and outer membrane surfaces and their hydrophobic acyl chains 
(gold) intimately associated with the surface of the protein exposed to 
the bilayer. The lipid forms a "grease seal" around the protein, which is 
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FIGURE 11-9 Bacteriorhodopsin, a membrane-spanning protein. 

(PDB ID 2AT9) The single polypeptide chain folds into seven hy- 
drophobic a helices, each of which traverses the lipid bilayer roughly 
perpendicular to the plane of the membrane. The seven transmem- 
brane helices are clustered, and the space around and between them 
is filled with the acyl chains of membrane lipids. The light-absorbing 
pigment retinal (see Fig. 10-21) is buried deep in the membrane in 
contact with several of the helical segments (not shown). The helices 
are colored to correspond with the hydropathy plot in Figure 1 1-1 1b. 




depicted as a green surface representation, (b) The crystal structure of 
the F 0 integral protein complex of the V-type Na + -ATPase from Entero- 
coccus hirae (PDB ID 2BL2) has 10 identical subunits, each with four 
transmembrane helices, surrounding a central cavity filled with phos- 
phatidylglycerol (PG). Here five of the subunits have been cut away to 
reveal the PG molecules associated with each subunit around the inte- 
rior of this structure. 
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The Topology of an Integral Membrane Protein Can 
Sometimes Be Predicted from its Sequence 

Determination of the three-dimensional structure of a 
membrane protein— that is, its topology— is generally 
much more difficult than determining its amino acid se- 
quence, either directly or by gene sequencing. The 
amino acid sequences are known for thousands of mem- 
brane proteins, but relatively few three-dimensional 
structures have been established by crystallography or 
NMR spectroscopy. The presence of unbroken sequences 
of more than 20 hydrophobic residues in a membrane 
protein is commonly taken as evidence that these se- 
quences traverse the lipid bilayer, acting as hydrophobic 
anchors or forming transmembrane channels. Virtually 
all integral proteins have at least one such sequence. 
Application of this logic to entire genomic sequences 
leads to the conclusion that in many species, 20% to 
30% of all proteins are integral membrane proteins. 

What can we predict about the secondary structure 
of the membrane-spanning portions of integral pro- 
teins? An a-helical sequence of 20 to 25 residues is 
just long enough to span the thickness (30 A) of the 
lipid bilayer (recall that the length of an a helix is 1.5 A 
(0.15 nm) per amino acid residue). A polypeptide chain 
surrounded by lipids, having no water molecules with 
which to hydrogen-bond, will tend to form a helices or p 
sheets, in which intrachain hydrogen bonding is maxi- 
mized. If the side chains of all amino acids in a helix are 
nonpolar, hydrophobic interactions with the surround- 
ing lipids further stabilize the helix. 

Several simple methods of analyzing amino acid 
sequences yield reasonably accurate predictions of 
secondary structure for transmembrane proteins. The 
relative polarity of each amino acid has been deter- 
mined experimentally by measuring the free-energy 
change accompanying the movement of that amino 
acid side chain from a hydrophobic solvent into water. 
This free energy of transfer, which can be expressed as 
a hydropathy index (see Table 3-1), ranges from 
very exergonic for charged or polar residues to very 
endergonic for amino acids with aromatic or aliphatic 
hydrocarbon side chains. The overall hydropathy index 
(hydrophobicity) of a sequence of amino acids is esti- 
mated by summing the free energies of transfer for the 
residues in the sequence. To scan a polypeptide se- 
quence for potential membrane-spanning segments, an 
investigator calculates the hydropathy index for suc- 
cessive segments (called windows) of a given size, 
from 7 to 20 residues. For a window of seven residues, 
for example, the indices for residues 1 to 7, 2 to 8, 3 to 
9, and so on, are plotted as in Figure 11-11 (plotted 
for the middle residue in each window— residue 4 for 
residues 1 to 7, for example). A region with more than 
20 residues of high hydropathy index is presumed 
to be a transmembrane segment. When the sequences 
of membrane proteins of known three-dimensional 
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(b) Bacteriorhodopsin 

FIGURE 11-11 Hydropathy plots. Hydropathy index (see Table 3-1) is 
plotted against residue number for two integral membrane proteins. 
The hydropathy index for each amino acid residue in a sequence of 
defined length, or "window/' is used to calculate the average hy- 
dropathy for that window. The horizontal axis shows the residue num- 
ber in the middle of the window, (a) Glycophorin from human 
erythrocytes has a single hydrophobic sequence between residues 75 
and 93 (yellow); compare this with Figure 11-7. (b) Bacteri- 
orhodopsin, known from independent physical studies to have seven 
transmembrane helices (see Fig. 11-9), has seven hydrophobic re- 
gions. Note, however, that the hydropathy plot is ambiguous in the re- 
gion of segments 6 and 7. X-ray crystallography has confirmed that 
this region has two transmembrane segments. 

structure are scanned in this way, we find a reasonably 
good correspondence between predicted and known 
membrane-spanning segments. Hydropathy analysis 
predicts a single hydrophobic helix for glycophorin 
(Fig. 11-1 la) and seven transmembrane segments for 
bacteriorhodopsin (Fig. 11-1 lb)— in agreement with 
experimental studies. 

On the basis of their amino acid sequences and hy- 
dropathy plots, many of the transport proteins de- 
scribed in this chapter are believed to have multiple 
membrane-spanning helical regions— that is, they are 
type III or type IV integral proteins (Fig. 11-8). When 
predictions are consistent with chemical studies of pro- 
tein localization (such as those described above for gly- 
cophorin and bacteriorhodopsin), the assumption that 
hydrophobic regions correspond to membrane-spanning 
domains is much better justified. 
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FIGURE 11-12 Tyr and Trp residues of membrane proteins clustering 
at the water-lipid interface. The detailed structures of these five inte- 
gral membrane proteins are known from crystallographic studies. The 
K + channel (PDB ID 1 BL8) is from the bacterium Streptomyces lividans 
(see Fig. 1 1-48); maltoporin (PDB ID 1 AF6), outer membrane phospho- 
lipase A (PDB ID 1QD5), OmpX (PDB ID 1QJ9), and phosphoporin 



E (PDB ID 1PHO) are proteins of the outer membrane of E, coli. 
Residues of Tyr (orange) and Trp (red) are found predominantly where 
the nonpolar region of acyl chains meets the polar head group region. 
Charged residues (Lys, Arg, Glu, Asp; shown in blue) are found almost 
exclusively in the aqueous phases. 



A further remarkable feature of many transmem- 
brane proteins of known structure is the presence of Tyr 
and Trp residues at the interface between lipid and wa- 
ter (Fig. 11-12). The side chains of these residues ap- 
parently serve as membrane interface anchors, able to 
interact simultaneously with the central lipid phase and 
the aqueous phases on either side of the membrane. An- 
other generalization about amino acid location relative 
to the bilayer is described by the positive-inside rule: 
the positively charged Lys, His, and Arg residues of 
membrane proteins occur more commonly on the cyto- 
plasmic face of membranes. 

Not all integral membrane proteins are composed of 
transmembrane a helices. Another structural motif com- 
mon in bacterial membrane proteins is the ft barrel (see 
Fig. 4-1 7b), in which 20 or more transmembrane seg- 
ments form /3 sheets that line a cylinder (Fig. 11-13). 
The same factors that favor a-helix formation in the hy- 
drophobic interior of a lipid bilayer also stabilize /3 
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FIGURE 11-13 Membrane proteins with /3-barreI structure. Three pro- 
teins of the £, coli outer membrane are shown, viewed in the plane of 
the membrane. FepA (PDB ID 1 FEP), involved in iron uptake, has 22 
membrane-spanning p strands. OmpLA (derived from PDB ID 1 QD5), 
a phospholipase, is a 12-stranded /3 barrel that exists as a dimer in the 
membrane. Maltoporin (derived from PDB ID 1MAL), a maltose trans- 
porter, is a trimer; each monomer consists of 1 6 (3 strands. 



barrels: when no water molecules are available to 
hydrogen-bond with the carbonyl oxygen and nitrogen of 
the peptide bond, maximal intrachain hydrogen bonding 
gives the most stable conformation. Planar /3 sheets do 
not maximize these interactions and are generally not 
found in the membrane interior; barrels allow all possi- 
ble hydrogen bonds and are apparently common among 
membrane proteins. Porins, proteins that allow certain 
polar solutes to cross the outer membrane of gram- 
negative bacteria such as E. coli, have many-stranded j3 
barrels lining the polar transmembrane passage. 

A polypeptide is more extended in the /3 conforma- 
tion than in an a helix; just seven to nine residues of j3 
conformation are needed to span a membrane. Recall 
that in the jS conformation, alternating side chains project 
above and below the sheet (see Fig. 4-6). In /J strands of 
membrane proteins, every second residue in the mem- 
brane-spanning segment is hydrophobic and interacts 
with the lipid bilayer; aromatic side chains are commonly 
found at the lipid-protein interface. The other residues 
may or may not be hydrophilic. The hydropathy plot is 
not useful in predicting transmembrane segments for 
proteins with p barrel motifs, but as the database of 
known j8-barrel motifs increases, sequence-based predic- 
tions of transmembrane /3 conformations have become 
feasible. For example, some outer membrane proteins of 
gram-negative bacteria (Fig. 11-13) have been correctly 
predicted, by sequence analysis, to contain p barrels. 

Covalently Attached Lipids Anchor 
Some Membrane Proteins 

Some membrane proteins contain one or more cova- 
lently linked lipids, which may be of several types: long- 
chain fatty acids, isoprenoids, sterols, or glycosylated 
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derivatives of phosphatidyKnositol (GPIs; Fig. 11-14). 
The attached lipid provides a hydrophobic anchor that 
inserts into the lipid bilayer and holds the protein at the 
membrane surface. The strength of the hydrophobic in- 
teraction between a bilayer and a single hydrocarbon 
chain linked to a protein is barely enough to anchor the 
protein securely, but many proteins have more than one 
attached lipid moiety Other interactions, such as ionic 
attractions between positively charged Lys residues in 
the protein and negatively charged lipid head groups, 
probably contribute to the stability of the attachment. 
The association of these lipid-linked proteins with the 
membrane is certainly weaker than that for integral 
membrane proteins and is, in at least some cases, re- 
versible. But treatment with alkaline carbonate does not 
release GPMinked proteins, which are therefore, by the 
working definition, integral proteins. 

Beyond merely anchoring a protein to the mem- 
brane, the attached lipid may have a more specific 
role. In the plasma membrane, proteins with GPI an- 
chors are exclusively on the outer face and are clus- 
tered in certain regions, as we shall see (pp. 384-386), 
whereas other types of lipid-linked proteins (with far- 
nesyl or geranylgeranyl groups attached; Fig. 11-14) 
are exclusively on the inner face. In polarized epithe- 
lial cells (such as intestinal epithelial cells, see Fig. 
11-44), in which apical and basal surfaces have 



different roles, GPI-linked proteins are directed 
specifically to the apical surface. Attachment of a spe- 
cific lipid to a newly synthesized membrane protein 
therefore has a targeting function, directing the pro- 
tein to its correct membrane location. 



SUMMARY 1 1.1 



The Composition and 
Architecture of Membranes 



Biological membranes define cellular boundaries, 
divide cells into discrete compartments, organize 
complex reaction sequences, and act in signal 
reception and energy transformations. 

Membranes are composed of lipids and proteins in 
varying combinations particular to each species, 
cell type, and organelle. The lipid bilayer is the 
basic structural unit. 

Peripheral membrane proteins are loosely associated 
with the membrane through electrostatic interactions 
and hydrogen bonds or by covalently attached lipid 
anchors. Integral proteins associate firmly with 
membranes by hydrophobic interactions between the 
lipid bilayer and their nonpolar amino acid side 
chains, which are oriented toward the outside of the 
protein molecule. Amphitropic proteins associate 
reversibly with membranes. 



FIGURE 11-14 Lipid-linked membrane proteins. Covalently attached lipids 
anchor membrane proteins to the lipid bilayer. A palmitoyl group is shown at- 
tached by thioester linkage to a Cys residue; an N-myristoyl group is generally 
attached to an amino-terminal Gly; the farnesyl and geranylgeranyl groups at- 
tached to carboxyl-terminal Cys residues are isoprenoids of 15 and 20 car- 
bons, respectively. These three lipid-protein assemblies are found only on the 
inner face of the plasma membrane. Glycosyl phosphatidylinositol (GPI) an- 
chors are derivatives of phosphatidylinositol in which the inositol bears a short 
oligosaccharide covalently joined to the carboxyl-terminal residue of a pro- 
tein through phosphoethanolamine- GPI-linked proteins are always on the ex- 
tracellular face of the plasma membrane. 
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Many membrane proteins span the lipid bilayer 
several times, with hydrophobic sequences of about 
20 amino acid residues forming transmembrane a 
helices. Multistranded /3 barrels are also common in 
integral proteins in bacterial membranes. Tyr and 
Trp residues of transmembrane proteins are 
commonly found at the lipid- water interface. 

The lipids and proteins of membranes are 
inserted into the bilayer with specific sidedness; 
thus membranes are structurally and functionally 
asymmetric. Plasma membrane glycoproteins are 
always oriented with the oligosaccharide-bearing 
domain on the extracellular surface. 



(a) Paracrystalline state (gel) 



11.2 Membrane Dynamics 



One remarkable feature of all biological membranes is 
their flexibility — their ability to change shape without 
losing their integrity and becoming leaky. The basis for 
this property is the noncovalent interactions among 
lipids in the bilayer and the mobility allowed to individ- 
ual lipids because they are not covalently anchored to 
one another. We turn now to the dynamics of mem- 
branes: the motions that occur and the transient struc- 
tures allowed by these motions. 

Acyl Groups in the Bilayer Interior Are 
Ordered to Varying Degrees 

Although the lipid bilayer structure is quite stable, its in- 
dividual phospholipid and sterol molecules have much 
freedom of motion (Fig. 11-15). The structure and 
flexibility of the lipid bilayer depend on the kinds of 
lipids present, and change with temperature. Below nor- 
mal physiological temperatures, the lipids in a bilayer 
form a semisolid gel phase, in which all types of motion 
of individual lipid molecules are strongly constrained; 
the bilayer is paracrystalline (Fig. 11-1 5a). Above phys- 
iological temperatures, individual hydrocarbon chains of 
fatty acids are in constant motion produced by rotation 
about the carbon-carbon bonds of the long acyl side 
chains. In this liquid-disordered state, or fluid state 
(Fig. 11-1 5b), the interior of the bilayer is more fluid 
than solid and the bilayer is like a sea of constantly mov- 
ing lipid. At intermediate (physiological) temperatures, 
the lipids exist in a liquid-ordered state; there is less 
thermal motion in the acyl chains of the lipid bilayer, but 
lateral movement in the plane of the bilayer still takes 
place. These differences in bilayer state are easily ob- 
served in liposomes composed of a single lipid, but bio- 
logical membranes contain many lipids with a variety of 
fatty acyl chains and thus do not show sharp phase 
changes with temperature. 

At temperatures in the physiological range for a 
mammal (about 20 to 40°C), long-chain saturated fatty 
acids (such as 16:0 and 18:0) pack into a liquid-ordered 
array, but the kinks in unsaturated fatty acids (see Fig. 
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FIGURE 11-15 Two extreme states of bilayer lipids, (a) In the paracrys- 
talline state, or gel phase, polar head groups are uniformly arrayed at 
the surface, and the acyl chains are nearly motionless and packed with 
regular geometry, (b) In the liquid-disordered state, or fluid state, acyl 
chains undergo much thermal motion and have no regular organiza- 
tion. Intermediate between these extremes is the liquid-ordered state, 
in which individual phospholipid molecules can diffuse laterally but 
the acyl groups remain extended and more or less ordered. 



10-2) interfere with packing, favoring the liquid-disor- 
dered state. Shorter-chain fatty acyl groups have the 
same effect. The sterol content of a membrane (which 
varies greatly with organism and organelle; Table 11-1) 
is another important determinant of lipid state. The 
rigid planar structure of the steroid nucleus, inserted 
between fatty acyl side chains, reduces the freedom of 
neighboring acyl chains to move by rotation about their 
carbon-carbon bonds, forcing the chains into their fully 
extended conformation. The presence of sterols there- 
fore reduces the fluidity in the core of the bilayer, thus 
favoring the liquid-ordered phase, and increases the 
thickness of the lipid leaflet (as described below). 

Cells regulate their lipid composition to achieve a 
constant membrane fluidity under various growth condi- 
tions. For example, bacteria synthesize more unsatu- 
rated fatty acids and fewer saturated ones when 
cultured at low temperatures than when cultured at 
higher temperatures (Table 11-2). As a result of this ad- 
justment in lipid composition, membranes of bacteria 
cultured at high or low temperatures have about the 
same degree of fluidity. 

Transbilayer Movement of Lipids Requires Catalysis 

At physiological temperatures, transbilayer — or "flip- 
flop"— diffusion of a lipid molecule from one leaflet of 
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TABLE 11-2 



Fatty Acid Composition of £ colt Cells Cul 
Temperatures 



lis Cultured at Different 

Percentage of total fatty acids* 






10 °C 


20 °C 


30 °C 


40 °C 


Myristic acid (14:0) 


4 


4 


4 


8 


Palmitic acid (16:0) 


18 


25 


29 


48 


Palmitoleic acid (16:1) 


26 


24 


23 


9 


Oleic acid (18:1) 


38 


34 


30 


12 


Hydroxymyristic acid 


13 


10 


10 


8 


Ratio of unsaturated to saturated 1 


2.9 


2.0 


1.6 


0.38 



Source: Data from Marr, A G. & Ingraham, J.L. (1962) Effect of temperature on the composition of fatty acids in Escherichia coii. J. 
Bacteriot. 84, 1260. 

*The exact fatty acid composition depends not only on growth temperature but on growth stage and growth medium composition. 

f Ratios calculated as the total percentage of 16:1 plus 18:1 divided by the total percentage of 14:0 plus 16:0 Hydroxymyristic acid was 
omitted from this calculation 



the bilayer to the other (Fig. ll-16a) occurs very 
slowly if at all in most membranes, although lateral dif- 
fusion in the plane of the bilayer is very rapid (Fig. 
11— 16b). Transbilayer movement requires that a polar 
or charged head group leave its aqueous environment 
and move into the hydrophobic interior of the bilayer, a 
process with a large, positive free-energy change. There 
are, however, situations in which such movement is es- 
sential. For example, in the ER, membrane glycerophos- 
pholipids are synthesized on the cytosolic surface, 
whereas sphingolipids are synthesized or modified on 
the lumenal surface. To get from their site of synthesis 
to their eventual point of deposition, these lipids must 
undergo flip-flop diffusion. 

Several families of proteins, including the flip- 
pases, floppases, and scramblases (Fig. ll-16c), facil- 
itate the transbilayer movement of lipids, providing a 
path that is energetically more favorable and much 
faster than the uncatalyzed movement. The combina- 
tion of asymmetric biosynthesis of membrane lipids, 
very slow uncatalyzed flip-flop diffusion, and the pres- 
ence of selective, energy-dependent lipid translocators 
is responsible for the transbilayer asymmetry in lipid 
composition shown in Figure 11-5. Besides contribut- 
ing to this asymmetry of composition, the energy- 
dependent transport of lipids to one bilayer leaflet 
may, by creating a larger surface on one side of the 
bilayer, be important in generating the membrane cur- 
vature essential in the budding of vesicles. 

Flippases catalyze translocation of the aminophos- 
pholipids phosphatidylethanolamine and phosphatidyl- 
serine from the extracellular to the cytosolic leaflet of the 
plasma membrane, contributing to the asymmetric dis- 
tribution of phospholipids: phosphatidylethanolamine 
and phosphatidylserine primarily in the cytosolic leaflet, 
and the sphingolipids and phosphatidylcholine in the 
outer leaflet. Keeping phosphatidylserine out of the ex- 
tracellular leaflet is important: its exposure on the outer 
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FIGURE 11-16 Motion of single phospholipids in a bilayer. (a) Uncat- 
alyzed movement from one leaflet to the other is very slow, but (b) lat- 
eral diffusion within the leaflet is very rapid, requiring no catalysis, 
(c) Three types of phospholipid translocaters in the plasma mem- 
brane. Flippases translocate primarily aminophospholipids (phos- 
phatidylethanolamine (PE), phosphatidylserine (PS)) from the outer (exo- 
plasmic) leaflet to the inner (cytosolic) leaflet; they require ATP and are 
members of the P-type ATPase family. Floppases move phospholipids 
from the cytosolic to the outer leaflet, require ATP, and are members of 
the ABC transporter family. Scramblases equilibrate phospholipids 
across both leaflets; they do not require ATP but are activated by Ca 2+ . 
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surface triggers apoptosis (programmed cell death; see 
Chapter 12) and engulfment by macrophages that carry 
phosphatidylserine receptors. Flippases also act in the 
ER, where they move newly synthesized phospholipids 
from their site of synthesis in the cytosolic leaflet to the 
lumenal leaflet. Flippases consume about one ATP per 
molecule of phospholipid translocated, and they are 
structurally and functionally related to the P-type ATP- 
ases (active transporters) described on page 396. 

Floppases move plasma membrane phospholipids 
from the cytosolic to the extracellular leaflet, and like 
flippases are ATP-dependent. Floppases are members of 
the ABC transporter family described on page 400, all 
of which actively transport hydrophobic substrates out- 
ward across the plasma membrane. Scramblases are 
proteins that move any membrane phospholipid across 
the bilayer down its concentration gradient (from the 
leaflet where it has a higher concentration to the leaflet 
where it has a lower concentration); their activity is not 
dependent on ATP Scramblase activity leads to con- 
trolled randomization of the head-group composition on 
the two faces of the bilayer. The activity rises sharply with 
an increase in cytosolic Ca 2+ concentration, which may 
result from cell activation, cell injury, or apoptosis; as 
noted above, exposure of phosphatidylserine on the 
outer surface marks a cell for apoptosis and engulfment 
by macrophages. Finally, a group of proteins that act pri- 
marily to move phosphatidylinositol lipids across lipid 
bilayers, the phosphatidylinositol transfer proteins, are 
believed to have important roles in lipid signaling and 
membrane trafficking. 

Lipids and Proteins Diffuse Laterally in the Bilayer 

Individual lipid molecules can move laterally in the 
plane of the membrane by changing places with neigh- 
boring lipid molecules; that is, they undergo Brownian 
movement within the bilayer (Fig. 11— 16b), which can 
be quite rapid. A molecule in the outer leaflet of the ery- 
throcyte plasma membrane, for example, can diffuse lat- 
erally so fast that it circumnavigates the erythrocyte in 
seconds. This rapid lateral diffusion in the plane of the 
bilayer tends to randomize the positions of individual 
molecules in a few seconds. 

Lateral diffusion can be shown experimentally by 
attaching fluorescent probes to the head groups of lipids 
and using fluorescence microscopy to follow the probes 
over time (Fig. 11-17). In one technique, a small re- 
gion (5 ^tm 2 ) of a cell surface with fluorescence-tagged 
lipids is bleached by intense laser radiation so that the 
irradiated patch no longer fluoresces when viewed with 
less-intense (nonbleaching) light in the fluorescence mi- 
croscope. However, within milliseconds, the region re- 
covers its fluorescence as unbleached lipid molecules 
diffuse into the bleached patch and bleached lipid mole- 
cules diffuse away from it. The rate of^/luorescence re- 
covery after photobleaching, or FRAP, is a measure of 
the rate of lateral diffusion of the lipids. Using the FRAP 
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FIGURE 11-17 Measurement of lateral diffusion rates of lipids by 
fluorescence recovery after photobleaching (FRAP). Lipids in the 
outer leaflet of the plasma membrane are labeled by reaction with a 
membrane-impermeant fluorescent probe (red), so the surface is uni- 
formly labeled when viewed with a fluorescence microscope. A small 
area is bleached by irradiation with an intense laser beam and be- 
comes nonfluorescent. With the passage of time, labeled lipid mole- 
cules diffuse into the bleached region, and it again becomes 
fluorescent. Researchers can track the time course of fluorescence re- 
turn and determine a diffusion coefficient for the labeled lipid. The dif- 
fusion rates are typically high; a lipid moving at this speed could 
circumnavigate an E. coli cell in one second. (The FRAP method can 
also be used to measure lateral diffusion of membrane proteins.) 
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technique, researchers have shown that some mem- 
brane lipids diffuse laterally at rates of up to 1 /rni/s. 

Another technique, single particle tracking, allows 
one to follow the movement of a single lipid molecule in 
the plasma membrane on a much shorter time scale. Re- 
sults from these studies confirm rapid lateral diffusion 
within small, discrete regions of the cell surface and 
show that movement from one such region to a nearby 
region ("hop diffusion") is inhibited; membrane lipids 
behave as though corralled by fences that they can occa- 
sionally cross by hop diffusion (Fig. 11-18). 

Many membrane proteins seem to be afloat in a sea of 
lipids. Like membrane lipids, these proteins are free to 
diffuse laterally in the plane of the bilayer and are in con- 
stant motion, as shown by the FRAP technique with fluo- 
rescence-tagged surface proteins. Some membrane 
proteins associate to form large aggregates ("patches") 
on the surface of a cell or organelle in which individual 
protein molecules do not move relative to one another; 
for example, acetylcholine receptors form dense, near- 
crystalline patches on neuronal plasma membranes at 
synapses. Other membrane proteins are anchored to in- 
ternal structures that prevent their free diffusion. In the 
erythrocyte membrane, both glycophorin and the chlo- 
ride-bicarbonate exchanger (p. 395) are tethered to spec- 
trin, a filamentous cytoskeletal protein (Fig. 11-19). 
One possible explanation for the pattern of lateral diffu- 
sion of lipid molecules shown in Figure 11-18 is that 
membrane proteins immobilized by their association with 
spectrin form the "fences" that define the regions of rela- 
tively unrestricted lipid motion. 
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FIGURE 11-18 Hop diffusion of individual lipid molecules. The mo- 
tion of a single fluorescently labeled lipid molecule in a cell surface is 
recorded on video by fluorescence microscopy, with a time resolution 
of 25 fis (equivalent to 40,000 frames/s). The track shown here repre- 
sents a molecule followed for 56 ms (2,250 frames); the trace begins in 
the purple area and continues through blue, green, and orange The 
pattern of movement indicates rapid diffusion within a confined region 
(about 250 nm in diameter, shown by a single color), with occasional 
hops into an adjoining region. This finding suggests that the lipids are 
corralled by molecular fences that they occasionally jump 
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FIGURE 11-19 Restricted motion of the erythrocyte chloride- 
bicarbonate exchanger and glycophorin. The proteins span the mem- 
brane and are tethered to spectrin, a cytoskeletal protein, by another 
protein, ankyrin, limiting their lateral mobility. Ankyrin is anchored in 
the membrane by a covalently bound palmitoyl side chain (see Fig. 
11-14). Spectrin, a long, filamentous protein, is cross-linked at junc- 
tional complexes containing actin. A network of cross-linked spectrin 
molecules attached to the cytoplasmic face of the plasma membrane 
stabilizes the membrane, making it resistant to deformation. This net- 
work of anchored membrane proteins may form the "corral" suggested 
by the experiment shown in Figure 11-18; the lipid tracks shown here 
are confined to regions defined by the tethered membrane proteins. 

Sphingolipids and Cholesterol Cluster Together 
in Membrane Rafts 

We have seen that diffusion of membrane lipids from one 
bilayer leaflet to the other is very slow unless catalyzed, 
and that the different lipid species of the plasma mem- 
brane are asymmetrically distributed in the two leaflets of 
the bilayer (Fig. 11-5). Even within a single leaflet, the 
lipid distribution is not random. Glycosphingolipids (cere- 
brosides and gangliosides), which typically contain long- 
chain saturated fatty acids, form transient clusters in the 
outer leaflet that largely exclude glycerophospholipids, 
which typically contain one unsaturated fatty acyl group 
and a shorter saturated acyl group. The long, saturated 
acyl groups of sphingolipids can form more compact, more 
stable associations with the long ring system of cholesterol 
than can the shorter, often unsaturated, chains of phos- 
pholipids. The cholesterol-sphingolipid microdomains in 
the outer monolayer of the plasma membrane, visible with 
atomic force microscopy (Box 11-1), are slightly thicker 
and more ordered (less fluid) than neighboring mi- 
crodomains rich in phospholipids and are more difficult to 
dissolve with nonionic detergents; they behave like liquid- 
ordered sphingolipid rafts adrift on an ocean of liquid- 
disordered phospholipids (Fig. 11-20, p. 386). 

These lipid rafts are remarkably enriched in two 
classes of integral membrane proteins: those anchored 
to the membrane by two covalently attached long-chain 
saturated fatty acids (two palmitoyl groups or a palmi- 
toyl and a myristoyl group) and GPI-anchored proteins 
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METHOD 



Atomic Force Microscopy to Visualize Membrane Proteins 



In atomic force microscopy (AFM) , the sharp tip of a mi- 
croscopic probe attached to a flexible cantilever is 
drawn across an uneven surface such as a membrane 
(Fig. 1). Electrostatic and van der Waals interactions 
between the tip and the sample produce a force that 
moves the probe up and down (in the z dimension) as it 
encounters hills and valleys in the sample. A laser beam 
reflected from the cantilever detects motions of as little 
as 1 A. In one type of atomic force microscope, the force 
on the probe is held constant (relative to a standard 
force, on the order of piconewtons) by a feedback cir- 
cuit that causes the platform holding the sample to rise 
or fall to keep the force constant. A series of scans in the 
x and y dimensions (the plane of the membrane) yields 
a three-dimensional contour map of the surface with 
resolution near the atomic scale — 0.1 nm in the vertical 
dimension, 0.5 to 1.0 nm in the lateral dimensions. The 
membrane rafts shown in Figure 1 l-20b were visualized 
by this technique. 

In favorable cases, AFM can be used to study single 
membrane protein molecules. Single molecules of bac- 
teriorhodopsin (see Fig. 11-9) in the purple membranes 
of the bacterium Halobacterium salinarum are seen 
as highly regular structures (Fig. 2a). When several im- 
ages of individual units are superimposed with the help 
of a computer, the real parts of the image reinforce each 
other and the noise in individual images is averaged out, 
yielding a high-resolution image of the protein (inset in 
Fig. 2a). AFM of purified coli aquaporin, reconstituted 
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constant pressure on cantilever 
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FIGURE 1 



into lipid bilayers and viewed as if from the outside of a 
cell, shows the fine details of the protein's periplasmic 
domains (Fig. 2b). And AFM reveals that F Q , the proton- 
driven rotor of the chloroplast ATP synthase (p. 760), is 
composed of many subunits (14 in Fig. 2c) arranged in 
a circle. 
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(Fig. 11-14). Presumably these lipid anchors, like the 
acyl chains of sphingolipids, form more stable associa- 
tions with the cholesterol and long acyl groups in rafts 
than with the surrounding phospholipids. (It is notable 
that other lipid-linked proteins, those with covalently 
attached isoprenyl groups such as farnesyl, are not pref- 
erentially associated with the outer leaflet of sphin- 
golipid/cholesterol rafts (Fig. ll-20a).) The "raft" and 
"sea" domains of the plasma membrane are not rigidly 
separated; membrane proteins can move into and out of 



lipid rafts on a time scale of seconds. But in the shorter 
time scale (microseconds) more relevant to many mem- 
brane-mediated biochemical processes, many of these 
proteins reside primarily in a raft. 

We can estimate the fraction of the cell surface oc- 
cupied by rafts from the fraction of the plasma mem- 
brane that resists detergent solubilization, which can be 
as high as 50% in some cases: the rafts cover half of the 
ocean (Fig. ll-20b). Indirect measurements in cultured 
fibroblasts suggest a diameter of roughly 50 nm for an 
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FIGURE 11-20 Membrane microdomains (rafts), (a) Stable associations 
of sphingolipids and cholesterol in the outer leaflet produce a mi- 
crodomain, slightly thicker than other membrane regions, that is en- 
riched with specific types of membrane proteins, GPI-linked proteins 
are common in the outer leaflet of these rafts, and proteins with one or 
several covalently attached long-chain acyl groups are common in the 
inner leaflet. Caveolin is especially common in inwardly curved rafts 
called caveolae {see Fig. 1 1-21). Proteins with attached prenyl groups 
(such as Ras; see Box 1 2-2) tend to be excluded from rafts, (b) In this ar- 
tificial membrane— reconstituted (on a mica surface) from cholesterol, 
synthetic phospholipid (dioleoylphosphatidylcholine), and the GPI- 
linked protein placental alkaline phosphatase— the greater thickness of 
raft regions is visualized by atomic force microscopy (see Box 11-1). 
The rafts protrude from a lipid bilayer ocean (the black surface is the top 
of the upper monolayer); sharp peaks represent GPI-linked proteins. 
Note that these peaks are found almost exclusively in the rafts, 

individual raft, which corresponds to a patch containing 
a few thousand sphingolipids and perhaps 10 to 50 
membrane proteins. Because most cells express more 
than 50 different kinds of plasma membrane proteins, it 
is likely that a single raft contains only a subset of mem- 
brane proteins and that this segregation of membrane 
proteins is functionally significant. For a process that in- 
volves interaction of two membrane proteins, their pres- 
ence in a single raft would hugely increase the likelihood 
of their collision. Certain membrane receptors and sig- 
naling proteins, for example, seem to be segregated to- 
gether in membrane rafts. Experiments show that 
signaling through these proteins can be disrupted by 
manipulations that deplete the plasma membrane of 
cholesterol and destroy lipid rafts. 



Caveolin is an integral membrane protein with two 
globular domains connected by a hairpin-shaped hy- 
drophobic domain, which binds the protein to the cytoplas- 
mic leaflet of the plasma membrane. Three palmitoyl 
groups attached to the carboxyl-temunal globular domain 
further anchor it to the membrane. Caveolin (actually, a 
family of related caveolins) binds cholesterol in the mem- 
brane, and the presence of caveolin forces the associated 
lipid bilayer to curve inward, forming caveolae ("little 
caves") in the surface of the cell (Fig. 11-21). Caveolae 
are unusual rafts: they involve both leaflets of the bilayer — 
the cytoplasmic leaflet, from which the caveolin globular 
domains project, and the extracellular leaflet, a typical 
sphingolipid/cholesterol raft with associated GPI-anchored 
proteins. Caveolae are implicated in a variety of cellular 
functions, including membrane trafficking within cells and 
the transduction of external signals into cellular responses. 
The receptors for insulin and other growth factors, as well 
as certain GTP-binding proteins and protein kinases asso- 
ciated with transmembrane signaling, seem to be localized 
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FIGURE 11-21 Caveolin forces inward curvature of a membrane. 

Caveolae are small invaginations in the plasma membrane, as seen in 
(a) an electron micrograph of an adipocyte surface-labeled with an 
electron-dense marker, (b) Each caveolin monomer has a central hy- 
drophobic domain and three long-chain acyl groups (red), which hold 
the molecule to the inside of the plasma membrane. When several 
caveolin dinners are concentrated in a small region (a raft), they force a 
curvature in the lipid bilayer, forming a caveola. Cholesterol molecules 
in the bilayer are shown in orange. 
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in rafts and perhaps in caveolae. We discuss some possible 
roles of rafts in signaling in Chapter 12. 

Membrane Curvature and Fusion Are Central 
to Many Biological Processes 

Caveolin is not unique in its ability to induce curvature in 
membranes. Changes of curvature are central to one of 
the most remarkable features of biological membranes: 
their ability to undergo fusion with other membranes 
without losing their continuity Although membranes are 
stable, they are by no means static. Within the eukaryotic 
endomembrane system (which includes the nuclear mem- 
brane, endoplasmic reticulum, Golgi, and various small 
vesicles), the membranous compartments constantly re- 
organize. Vesicles bud from the ER to carry newly synthe- 
sized lipids and proteins to other organelles and to the 
plasma membrane. Exocytosis, endocytosis, cell division, 
fusion of egg and sperm cells, and entry of a membrane- 
enveloped virus into its host cell all involve membrane re- 
organization in which the fundamental operation is fusion 
of two membrane segments without loss of continuity 
(Fig. 11-22). Most of these processes begin with a local 
increase in membrane curvature. Three mechanisms for 
inducing membrane curvature are shown in Figure 
11-23. A protein that is intrinsically curved may force 
curvature in a bilayer by binding to it; the binding energy 
provides the driving force for the increase in bilayer curva- 
ture. Alternatively, many subunits of a scaffold protein 
may assemble into curved supramolecular complexes and 
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stabilize curves that spontaneously form in the bilayer. Or, 
a protein may insert one or more hydrophobic helices into 
one face of the bilayer, expanding its area relative to the 
other face and thereby forcing curvature. 

Specific fusion of two membranes requires that (1) 
they recognize each other; (2) their surfaces become 
closely apposed, which requires the removal of water 
molecules normally associated with the polar head 
groups of lipids; (3) their bilayer structures become lo- 
cally disrupted, resulting in fusion of the outer leaflet of 
each membrane (hemifusion); and (4) their bilayers fuse 
to form a single continuous bilayer. The fusion occurring 
in receptor-mediated endocytosis, or regulated secretion, 
also requires that (5) the process is triggered at the ap- 
propriate time or in response to a specific signal. Integral 
proteins called fusion proteins mediate these events, 
bringing about specific recognition and a transient local 
distortion of the bilayer structure that favors membrane 
fusion. (Note that these fusion proteins are unrelated to 
the products encoded by two fused genes, also called fu- 
sion proteins, discussed in Chapter 9.) 



A protein with intrinsic curvature on 
its surface interacts strongly with a 
curved membrane surface, allowing 
both membrane and protein to 
achieve their lowest energy. 





FIGURE 11-22 Membrane fusion. The fusion of two membranes is 
central to a variety of cellular processes involving organelles and the 
plasma membrane. 



If a membrane region spontaneously 
curves, monomeric subunits of 
certain proteins can polymerize into 
a superstructure that favors and 
maintains the curvature. 



A protein with one or more 
amphipathic helices inserted into 
one leaflet of the bilayer crowds the 
lipids in that leaflet, forcing the 
membrane to bend. 

FIGURE 11-23 Three models for protein-induced curvature of 
membranes. 
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FIGURE 11-24 Membrane fusion during neurotransmitter release at a 
synapse. The secretory vesicle membrane contains the v-SNARE 
synaptobrevin (red). The target (plasma) membrane contains the 
t-SNAREs syntaxin (blue) and SNAP25 (violet). When a local increase in 
[Ca 2+ ] signals release of neurotransmitter, the v-SNARE, SNAP25, and 
t-SNARE interact, forming a coiled bundle of four a helices, pulling the 
two membranes together and disrupting the bilayer locally. This leads 
first to hemifusion, joining the inner leaflets of the two membranes, 
then to complete membrane fusion and neurotransmitter release. 



A well-studied example of membrane fusion is that 
occurring at synapses, when intracellular vesicles 
loaded with neurotransmitter fuse with the plasma 
membrane. This process involves a family of proteins 
called SNARES (Fig. 11-24). SNAREs in the cytoplas- 
mic face of the intracellular vesicle are called v- 
SNAREs; those in the target membrane with which the 
vesicle fuses (the plasma membrane during exocytosis) 
are t-SNAREs. Two other proteins, SNAP25 and NSF, 
are also involved. During fusion, a v-SNARE and t- 
SNARE bind to each other and undergo a structural 
change that produces a bundle of long thin rods made 
up of helices from both SNARES and two helices from 
SNAP25 (Fig. 11-24). The two SNAREs initially interact 
at their ends, then zip up into the bundle of helices. This 
structural change pulls the two membranes into contact 
and initiates the fusion of their lipid bilayers. 

The complex of SNAREs and SNAP25 is the target 
of the powerful Clostridium botulinum toxin, a pro- 
tease that cleaves specific bonds in these proteins, pre- 
venting neurotransmission and thereby causing the 
death of the organism. Because of its very high speci- 
ficity for these proteins, purified botulinum toxin has 
served as a powerful tool for dissecting the mechanism 
of neurotransmitter release in vivo and in vitro. 

Integral Proteins of the Plasma Membrane Are involved in 
Surface Adhesion, Signaling, and Other Cellular Processes 

Several families of integral proteins in the plasma mem- 
brane provide specific points of attachment between 
cells, or between a cell and extracellular matrix pro- 
teins. Integrins are surface adhesion proteins that me- 
diate a cell's interaction with the extracellular matrix 
and with other cells, including some pathogens. Inte- 
grins also carry signals in both directions across the 
plasma membrane, integrating information about the 
extracellular and intracellular environments. All inte- 
grins are heterodimeric proteins composed of two un- 
like subunits, a and /3, each anchored to the plasma 
membrane by a single transmembrane helix. The large 
extracellular domains of the a and /3 subunits combine 
to form a specific binding site for extracellular proteins 
such as collagen and fibronectin, which contain a com- 
mon determinant of integrin binding, the sequence 
Arg-Gly-Asp (RGD). We discuss the signaling functions 
of integrins in more detail in Chapter 12 (p. 455). 

Other plasma membrane proteins involved in sur- 
face adhesion are the cadherins, which undergo ho- 
mophilic ("with same kind") interactions with identical 
cadherins in an adjacent cell. Selectins have extracel- 
lular domains that, in the presence of Ca 2+ , bind spe- 
cific polysaccharides on the surface of an adjacent cell. 
Selectins are present primarily in the various types of 
blood cells and in the endothelial cells that line blood 
vessels (see Fig. 7-31). They are an essential part of the 
blood-clotting process. 

Integral membrane proteins play roles in many other 
cellular processes. They serve as transporters and ion 
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channels (discussed in Section 11.3) and as receptors for 
hormones, neurotransmitters, and growth factors (Chap- 
ter 12). They are central to oxidative phosphorylation and 
photophosphorylation (Chapter 19) and to cell-cell and 
cell-antigen recognition in the immune system (Chap- 
ter 5). Integral proteins are also important players in the 
membrane fusion that accompanies exocytosis, endocyto- 
sis, and the entry of many types of viruses into host cells. 

SUMMARY 1 1.2 Membrane Dynamics 

■ Lipids in a biological membrane can exist in 
liquid-ordered or liquid-disordered states; in the 
latter state, thermal motion of acyl chains makes 
the interior of the bilayer fluid. Fluidity is affected 
by temperature, fatty acid composition, and sterol 
content. 

■ Flip-flop diffusion of lipids between the inner and 
outer leaflets of a membrane is very slow except 
when specifically catalyzed by flippases, floppases, 
or scramblases. 

■ Lipids and proteins can diffuse laterally within the 
plane of the membrane, but this mobility is limited 
by interactions of membrane proteins with internal 
cytoskeletal structures and interactions of lipids 
with lipid rafts. One class of lipid rafts consists of 
sphingolipids and cholesterol with a subset of 
membrane proteins that are GPI-linked or attached 
to several long-chain fatty acyl moieties. 
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u Caveolin is an integral membrane protein that 
associates with the inner leaflet of the plasma 
membrane, forcing it to curve inward to form 
caveolae, probably involved in membrane transport 
and signaling. 

m Specific proteins cause local membrane curvature 
and mediate the fusion of two membranes, which 
accompanies processes such as endocytosis, 
exocytosis, and viral invasion. 

m Integrins are transmembrane proteins of the plasma 
membrane that act both to attach cells to each other 
and to carry messages between the extracellular 
matrix and the cytoplasm. 

1 1 .3 Solute Transport across Membranes 

Every living cell must acquire from its surroundings the 
raw materials for biosynthesis and for energy production, 
and must release to its environment the byproducts of 
metabolism. A few nonpolar compounds can dissolve in 
the lipid bilayer and cross the membrane unassisted, but 
for transmembrane movement of any polar compound or 
ion, a membrane protein is essential In some cases a 
membrane protein simply facilitates the diffusion of a 
solute down its concentration gradient, but transport can 
also occur against a gradient of concentration, electrical 
charge, or both, in which case the process requires 
energy (Fig. J 1-25). The energy may come directly 
from ATP hydrolysis or may be 
supplied in the form of one solute 
moving down its electrochemical 
gradient, with the release of 
enough energy to drive another 
solute up its gradient. Ions may 
also move across membranes via 
ion channels formed by proteins, 
or they may be carried across by 
ionophores, small molecules that 
mask the charge of ions and allow 
them to diffuse through the lipid 
bilayer. With very few exceptions, 
the traffic of small molecules 
across the plasma membrane is 
mediated by proteins such as 
transmembrane channels, carriers, 
or pumps. Within the eukaryotic 
cell, different compartments have 
different concentrations of ions 
and of metabolic intermediates 
and products, and these, too, must 
move across intracellular mem- 
branes in tightly regulated, pro- 
tein-mediated processes. 
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FIGURE 11-25 Summary of 
transport types. 
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FIGURE 11-26 Movement of solutes across a per- 
meable membrane, (a) Net movement of an elec- 
trically neutral solute is toward the side of lower 
solute concentration until equilibrium is achieved. 
The solute concentrations on the left and right 
sides of the membrane are designated C| and C 2 . 
The rate of transmembrane movement (indicated 
by the arrows) is proportional to the concentration 
gradient, C 2 /C r , (b) Net movement of an electri- 
cally charged solute is dictated by a combination 
of the electrical potential (V m ) and the chemical 
concentration difference (C 2 /Ci) across the mem- 
brane; net ion movement continues until this elec- 
trochemical potential reaches zero. 



Passive Transport is Facilitated by Membrane Proteins 

When two aqueous compartments containing unequal 
concentrations of a soluble compound or ion are sepa- 
rated by a permeable divider (membrane), the solute 
moves by simple diffusion from the region of higher 
concentration, through the membrane, to the region of 
lower concentration, until the two compartments have 
equal solute concentrations (Fig. ll-26a). When ions of 
opposite charge are separated by a permeable mem- 
brane, there is a transmembrane electrical gradient, a 
membrane potential, V m (expressed in millivolts) . This 
membrane potential produces a force opposing ion move- 
ments that increase V m and driving ion movements that 
reduce V m (Fig. ll-26b). Thus the direction in which a 
charged solute tends to move spontaneously across a 
membrane depends on both the chemical gradient (the 
difference in solute concentration) and the electrical gra- 
dient (F m ) across the membrane. Together, these two 
factors are referred to as the electrochemical gradient 
or electrochemical potential. This behavior of solutes 
is in accord with the second law of thermodynamics: mol- 
ecules tend to spontaneously assume the distribution of 
greatest randomness and lowest energy. 

To pass through a lipid bilayer, a polar or charged 
solute must first give up its interactions with the water 
molecules in its hydration shell, then diffuse about 3 nm 
(30 A) through a substance (lipid) in which it is poorly 
soluble (Fig. 11-27). The energy used to strip away the 
hydration shell and to move the polar compound from 
water into lipid, then through the lipid bilayer, is regained 
as the compound leaves the membrane on the other 
side and is rehydrated. However, the intermediate stage 
of transmembrane passage is a high-energy state com- 
parable to the transition state in an enzyme-catalyzed 
chemical reaction. In both cases, an activation barrier 
must be overcome to reach the intermediate stage 
(Fig. 11-27; compare with Fig. 6-3). The energy of 
activation (AG*) for translocation of a polar solute 
across the bilayer is so large that pure lipid bilayers are 
virtually impermeable to polar and charged species over 
periods of time relevant to cell growth and division. 



Membrane proteins lower the activation energy for 
transport of polar compounds and ions by providing an alter- 
native path through the bilayer for specific solutes. Proteins 
that bring about this facilitated diffusion, or passive 
transport, are not enzymes in the usual sense; their "sub- 
strates" are moved from one compartment to another but 
are not chemically altered. Membrane proteins that speed 




FIGURE 1 1 -27 Energy changes accompanying passage of a hydrophilic 
solute through the lipid bilayer of a biological membrane, (a) In sim- 
ple diffusion, removal of the hydration shell is highly endergonic, and 
the energy of activation (AC*) for diffusion through the bilayer is very 
high, (b) A transporter protein reduces the AC* for transmembrane dif- 
fusion of the solute. It does this by forming noncovalent interactions 
with the dehydrated solute to replace the hydrogen bonding with wa- 
ter and by providing a hydrophilic transmembrane pathway. 
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the movement of a solute across a membrane by facilitating 
diffusion are called transporters or permeases. 

Like enzymes, transporters bind their substrates with 
stereochemical specificity through multiple weak, non- 
covalent interactions. The negative free-energy change 
associated with these weak interactions, AG binding , 
counterbalances the positive free-energy change that 
accompanies loss of the water of hydration from the 



substrate, AG 



, thereby lowering AG* for trans- 



dehydration 

membrane passage (Fig. 11-27). Transporters span the 
lipid bilayer several times, forming a transmembrane 
channel lined with hydrophilic amino acid side chains. 
The channel provides an alternative path for a specific 
substrate to move across the lipid bilayer without its 
having to dissolve in the bilayer, further lowering AG* 
for transmembrane diffusion. The result is an increase 
of several to many orders of magnitude in the rate of 
transmembrane passage of the substrate. 

Transporters Can Be Grouped into Superfamiiies 
Based on Their Structures 

We know from genomic studies that transporters con- 
stitute a significant fraction of all proteins encoded in 
the genomes of both simple and complex organisms. 
There are probably a thousand or more different trans- 
porters in the human genome. Transporters fall within 
two very broad categories: carriers and channels (Fig. 
11-28). Carriers bind their substrates with high stereo- 
specificity, catalyze transport at rates well below the 
limits of free diffusion, and are saturable in the same 
sense as are enzymes: there is some substrate concen- 
tration above which further increases will not produce a 
greater rate of transport. Channels generally allow 
transmembrane movement at rates orders of magnitude 
greater than those typical of carriers, rates approaching 
the limit of unhindered diffusion. Channels typically 
show less stereospecificity than carriers and are usually 
not saturable. Most channels are oligomeric complexes 
of several, often identical, subunits, whereas many car- 
riers function as monomeric proteins. The classification 
as carrier or channel is the broadest distinction among 
transporters. Within each of these categories are super- 
families of various types, defined not only by their pri- 

Transporters 



mary sequences but by their secondary structures. 
Some channels are constructed primarily of helical 
transmembrane segments, others have /3-barrel struc- 
tures. Among the carriers, some simply facilitate diffu- 
sion down a concentration gradient; they are the 
passive transporter superfamily. Active trans- 
porters can drive substrates across the membrane 
against a concentration gradient, some using energy 
provided directly by a chemical reaction (primary ac- 
tive transporters) and some coupling uphill transport of 
one substrate with downhill transport of another (sec- 
ondary active transporters). We now consider some 
well-studied representatives of the main transporter 
superfamiiies. You will encounter some of these trans- 
porters again in later chapters in the context of the 
metabolic pathways in which they participate. 

The Glucose Transporter of Erythrocytes 
Mediates Passive Transport 

Energy-yielding metabolism in erythrocytes depends on 
a constant supply of glucose from the blood plasma, 
where the glucose concentration is maintained at about 
5 ium. Glucose enters the erythrocyte by facilitated dif- 
fusion via a specific glucose transporter, at a rate about 
50,000 times greater than uncatalyzed transmembrane 
diffusion. The glucose transporter of erythrocytes 
(called GLUT1 to distinguish it from related glucose 
transporters in other tissues) is a type III integral pro- 
tein (M r —45,000) with 12 hydrophobic segments, each 
of which is believed to form a membrane-spanning helix. 
The detailed structure of GLUT1 is not yet known, but 
one plausible model suggests that the side-by-side as- 
sembly of several helices produces a transmembrane 
channel lined with hydrophilic residues that can hydro- 
gen-bond with glucose as it moves through the channel 
(Fig. 11-29). 

The process of glucose transport can be described 
by analogy with an enzymatic reaction in which the 
"substrate" is glucose outside the cell (S out ), the "prod- 
uct" is glucose inside (S in ), and the "enzyme" is the 
transporter, T. When the initial rate of glucose uptake is 
measured as a function of external glucose concentra- 
tion (Fig. 11-30), the resulting plot is hyperbolic; at 
high external glucose concentrations the rate of uptake 
approaches ^ max - Formally, such a transport process 
can be described by the equations 
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FIGURE 11-28 Classification of transporters. 



T x 



S in + T 2 S in • T 2 

in which k u /c_ 1} and so forth, are the forward and re- 
verse rate constants for each step; T x is the transporter 
conformation in which the glucose-binding site faces 
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FIGURE 11-29 Proposed structure of GLUT1. (a) Transmembrane he- 
lices are represented here as oblique (angled) rows of three or four 
amino acid residues, each row depicting one turn of the a helix. Nine 
of the 1 2 helices contain three or more polar or charged residues (blue 
or red), often separated by several hydrophobic residues (yellow). This 
representation of topology is not intended to represent three-dimen- 
sional structure, (b) A helical wheel diagram shows the distribution of 
polar and nonpolar residues on the surface of a helical segment. The 
helix is diagrammed as though observed along its axis from the amino 
terminus. Adjacent residues in the linear sequence are connected, and 
each residue is placed around the wheel in the position it occupies in 
the helix; recall that 3.6 residues are required to make one complete 
turn of the a helix. In this example, the polar residues (blue) are on one 
side of the helix and the hydrophobic residues (yellow) on the other. 
This is, by definition, an amphipathic helix, (c) Side-by-side associa- 
tion of four amphipathic helices, each with its polar face oriented 
toward the central cavity, can produce a transmembrane channel lined 
with polar (and charged) residues. This channel provides many oppor- 
tunities for hydrogen bonding with glucose as it moves through. 



out, and T 2 the conformation in which it faces in. The 
steps are summarized in Figure 1 1-3 1 . Given that every 
step in this sequence is reversible, the transporter is, in 
principle, equally able to move glucose into or out of the 
cell. However, glucose always moves down its concentra- 
tion gradient, which normally means into the cell. Glucose 
that enters a cell is generally metabolized immediately, 
and the intracellular glucose concentration is thereby 
kept low relative to its concentration in the blopd. 




Extracellular glucose 
(a) concentration, [S] out (him) 




(b) K t 

FIGURE 11-30 Kinetics of glucose transport into erythrocytes, (a) The 

initial rate of glucose entry into an erythrocyte, V 0f depends on the 
initial concentration of glucose on the outside, [S] out . (b) Double- 
reciprocal plot of the data in (a). The kinetics of facilitated diffusion is 
analogous to the kinetics of an enzyme-catalyzed reaction. Compare 
these plots with Figure 6-1 1 , and with Figure 1 in Box 6-1 . Note that 
K t is analogous to K mf the Michaelis constant. 
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FIGURE 11-31 Model of glucose transport into erythrocytes by 
GLUT1. The transporter exists in two conformations: J u with the 
glucose-binding site exposed on the outer surface of the plasma mem- 
brane, and T 2 , with the binding site exposed on the inner surface. Glu- 
cose transport occurs in four steps. (T) Glucose in blood plasma binds 
to a stereospecific site on T^; this lowers the activation energy 
for (2) a conformational change from glucose out • Ti to glucose in ■ T 2 , 
effecting the transmembrane passage of the glucose. ® Glucose is re- 
leased fromT 2 into the cytoplasm, and @ the transporter returns to the 
Ti conformation, ready to transport another glucose molecule. 
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j TABLE 11-3 


Glucose Transporters in the Human Genome 






Transporter 


Tissue(s) where expressed 


Gene 


Role* 


GLUT1 


Ubiquitous 




rsasai glucose upLaKe 


GLUT2 


Liver, pancreatic islets, intestine 


QT H9 A 9 


In liver, removal of excess glucose from blood; 








in pancreas, regulation of insulin release 


GLUT3 


Brain (neuronal) 


SLC2A3 


Basal glucose uptake 


GLUT4 


Muscle, fat, heart 


SLC2A4 


Activity increased by insulin 


GLUTS 


Intestine, testis, kidney, sperm 


SLC2A5 


Primarily fructose transport 


GLUT6 


Spleen, leukocytes, brain 


SLC2A6 


Possibly no transporter function 


GLUT7 


Liver microsomes 


SLC2A7 




GLUT8 


Testis, blastocyst, brain 


SLC2A8 




GLUT9 


ijivci, jviui icy 


SLC2A9 




GLUT10 


Liver, pancreas 


SLC2A10 




GLUT11 


Heart, skeletal muscle 


SLC2A11 




GLUT 12 


Skeletal muscle, adipose, small intestine 


SLC2A12 





*Dash indicates role uncertain. 



which differ only in the position of one hydroxyl group, 
the values of K t are 20 and 30 him, respectively; and for 
L-glucose, K t exceeds 3,000 mM. Thus GLUT1 shows the 
three hallmarks of passive transport: high rates of diffu- 
sion down a concentration gradient, saturability, and 
specificity. 

Twelve glucose transporters are encoded in the 
human genome, each with its unique kinetic proper- 
ties, patterns of tissue distribution, and function 
(Table 11-3). In liver, GLUT2 transports glucose out of 
hepatocytes when liver glycogen is broken down to 
replenish blood glucose. GLUT2 has a K t of about 66 
mM and can therefore respond to increased levels of in- 
tracellular glucose (produced by glycogen breakdown) 
by increasing outward transport. Skeletal and heart 
muscle and adipose tissue have yet another glucose 
transporter, GLUT4 (K t = 5 mM), which is distin- 
guished by its response to insulin: its activity increases 
when insulin signals a high blood glucose concentra- 
tion, thus increasing the rate of glucose uptake into 
muscle and adipose tissue (Box 11-2 describes some 
malfunctions of this transporter). 

The Chloride-Bicarbonate Exchanger Catalyzes 
Electroneutral Cotransport of Anions across 
the Plasma Membrane 

The erythrocyte contains another facilitated diffusion sys- 
tem, an anion exchanger that is essential in C0 2 transport 
to the lungs from tissues such as skeletal muscle and liver. 
Waste C0 2 released from respiring tissues into the blood 
plasma enters the erythrocyte, where it is converted to bi- 
carbonate (HCO3 ) by the enzyme carbonic anhydrase. 
(Recall that HCO3 is the primary buffer of blood pH; see 
Fig. 2-20.) The HCO3 reenters the blood plasma for 



The rate equations for glucose transport can be 
derived exactly as for enzyme-catalyzed reactions 
(Chapter 6), yielding an expression analogous to the 
Michaelis-Menten equation: 

in which V 0 is the initial velocity of accumulation of 
glucose inside the cell when its concentration in the 
surrounding medium is [S] out , and K t (iT transport ) is a 
constant analogous to the Michaelis constant, a combi- 
nation of rate constants that is characteristic of each 
transport system. This equation describes the initial 
velocity, the rate observed when [S] in = 0. As is the 
case for enzyme-catalyzed reactions, the slope-inter- 
cept form of the equation describes a linear plot of 1/V 0 
against l/[S] out , from which we can obtain values of K t 
and V max (Fig. ll-30b). When [S] out = K t , the rate of 
uptake is V2F max ; the transport process is half-satu- 
rated. The concentration of glucose in blood is 4.5 to 5 
mM, about three times K t9 which ensures that GLUT1 is 
nearly saturated with substrate and operates near 

v max- 

Because no chemical bonds are made or broken in 
the conversion of S out to S in , neither "substrate" nor 
"product" is intrinsically more stable, and the process 
of entry is therefore fully reversible. As [S] in approaches 
[S] out , the rates of entry and exit become equal. Such a 
system is therefore incapable of accumulating glucose 
within a cell at concentrations above that in the sur- 
rounding medium; it simply equilibrates glucose on the 
two sides of the membrane much faster than would oc- 
cur in the absence of a specific transporter. GLUT1 is 
specific for D-glucose, with a measured K t of 1.5 mM. 
For the close analogs D-mannose and D-galactose. 
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SOX 1 1 -2 ■ Defective Glucose and Water Transport in 

igmjUgm Two Forms of Diabetes 



When ingestion of a carbohydrate-rich meal causes 
blood glucose to exceed the usual concentration be- 
tween meals (about 5 mM), excess glucose is taken up 
by the myocytes of cardiac and skeletal muscle (which 
store it as glycogen) and by adipocytes (which convert 
it to triacylglycerols). Glucose uptake into myocytes 
and adipocytes is mediated by the glucose transporter 
GLUT4. Between meals, some GLUT4 is present in the 
plasma membrane, but most is sequestered in the 
membranes of small intracellular vesicles (Fig. 1). In- 
sulin released from the pancreas in response to high 
blood glucose triggers the movement of these intracel- 
lular vesicles to the plasma membrane, where they 
fuse, thus exposing GLUT4 molecules on the outer sur- 
face of the cell (see Fig. 12-16). With more GLUT4 mol- 
ecules in action, the rate of glucose uptake increases 
15-fold or more. When blood glucose levels return to 
normal, insulin release slows and most GLUT4 mole- 
cules are removed from the plasma membrane and 
stored in vesicles. 

In type 1 (juvenile-onset) diabetes mellitus, the in- 
ability to release insulin (and thus to mobilize glucose 



transporters) results in low rates of glucose uptake into 
muscle and adipose tissue. One consequence is a pro- 
longed period of high blood glucose after a carbohydrate- 
rich meal. This condition is the basis for the glucose 
tolerance test used to diagnose diabetes (Chapter 23). 

The water permeability of epithelial cells lining the 
renal collecting duct in the kidney is due to the presence 
of an aquaporin (AQP-2) in their apical plasma mem- 
branes (facing the lumen of the duct). Vasopressin (an- 
tidiuretic hormone, ADH) regulates the retention of 
water by mobilizing AQP-2 molecules stored in vesicle 
membranes within the epithelial cells, much as insulin 
mobilizes GLUT4 in muscle and adipose tissue. When 
the vesicles fuse with the epithelial cell plasma mem- 
brane, water permeability greatly increases and more 
water is reabsorbed from the collecting duct and re- 
turned to the blood. When the vasopressin level drops, 
AQP-2 is resequestered within vesicles, reducing water 
retention. In the relatively rare human disease diabetes 
insipidus, a genetic defect in AQP-2 leads to impaired 
water reabsorption by the kidney. The result is excre- 
tion of copious volumes of very dilute urine. 



® 



Insulin 



When insulin interacts with its receptor, vesicles 
move to surface and fuse with the plasma 
membrane, increasing the number of glucose 
transporters in the plasma membrane. 
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membrane 
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Glucose transporters 
"stored" within ceil in 
membrane vesicles. 



Patches of the endosome enriched with 
glucose transporters bud off to become 
small vesicles, ready to return to the 
surface when insulin levels rise again. 




When insulin level drops, 
glucose transporters are 
removed from the plasma 
membrane by endocytosis, 
forming small vesicles. 



5? 



The smaller 
vesicles fuse with 
larger endosome. 



FIGURE 1 Transport of glucose into a myocyte by GLUT4 is regulated by insulin. 
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FIGURE 11-32 Chloride-bicarbonate exchanger of the erythrocyte 
membrane. This cotransport system allows the entry and exit of HCO^ 
without changing the membrane potential. Its role is to increase the 
C0 2 -carrying capacity of the blood. 

transport to the lungs (Fig. 11-32). Because HCO3 is 
much more soluble in blood plasma than is C0 2 , this 
roundabout route increases the capacity of the blood to 
carry carbon dioxide from the tissues to the lungs. In the 
lungs, HCO3 reenters the erythrocyte and is converted to 
C0 2 , which is eventually released into the lung space and 
exhaled. To be effective, this shuttle requires very rapid 
movement of HCO3 across the erythrocyte membrane. 

The chloride-bicarbonate exchanger, also called 
the anion exchange (AE) protein, increases the rate 
of HCO3 transport across the erythrocyte membrane 
more than a millionfold. Like the glucose transporter, it is 
an integral protein that probably spans the membrane at 
least 12 times. This protein mediates the simultaneous 



movement of two anions: for each HCO3 ion that moves 
in one direction, one Cl~ ion moves in the opposite direc- 
tion (Fig. 11-33), with no net transfer of charge; the ex- 
change is electronentral. The coupling of CF and 
HCO3 movements is obligatory; in the absence of chlo- 
ride, bicarbonate transport stops. In this respect, the an- 
ion exchanger is typical of those systems, called 
cotransport systems, that simultaneously carry two 
solutes across a membrane. When, as in this case, the 
two substrates move in opposite directions, the process 
is antiport. In symport, two substrates are moved si- 
multaneously in the same direction. Transporters that 
carry only one substrate, such as the erythrocyte glucose 
transporter, are known as uniport systems (Fig. 11-33). 

The human genome has genes for three closely re- 
lated chloride-bicarbonate exchangers, all with the 
same predicted transmembrane topology. Erythrocytes 
contain the AE1 transporter, AE2 is prominent in liver, 
and AE3 is present in plasma membranes of the brain, 
heart, and retina. Similar anion exchangers are also 
found in plants and microorganisms. 

Active Transport Results in Solute Movement against a 
Concentration or Electrochemical Gradient 

In passive transport, the transported species always moves 
down its electrochemical gradient and is not accumulated 
above the equilibrium concentration. Active transport, 
by contrast, results in the accumulation of a solute above 
the equilibrium point. Active transport is thermodynam- 
ically unfavorable (endergonic) and takes place only 
when coupled (directly or indirectly) to an exergonic 
process such as the absorption of sunlight, an oxidation 
reaction, the breakdown of ATP, or the concomitant flow 
of some other chemical species down its electrochemical 
gradient. In primary active transport, solute accumula- 
tion is coupled directly to an exergonic chemical reaction, 
such as conversion of ATP to ADP + P { (Fig. 11-34). 



Uniport 



Symport 




Cotransport 



FIGURE 11-33 Three general classes of transport systems. Transporters 
differ in the number of solutes (substrates) transported and the direc- 
tion in which each solute moves. Examples of all three types of trans- 
porters are discussed in the text. Note that this classification tells us 
nothing about whether these are energy-requiring (active transport) or 
energy-independent (passive transport) processes. 
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FIGURE 11-34 Two types of active transport, (a) In primary active 
transport the energy released by ATP hydrolysis drives solute move- 
ment against an electrochemical gradient, (b) In secondary active 
transport, a gradient of ion X (often Na + ) has been established by pri- 
mary active transport. Movement of X down its electrochemical gradi- 
ent now provides the energy to drive cotransport of a second solute (S) 
against its electrochemical gradient. 
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Secondary active transport occurs when endergonic 
(uphill) transport of one solute is coupled to the exergonic 
(downhill) flow of a different solute that was originally 
pumped uphill by primary active transport. 

The amount of energy needed for the transport of a 
solute against a gradient can be calculated from the ini- 
tial concentration gradient. The general equation for the 
free-energy change in the chemical process that con- 
verts S to P is 

AG = AG' 0 + RT\n ([P]/[S]) (11-2) 

where AG'° is the standard free-energy change, R is the 
gas constant, 8.315 J/mol • K, and T is the absolute tem- 
perature. When the "reaction" is simply transport of a 
solute from a region where its concentration is C Y to a 
region where its concentration is C 2 , no bonds are made 
or broken and AG' 0 is zero. The free-energy change for 
transport, AG t) is then 

AG t = RT In (XVCj) g(ll-3) 

If there is a 10-fold difference in concentration between 
two compartments, the cost of moving 1 mol of an un- 
charged solute at 25 °C uphill across a membrane sepa- 
rating the compartments is 

AG = (8.315 J/mol -K) (298 K) In (10/1) - 5,700 J/mol 

= 5.7 kJ/mol 

Equation 11-3 holds for all uncharged solutes. 

Hi WORKED EXAMPLE 11-1 Energy Cost of Pumping 

an Uncharged Solute 

Calculate the energy cost (free-energy change) of 
pumping an uncharged solute against a 1.0 X 10 4 -fold 
concentration gradient at 25 °C. 

Solution: Begin with Equation 11-3. Substitute 1.0 x 10 4 
for ((VCD, 8.315 J/mol • K for R, and 298 K for T: 

AG t = RT In (C 2 /d) 

= (8.315 J/mol -K)(298 K) In (1.0 x 10 4 ) 

= 23 kJ/mol 



When the solute is an ion, its movement without an 
accompanying counterion results in the endergonic sep- 
aration of positive and negative charges, producing an 
electrical potential; such a transport process is said to 
be electrogenic. The energetic cost of moving an ion 
depends on the electrochemical potential (p. 390), the 
sum of the chemical and electrical gradients: 

AG t = RTln (C 2 /d) + ZJAtp §(11-4) 

where Z is the charge on the ion, J is the Faraday con- 
stant (96,480 J/V • mol), and Aif/ is the transmembrane 
electrical potential (in volts). Eukaryotic cells typically 
have plasma membrane potentials of about 0.05 V (with 



the inside negative relative to the outside), so the sec- 
ond term of Equation 1 1-4 can make a significant contri- 
bution to the total free-energy change for transporting 
an ion. Most cells maintain more than a 10-fold differ- 
ence in ion concentrations across their plasma or intra- 
cellular membranes, and for many cells and tissues 
active transport is therefore a major energy-consuming 
process. 

WORKED EXAMPLE 11-2 Energy Cost of Pumping 

a Charged Solute 

Calculate the energy cost (free-energy change) of 
pumping Ca 2+ from the cytosol, where its concentration 
is about 1.0 X 10" 7 m, to the extracellular fluid, where its 
concentration is about 1.0 mM. Assume a temperature of 
37 °C (body temperature in a mammal) and a standard 
transmembrane potential of 50 mV (inside negative) for 
the plasma membrane. 

Solution: In this calculation, both the concentration gra- 
dient and the electrical potential must be taken into ac- 
count. In Equation 1 1-4, substitute 8.315 J/mol • K for R, 
310 K for T, 1.0 X 10" 3 for C 2 , 1.0 X 10~ 7 for C u 96,500 
J/V • mol for J, +2 (the charge on a Ca 2+ ion) for Z, and 
0.050 V for Aip. Note that the transmembrane potential is 
50 mV (inside negative) , so the change in potential when 
an ion moves from inside to outside is 50 mV. 

AG t = RT In (CyCi) + Z3 Aip 

1 0 X 10~ 3 

= (8.315 J/mol -10(310 K) In — ^— 

1.0 x 10" 7 

+ 2 (96,500 J/V*mol)(0.050 V) 
= 33 kJ/mol 



The mechanism of active transport is of fundamen- 
tal importance in biology. As we shall see in Chapter 19, 
ATP is formed in mitochondria and chloroplasts by a 
mechanism that is essentially ATP-driven ion transport 
operating in reverse. The energy made available by the 
spontaneous flow of protons across a membrane is cal- 
culable from Equation 11-4; remember that AG for flow 
down an electrochemical gradient has a negative value, 
and AG for transport of ions against an electrochemical 
gradient has a positive value. 

P-Type ATPases Undergo Phosphorylation 
during Their Catalytic Cycles 

The family of active transporters called P-type ATP- 
ases are cation transporters that are reversibly phospho- 
rylated by ATP (thus the name P-type) as part of the 
transport cycle. Phosphorylation forces a conformational 
change that is central to movement of the cation across 
the membrane. The human genome encodes at least 70 
P-type ATPases that share similarities in amino acid se- 
quence and topology, especially near the Asp residue 



that undergoes phosphorylation. All are integral proteins 
with 8 or 10 predicted membrane-spanning regions in a 
single polypeptide (type III in Fig. 11-8), and all are sen- 
sitive to inhibition by the phosphate analog vanadate. 

O- Cr 

I I 
0=P-Cr o=V-Cr 

I I 
OH OH 

Phosphate Vanadate 

The P-type ATPases are widespread in eukaryotes 
and bacteria. In animal tissues, the Ca 2+ ATPase (a uni- 
porter for Ca 2+ ions) and the Na + K + ATPase (an an- 
tiporter for Na + and K + ions) are P-type ATPases that 
maintain differences in ionic composition between the cy- 
tosol and the extracellular medium. Parietal cells in the 
lining of the mammalian stomach have a P-type ATPase 
that pumps H + and K + across the plasma membrane, 
thereby acidifying the stomach contents. Lipid flippases, 
as we noted above, are structurally and functionally re- 
lated to P-type transporters. In vascular plants, a P-type 
ATPase pumps protons out of cells, establishing an elec- 
trochemical difference of as much as 2 pH units and 250 
mV across the plasma membrane. A similar P-type ATP- 
ase in the bread mold Neurospora pumps protons out 
of cells to establish an inside-negative membrane poten- 
tial, which is used to drive the uptake of substrates and 
ions from the surrounding medium by secondary active 
transport. Bacteria use P-type ATPases to pump out toxic 
heavy metal ions such as Cd 2+ and Cu 2+ . 

The best-understood P-type pumps are the Ca 2+ 
pumps that maintain a low concentration of Ca 2+ in the 
cytosol of virtually all cells. The plasma membrane 
Ca 2+ pump moves calcium ions out of the cell, and an- 
other P-type pump in the endoplasmic reticulum moves 
Ca 2+ into the ER lumen. (In muscle cells, Ca 2+ is nor- 
mally sequestered in a specialized form of ER called the 
sarcoplasmic reticulum; release of this Ca 2+ triggers 
muscle contraction.) 

The sarcoplasmic and endoplasmic reticulum 
calcium (SERCA) pumps are P-type ATPases closely 
related in structure and mechanism. The SERCA pump 
of the sarcoplasmic reticulum, which comprises 80% of 
the protein in that membrane, is a single polypeptide 
(M r —110,000) that spans the membrane 10 times 
(Fig. 1 1-35). Three cytosolic domains formed by long 
loops connect the transmembrane helices: the N do- 
main, where the nucleotide ATP and Mg 2+ bind; the P 
domain, which contains thephosphorylated Asp residue 
characteristic of all P-type ATPases; and the A (actua- 
tor) domain, which communicates movements of the N 
and P domains to the two Ca 2+ -binding sites. The M do- 
main contains the transmembrane helices and the Ca 2+ - 
binding sites, which are located near the middle of the 
membrane bilayer, 40 to 50 A from the phosphorylated 
Asp residue — thus Asp phosphorylation-dephosphory- 
lation does not directly affect Ca 2+ binding. 
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FIGURE 11-35 The Ca 2+ pump of sarcoplasmic reticulum: a SERCA 
pump. (PDB ID 1EUL) Ten transmembrane helices (the M domain, 
framed in pink) surround the path for Ca 2+ movement through the 
membrane. Two of the helices are interrupted near the middle of the 
bilayer; their nonhelical regions form the binding sites for two Ca 2 + 
ions (purple). The carboxylate groups of an Asp residue in one helix 
and a Glu residue in another contribute to the Ca 2+ -binding sites 
Three globular domains extend from the cytosolic side: the N 
(nucleotide-binding) domain (framed in green) contains the binding 
site for ATP; the P (phosphorylation) domain (orange circle) contains 
the Asp 351 residue that undergoes reversible phosphorylation, and the 
A (actuator) domain (blue triangle) mediates the structural changes that 
alter the affinity of the Ca 2+ -binding site and its exposure to cytosol 
or lumen. Note the long distance between the phosphorylation site 
and the Ca 2+ -binding site. This SERCA pump structure is the prototype 
for all P-type ATPases, and it contains some residues (shown in red) 
that are conserved in all members of the P-type ATPase family. 

The mechanism postulated for SERCA pumps 
(Fig. 11-86) takes into account the large conforma- 
tional changes and the phosphorylation-dephosphory- 
lation of the critical Asp residue in the P domain that is 
known to occur during a catalytic cycle. Each catalytic 
cycle moves two Ca 2+ ions across the membrane and 
converts an ATP to ADP and P it The role of ATP binding 
and hydrolysis is to bring about the interconversion of 
two conformations (El and E2) of the transporter. In 
the El conformation, the two Ca 2+ -binding sites are ex- 
posed on the cytosolic side of the ER or sarcoplasmic 
reticulum and bind Ca 2+ with high affinity. ATP bind- 
ing and Asp phosphorylation drive a conformational 
change from El to E2 in which the Ca 2+ -binding sites 
are now exposed on the lumenal side of the membrane 
and their affinity for Ca 2+ is greatly reduced, causing 
Ca 2+ release into the lumen. By this mechanism, the 
energy released by hydrolysis of ATP during one 
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FIGURE 11-36 Postulated mechanism of the SERCA pump. The transport 
cycle begins with the protein in the E1 conformation, with the Ca 2+ - 
binding sites facing the cytosol. Two Ca 2+ ions bind, then ATP binds to 
the transporter and phosphorylates Asp 351 , forming E1-P. Phosphory- 
lation favors the second conformation, E2-P, in which the Ca 2+ -binding 
sites, now with a reduced affinity for Ca 2+ , are accessible on the other 
side of the membrane (the lumen or extracellular space), and the re- 
leased Ca 2+ diffuses away. Finally, E2-P is dephosphorylated, returning 
the protein to the E1 conformation for another round of transport. 

phosphorylation-dephosphorylation cycle drives Ca 2+ 
across the membrane against a large electrochemical 
gradient. 

A variation on this basic mechanism is seen in 
the Na + K + ATPase of the plasma membrane, discov- 
ered by Jens Skou in 1957. This cotransporter couples 
phosphorylation-dephosphorylation of the critical Asp 
residue to the simultaneous movement of both Na + and 
K + against their electrochemical gradients (Fig. 11-37). 
The Na + K + ATPase is responsible for maintaining low 
Na + and high K + concentrations in the cell relative to the 
extracellular fluid (Fig. 11-38). For each molecule of 
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FIGURE 11-37 Postulated mechanism of the Na + K + ATPase. 
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FIGURE 11-38 Role of the Na + K + ATPase in animal cells. In animal cells, 
this active transport system is primarily responsible for setting and main- 
taining the intracellular concentrations of Na + and K + and for generating 
the membrane potential. It does this by moving three Na + out of the cell 
for every two K + it moves in. The electrical potential across the plasma 
membrane is central to electrical signaling in neurons, and the gradient of 
Na + is used to drive the uphill cotransport of solutes in many cell types. 



Extracellular fluid [K + ] = 4 mM 
or blood plasma [Na + ] = 145 mM 
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ATP converted to ADP and P i? 
the transporter moves two K + 
ions inward and three Na + ions 
outward across the plasma 
membrane. Cotransport is 
therefore electrogenic — it cre- 
ates a net separation of charge 
across the membrane; in ani- 
mals, this produces the mem- 
brane potential of -50 to -70 
mV (inside negative relative to 
outside) that is characteristic 
of most cells and is essential to the conduction of action 
potentials in neurons. The central role of the Na + K + ATP- 
ase is reflected in the energy invested in this single reac- 
tion: about 25% of the total energy consumption of a 
human at rest! 

F-Type ATPases Are Reversible, ATP-Driven Proton Pumps 

F-type ATPase active transporters catalyze the uphill 
transmembrane passage of protons driven by ATP hy- 
drolysis. The "F-type" designation derives from the 
identification of these ATPases as energy-coupling 
/actors. The F 0 integral membrane protein complex 
(Fig. 11-39; subscript o denotes its inhibition by the 
drug oligomycin) provides a transmembrane pathway 
for protons, and the peripheral protein F 1 (subscript 1 
indicating this was the first of several factors isolated 
from mitochondria) uses the energy of ATP to drive pro- 
tons uphill (into a region of higher H + concentration). 




FIGURE 11-39 F 0 F-, ATPase/ ATP synthase. F-type ATRases have a periph- 
eral domain, F v consisting of three a subunits, three /3 subunits, oneS sub- 
unit (purple), and a central shaft (the y subunit, green) joined to the integral 
domain through e. The integral domain, F Q (orange), has multiple copies of 
the c subunit (12 shown here), one a subunit, and two b subunits. F Q pro- 
vides a transmembrane channel through which protons are pumped (red 
arrows), about four for each ATP hydrolyzed on the (3 subunits of Ft . The re- 
markable mechanism by which these two events are coupled is described 
in detail in Chapter 1 9. It involves rotation of F Q relative to Ft (black arrow). 
The structures of the and A Q A^ ATfases are essentially similar to that 
of FoFt, and the mechanisms are probably similar, too. 



The F 0 Fx organization of proton-pumping transporters 
must have developed very early in evolution. Bacteria 
such as E. coli use an ATPase complex in their 
plasma membrane to pump protons outward, and ar- 
chaea have a closely homologous proton pump, the 
ATPase. 

The reaction catalyzed by F-type ATPases is re- 
versible, so a proton gradient can supply the energy to 
drive the reverse reaction, ATP synthesis (Fig. 11-40). 
When functioning in this direction, the F-type ATPases 
are more appropriately named ATP synthases. ATP 
synthases are central to ATP production in mitochon- 
dria during oxidative phosphorylation and in chloro- 
plasts during photophosphorylation, as well as in 
bacteria and archaea. The proton gradient needed to 
drive ATP synthesis is produced by other types of pro- 
ton pumps powered by substrate oxidation or sunlight. 
We provide a detailed description of these processes in 
Chapter 19. 

V-type ATPases, a class of proton-transporting 
ATPases structurally (and possibly mechanistically) re- 
lated to the F-type ATPases, are responsible for acidify- 
ing intracellular compartments in many organisms 
(thus Vior vacuolar). Proton pumps of this type main- 
tain the vacuoles of fungi and higher plants at a pH be- 
tween 3 and 6, well below that of the surrounding 
cytosol (pH 7.5). V-type ATPases are also responsible 
for the acidification of lysosomes, endosomes, the Golgi 
complex, and secretory vesicles in animal cells. All V- 
type ATPases have a similar complex structure, with an 
integral (transmembrane) domain (V 0 ) that serves as a 
proton channel and a peripheral domain (V x ) that con- 
tains the ATP-binding site and the ATPase activity. The 
mechanism by which V-type ATPases couple ATP hy- 
drolysis to the uphill transport of protons is not under- 
stood in detail. 
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FIGURE 11-40 Reversibility of F-type ATPases. An ATP-driven proton 
transporter also can catalyze ATP synthesis (red arrows) as protons flow 
down their electrochemical gradient. This is the central reaction in the 
processes of oxidative phosphorylation and photophosphorylation, 
both described in detail in Chapter 19. 
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FIGURE 1 1 —41 An ABC transporter of F. co//. The vitamin B 12 importer 
BtuCD (PDB ID 1 L7V) is a homodimer with 1 0 transmembrane helical 
domains (blue) in each monomer and two nucleotide-binding domains 
(NBDs; red) that extend into the cytoplasm. The residues involved in 
ATP binding and hydrolysis are shown as ball-and-stick structures. 



ABC Transporters Use ATP to Drive the Active Transport 
of a Wide Variety of Substrates 

EH ABC transporters (Fig. 11-41) constitute a 
HI large family of ATP-dependent transporters that 
pump amino acids, peptides, proteins, metal ions, vari- 
ous lipids, bile salts, and many hydrophobic compounds, 
including drugs, out of cells against a concentration gra- 
dient. One ABC transporter in humans, the multi-drug 
transporter (MDR1), is responsible for the striking 
resistance of certain tumors to some generally effective 
antitumor drugs. MDR1 has a broad substrate specificity 
for hydrophobic compounds, including, for example, the 
chemotherapeutic drugs adriamycin, doxorubicin, and 
vinblastine. By pumping these drugs out of the cell, the 
transporter prevents their accumulation within a tumor 
and thus blocks their therapeutic effects. MDR1 is an in- 
tegral membrane protein (M T 170,000) with 12 trans- 
membrane segments and two ATP-binding domains 
("cassettes"), which give the family its name: ^TP-bind- 
ing cassette transporters. ■ 

All ABC transporters have two nucleotide-binding do- 
mains (NBDs) and two transmembrane domains. In some 
cases, all these domains are in a single long polypeptide; 
other ABC transporters have two subunits, each con- 
tributing an NBD and a domain with six (or in some cases 
10; Fig. 11-41) transmembrane helices. Many ABC trans- 
porters are in the plasma membrane, but some types are 
also found in the endoplasmic reticulum and in the mem- 
branes of mitochondria and lysosomes. Most ABC trans- 
porters act as pumps, but at least some members of the 
superfamily act as ion channels that are opened and 
closed by ATP hydrolysis. The CFTR transporter (see Box 
11-3) is a Cl~ channel operated by ATP hydrolysis. 

The NBDs of all ABC proteins are similar in se- 
quence and presumably in three-dimensional structure; 
they are the conserved molecular motor that can be 



coupled to a wide variety of pumps and channels. When 
coupled with a pump, the ATP-driven motor moves 
solutes against a concentration gradient; when coupled 
with an ion channel, the motor opens and closes the 
channel, using ATP as energy source. The stoichiometry 
of ABC pumps is about one ATP hydrolyzed per molecule 
of substrate transported, but neither the mechanism of 
coupling nor the site of substrate binding is known. 

I Some ABC transporters have very high specificity 
fl| for a single substrate; others are more promiscuous. 
The human genome contains at least 48 genes that encode 
ABC transporters, many of which are involved in maintain- 
ing the lipid bilayer and in transporting sterols, sterol 
derivatives, and fatty acids throughout the body. The flip- 
pases that move membrane lipids from one leaflet of the bi- 
layer to the other are ABC transporters, and the cellular 
machinery for exporting excess cholesterol includes an 
ABC transporter. Mutations in the genes that encode 
some of these proteins contribute to several genetic dis- 
eases, including cystic fibrosis (Box 11-3), Tangier disease 
(p. 843), retinal degeneration, anemia, and liver failure. 

ABC transporters are also present in simpler animals 
and in plants and microorganisms. Yeast has 31 genes 
that encode ABC transporters, Drosophila has 56, and 
E. coli has 80, representing 2% of its entire genome. 
The presence of ABC transporters that confer antibiotic 
resistance in pathogenic microbes (Pseudomonas 
aeruginosa, Staphylococcus aureus, Candida albi- 
cans, Neisseria gonorrhoeae, and Plasmodium falci- 
parum) is a serious public health concern and makes 
these transporters attractive targets for drug design. ■ 

Ion Gradients Provide the Energy for Secondary Active Transport 

The ion gradients formed by primary transport of Na + or 
H + can in turn provide the driving force for cotransport 
of other solutes. Many cell types contain transport sys- 
tems that couple the spontaneous, downhill flow of 
these ions to the simultaneous uphill pumping of an- 
other ion, sugar, or amino acid (Table 11-4). 

Cotransport Systems Driven by Gradients of 
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BOX 1 1-3 i£ ; - ; A Defective Ion Channel in Cystic Fibrosis 



Cystic fibrosis (CF) is a serious and relatively common 
hereditary disease of humans. About 5% of white Amer- 
icans are carriers, having one defective and one normal 
copy of the gene. Only individuals with two defective 
copies show the severe symptoms of the disease: ob- 
struction of the gastrointestinal and respiratory tracts, 
commonly leading to bacterial infection of the airways 
and death due to respiratory insufficiency before the 
age of 30. In CF, the thin layer of mucus that normally 
coats the internal surfaces of the lungs is abnormally 
thick, obstructing air flow and providing a haven for 
pathogenic bacteria, particularly Staphylococcus au- 
reus and Pseudomonas aeruginosa. 

The defective gene in CF patients was discovered in 
1989. It encodes a membrane protein called cystic/ibrosis 
transmembrane conductance regulator, or CFTR. This 
protein has two segments, each containing six transmem- 
brane helices, two nucleotide-binding domains (NBDs), 
and a regulatory region (Fig. 1). CFTR is therefore very 
similar to other ABC transporter proteins. The normal 
CFTR protein proved to be an ion channel specific for CF 
ions. The channel conducts CF across the plasma mem- 
brane when both NBDs have bound ATP, and it closes 

FIGURE 1 Three states of the cystic fibrosis transmembrane conduc- 
tance regulator, CFTR. The protein has two segments, each with six 
transmembrane helices, and three functionally significant domains 
extend from the cytoplasmic surface: NBDt and NBD 2 (green) are 
nucleotide-binding domains that bind ATP, and a regulatory domain 
(blue) is the site of phosphorylation by cAMP-dependent protein 
kinase. When this R domain is phosphorylated but no ATP is bound to 
the NBDs (left), the channel is closed. The binding of ATP opens the 
channel (middle) until the bound ATP is hydrolyzed. When the regu- 
latory domain is unphosphorylated (right), it binds the NBD domains 
and prevents ATP binding and channel opening. The most commonly 
occurring mutation leading to CF is the deletion of Phe 508 in the 
NBD-, domain (left), CFTR is a typical ABC transporter in all but two 
respects: most ABC transporters lack the regulatory domain, and CFTR 
acts as an ion channel (for Cl~), not as a typical transporter. 




FIGURE 2 Mucus lining the surface of the lungs traps bacteria. In 
healthy lungs (shown here), these bacteria are killed and swept away 
by the action of cilia. In CF, this mechanism is impaired, resulting in re- 
curring infections and progressive damage to the lungs. 

when the ATP on one of the NBDs is broken down to ADP 
and Pi. The CF channel is further regulated by phospho- 
rylation of several Ser residues in the regulatory domain, 
catalyzed by cAMP-dependent protein kinase (Chapter 
12). When the regulatory domain is not phosphorylated, 
the CF channel is closed. The mutation responsible for 
CF in 70% of cases results in deletion of a Phe residue at 
position 508. The mutant protein folds incorrectly, which 
interferes with its insertion in the plasma membrane, re- 
sulting in reduced CF movement across the plasma 
membranes of epithelial cells that line the airways (Fig. 
2), the digestive tract, and exocrine glands (pancreas, 
sweat glands, bile ducts, and vas deferens). 

Diminished export of CF is accompanied by dimin- 
ished export of water from cells, causing the mucus on 
their surfaces to become dehydrated, thick, and exces- 
sively sticky. In normal circumstances, cilia on the epithe- 
lial cells that line the inner surface of the lungs constantly 
sweep away bacteria that settle in this mucus, but the 
thick mucus in individuals with CF hinders this process. 
Frequent infections by bacteria such as S. aureus and P 
aeruginosa result, causing progressive damage to the 
lungs and reduced respiratory efficiency. Respiratory fail- 
ure is commonly the cause of death in people with CF. 
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FIGURE 11-42 Lactose uptake in E. coli. (a) The primary transport of 
H + out of the cell, driven by the oxidation of a variety of fuels, estab- 
lishes both a proton gradient and an electrical potential (inside nega- 
tive) across the membrane. Secondary active transport of lactose into 
the cell involves symport of H + and lactose by the lactose transporter. 
The uptake of lactose against its concentration gradient is entirely 
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dependent on this inflow of protons driven by the electrochemical gra- 
dient, (b) When the energy-yielding oxidation reactions of metabolism 
are blocked by cyanide (CN'), the lactose transporter allows equilibra- 
tion of lactose across the membrane via passive transport. Mutations 
that affect Glu 325 or Arg 302 have the same effect as cyanide. The dashed 
line represents the concentration of lactose in the surrounding medium. 



The lactose transporter (lactose permease) of 

E. coli is the well-studied prototype for proton-driven 
cotransporters. This protein consists of a single polypep- 
tide chain (417 residues) that functions as a monomer to 
transport one proton and one lactose molecule into the 
cell, with the net accumulation of lactose (Fig. 11-42). 
E. coli normally produces a gradient of protons and 
charge across its plasma membrane by oxidizing fuels 
and using the energy of oxidation to pump protons out- 
ward. (This mechanism is discussed in detail in Chapter 
19.) The lipid bilayer is impermeable to protons, but the 
lactose transporter provides a route for proton reentry, 
and lactose is simultaneously carried into the cell by 
symport. The endergonic accumulation of lactose is 
thereby coupled to the exergonic flow of protons into the 
cell, with a negative overall free-energy change. 

The lactose transporter is one member of the ma- 
jor facilitator superfamily (MFS) of transporters, 
which comprises 28 families. Almost all proteins in this 



superfamily have 12 transmembrane domains (the few 
exceptions have 14). The proteins share relatively little 
sequence homology, but the similarity of their secondary 
structures and topology suggests a common tertiary 
structure. The crystallographic solution of the E. coli lac- 
tose transporter may provide a glimpse of this general 
structure (Fig. ll-43a). The protein has 12 transmem- 
brane helices, and connecting loops protrude into the cy- 
toplasm or the periplasmic space (between the plasma 
membrane and outer membrane or cell wall). The six 
amino-terminal and six carboxyl-terminal helices form 
very similar domains, to produce a structure with a rough 
twofold symmetry. In the crystallized form of the protein, 
a large aqueous cavity is exposed on the cytoplasmic side 
of the membrane. The substrate-binding site is in this 
cavity, more or less in the middle of the membrane. The 
side of the transporter facing outward (the periplas- 
mic face) is closed tightly, with no channel big enough 
for lactose to enter. The proposed mechanism for 
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FIGURE 11-43 The lactose transporter (lactose permease) of f. coli. 

(a) Ribbon representation viewed parallel to the plane of the membrane 
shows the 12 transmembrane helices arranged in two nearly symmetric 
domains, shown in different shades of purple. In the form of the protein 
for which the crystal structure was determined, the substrate sugar (red) 
is bound near the middle of the membrane where it is exposed to the cy- 
toplasm (derived from PDB ID 1 PV7). (b) The postulated second confor- 



mation of the transporter, related to the first by a large, reversible confor- 
mational change in which the substrate-binding site is exposed first to 
the periplasm (conformation on the right), where lactose is picked up, 
then to the cytoplasm (conformation on the left), where the lactose is re- 
leased. The interconversion of the two forms is driven by changes in the 
pairing of charged (protonatable) side chains such as those of Glu 325 and 
Arg 302 (green), which is affected by the transmembrane proton gradient. 
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transmembrane passage of the substrate (Fig. ll-43b) 
involves a rocking motion between the two domains, 
driven by substrate binding and proton movement, alter- 
nately exposing the substrate-binding domain to the cy- 
toplasm and to the periplasm. This "rocking banana" 
model is similar to that shown in Figure 11-31 for GLUT1. 

How is proton movement into the cell coupled with 
lactose uptake? Extensive genetic studies of the lactose 
transporter have established that of the 417 residues in 
the protein, only 6 are absolutely essential for cotransport 
of H + and lactose — some for lactose binding, others for 
proton transport. Mutation in either of two residues 
(Glu 325 and Arg 302 ; Fig. 11-43) results in a protein still 
able to catalyze facilitated diffusion of lactose but inca- 
pable of coupling H + flow to uphill lactose transport. A 
similar effect is seen in wild-type (unmutated) cells when 
their ability to generate a proton gradient is blocked with 
CN~: the transporter carries out facilitated diffusion nor- 
mally, but it cannot pump lactose against a concentration 
gradient (Fig. ll-42b). The balance between the two 
conformations of the lactose transporter is affected by 
changes in charge pairing between side chains. 

In intestinal epithelial cells, glucose and certain 
amino acids are accumulated by symport with Na + , 
down the Na + gradient established by the Na + K + ATP- 
ase of the plasma membrane (Fig. 11-44). The apical 
surface of the intestinal epithelial cell is covered with 
microvilli, long thin projections of the plasma membrane 
that greatly increase the surface area exposed to the 
intestinal contents. Na + -glucose symporters in the 
apical plasma membrane take up glucose from the intes- 
tine in a process driven by the downhill flow of Na + : 

2Nao Ut + glucose out >2Na£ + glucose in 
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FIGURE 11-44 Glucose transport in intestinal epithelial cells. Glu- 
cose is cotransported with Na + across the apical plasma membrane 
into the epithelial cell. It moves through the cell to the basal surface, 
where it passes into the blood via GLUT2, a passive glucose uniporter. 
The Na + K + ATPase continues to pump Na + outward to maintain the 
Na + gradient that drives glucose uptake. 



The energy required for this process comes from two 
sources: the greater concentration of Na + outside than 
inside (the chemical potential) and the membrane 
(electrical) potential, which is inside negative and 
therefore draws Na + inward. 



WORKED EXAMPLE 11-3 



[glucose] h 



Energetics of Pumping 
by Symport 

- ratio that can be 



Calculate the maximum r 

[glucose] out 

achieved by the plasma membrane Na + -glucose sym- 
porter of an epithelial cell, when [Na + ] in is 12 mM, 



[Na^ 



is 145 mM, the membrane potential is -50 mV 



(inside negative), and the temperature is 37 °C. 

Solution: Using Equation 11-4 (p. 396), we can calcu- 
late the energy inherent in an electrochemical Na + 
gradient— that is, the cost of moving one Na + ion up 
this gradient: 

AG t = RT\n l ^^ + ZJA<p 
[Na] in 

We then substitute standard values for R, T, and J, and 
the given values for [Na + ] (expressed as molar concen- 
trations), + 1 for Z (because Na + has a positive 
charge), and 0.050 V for A(/>. Note that the membrane 
potential is -50 mV (inside negative), so the change in 
potential when an ion moves from inside to outside is 
50 mV. 



AG t = (8.315 J/mol-K)(310 K)ln 



1.45 X 1(T 



1.2 X 10" 2 
+ 1(96,500 J/V-mol)(0.050 V) 

= 11.2kJ/mol 

This AG t is the potential energy per mole of Na + in the 
Na + gradient that is available to pump glucose. Given 
that two Na + ions pass down their electrochemical gra- 
dient and into the cell for each glucose carried in by 
symport, the energy available to pump 1 mol of glucose 
is 2 X 11.2 kJ/mol = 22.4 kJ/mol. We can now calculate 
the concentration ratio of glucose that can be achieved 
by this pump (from Equation 11-3, p. 396): 



AG t = RT In 



[glucose]^ 



[glucose] ou t 

Rearranging, then substituting the values of AG t , R, and 
T, gives 



in 



[glucose] in AG t 



22.4kJ/mol 



[glucose] out R,T (8.315 J/mol-K)(310 K) 
[glucose] in 869 



= 8.69 



[glucose] out 



= 5.94 X 10 s 

Thus the cotransporter can pump glucose inward until 
its concentration inside the epithelial cell is about 6,000 
times that outside (in the intestine). 
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As glucose is pumped from the intestine into the ep- 
ithelial cell at the apical surface, it is simultaneously 
moved from the cell into the blood by passive transport 
through a glucose transporter (GLUT2) in the basal sur- 
face (Fig. 11-44). The crucial role of Na + in symport 
and antiport systems such as this requires the continued 
outward pumping of Na + to maintain the transmem- 
brane Na + gradient. 

■ Because of the essential role of ion gradients in 
active transport and energy conservation, com- 
pounds that collapse ion gradients across cellular 
membranes are effective poisons, and those that are 
specific for infectious microorganisms can serve as an- 
tibiotics. One such substance is valinomycin, a small 
cyclic peptide that neutralizes the K + charge by sur- 
rounding it with six carbonyl oxygens (Fig. 11-45). 
The hydrophobic peptide then acts as a shuttle, carry- 
ing K + across membranes down its concentration gra- 
dient and deflating that gradient. Compounds that 
shuttle ions across membranes in this way are called 
ionophores ("ion bearers"). Both valinomycin and 
monensin (a Na + -carrying ionophore) are antibiotics; 
they kill microbial cells by disrupting secondary trans- 
port processes and energy-conserving reactions. Mon- 
ensin is widely used as an antifungal and antiparasitic 
agent. ■ 




FIGURE 11-45 Valinomycin, a peptide ionophore that binds K + . In this 
image, the surface contours are shown as a transparent mesh, through 
which a stick structure of the peptide and a K + atom (green) are visible. 
The oxygen atoms (red) that bind K + are part of a central hydrophilic 
cavity. Hydrophobic amino acid side chains (yellow) coat the outside 
of the molecule. Because the exterior of the K + -valinomycin complex 
is hydrophobic, the complex readily diffuses through membranes, car- 
rying K + down its concentration gradient The resulting dissipation of 
the transmembrane ion gradient kills microbial cells, making valino- 
mycin a potent antibiotic. 



Aquaporins Form Hydrophilic Transmembrane 
Channels for the Passage of Water 




A family of integral membrane 
proteins discovered by Peter 
Agre, the aquaporins (AQPs), 

provide channels for rapid 
movement of water molecules 
across all plasma membranes. 
Eleven aquaporins are known in 
mammals, each with a specific 
localization and role (Table 11-5, 
p. 406). Erythrocytes, which 
swell or shrink rapidly in re- 



Peter Agre , 

sponse to abrupt changes m ex- 
tracellular osmolarity as blood travels through the renal 
medulla, have a high density of aquaporin in their plasma 
membrane (2 X 10 5 copies of AQP-1 per cell). Water se- 
cretion by the exocrine glands that produce sweat, saliva, 
and tears occurs through aquaporins. Seven different 
aquaporins play roles in urine production and water reten- 
tion in the nephron (the functional unit of the kidney). 
Each renal AQP has a specific localization in the nephron, 
and each has specific properties and regulatory features. 
For example, AQP-2 in the epithelial cells of the renal col- 
lecting duct is regulated by vasopressin (also called anti- 
diuretic hormone): more water is reabsorbed in the kidney 
when the vasopressin level is high. Mutant mice with no 
AQP-1 gene have increased urine output (polyuria) and 
decreased urine-concentrating ability, the result of de- 
creased water permeability of the proximal tubule. In hu- 
mans, genetically defective AQPs are known to be 
responsible for a variety of diseases, including a relatively 
rare form of diabetes that is accompanied by polyuria 
(Box 11-2). 

Aquaporins are found in all organisms. The plant 
Arabidopsis thaliana has 38 AQP genes, reflecting the 
critical roles of water movement in plant physiology. 
Changes in turgor pressure, for example, require rapid 
movement of water across a membrane (see p. 52). 

Water molecules flow through an AQP-1 channel at 
a rate of about 10 9 s" 1 . For comparison, the highest 
known turnover number for an enzyme is that for cata- 
lase, 4 x 10 r s"" 1 , and many enzymes have turnover 
numbers between 1 s" 1 and 10 4 s" 1 (see Table 6-7). 
The low activation energy for passage of water through 
aquaporin channels (AG* < 15 kJ/mol) suggests that wa- 
ter moves through the channels in a continuous stream, 
in the direction dictated by the osmotic gradient. (For a 
discussion of osmosis, see p. 52.) Aquaporins do not al- 
low passage of protons (hydronium ions, H 3 0 + ), which 
would collapse membrane electrochemical gradients. 
What is the basis for this extraordinary selectivity? 

We find an answer in the structure of AQP-1, as de- 
termined by x-ray crystallography. AQP-1 (Fig. ll-46a) 
consists of four identical monomers (each M r 28,000), 
each of which forms a transmembrane pore with a 
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FIGURE 11-46 Aquaporin. The protein is a tetramer of identical sub- 
units, each with a transmembrane pore, (a) A monomer of 
spinach aquaporin SoPIP2;1 (derived from PDB ID 2B5F), viewed in 
the plane of the membrane. The helices form a central pore, and two 
short helical segments (green) contain the Asn-Pro-Ala (NPA) sequences, 
found in all aquaporins, that form part of the water channel, (b) This 
cartoon of bovine aquaporin 1 (derived from PDB ID 1 J4N) shows that 
the pore (brown; filled with water molecules shown in red and white) 



narrows at His 180 to a diameter of 2.8 A (about the size of a water mol- 
ecule), limiting passage of molecules larger than H 2 0. The positive 
charge of Arg 195 repels cations, including H 3 0 + , preventing their pas- 
sage through the pore. The two short helices shown in green are ori- 
ented with their positively charged dipoles pointed at the pore in such 
a way as to force a water molecule to reorient as it passes through; this 
breaks up hydrogen-bonded chains of water molecules, preventing 
proton passage by "proton hopping" (see Fig. 2-13). 



diameter sufficient to allow passage of water molecules in 
single file. Each monomer has six transmembrane helical 
segments and two shorter helices, both of which contain 
the sequence Asn-Pro-Ala (NPA). The six transmem- 
brane helices form the pore through the monomer, and 
the two short loops containing the NPA sequences extend 
toward the middle of the bilayer from opposite sides. 
Their NPA regions overlap in the middle of the membrane 
to form part of the specificity filter — the structure that al- 
lows only water to pass (Fig. ll-46b). 

The water channel narrows to a diameter of 2.8 A 
near the center of the membrane, severely restricting 
the size of molecules that can travel through. The posi- 
tive charge of a highly conserved Arg residue at this bot- 
tleneck discourages the passage of cations such as 
H 3 0 + The residues that line the channel of each AQP-1 
monomer are generally nonpolar, but carbonyl oxygens 
in the peptide backbone, projecting into the narrow part 
of the channel at intervals, can hydrogen-bond with in- 
dividual water molecules as they pass through; the two 
Asn residues (Asn 76 and Asn 192 ) in the NPA loops also 
form hydrogen bonds with the water. The structure of 
the channel does not permit formation of a chain of wa- 



ter molecules close enough to allow proton hopping (see 
Fig. 2-13), which would effectively move protons across 
the membrane. Critical Arg and His residues and elec- 
tric dipoles formed by the short helices of the NPA loops 
provide positive charges in positions that repel any pro- 
tons that might leak through the pore, and prevent hy- 
drogen bonding between adjacent water molecules. 

An aquaporin isolated from spinach is known to be 
"gated" — open when two critical Ser residues near the 
intracellular end of the channel are phosphorylated, and 
closed when they are dephosphorylated. Both the open 
and closed structures have been determined by crystal- 
lography. Phosphorylation favors a conformation that 
presses two nearby Leu residues and a His residue into 
the channel, blocking the movement of water past that 
point and effectively closing the channel. Other aqua- 
porins are regulated in other ways, allowing rapid 
changes in membrane permeability to water. 

Although generally highly specific for water, some 
AQPs also allow glycerol or urea to pass at high rates 
(Table 11-5); these AQPs are believed to be important 
in the metabolism of glycerol. AQP-7, for example, 
found in the plasma membranes of adipocytes (fat 
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I^J^^JJ Permeability Characteristics and Predominant Distribution of Known Mammalian Aquaporins 


Amiftnorin 




Tissue distribution 


Subcellular distribution* 


AQP-0 


Water (low) 


Lens 


Plasma membrane 


AQP-1 


Water (high) 


Erythrocyte, kidney, lung, vascular 
endothelium, brain, eye 


Plasma membrane 


AQP-2 


Water (high) 


Kidney, vas deferens 


Apical plasma membrane, 
intracellular vesicles 


AQP-3 


Water (high), glycerol 


Kidney, skin, lung, eye, colon 


Basolateral plasma membrane 




(high), urea (moderate) 




AQP-4 


Water (high) 


Brain, muscle, kidney, lung, stomach, 
small intestine 


Basolateral plasma membrane 


AQP-5 


Water (high) 


Salivarv t?land lacrimal sland ^wpat 
gland, lung, cornea 


JT.pn^al picLolUa ILLclUUidllc 


AQP-6 


Water (low) , anions 
(N0 3 " > CI") 


Kidney 


Tntrar'plliilpr vptiiplpQ 

J.1 LLl CLVj Cll LLLCLL V COlLlCO 


AQP-7 


Water (high), glycerol 
(high) > urea (high) , 
arsenite 


Adipose tissue, kidney, testis 


Plasma membrane 


AQP-8 r 


Water (high) 


Testis, kidney, liver, pancreas, small 
intestine, colon 


Plasma membrane, intracellular 
vesicles 


AQP-9 


Water (low), glycerol 
(high), urea (high), 
arsenite 


Liver, leukocyte, brain, testis 


Plasma membrane 


AQP-10 


Water (low), glycerol 
(high), urea (high) 


Small intestine 


Intracellular vesicles 



Source: Data from King, L.S., Kozono, D., & Agre, R (2004) From structure to disease: the evolving tale of aquaporin biology. Nat. Rev. 5, 688. 

♦Aquaporins that are present primarily in the apical or in the basolateral membrane are noted as localized in one of these membranes; those 
present in both membranes are described as localized in the plasma membrane. 

f AQP-8 might also be permeated by urea. 



cells), transports glycerol efficiently. Mice with defec- 
tive AQP-7 develop obesity and adult-onset diabetes, 
presumably as a result of their inability to move glycerol 
into or out of adipocytes as triacylglycerols are con- 
verted to free fatty acids and glycerol, and vice versa. 

Ion-Selective Channels Allow Rapid Movement 
of Ions across Membranes 

Ion-selective channels — first recognized in neurons 
and now known to be present in the plasma membranes 
of all cells, as well as in the intracellular membranes of 
eukaryotes — provide another mechanism for moving in- 
organic ions across membranes. Ion channels, together 
with ion pumps such as the Na + K + ATPase, determine a 
plasma membrane's permeability to specific ions and reg- 
ulate the cytosolic concentration of ions and the mem- 
brane potential. In neurons, very rapid changes in the 
activity of ion channels cause the changes in membrane 
potential (action potentials) that carry signals from one 
end of a neuron to the other. In myocytes, rapid opening 
of Ca 2+ channels in the sarcoplasmic reticulum releases 
the Ca 2+ that triggers muscle contraction. We discuss 
the signaling functions of ion channels in Chapter 12. 
Here we describe the structural basis for ion-channel 



function, using as examples a voltage-gated K + channel, 
the neuronal Na + channel, and the acetylcholine recep- 
tor ion channel 

Ion channels are distinct from ion transporters in 
at least three ways. First, the rate of flux through chan- 
nels can be several orders of magnitude greater than 
the turnover number for a transporter — 10 7 to 10 s 
ions/s for an ion channel, approaching the theoretical 
maximum for unrestricted diffusion. By contrast, the 
turnover rate of the Na + K + ATPase is about 100 s" 1 ! 
Second, ion channels are not saturable: rates do not 
approach a maximum at high substrate concentration. 
Third, they are gated in response to some cellular 
event. In ligand-gated channels (which are generally 
oligomeric), binding of an extracellular or intracellular 
small molecule forces an allosteric transition in the 
protein, which opens or closes the channel. In volt- 
age-gated ion channels, a change in transmembrane 
electrical potential (7 m ) causes a charged protein do- 
main to move relative to the membrane, opening or 
closing the channel. Both types of gating can be very 
fast. A channel typically opens in a fraction of a mil- 
lisecond and may remain open for only milliseconds, 
making these molecular devices effective for very fast 
signal transmission in the nervous system. 
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ion-Channe! Function is Measured Electrically 

Because a single ion channel typically remains open for 
only a few milliseconds, monitoring this process is be- 
yond the limit of most biochemical measurements. Ion 
fluxes must therefore be measured electrically, either as 
changes in V m (in the millivolt range) or as electric cur- 
rents / (in the microampere or picoampere range), using 
microelectrodes and appropriate amplifiers. In patch- 
clamping, a technique developed by Erwin Neher and 
Bert Sakmann in 1976, very small currents are meas- 
ured through a tiny region of the membrane surface 
containing only one or a few ion-channel molecules 
(Fig. 11-47). The researcher can measure the size and 
duration of the current that flows during one opening of 
an ion channel and can determine how often a channel 

Channel 




Patch of membrane 
placed in aqueous 
solution 



Micropipette 
Electrodes 



Electronics to hold transmembrane 
potential (V m ) constant and measure 
current flowing across membrane 



FIGURE 11-47 Electrical measurements of ion-channel function. The 

"activity" of an ion channel is estimated by measuring the flow of ions 
through it using the patch-clamp technique A finely drawn-out 
pipette (micropipette) is pressed against the cell surface, and negative 
pressure in the pipette forms a pressure seal between pipette and mem- 
brane. As the pipette is pulled away from the cell, it pulls off a tiny 
patch of membrane {which may contain one or a few ion channels) Af- 
ter placing the pipette and attached patch in an aqueous solution, the 
researcher can measure channel activity as the electric current that 
flows between the contents of the pipette and the aqueous solution In 
practice, a circuit is set up that "clamps" the transmembrane potential 
at a given value and measures the current that must flow to maintain 
this voltage. With highly sensitive current detectors, researchers can 
measure the current flowing through a single ion channel, typically a 
few picoamperes. The trace showing the current as a function of time 
(in milliseconds) reveals how fast the channel opens and closes, how 
frequently it opens, and how long it stays open. Clamping the V m at dif- 
ferent values permits determination of the effect of membrane poten- 
tial on these parameters of channel function. 



opens and how that frequency is affected by membrane 
potential, regulatory ligands, toxins, and other agents. 
Patch-clamp studies have revealed that as many as 10 4 
ions can move through a single ion channel in 1 ms. Such 
an ion flux represents a huge amplification of the initial 
signal; for example, only two acetylcholine molecules 
are needed to open an acetylcholine receptor channel 
(as described below). 




Erwin Neher 



Bert Sakmann 




Roderick MacKinnon 



The Structure of a K + Channel Reveals 
the Basis for Its Specificity 

The structure of a potassium 
channel from the bacterium 
Streptomyces lividans, deter- 
mined crystallographically by 
Roderick MacKinnon in 1998, 
provides much insight into the 
way ion channels work. This 
bacterial ion channel is related 
in sequence to all other known 
K + channels and serves as the 
prototype for such channels, 
including the voltage-gated K + 
channel of neurons. Among the 
members of this protein family, the similarities in se- 
quence are greatest in the "pore region," which contains 
the ion selectivity filter that allows K + (radius 1.33 A) to 
pass 10,000 times more readily than Na + (radius 0.95 A)— 
at a rate (about 10 s ions/s) approaching the theoretical 
limit for unrestricted diffusion. 

The K + channel consists of four identical subunits 
that span the membrane and form a cone within a cone 
surrounding the ion channel, with the wide end of 
the double cone facing the extracellular space (Fig. 
11-48). Each subunit has two transmembrane a he- 
lices as well as a third, shorter helix that contributes 
to the pore region. The outer cone is formed by one of 
the transmembrane helices of each subunit. The inner 
cone, formed by the other four transmembrane he- 
lices, surrounds the ion channel and cradles the ion 
selectivity filter. 

Both the ion specificity and the high flux through 
the channel are understandable from what we know of 
the channel's structure. At the inner and outer plasma 
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K + with hydrating 
water molecules 

(c) 



FIGURE 11-48 The K + channel of Streptomyces lividans. (PDB ID 
1 BL8) (a) Viewed in the plane of the membrane, the channel consists 
of eight transmembrane helices (two from each of four identical sub- 
units), forming a cone with its wide end toward the extracellular space. 
The inner helices of the cone (lighter colored) line the transmembrane 
channel, and the outer helices interact with the lipid bilayer, Short seg- 
ments of each subunit converge in the open end of the cone to make a 
selectivity filter, (b) This view, perpendicular to the plane of the mem- 
brane, shows the four subunits arranged around a central channel just 
wide enough for a single K + ion to pass, (c) Diagram of a K + channel 
in cross section, showing the structural features critical to function. 
(See also Fig. 1 1-49.,) 

membrane surfaces, the entryways to the channel have 
several negatively charged amino acid residues, which 
presumably increase the local concentration of cations 
such as K + and Na + . The ion path through the mem- 
brane begins (on the inner surface) as a wide, water- 
filled channel in which the ion can retain its hydration 



sphere. Further stabilization is provided by the short 
helices in the pore region of each subunit, with the par- 
tial negative charges of their electric dipoles pointed at 
K + in the channel. About two-thirds of the way through 
the membrane, this channel narrows in the region of the 
selectivity filter, forcing the ion to give up its hydrating 
water molecules. Carbonyl oxygen atoms in the back- 
bone of the selectivity filter replace the water molecules 
in the hydration sphere, forming a series of perfect coor- 
dination shells through which the K + moves. This favor- 
able interaction with the filter is not possible for Na + , 
which is too small to make contact with all the potential 
oxygen ligands. The preferential stabilization of K + is 
the basis for the ion selectivity of the filter, and muta- 
tions that change residues in this part of the protein 
eliminate the channel's ion selectivity. The K + -binding 
sites of the filter are flexible enough to collapse to fit any 
Na + that enters the channel, and this conformational 
change closes the channel. 

There are four potential K + -binding sites along the 
selectivity filter, each composed of an oxygen "cage" 
that provides ligands for the K + ions (Fig. 11-49). In 
the crystal structure, two K + ions are visible within the 
selectivity filter, about 7.5 A apart, and two water mole- 
cules occupy the unfilled positions. K + ions pass 
through the filter in single file; their mutual electrostatic 
repulsion most likely just balances the interaction of 
each ion with the selectivity filter and keeps them mov- 
ing. Movement of the two K + ions is concerted: first they 
occupy positions 1 and 3, then they hop to positions 2 
and 4 (Fig. ll-48c). The energetic difference between 




FIGURE 11-49 K + binding sites in the selectivity pore of the K + chan- 
nel. (PDB ID 1J95) Carbonyl oxygens (red) of the peptide backbone in 
the selectivity filter protrude into the channel, interacting with and sta- 
bilizing a K + ion passing through. These ligands are perfectly posi- 
tioned to interact with each of four K + ions, but not with the smaller 
Na + ions. This preferential interaction with K + is the basis for the ion 
selectivity. The mutual repulsion between K + ions results in occupa- 
tion of only two of the four K + sites at a time (both green or both blue) 
and counteracts the tendency for a lone K + to stay bound in one site. 
The combined effect of K + binding to carbonyl oxygens and repulsion 
between K + ions ensures that each ion keeps moving, changing posi- 
tions within 1 0 to 1 00 ns, and that there are no large energy barriers to 
ion flow through the membrane. 
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these two configurations (1, 3 and 2, 4) is very small; en- 
ergetically, the selectivity pore is not a series of hills and 
valleys but a flat surface, which is ideal for rapid ion 
movement through the channel. The structure of the 
channel seems to have been optimized during evolution 
to give maximal flow rates and high specificity. 

Voltage-gated K + channels are more complex struc- 
tures than that illustrated in Figure 11-48, but they are 
variations on the same theme. For example, the mam- 
malian voltage-gated K + channels in the Shaker family 
have an ion channel like that of the bacterial channel 
shown in Figure 11-48, but with additional protein do- 
mains that sense the membrane potential, move in re- 
sponse to a change in potential, and in moving trigger 



the opening or closing of the K + channel (Fig. 11-50). 
The critical transmembrane helix in the voltage-sensing 
domain of Shaker K + channels contains four Arg 
residues; the positive charges on these residues cause 
the helix to move relative to the membrane in response 
to changes in the transmembrane electrical field (the 
membrane potential). 

Cells also have channels that specifically conduct 
Na + or Ca 2+ , and exclude K + . In each case, the ability to 
discriminate among cations requires both a cavity in the 
binding site of just the right size (neither too large nor 
too small) to accommodate the ion and the precise posi- 
tioning within the cavity of carbonyl oxygens that can 
replace the ion's hydration shell. This fit can be achieved 
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FIGURE 11-50 Structural basis for voltage gating in the K + channel. 

(PDB ID 2A79) This crystal structure of the Kv1 .2-/32 subunit complex 
from rat brain shows the basic K + channel (corresponding to that shown 
in Fig. 1 1^8) with the extra machinery necessary to make the channel 
sensitive to gating by membrane potential: four transmembrane helical 
extensions of each subunit and four j3 subunits. The entire complex, 
viewed (a) in the plane of the membrane and (b) perpendicular to the 
membrane plane (as viewed from outside the membrane), is represented 
as in Figure 1 1^18, with each subunit in a different color; each of the four 
p subunits is colored like the subunit with which it associates. In (b), each 
transmembrane helix of one subunit (red) is numbered, S1 to S6. S5 and 
S6 from each of four subunits form the channel itself, and are compara- 
ble to the two transmembrane helices of each subunit in Figure 1 1-48. S1 
to S4 are four transmembrane helices. The S4 helix contains the highly 



conserved Arg residues and is believed to be the chief moving part of the 
voltage-sensing mechanism, (c) A schematic diagram of the voltage- 
gated channel, showing the basic pore structure (center) and the extra 
structures that make the channel voltage-sensitive; S4, the Arg-containing 
helix, is orange. For clarity, the /3 subunits are not shown in this view. 
Normally, the transmembrane electrical potential (inside negative) exerts 
a pull on positively charged Arg side chains in S4, toward the cytosolic 
side. When the membrane is depolarized the pull is lessened, and with 
complete reversal of the membrane potential, S4 is drawn toward the ex- 
tracellular side, (d) This movement of S4 is physically coupled to opening 
and closing of the K + channel, which is shown here in its open and 
closed conformations. Although K + is present in the closed channel, the 
pore closes on the bottom, near the cytosol, preventing K + passage. 
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with molecules smaller than proteins; for example, vali- 
nomycin (Fig. 11-45) can provide the precise fit that 
gives high specificity for the binding of one ion rather 
than another. Chemists have designed small molecules 
with very high specificity for binding of Li + (radius 
0.60 A), Na + (radius 0.95 A), K + (radius 1.33 A), or Rb + 
(radius 1.48 A). The biological versions, however— the 
channel proteins— not only bind specifically but con- 
duct ions across membranes in a gated fashion. 

Gated Ion Channels Are Central in Neuronal Function 

Virtually all rapid signaling between neurons and their 
target tissues (such as muscle) is mediated by the rapid 
opening and closing of ion channels in plasma mem- 
branes. For example, Na + channels in neuronal plasma 
membranes sense the transmembrane electrical gradi- 
ent and respond to changes by opening or closing. These 
voltage-gated ion channels are typically very selective 
for Na + over other monovalent or divalent cations (by 
factors of 100 or more) and have very high flux rates 
(>10 7 ions/s). Closed in the resting state, Na + channels 
are opened— activated— by a reduction in the mem- 
brane potential; they then undergo very rapid inactiva- 
tion. Within milliseconds of opening, a channel closes 
and remains inactive for many milliseconds. Activation 
followed by inactivation of Na + channels is the basis for 
signaling by neurons (see Fig. 12-25). 

Another very well-studied ion channel is the nico- 
tinic acetylcholine receptor, which functions in the 
passage of an electrical signal from a motor neuron to a 
muscle fiber at the neuromuscular junction (signaling 
the muscle to contract). Acetylcholine released by the 
motor neuron diffuses a few micrometers to the plasma 
membrane of a myocyte, where it binds to an acetyl- 
choline receptor. This forces a conformational change in 
the receptor, causing its ion channel to open. The result- 
ing inward movement of positively charged ions into the 
myocyte depolarizes its plasma membrane and triggers 
contraction. The acetylcholine receptor allows Na + , 
Ca 2+ , and K + to pass through its channel with equal 
ease, but other cations and all anions are unable to pass. 
Movement of Na + through an acetylcholine receptor ion 
channel is unsaturable (its rate is linear with respect to 
extracellular [Na + ]) and very fast— about 2 X 10 7 ions/s 
under physiological conditions. 

Q 

O — CH 2 — CH 2 -^N — CH 3 
CH 3 

Acetylcholine 

The acetylcholine receptor channel is typical of many 
other ion channels that produce or respond to electrical 
signals: it has a "gate" that opens in response to stimula- 
tion by a signal molecule (in this case acetylcholine) and 
an intrinsic timing mechanism that closes the gate after a 



split second. Thus the acetylcholine signal is transient — 
an essential feature of all electrical signal conduction. 

Based on similarities between the amino acid se- 
quences of other ligand-gated ion channels and the 
acetylcholine receptor, neuronal receptor channels that 
respond to the extracellular signals y-aminobutyric 
acid (GAB A), glycine, and serotonin are grouped to- 
gether in the acetylcholine receptor superfamily, and 
probably share three-dimensional structure and gating 
mechanisms. The GABA A and glycine receptors are 
anion channels specific for CI" or HCO3, whereas the 
serotonin receptor, like the acetylcholine receptor, is 
cation-specific. 

Another class of ligand-gated ion channels respond 
to intracellular ligands: 3',5'-cyclic guanosine mono- 
nucleotide (cGMP) in the vertebrate eye, cGMP and 
cAMP in olfactory neurons, and ATP and inositol 1,4,5- 
trisphosphate (IP 3 ) in many cell types. These channels 
are composed of multiple subunits, each with six trans- 
membrane helical domains. We discuss the signaling 
functions of these ion channels in Chapter 12. 

Table 11-6 shows some transporters discussed in 
other chapters in the context of the pathways in which 
they act. 

Defective Ion Channels Can Have Severe 
Physiological Consequences 

|S The importance of ion channels to physiological 
mm processes is clear from the effects of mutations in 
specific ion-channel proteins (Table 11-7, Box 11-3). 
Genetic defects in the voltage-gated Na + channel of the 
myocyte plasma membrane result in diseases in which 
muscles are periodically either paralyzed (as in hyper- 
kalemic periodic paralysis) or stiff (as in paramyotonia 
congenita). Cystic fibrosis is the result of a mutation 
that changes one amino acid in the protein CFTR, a CI" 
ion channel; the defective process here is not neuro- 
transmission but secretion by various exocrine gland 
cells with activities tied to CF ion fluxes. 

Many naturally occurring toxins act on ion channels, 
and the potency of these toxins further illustrates 
the importance of normal ion-channel function. Tetro- 
dotoxin (produced by the puffer fish, Sphaeroides 
rubripes) and saxitoxin (produced by the marine di- 
noflagellate Gonyaulax, which causes "red tides") act 
by binding to the voltage-gated Na + channels of neurons 
and preventing normal action potentials. Puffer fish is 
an ingredient of the Japanese delicacy fugu, which may 
be prepared only by chefs specially trained to separate 
succulent morsel from deadly poison. Eating shellfish 
that have fed on Gonyaulax can also be fatal; shellfish 
are not sensitive to saxitoxin, but they concentrate it in 
their muscles, which become highly poisonous to organ- 
isms higher up the food chain. The venom of the black 
mamba snake contains dendrotoxin, which interferes 
with voltage-gated K + channels. Tubocurarine, the active 
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TABLE 11-6 



Transport Systems Described Elsewhere in This Text 

Transport system and location Figure number 



Role 



Adenine nucleotide antiporter of mitochondrial 19-28 
inner membrane 

Acyl-carnitine/carnitine transporter of mitochondrial 17-6 
inner membrane 

P r H + symporter of mitochondrial inner membrane 19-28 

Malate-a-ketoglutarate transporter of mitochondrial 19-29 
inner membrane 

Glutamate-aspartate transporter of mitochondrial 19-29 
inner membrane 

Citrate transporter of mitochondrial inner membrane 21-10 

Pyruvate transporter of mitochondrial inner 21-10 
membrane 

Fatty acid transporter of myocyte plasma 1 7-3 
membrane 

Complex I, III, and IV proton transporters of 19-16 
mitochondrial inner membrane 

Thermogenin (uncoupler protein), a proton pore of 19-34, 23-35 
mitochondrial inner membrane 



Cytochrome bf complex, a proton transporter of 
chloroplast thylakoid 

Bacteriorhodopsin, a light-driven proton pump 

F Q F 1 ATPase/ATP synthase of mitochondrial inner 
membrane, chloroplast thylakoid, and bacterial 
plasma membrane 

Pj-triose phosphate antiporter of chloroplast inner 
membrane 

Bacterial protein transporter 

Protein translocase of ER 

Nuclear pore protein translocase 

LDL receptor in animal cell plasma membrane 

Glucose transporter of animal cell plasma 
to membrane; regulated by insulin 

IP 3 -gated Ca 2+ channel of endoplasmic reticulum 

cGMP-gated Ca 2+ channel of retinal rod and cone 
cells 

Voltage-gated Na + channel of neuron 



19-59 

19-66 

19- 64 

20- 15, 20-16 

27-44 

27-38 

27-12 
21^2 

12-16 

12-10 

12-36 

12-25 



Imports substrate ADP for oxidative 
phosphorylation, and exports product ATP 

Imports fatty acids into matrix for j8 oxidation 

Supplies Pi for oxidative phosphorylation 
Shuttles reducing equivalents (as malate) from 
matrix to cytosol 

Completes shuttling begun by 
malate-a-ketoglutarate shuttle 

Provides cytosolic citrate as source of acetyl-CoA 

for lipid synthesis 
Is part of mechanism for shuttling citrate from 

matrix to cytosol 
Imports fatty acids for fuel 

Acts as energy-conserving mechanism in 
oxidative phosphorylation, converting 
electron flow into proton gradient 

Allows dissipation of proton gradient in 
mitochondria as means of thermogenesis 
and/or disposal of excess fuel 

Acts as proton pump, driven by electron flow 
through the Z scheme; source of proton 
gradient for photosynthetic ATP synthesis 

Is light-driven source of proton gradient for ATP 

synthesis in halophilic bacterium 
Interconverts energy of proton gradient and ATP 

during oxidative phosphorylation and 

photophosphorylation 

Exports photosynthetic product from stroma; 

imports Pi for ATP synthesis 
Exports secreted proteins through plasma 

membrane 

Transports into ER proteins destined for plasma 

membrane, secretion, or organelles 
Shuttles proteins between nucleus and cytoplasm 
Imports, by receptor-mediated endocytosis, lipid 

carrying particles 
Increases capacity of muscle and adipose tissue to 

take up excess glucose from blood 
Allows signaling via changes of cytosolic Ca 2+ 

concentration 
Allows signaling via rhodopsin linked to cAMP 

phosphodiesterase in vertebrate eye 
Creates action potentials in neuronal signal 

transmission 



component of curare (used as an arrow poison in the 
Amazon region) , and two other toxins from snake ven- 
oms, cobrotoxin and bungarotoxin, block the acetyl- 
choline receptor or prevent the opening of its ion 



channel. By blocking signals from nerves to muscles, all 
these toxins cause paralysis and possibly death. On the 
positive side, the extremely high affinity of bungaro- 
toxin for the acetylcholine receptor (K d = 10' 15 m) has 



^412^ Biological Membranes and Transport 



TABLE 11 -7 



Ion channel 



Some Diseases Resulting 



from Ion Channel 

Affected gene 



Na + (voltage-gated, skeletal muscle) SCN4A 

Na + (voltage-gated, neuronal) SCNIA 

Na + (voltage-gated, cardiac muscle) SCN5A 

Ca 2+ (neuronal) CACNAIA 

Ca 2+ (voltage-gated, retina) CACNAIF 

Ca 2+ (polycystic 1) PKDl 

K + (neuronal) KCNQ4 

K + (voltage-gated, neuronal) KCNQ2 

Nonspecific cation (cGMP-gated, retinal) CNCGl 

Acetylcholine receptor (skeletal muscle) CHRNAl 

cr CFTR 



Defects 
Disease 



Hyperkalemic periodic paralysis (or paramyotonia congenita) 

Generalized epilepsy with febrile seizures 

Long QT syndrome 3 

Familial hemiplegic migraine 

Congenital stationary night blindness 

Polycystic kidney disease 

Dominant deafness 

Benign familial neonatal convulsions 

Retinitis pigmentosa 

Congenital myasthenic syndrome 

Cystic fibrosis 



proved useful experimentally: the radiolabeled toxin was 
used to quantify the receptor during its purification. ■ 




D-Tubocurarine chloride 

SUMMARY 1 1 .3 Solute Transport across 
Membranes 

■ Movement of polar compounds and ions across 
biological membranes requires transporter proteins. 
Some transporters simply facilitate passive diffusion 
across the membrane from the side with higher 



concentration to the side with lower. Others 
transport solutes against an electrochemical 
gradient; this requires a source of metabolic 
energy. 

■ Carriers, like enzymes, show saturation and 
stereospeciflcity for their substrates. Transport via 
these systems may be passive or active. Primary 
active transport is driven by ATP or electron-transfer 
reactions; secondary active transport is driven by 
coupled flow of two solutes, one of which (often H + 
or Na + ) flows down its electrochemical gradient as 
the other is pulled up its gradient. 

■ The GLUT transporters, such as GLUT1 of 
erythrocytes, carry glucose into cells by facilitated 
diffusion. These transporters are uniporters, 
carrying only one substrate. Symporters permit 
simultaneous passage of two substances in the 
same direction; examples are the lactose transporter 
of E. coli, driven by the energy of a proton gradient 
(lactose-H + symport), and the glucose transporter 
of intestinal epithelial cells, driven by a Na + gradient 
(glucose-Na + symport). Antiporters mediate 
simultaneous passage of two substances in opposite 
directions; examples are the chloride-bicarbonate 
exchanger of erythrocytes and the ubiquitous 
Na + K + ATPase. 

m In animal cells, Na + K + ATPase maintains the 
differences in cytosolic and extracellular 
concentrations of Na + and K + , and the resulting 
Na + gradient is used as the energy source for a 
variety of secondary active transport processes. 

■ The Na + K + ATPase of the plasma membrane and 
the Ca 2+ transporters of the sarcoplasmic and 
endoplasmic reticulum (the SERCA pumps) are 
examples of P-type ATPases; they undergo 
reversible phosphorylation during their catalytic 
cycle. F-type ATPase proton pumps (ATP 
synthases) are central to energy-conserving 
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mechanisms in mitochondria and chloroplasts. 
V-type ATPases produce gradients of protons 
across some intracellular membranes, including 
plant vacuolar membranes. 

m ABC transporters carry a variety of substrates 
(including many drugs) out of cells, using ATP as 
energy source. 

■ Ionophores are lipid-soluble molecules that bind 
specific ions and carry them passively across 
membranes, dissipating the energy of 
electrochemical ion gradients. 

m Water moves across membranes through aquaporins. 
Some aquaporins are regulated; some also transport 
glycerol or urea. 

■ Ion channels provide hydrophilic pores through 
which select ions can diffuse, moving down 

their electrical or chemical concentration gradients; 
they characteristically are unsaturable, have 
very high flux rates, and are highly specific for 
one ion. Most are voltage- or ligand-gated. The 
neuronal Na + channel is voltage-gated, and 
the acetylcholine receptor ion channel is gated 
by acetylcholine, which triggers conformational 
changes that open and close the transmembrane 
path. 
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Terms in bold are defined in the glossary. 
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Problems 

1. Determining the Cross-Sectional Area of a Lipid 
Molecule When phospholipids are layered gently onto the 
surface of water, they orient at the air-water interface with 
their head groups in the water and their hydrophobic tails in 
the air. An experimental apparatus (a) has been devised that 
reduces the surface area available to a layer of lipids. By meas- 
uring the force necessary to push the lipids together, it is pos- 
sible to determine when the molecules are packed tightly in a 
continuous monolayer; as that area is approached, the force 
needed to further reduce the surface area increases sharply 
(b) How would you use this apparatus to determine the aver- 
age area occupied by a single lipid molecule in the monolayer? 



Force applied here 
to compress 
monolayer (1 



(a) 



1 - 




2. Evidence for a Lipid Bilayer In 1925, E. Gorter and F. 
Grendel used an apparatus like that described in Problem 1 to 
determine the surface area of a lipid monolayer formed by 
lipids extracted from erythrocytes of several animal species. 
They used a microscope to measure the dimensions of individ- 
ual cells, from which they calculated the average surface area 
of one erythrocyte. They obtained the data shown in the table. 
Were these investigators justified in concluding that "chromo- 
cytes [erythrocytes] are covered by a layer of fatty substances 
that is two molecules thick" (i.e., a lipid bilayer)? 

Total surface 





Volume of 


Number 


area of lipid 


Total surface 




packed 


of cells 


monolayer 


area of one 


Animal 


cells (mL) 


(per mm 3 ) 


from cells (m 2 ) 


cell (^tm 2 ) 


Dog 


40 


8,000,000 


62 


98 


Sheep 


10 


9,900,000 


6.0 


29.8 


Human 


1 


4,740,000 


0.92 


99.4 



(b) 



0.6 1.0 
Area (nm 2 /molecule) 



Source: Data from Gorter, E & Grendel, F. (1925) On bimolecular layers 
of lipoids on the chromocytes of the blood. J. Exp Med 41, 439-443. 

3. Number of Detergent Molecules per Micelle When a 
small amount of the detergent sodium dodecyl sulfate 
(SDS; Na + CH 3 (CH 2 ) n OS03) is dissolved in water, the deter- 
gent ions enter the solution as monomeric species. As more 
detergent is added, a concentration is reached (the critical mi- 
celle concentration) at which the monomers associate to form 
micelles The critical micelle concentration of SDS is 8.2 ium. 
The micelles have an average particle weight (the sum of 
the molecular weights of the constituent monomers) of 
18,000. Calculate the number of detergent molecules in the 
average micelle. 

4. Properties of Lipids and Lipid Bilayers Lipid bilayers 
formed between two aqueous phases have this important 
property: they form two-dimensional sheets, the edges of 
which close upon each other and undergo self-sealing to form 
vesicles (liposomes). 

(a) What properties of lipids are responsible for this prop- 
erty of bilayers? Explain. 

(b) What are the consequences of this property for the 
structure of biological membranes? 

5. Length of a Fatty Acid Molecule The carbon-carbon 
bond distance for single-bonded carbons such as those in a 
saturated fatty acyl chain is about 1.5 A. Estimate the length of 
a single molecule of palmitate in its fully extended form. If two 
molecules of palmitate were placed end to end, how would 
their total length compare with the thickness of the lipid bi- 
layer in a biological membrane? 

6. Temperature Dependence of Lateral Diffusion The 

experiment described in Figure 11-17 was performed at 37 °C. 
If the experiment were carried out at 10 °C, what effect would 
you expect on the rate of diffusion? Why? 

7. Synthesis of Gastric Juice: Energetics Gastric juice 
(pH 1.5) is produced by pumping HC1 from blood plasma (pH 
7.4) into the stomach, Calculate the amount of free energy 
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required to concentrate the H + in 1 L of gastric juice at 37 °C. 
Under cellular conditions, how many moles of ATP must be hy- 
drolyzed to provide this amount of free energy? The free- 
energy change for ATP hydrolysis under cellular conditions is 
about -58 kJ/mol (as explained in Chapter 13). Ignore the 
effects of the transmembrane electrical potential. 

8. Energetics of the Na + K + ATPase For a typical verte- 
brate cell with a membrane potential of -0.070 V (inside neg- 
ative), what is the free-energy change for transporting 1 mol of 
Na + from the cell into the blood at 37 °C? Assume the concen- 
tration of Na + inside the cell is 12 mM and that in blood plasma 
is 145 mM. 

9. Action of Ouabain on Kidney Tissue Ouabain specifi- 
cally inhibits the Na + K + ATPase activity of animal tissues but 
is not known to inhibit any other enzyme. When ouabain is 
added to thin slices of living kidney tissue, it inhibits oxygen 
consumption by 66% Why? What does this observation tell us 
about the use of respiratory energy by kidney tissue? 

10. Energetics of Symport Suppose you determined ex- 
perimentally that a cellular transport system for glucose, 
driven by symport of Na + , could accumulate glucose to con- 
centrations 25 times greater than in the external medium, 
while the external [Na + ] was only 10 times greater than the in- 
tracellular [Na + ]. Would this violate the laws of thermodynam- 
ics? If not, how could you explain this observation? 

11. Location of a Membrane Protein The following observa- 
tions are made on an unknown membrane protein, X. It can be 
extracted from disrupted erythrocyte membranes into a concen- 
trated salt solution, and it can be cleaved into fragments by 
proteolytic enzymes. Treatment of erythrocytes with proteolytic 
enzymes followed by disruption and extraction of membrane 
components yields intact X. However, treatment of erythrocyte 
"ghosts" (which consist of just plasma membranes, produced by 
disrupting the cells and washing out the hemoglobin) with prote- 
olytic enzymes followed by disruption and extraction yields ex- 
tensively fragmented X, What do these observations indicate 
about the location of X in the plasma membrane? Do the proper- 
ties of X resemble those of an integral or peripheral membrane 
protein? 

12. Membrane Self-sealing Cellular membranes are self- 
sealing— if they are punctured or disrupted mechanically, they 
quickly and automatically reseal What properties of mem- 
branes are responsible for this important feature? 

13. Lipid Melting Temperatures Membrane lipids in tissue 
samples obtained from different parts of the leg of a reindeer 
have different fatty acid compositions. Membrane lipids from 
tissue near the hooves contain a larger proportion of unsatu- 
rated fatty acids than those from tissue in the upper leg. What 
is the significance of this observation? 

14. Flip-Flop Diffusion The inner leaflet (monolayer) of 
the human erythrocyte membrane consists predominantly of 
phosphatidylethanolarnine and phosphatidylserine. The outer 
leaflet consists predominantly of phosphatidylcholine and 



sphingomyelin. Although the phospholipid components of the 
membrane can diffuse in the fluid bilayer, this sidedness is pre- 
served at all times. How? 

15. Membrane Permeability At pH 7, tryptophan crosses a 
lipid bilayer at about one-thousandth the rate of indole, a 
closely related compound: 




a 



Suggest an explanation for this observation. 

16. Water Flow through an Aquaporin A human erythro- 
cyte has about 2 X 10 5 AQP-1 monomers. If water molecules 
flow through the plasma membrane at a rate of 5 X 10 8 per 
AQP-1 tetramer per second, and the volume of an erythrocyte 
is 5 X 10 -11 mL, how rapidly could an erythrocyte halve its 
volume as it encountered the high osmolarity (1 m) in the in- 
terstitial fluid of the renal medulla? Assume that the erythro- 
cyte consists entirely of water. 

17. Labeling the Lactose Transporter A bacterial lactose 
transporter, which is highly specific for lactose, contains a Cys 
residue that is essential to its transport activity. Covalent reac- 
tion of 7V-ethylmaleimide (NEM) with this Cys residue irre- 
versibly inactivates the transporter. A high concentration of 
lactose in the medium prevents inactivation by NEM, presum- 
ably by sterically protecting the Cys residue, which is in or near 
the lactose-binding site, You know nothing else about the trans- 
porter protein. Suggest an experiment that might allow you to 
determine theM r of the Cys-containing transporter polypeptide. 

18. Predicting Membrane Protein Topology from Se- 
quence You have cloned the gene for a human erythrocyte 
protein, which you suspect is a membrane protein. From the 
nucleotide sequence of the gene, you know the amino acid se- 
quence. From this sequence alone, how would you evaluate 
the possibility that the protein is an integral protein? Suppose 
the protein proves to be an integral protein, either type I or 
type II. Suggest biochemical or chemical experiments that 
might allow you to determine which type it is. 

19. Intestinal Uptake of Leucine You are studying the up- 
take of L-leucine by epithelial cells of the mouse intestine. 
Measurements of the rate of uptake of L-leucine and several of 
its analogs, with and without Na + in the assay buffer, yield the 
results given in the table. What can you conclude about the 
properties and mechanism of the leucine transporter? Would 
you expect L-leucine uptake to be inhibited by ouabain? 



Uptake in Uptake in 

presence of Na + absence of Na + 



Substrate 


V 

r max 


tft(mM) 


v 

' max 


K t (vm) 


L-Leucine 


420 


0.24 


23 


0.2 


D-Leucine 


310 


4.7 


5 


4 7 


L-Valine 


225 


0.31 


19 


0.31 



Problems 417 



20. Effect of an Ionophore on Active Transport Consider 
the leucine transporter described in Problem 19. Would 7 max 
and/or K t change if you added a Na + ionophore to the assay so- 
lution containing Na + ? Explain. 

21. Surface Density of a Membrane Protein E. coli can 
be induced to make about 10,000 copies of the lactose trans- 
porter (M T 31,000) per cell. Assume thatE. coli is a cylinder 1 
fim in diameter and 2 /xm long. What fraction of the plasma 
membrane surface is occupied by the lactose transporter mol- 
ecules? Explain how you arrived at this conclusion. 

22. Use of the Helical Wheel Diagram A helical wheel is a 
two-dimensional representation of a helix, a view along its cen- 
tral axis (see Fig. 1 1— 29b; see also Fig. 4-4d). Use the helical 
wheel diagram below to determine the distribution of amino 
acid residues in a helical segment with the sequence -Val- 
Asp-Arg-Val-Phe-Ser-Asn-Val-Cys-Thr-His-Leu-Lys-Thr- 
Leu-Gln-Asp-Lys- 




What can you say about the surface properties of this 
helix? How would you expect the helix to be oriented in the 
tertiary structure of an integral membrane protein? 

23. Molecular Species in the E. coli Membrane The 

plasma membrane of E coli is about 75% protein and 25% 
phospholipid by weight. How many molecules of membrane 
lipid are present for each molecule of membrane protein? As- 
sume an average protein M r of 50,000 and an average phos- 
pholipid M T of 750. What more would you need to know to 
estimate the fraction of the membrane surface that is covered 
by lipids? 

Biochemistry on the Internet 

24. Membrane Protein Topology The receptor for the 
hormone epinephrine in animal cells is an integral membrane 
protein (M T 64,000) that is believed to have seven membrane- 
spanning regions. 

(a) Show that a protein of this size is capable of spanning 
the membrane seven times. 

(b) Given the amino acid sequence of this protein, how 
would you predict which regions of the protein form the 
membrane-spanning helices? 



(c) Go to the Protein Data Bank (www.rcsb.org) Use 
the PDB identifier 1DEP to retrieve the data page for a por- 
tion of the jS-adrenergic receptor (one type of epinephrine 
receptor) isolated from a turkey. Using Jmol to explore the 
structure, predict whether this portion of the receptor is lo- 
cated within the membrane or at the membrane surface. 
Explain. 

(d) Retrieve the data for a portion of another receptor, the 
acetylcholine receptor of neurons and myocytes, using the 
PDB identifier 1A11. As in (c), predict where this portion of 
the receptor is located and explain your answer. 

If you have not used the PDB, see Box 4-4 (p. 129) for 
more information. 

Data Analysis Problem — — 

25. The Fluid Mosaic Model of Biological Membrane 
Structure Figure 11-3 shows the currently accepted fluid 
mosaic model of biological membrane structure. This 
model was presented in detail in a review article by S. J. 
Singer in 1971. In the article, Singer presented the three 
models of membrane structure that had been proposed by 
that time: 




C 

A. The Davson-Danielli-Robertson Model. This was the 
most widely accepted model in 1971 , when Singer's review was 
published. In this model, the phospholipids are arranged as a 
bilayer. Proteins are found on both surfaces of the bilayer, at- 
tached to it by ionic interactions between the charged head 
groups of the phospholipids and charged groups in the pro- 
teins. Crucially, there is no protein in the interior of the bilayer. 

B. The Benson Lipoprotein Subunit Model. Here, the pro- 
teins are globular and the membrane is a protein-lipid mixture. 
The hydrophobic tails of the lipids are embedded in the hy- 
drophobic parts of the proteins. The lipid head groups are 
exposed to the solvent. There is no lipid bilayer. 
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C The Lipid-Globular Protein Mosaic Model. This is the 
model shown in Figure 1 1-3. The lipids form a bilayer and pro- 
teins are embedded in it, some extending through the bilayer 
and others not. Proteins are anchored in the bilayer by hy- 
drophobic interactions between the hydrophobic tails of the 
lipids and hydrophobic portions of the protein. 

For the data given below, consider how each piece of in- 
formation aligns with each of the three models of membrane 
structure. Which model(s) are supported, which are not sup- 
ported, and what reservations do you have about the data or 
their interpretation? Explain your reasoning 

(a) When cells were fixed, stained with osmium tetroxide, 
and examined in the electron microscope, they gave images 
like that in Figure 11-1: the membranes showed a "railroad 
track" appearance, with two dark-staining lines separated by a 
light space. 

(b) The thickness of membranes in cells fixed and stained 
in the same way was found to be 5 to 9 nm. The thickness of a 
"naked" phospholipid bilayer, without proteins, was 4 to 4.5 nm. 
The thickness of a single monolayer of proteins was about 1 nm. 

(c) In Singer's words: "The average amino acid composi- 
tion of membrane proteins is not distinguishable from that of 
soluble proteins. In particular, a substantial fraction of the 
residues is hydrophobic" (p. 165). 

(d) As described in Problems 1 and 2 of this chapter, re- 
searchers had extracted membranes from cells, extracted 
the lipids, and compared the area of the lipid monolayer with 
the area of the original cell membrane. The interpretation of 
the results was complicated by the issue illustrated in the 
graph of Problem 1; the area of the monolayer depended on 
how hard it was pushed. With very light pressures, the ratio of 



monolayer area to cell membrane area was about 2.0. At 
higher pressures — thought to be more like those found in 
cells — the ratio was substantially lower. 

(e) Circular dichroism spectroscopy uses changes in po- 
larization of UV light to make inferences about protein second- 
ary structure (see Fig. 4-9). On average, this technique showed 
that membrane proteins have a large amount of a helix and little 
or no /3 sheet. This finding was consistent with most membrane 
proteins having a globular structure. 

(f) Phospholipase C is an enzyme that removes the polar 
head group (including the phosphate) from phospholipids. In 
several studies, treatment of intact membranes with phospho- 
lipase C removed about 70% of the head groups without dis- 
rupting the "railroad track" structure of the membrane. 

(g) Singer described a study in which "a glycoprotein of 
molecular weight about 31,000 in human red blood cell mem- 
branes is cleaved by tryptic treatment of the membranes into 
soluble glycopeptides of about 10,000 molecular weight, while 
the remaining portions are quite hydrophobic" (p. 199). 
Trypsin treatment did not cause gross changes in the mem- 
branes, which remained intact. 

Singer's review also included many more studies in this 
area. In the end, though, the data available in 1971 did not con- 
clusively prove Model C was correct. As more data have accu- 
mulated, this model of membrane structure has been accepted 
by the scientific community 

Reference 

Singer, S.J. (1971) The molecular organization of biological mem- 
branes. In Structure and Function of Biological Membranes (Roth- 
field, L.I., ed,), pp. 145-222, Academic Press, Inc., New York, 



When I first entered the study of hormone action, some 25 years ago, 
there was a widespread feeling among biologists that hormone action 
could not be studied meaningfully in the absence of organized cell 
structure. However, as I reflected on the history of biochemistry, it 
seemed to me there was a real possibility that hormones might act at 
the molecular level. 

—Earl W. Sutherland, Nobel Address, 1971 
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The ability of cells to receive and act on signals from 
beyond the plasma membrane is fundamental to life. 
Bacterial cells receive constant input from mem- 
brane proteins that act as information receptors, sam- 
pling the surrounding medium for pH, osmotic strength, 
the availability of food, oxygen, and light, and the pres- 
ence of noxious chemicals, predators, or competitors for 
food. These signals elicit appropriate responses, such as 
motion toward food or away from toxic substances or the 
formation of dormant spores in a nutrient-depleted 



medium. In multicellular organisms, cells with different 
functions exchange a wide variety of signals. Plant cells 
respond to growth hormones and to variations in sunlight. 
Animal cells exchange information about the concentra- 
tions of ions and glucose in extracellular fluids, the inter- 
dependent metabolic activities taking place in different 
tissues, and, in an embryo, the correct placement of cells 
during development. In all these cases, the signal repre- 
sents information that is detected by specific receptors 
and converted to a cellular response, which always in- 
volves a chemical process. This conversion of informa- 
tion into a chemical change, signal transduction, is a 
universal property of living cells. 



12.1 General Features of Signal Transduction 

Signal transductions are remarkably specific and exquis- 
itely sensitive. Specificity is achieved by precise molec- 
ular complementarity between the signal and receptor 
molecules (Fig. 12-la), mediated by the same kinds 
of weak (noncovalent) forces that mediate enzyme- 
substrate and antigen-antibody interactions. Multicel- 
lular organisms have an additional level of specificity, 
because the receptors for a given signal, or the intracel- 
lular targets of a given signal pathway, are present only 
in certain cell types. Thyrotropin-releasing hormone, for 
example, triggers responses in the cells of the anterior 
pituitary but not in hepatocytes, which lack receptors 
for this hormone. Epinephrine alters glycogen metabo- 
lism in hepatocytes but not in adipocytes; in this case, 
both cell types have receptors for the hormone, but 
whereas hepatocytes contain glycogen and the glycogen- 
metabolizing enzyme that is stimulated by epinephrine, 
adipocytes contain neither. 

Three factors account for the extraordinary sensi- 
tivity of signal transducers: the high affinity of receptors 
for signal molecules, cooperativity (often but not al- 
ways) in the ligand-receptor interaction, and amplifica- 
tion of the signal by enzyme cascades. The affinity 
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(a) Specificity 

Signal molecule fits 
binding site on its 
complementary receptor; 
other signals do not fit. 



Receptor 



I 

Effect 



(c) Desensitization/Adaptation 

Receptor activation triggers 
a feedback circuit that shuts 
off the receptor or removes 
it from the cell surface. 



Signal 



Xj Receptor 




Response 



(b) Amplification 

When enzymes activate 
enzymes, the number of 
affected molecules 
increases geometrically 
in an enzyme cascade. 
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FIGURE 12-1 Four features of signal-transducing systems. 



(d) Integration 

When two signals have 
opposite effects on a 
metabolic characteristic 
such as the concentration 
of a second messenger X, 
or the membrane potential 
V m , the regulatory outcome 
results from the integrated 
input from both receptors. 
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between signal (ligand) and receptor can be expressed 
as the dissociation constant K d) usually 10~ 10 m or less — 
meaning that the receptor detects picomolar concentra- 
tions of a signal molecule. Receptor-ligand interactions 
are quantified by Scatchard analysis, which yields a 
quantitative measure of affinity (X d ) and the number of 
ligand-binding sites in a receptor sample (Box 12-1). 

Cooperativity in receptor-ligand interactions results 
in large changes in receptor activation with small changes 
in ligand concentration (recall the effect of cooperativity 
on oxygen binding to hemoglobin; see Fig. 5-12). Ampli- 
fication by enzyme cascades results when an enzyme 
associated with a signal receptor is activated and, in turn, 
catalyzes the activation of many molecules of a second en- 
zyme, each of which activates many molecules of a third 
enzyme, and so on (Fig. 12-lb). Such cascades can pro- 
duce amplifications of several orders of magnitude within 
milliseconds. The response to a signal must also be termi- 
nated such that the downstream effects are in proportion 
to the strength of the original stimulus. 

The sensitivity of receptor systems is subject to 
modification. When a signal is present continuously, de- 
sensitization of the receptor system results (Fig. 12-1 c); 
when the stimulus falls below a certain threshold, the 
system again becomes sensitive. Think of what happens 
to your visual transduction system when you walk from 
bright sunlight into a darkened room or from darkness 
into the light. 

A final noteworthy feature of signal-transducing 
systems is integration (Fig. 12-ld), the ability of the 
system to receive multiple signals and produce a uni- 
fied response appropriate to the needs of the cell or 
organism. Different signaling pathways converse with 
each other at several levels, generating a wealth of in- 
teractions that maintain homeostasis in the cell and 
the organism. 



One of the revelations of research on signaling is the 
remarkable degree to which signaling mechanisms have 
been conserved during evolution. Although the number 
of different biological signals (Table 12-1) is probably in 
the thousands, and the kinds of responses elicited by 
these signals are comparably numerous, the machinery 
for transducing all of these signals is built from about 10 
basic types of protein components. 

In this chapter we examine some examples of the 
major classes of signaling mechanisms, looking at how 
they are integrated in specific biological functions such 
as the transmission of nerve signals; responses to hor- 
mones and growth factors; the senses of sight, smell, 
and taste; and control of the cell cycle. Often, the end 
result of a signaling pathway is the phosphorylation of a 
few specific target-cell proteins, which changes their ac- 
tivities and thus the activities of the cell. Throughout 
our discussion we emphasize the conservation of funda- 
mental mechanisms for the transduction of biological 
signals and the adaptation of these basic mechanisms to 
a wide range of signaling pathways. 

We consider the molecular details of several represen- 
tative signal-transduction systems, classified according 
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Some Signals to Which Cells Respond 



Antigens 

Cell surface glycoproteins/ 

oligosaccharides 
Developmental signals 
Extracellular matrix 

components 
Growth factors 
Hormones 



Light 

Mechanical touch 

Neurotransmitters 

Nutrients 

Odorants 

Pheromones 

Tastants 



12.1 General Features of Signal Transduction 



BOX 12-1 



METHODS 



catchard Analysis Quantifies the Receptor-Ligand Interaction 



The cellular actions of a hormone begin when the hormone 
(ligand, L) binds specifically and tightly to its protein re- 
ceptor (R) on or in the target cell. Binding is mediated 
by noncovalent interactions (hydrogen-bonding, hy- 
drophobic, and electrostatic) between the complemen- 
tary surfaces of ligand and receptor. Receptor-ligand 
interaction brings about a conformational change that 
alters the biological activity of the receptor, which may 
be an enzyme, an enzyme regulator, an ion channel, or a 
regulator of gene expression. 

Receptor-ligand binding is described by the equation 



R + L = 

Receptor Ligand 



RL 

Receptor-ligand complex 



This binding, like that of an enzyme to its substrate, de- 
pends on the concentrations of the interacting compo- 
nents and can be described by an equilibrium constant: 



R + L 

Receptor Ligand 



RL 

Receptor-ligand complex 



[RL] _ &4 
[R][L] " A. 



where K a is the association constant and K d is the disso- 
ciation constant. 

Like enzyme-substrate binding, receptor-ligand 
binding is saturable. As more ligand is added to a fixed 
amount of receptor, an increasing fraction of receptor 
molecules is occupied by ligand (Fig. la). A rough mea- 
sure of receptor-ligand affinity is given by the concentra- 
tion of ligand needed to give half-saturation of the 
receptor. Using Scatchard analysis of receptor-ligand 
binding, we can estimate both the dissociation constant 
K d and the number of receptor-binding sites in a given 
preparation. When binding has reached equilibrium, the 
total number of possible binding sites, B max , equals the 
number of unoccupied sites, represented by [R], plus 
the number of occupied or ligand-bound sites, [RL]; that 
is, B max = [R] + [RL]. The number of unbound sites can 
be expressed in terms of total sites minus occupied 
sites: [R] = B max - [RL]. The equilibrium expression can 
now be written 



[RL] 



[L](B n 



[RL]) 



Rearranging to obtain the ratio of receptor-bound ligand 
to free (unbound) ligand, we get 



[Bound] [RL] 



[Free] 



[L] 
1 



[RL]) 



[RL]) 



From this slope-intercept form of the equation, we can 
see that a plot of [bound ligand]/[free ligand] versus 
[bound ligand] should give a straight line with a slope of 




Total binding 
Specific binding 



Nonspecific binding 



(a) 



Total hormone added, [L] + [RL] 



i 3 




(b) 



Bound hormone, [RL] 



FIGURE 1 Scatchard analysis of a receptor-ligand interaction. A radio- 
labeled ligand (L) — a hormone, for example — is added at several con- 
centrations to a fixed amount of receptor (R), and the fraction of the 
hormone bound to receptor is determined by separating the receptor- 
hormone complex (RL) from free hormone. 

(a) A plot of [RL] versus [L] + [RL] (total hormone added) is hy- 
perbolic, rising toward a maximum for [RL] as the receptor sites be- 
come saturated. To control for nonsaturable, nonspecific binding 
sites (eicosanoid hormones bind nonspecificaliy to the lipid bilayer, 
for example), a separate series of binding experiments is also neces- 
sary. A large excess of unlabeled hormone is added along with the di- 
lute solution of labeled hormone. The unlabeled molecules compete 
with the labeled molecules for specific binding to the saturable site 
on the receptor, but not for the nonspecific binding. The true value for 
specific binding is obtained by subtracting nonspecific binding from 
total binding. 

(b) A linear plot of [RL]/[L] versus [RL] gives K d and S max for the 
receptor-hormone complex. Compare these plots with those of V 0 ver- 
sus [S] and 1/V 0 versus 1/[S] for an enzyme-substrate complex (see Fig. 
6-12, Box 6-1). 



-K a (-l/iT d ) and an intercept on the abscissa of B max , 
the total number of binding sites (Fig. lb). Hormone- 
ligand interactions typically have K d values of 10~ 9 to 
10~ u m, corresponding to very tight binding. 

Scatchard analysis is reliable for the simplest cases, 
but as with Lineweaver-Burk plots for enzymes, when 
the receptor is an allosteric protein, the plots deviate 
from linearity. 
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G protein-coupled 
receptor 

External ligand (S) binding 
to receptor (R) activates an 
intracellular GTP-binding 
protein (G), which regulates 
an enzyme (Enz) that 
generates an intracellular 
second messenger, X. 



Receptor tyrosine kinase 

Ligand binding activates 
tyrosine kinase activity 
by autophosphorylation, 



Receptor guanylyl 
cyclase 

Ligand binding to 
extracellular domain 
stimulates formation 
of second messenger 
cyclic GMP. 



Adhesion receptor 
(integrin) 

Binds molecules in 
extracellular matrix, 
changes conformation, 
thus altering its 
interaction with 
cytoskeleton, 




Plasma 
membrane 



Nuclear 
envelope 




Protein 



Gated ion channel 

Opens or closes in 
response to concentration 
of signal ligand or 
membrane potential. 



Nuclear receptor 

Steroid binding 
allows the receptor 
to regulate the 
expression of 
specific genes. 




Protein 



FIGURE 12-2 Six general types of signal transducers. 



to the type of receptor. The trigger for each system is dif- 
ferent, but the general features of signal transduction are 
common to all: a signal interacts with a receptor; the acti- 
vated receptor interacts with cellular machinery, produc- 
ing a second signal or a change in the activity of a cellular 
protein; the metabolic activity of the target cell undergoes 
a change; and finally, the transduction event ends. To illus- 
trate these general features of signaling systems, we will 
look at examples of six basic receptor types (Fig. 12-2). 

1 . G protein-coupled receptors that indirectly 
activate (through GTP-binding proteins, or G 
proteins) enzymes that generate intracellular second 
messengers. This type of receptor is illustrated by 
the ^-adrenergic receptor system that detects 
epinephrine (adrenaline) (Section 12.2). 

2. Receptor tyrosine kinases, plasma membrane 
receptors that are also enzymes. When one of these 
receptors is activated by its extracellular ligand, it 
catalyzes the phosphorylation of several cytosolic 
or plasma membrane proteins. The insulin receptor 
is one example (Section 12.3); the receptor for 
epidermal growth factor (EGF-R) is another. 

3. Receptor guanylyl cyclases, which are also plasma 
membrane receptors with an enzymatic cytoplasmic 
domain. The intracellular second messenger for 
these receptors, cyclic guanosine monophosphate 
(cGMP), activates a cytosolic protein kinase that 
phosphorylates cellular proteins and thereby 
changes their activities (Section 12.4). 

4. Gated ion channels of the plasma membrane that 
open and close (hence the term "gated") in response 



to the binding of chemical ligands or changes in 
transmembrane potential. These are the simplest 
signal transducers. The acetylcholine receptor 
ion channel is an example of this mechanism 
(Section 12.6). 

5. Adhesion receptors that interact with macromo- 
lecular components of the extracellular matrix (such 
as collagen) and convey instructions to the cyto- 
skeletal system about cell migration or adherence 
to the matrix. Integrins illustrate this general type 
of transduction mechanism (Section 12.7). 

6. Nuclear receptors (steroid receptors) that bind 
specific ligands (such as the hormone estrogen) 
and alter the rate at which specific genes are 
transcribed and translated into cellular proteins. 
Because steroid hormones function through 
mechanisms intimately related to the regulation of 
gene expression, we consider them here only briefly 
(Section 12.8) and defer a detailed discussion of 
their action until Chapter 28. 

As we begin this discussion of biological signaling, a 
word about the nomenclature of signaling proteins is in 
order. These proteins are typically discovered in one 
context and named accordingly, then prove to be in- 
volved in a broader range of biological functions for 
which the original name is not helpful. For example, the 
retinoblastoma protein, pRb, was initially identified as 
the site of a mutation that contributes to cancer of the 
retina (retinoblastoma) , but it is now known to function 
in many pathways essential to cell division in all cells, 
not just those of the retina. Some genes and proteins are 
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given noncommittal names: the tumor suppressor pro- 
tein p53, for example, is a protein of 53 kDa, but its 
name gives no clue to its great importance in the regula- 
tion of cell division and the development of cancer. In 
this chapter we generally define these protein names as 
we encounter them, introducing the names commonly 
used by researchers in the field. Don't be discouraged 
if you can't get them all straight the first time you en- 
counter them! 



blockers," prescribed for such diverse conditions as hy- 
pertension, cardiac arrhythmia, glaucoma, anxiety, and 
migraine headache. At least 150 of the GPCRs found in 
the human genome are still "orphan receptors": their nat- 
ural ligands are not yet identified, and so we know noth- 
ing about their biology. The jS-adrenergic receptor, with 
well-understood biology and pharmacology, is the proto- 
type for all GPCRs, and our discussion of signal-trans- 
ducing systems begins there. ■ 



SUMMARY 12.1 



General Features of Signal 
Transduction 



■ All cells have specific and highly sensitive signal- 
transducing mechanisms, which have been 
conserved during evolution. 

■ A wide variety of stimuli act through specific protein 
receptors in the plasma membrane. 

■ The receptors bind the signal molecule, amplify the 
signal, integrate it with input from other receptors, 
and transmit the information into the cell. If the 
signal persists, receptor desensitization reduces or 
ends the response. 

■ Multicellular organisms have six general types of 
signaling mechanisms: plasma membrane proteins 
that act through G proteins; receptor tyrosine 
kinases; receptor guanylyl cyclases that act through 
a protein kinase; gated ion channels; adhesion 
receptors that carry information between the 
extracellular matrix and the cytoskeleton; and 
nuclear receptors that bind steroids and alter gene 
expression. 

12.2 G Protein-Coupled Receptors and 
Second Messengers 

Three essential components define signal transduction 
through G protein-coupled receptors (GPCRs): a 

plasma membrane receptor with seven transmembrane 
helical segments, an effector enzyme in the plasma 
membrane that generates an intracellular second mes- 
senger, and a guanosine nucleotide-binding protein 
(G protein) that activates the effector enzyme. The 
G protein, stimulated by the activated receptor, 
exchanges bound GDP for GTP; the GTP-protein dissoci- 
ates from the occupied receptor and binds to a nearby 
enzyme, altering its activity. The human genome encodes 
about 350 GPCRs for detecting hormones, growth factors, 
and other endogenous ligands, and perhaps 500 that serve 
as olfactory (smell) and gustatory (taste) receptors. 

GPCRs have been implicated in many common 
human diseases, including allergies, depression, 
blindness, diabetes, and various cardiovascular defects 
with serious health consequences. Close to half of all 
drugs on the market target one GPCR or another. For 
example, the /3-adrenergic receptor, which mediates 
the effects of epinephrine, is the target of the "beta 



The /2-Adrenergic Receptor System Acts through 
the Second Messenger cAMP 

Epinephrine sounds the alarm when some threat re- 
quires the organism to mobilize its energy-generating 
machinery; it signals the need to fight or flee. Epineph- 
rine action begins when the hormone binds to a protein 
receptor in the plasma membrane of an epinephrine- 
sensitive cell. Adrenergic receptors ("adrenergic" 
reflects the alternative name for epinephrine, adrena- 
line) are of four general types, a u a 2 ,Pi, and /3 2 , defined 
by differences in their affinities and responses to a 
group of agonists and antagonists. Agonists are struc- 
tural analogs that bind to a receptor and mimic the ef- 
fects of its natural ligand; antagonists are analogs 
that bind the receptor without triggering the normal 
effect and thereby block the effects of agonists, includ- 
ing the biological ligand. In some cases, the affinity of 
the synthetic agonist or antagonist for the receptor is 
greater than that of the natural agonist (Fig. 12-3). 
The four types of adrenergic receptors are found in dif- 
ferent target tissues and mediate different responses 
to epinephrine. Here we focus on the ^-adrenergic 
receptors of muscle, liver, and adipose tissue. These 
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FIGURE 12-3 Epinephrine and its synthetic analogs. Epinephrine, also 
called adrenaline, is released from the adrenal gland and regulates 
energy-yielding metabolism in muscle, liver, and adipose tissue. It also 
serves as a neurotransmitter in adrenergic neurons. Its affinity for its 
receptor is expressed as a dissociation constant for the receptor-ligand 
complex. Isoproterenol and propranolol are synthetic analogs, one an 
agonist with an affinity for the receptor that is higher than that of 
epinephrine, and the other an antagonist with extremely high affinity. 
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receptors mediate changes in fuel metabolism, as de- 
scribed in Chapter 23, including the increased break- 
down of glycogen and fat. Adrenergic receptors of the 
Pi and /3 2 subtypes act through the same mechanism, so 
in our discussion, "/3-adrenergic" applies to both types. 

The jS-adrenergic receptor is an integral protein 
with seven hydrophobic regions of 20 to 28 amino acid 
residues that "snake" back and forth across the plasma 
membrane seven times (thus the alternative names for 
GPCRs: serpentine receptors or heptahelical re- 
ceptors). The binding of epinephrine to a site on the re- 
ceptor deep within the plasma membrane (Fig. 12-4a, 
step (1)) promotes a conformational change in the re- 
ceptor's intracellular domain that affects its interaction 
with the second protein in the signal-transduction path- 
way, stimulatory G protein, or G s , on the cytoplasmic 
side of the membrane, Alfred G. Gilman and Martin Rod- 



bell discovered that active G s stimulates the production 
of cAMP (cyclic AMP) by adenylyl cyclase (see below) 
in the plasma membrane. G s , the prototype for a family 
of G proteins that act in biosignaling (see Box 12-2), is 
heterotrimeric, with subunit structure a/3y. When the 
nucleotide-binding site of G s (on the a subunit) is occu- 
pied by GTP, G s is turned on and can activate adenylyl 
cyclase (AC in Fig. 12-4a); with GDP bound to the site, 
G s is switched off. The activated /3-adrenergic receptor 
interacts with G s , catalyzing replacement of bound GDP 
with GTP and converting G s to its active form (step (2)). 
As this occurs, the (3 and y subunits of G s dissociate 
from the a subunit as a jSy dimer, and G sa , with its 
bound GTP, moves in the plane of the membrane from 
the receptor to a nearby molecule of adenylyl cyclase 
(step (3)). G sa is held to the membrane by a covalently 
attached palmitoyl group (see Fig. 11-14). 
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FIGURE 12-4 Transduction of the epinephrine signal: the ^-adrenergic 
pathway, (a) The mechanism that couples binding of epinephrine (E) to its 
receptor (Rec) with activation of adenylyl cyclase (AC); the seven steps 
are discussed further in the text. The same adenylyl cyclase molecule in 
the plasma membrane may be regulated by a stimulatory G protein (G s ), 



as shown/ or an inhibitory G protein (Gj, not shown). G s and Gj are un- 
der the influence of different hormones. Hormones that induce GTP 
binding to Gj cause inhibition of adenylyl cyclase/ resulting in lower cel- 
lular [cAMP]. (b) The combined action of the enzymes that catalyze 
steps (5) and©, forming then inactivating the second messenger/ cAMP. 
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BOX 12- 



MEDICINE 



G Proteins: Binary Switches in Health and Disease 



Alfred G. Gilman and Martin Rodbell (Fig. 1) discovered 
the critical roles of guanosine nucleotide-binding proteins 
(G proteins) in a wide variety of cellular processes, in- 
cluding sensory perception, signaling for cell division, 
growth and differentiation, intracellular movements of 
proteins and membrane vesicles, and protein synthesis. 
The human genome encodes nearly 200 of these pro- 
teins, which differ in size and subunit structure, intra- 
cellular location, and function. But all G proteins share a 
common feature: they can become activated and then, 
after a brief period, can inactivate themselves, thereby 
serving as molecular binary switches with built-in 
timers. This superfamily of proteins includes the 
trimeric G proteins involved in adrenergic signaling (G s 
and Gj) and vision (transducin); small G proteins such 
as that involved in insulin signaling (Ras) and others 
that function in vesicle trafficking (ARF and Rab) , trans- 
port into and out of the nucleus (Ran; see Fig. 27-42), 
and timing of the cell cycle (Rho); and several proteins 
involved in protein synthesis (initiation factor IF2 and 
elongation factors EF-Tu and EF-G; see Chapter 26). 
Many G proteins have covalently bound lipids, which 
give them an affinity for membranes and dictate their lo- 
cations in the cell. 

All G proteins have the same core structure and 
use the same mechanism for switching between an 
inactive conformation, favored when GDP is bound, 
and an active conformation, favored when GTP is 
bound. We can use the Ras protein (-20 kDa), a mini- 
mal signaling unit, as a prototype for all members of 
this superfamily (Fig. 2). 

In the GTP-bound conformation, the G protein ex- 
poses previously buried regions (called switch I and 
switch II) that interact with proteins downstream in the 
signaling pathway, until the G protein inactivates itself by 
hydrolyzing its bound GTP to GDP. The critical determi- 
nant of G-protein conformation is the y phosphate of 
GTP, which interacts with a region called the P loop 




Switch I 



Switch IT 




FIGURE 2 The Ras protein, the prototype for all G proteins (PDB ID 
5P21). Mg 2+ -GTP (ball-and-stick structure) is held by critical residues 
in the phosphate-binding P loop (blue), and by Thr 35 in the switch I 
(red) and Gly 60 in the switch II (green) regions. Ala 146 gives specificity 
for GTP over ATP. 

(phosphate-binding; Fig. 3). In Ras, GTP phosphates 
bind to a Lys residue in the P loop and to two critical 
residues, Thr 35 in switch I and Gly 60 in switch II, that 
hydrogen-bond with the oxygens of the y phosphate of 
GTP These hydrogen bonds act like a pair of springs 

{continued on next page) 
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FIGURE 1 Alfred G. Gilman (left) and Martin Rodbell (1925-1998; 
right). Their Nobel lectures on the discovery and exploration of G pro- 
teins are available at www.nobelprize.org. 



FIGURE 3 When bound GTP is hydrolyzed by the GTPase activities of 
Ras and its GAP, loss of hydrogen bonds to Thr 35 and Gly 60 allows the 
switch I and switch II regions to relax into a conformation in which they 
are no longer available to interact with downstream targets such as Raf. 
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holding the protein in its active conformation. When GTP 
is cleaved to GDP and Pj is released, these hydrogen 
bonds are lost; the protein relaxes into its inactive con- 
formation, burying the sites that interact with other part- 
ners in its active state. Ala 146 hydrogen-bonds to the 
guanine oxygen, allowing GTP, but not ATP, to bind. 

The intrinsic GTPase activity of G proteins is in- 
creased up to 10 5 -fold by GTPase activator proteins 
(GAPs), also called, in the case of heterotrimeric G pro- 
teins, regulators of G protein signaling (RGSs; 
Fig. 4). GAPs (and RGSs) thus determine how long the 
switch remains "on." They contribute a critical Arg residue 
that reaches into the G-protein GTPase active site and 
assists in catalysis. The intrinsically slow process of re- 
placing bound GDP with GTP, switching the protein on, is 
catalyzed by guanosine nucleotide-exchange factors 
(GEFs) associated with the G protein (Fig. 4). 
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FIGURE 4 Factors that regulate the activity of G proteins (green) Inactive 
G proteins (both small G proteins such as Ras and heterotrimeric G pro- 
teins such as G s ) interact with upstream GTP-GDP exchange factors, 
GEFs (red; often these are activated receptors such as rhodopsin, ^-adren- 
ergic receptors, and Sos ), and are activated by GTP binding. In this form, 
the G proteins activate downstream effector enzymes (blue; enzymes 
such as cGMP phosphodiesterase, adenylyl cyclase, and Raf), GTPase ac- 
tivator proteins (GAPs, in the case of small G proteins) and regulators of 
G protein signaling (RGSs) (yellow), by modulating the GTPase activity of 
G proteins, determine how long the G protein will remain active. 



Because G proteins play crucial roles in so many sig- 
naling processes, it is not surprising that defects in G pro- 
teins lead to a variety of diseases. In about 25% of all 
human cancers (and in a much higher proportion of certain 
types of cancer), there is a mutation in a Ras protein — 
typically in one of the critical residues around the GTP- 
binding site or in the P loop — that virtually eliminates its 
GTPase activity. Once activated by GTP binding, this Ras 
protein remains constitutively active, promoting cell divi- 
sion in cells that should not divide. The tumor suppressor 
gene NF1 encodes a GAP that enhances the GTPase ac- 
tivity of normal Ras. Mutations in NF1 that result in a 
nonfunctioning GAP leave Ras with only its intrinsic 
GTPase activity; once activated by GTP binding, Ras 
stays active, continuing to send the signal: divide. 

Defective heterotrimeric G proteins can also lead to 
disease. Mutations in the gene that encodes the a sub- 
unit of G s (which mediates changes in [cAMP] in re- 
sponse to hormonal stimuli) may result in a G a that is 
permanently active or permanently inactive. "Activat- 
ing" mutations generally occur in residues crucial to 
GTPase activity; they lead to a continuously elevated 
[cAMP], with significant downstream consequences, in- 
cluding undesirable cell proliferation. For example, such 
mutations are found in about 40% of pituitary tumors 
(adenomas). Individuals with "inactivating" mutations 
in G a are unresponsive to hormones (such as thyroid 
hormone) that act through cAMP. Mutation in the gene 
for the transducin a subunit (T a ), which is involved in 
visual signaling, leads to a type of night blindness, ap- 
parently due to defective interaction between the acti- 
vated T a subunit and the phosphodiesterase of the rod 
outer segment (see Fig. 12-38). A sequence variation in 
the gene encoding the /3 subunit of a heterotrimeric G 
protein is commonly found in individuals with hyperten- 
sion (high blood pressure), and this variant gene is sus- 
pected of involvement in obesity and atherosclerosis. 

The pathogenic bacteria that cause cholera and per- 
tussis (whooping cough) produce toxins that target G 
proteins, interfering with normal signaling in host cells. 
Cholera toxin, secreted by Vibrio cholerae in the in- 
testine of an infected person, is a heterodimeric protein. 
Subunit B recognizes and binds to specific gangliosides 
on the surface of intestinal epithelial cells and provides 
a route for subunit A to enter these cells. After entry, 
subunit A is broken into two pieces: the Al fragment 



Adenylyl cyclase is an integral protein of the 
plasma membrane, with its active site on the cytoplas- 
mic face. The association of active G sa with adenylyl cy- 
clase stimulates the cyclase to catalyze cAMP synthesis 
from ATP (Fig. 12-4a, step ®; see also Fig. 12-lb), rais- 
ing the cytosolic [cAMP]. The interaction between G sa 



and adenylyl cyclase is possible only when the "switch" 
regions of G sa are exposed by a GTP-induced conforma- 
tional change. 

The stimulation by G sa is self-limiting; G sa is a GTP- 
ase that turns itself off by converting its bound GTP 
to GDP (Fig. 12-5). The now inactive G sa dissociates 
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FIGURE 5 The bacterial toxins that cause cholera and whooping cough 
(pertussis) are enzymes that catalyze transfer of the ADP-ribose moiety 
of NAD + to an Arg residue of G s (in the case of cholera toxin, as shown 
here) or a Cys residue of Gj (pertussis toxin). The G proteins thus 



ADP-ribose 

modified fail to respond to normal hormonal stimuli. The pathology of 
both diseases results from defective regulation of adenylyl cyclase and 
overproduction of cAMR 



and the A2 fragment. Al then associates with the ADP- 
ribosylation factor ARF6, a small G protein in host cells, 
through residues in its switch I and switch II regions — 
which are accessible only when ARF6 is in its active 
(GTP-bound) form. This association with ARF6 activates 
Al, which catalyzes the transfer of ADP-ribose from 
NAD + to the critical Arg residue in the P loop of the a 
subunit of G s (Fig. 5). ADP-ribosylation blocks the GTP- 
ase activity of G s and thereby renders G s permanently 
active. This results in continuous activation of the adeny- 
lyl cyclase of intestinal epithelial cells, chronically high 
[cAMP], and chronically active PKA. PKA phosphorylates 
the CFTR CI" channel (see Box 11-3) and a Na + -H + ex- 
changer in the intestinal epithelial cells. The resultant 
efflux of NaCl triggers massive water loss through the in- 
testine as cells respond to the ensuing osmotic imbalance. 
Severe dehydration and electrolyte loss are the major 



pathologies in cholera. These can be fatal in the absence 
of prompt rehydration therapy. 

The pertussis toxin, produced by Bordetella per- 
tussis, catalyzes ADP-ribosylation of the a subunit of G h 
in this case preventing GDP-GTP exchange and blocking 
inhibition of adenylyl cyclase by G t . The bacterium infects 
the respiratory tract, where it destroys the ciliated ep- 
ithelial cells that normally sweep away mucus. Without 
this ciliary action, vigorous coughing is needed to clear 
the tract; this is the gasping cough that gives the disease 
its name (and spreads the bacterium to others). How the 
defect in G-protein signaling kills ciliated epithelial cells is 
not yet clear. 

Given the large number of G protein-coupled recep- 
tors in the human genome, it seems likely that future 
studies will reveal many more examples of how defec- 
tive G-protein signaling affects human health. 



from adenylyl cyclase, rendering the cyclase inactive. 
G sa reassociates with the (3y dimer (G s/3y ), and in- 
active G s is again available to interact with a hormone- 
bound receptor. A variety of G proteins act as binary 
switches in signaling systems with GPCRs and in 
many processes that involve membrane fusion or 



fission (Box 12-2). Q Trimeric G Proteins: Molecular On/Off 
Switches 

Epinephrine exerts its downstream effects through 
the increase in [cAMP] that results from the activation of 
adenylyl cyclase. Cyclic AMP, in turn, allosterically acti- 
vates cAMP-dependent protein kinase, also called 
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FIGURE 12-5 Self-inactivation of G s . The steps are further described in 
the text. The protein's intrinsic GTPase activity, in many cases stimu- 
lated by RGS proteins (regulators of G-protein signaling), determines 
how quickly bound GTP is hydrolyzed to GDP and thus how long the 
G protein remains active. 



protein kinase A or PKA (Fig. 12-4a, step (5)), which 
catalyzes the phosphorylation of other proteins, includ- 
ing glycogen phosphorylase b kinase. This enzyme is ac- 
tive when phosphorylated and can begin the process of 
mobilizing glycogen stores in muscle and liver in antici- 
pation of the need for energy. 

The inactive form of PKA contains two identical cat- 
alytic subunits (C) and two identical regulatory subunits 
(R) (Fig. 12-6a). The tetrameric R 2 C 2 complex is catalyt- 
ically inactive, because an autoinhibitory domain of each R 
subunit occupies the substrate-binding cleft of each C sub- 
unit. When cAMP binds to the R subunits, they undergo a 
conformational change that moves the autoinhibitory do- 
main of R out of the catalytic domain of C, and the R 2 C 2 
complex dissociates to yield two free, catalytically active C 
subunits. This same basic mechanism — displacement of an 
autoinhibitory domain — mediates the allosteric activation 
of many types of protein kinases by their second messen- 
gers (as in Figs 12-14 and 12-22, for example). The 
structure of the substrate-binding cleft in PKA is the proto- 
type for all known protein kinases (Fig. 12-6b); certain 
residues in this cleft region have identical counterparts in 
all of the more than 1,000 known protein kinases. 

As indicated in Figure 12^a (step ©), PKA regu- 
lates several enzymes downstream in the signaling path- 
way (Table 12-2). Although these downstream targets 
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FIGURE 12-6 Activation of cAMP-dependent protein kinase (PKA). 

(a) When [cAMP] is low, the two identical regulatory subunits (R; red) 
associate with the two identical catalytic subunits (C; blue). In this R 2 C 2 
complex, the inhibitor sequences of the R subunits lie in the substrate- 
binding cleft of the C subunits and prevent binding of protein substrates; 
the complex is therefore catalytically inactive. The amino-terminal 
sequences of the R subunits interact to form an R 2 dimer, the site of bind- 
ing to an A kinase anchoring protein (AKAP; green), described later in 
the text. When [cAMP] rises in response to a hormonal signal, each R 
subunit binds two cAMP molecules and undergoes a dramatic reorgan- 



ization that pulls its inhibitory sequence away from the C subunit, open- 
ing up the substrate-binding cleft and releasing each C subunit in its cat- 
alytically active form, (b) A crystal structure showing part of the R 2 C 2 
complex (PDB ID 1 U7E)— one C subunit (blue) and part of the R subunit 
(two shades of red). The amino-terminal dimerization region of the R 
subunit is omitted for simplicity. The small lobe of C contains the ATP- 
binding site, and the large lobe surrounds and defines the cleft where the 
protein substrate binds and undergoes phosphorylation at a Ser orThr 
residue. In this inactive form, the inhibitor sequence of R (bright red) 
blocks the substrate-binding cleft of C, inactivating it. 
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TABLE 12-2 



Enzyme/protein 



Some Enzymes and Other Proteins Regulated by cAMP-Dependent Phosphorylation {by PKA) 

Sequence phosphorylated* Pathway/process regulated 



Glycogen synthase 

Phosphorylase b kinase 
a subunit 
p subunit 

Pyruvate kinase (rat liver) 

Pyruvate dehydrogenase complex (type L) 

Hormone-sensitive lipase 

Phosphofructokinase-2/fructose 2,6-bisphosphatase 
Tyrosine hydroxylase 



Histone HI 
Histone H2B 

Cardiac phospholamban (cardiac pump regulator) 
Protein phosphatase-1 inhibitor- 1 
PKA consensus sequence 1 ^ 



RASCTSSS 

VEFRRL I 
RTKRSGSV 

GVLRRASVAZL 

GYLRRASV 

PMRRSV 

LQRRRGSSIPQ 
FIGRRQSL 



AKRKASGPPVS 

KKAKASRKESYSVYVYK 

AIRRAST 

IRRRRPTP 

xR[RK]x[ST]B 



Glycogen synthesis 

Glycogen breakdown 
Glycolysis 

Pyruvate to acetyl-CoA 
Triacylglycerol mobilization 
and fatty acid oxidation 

Glycolysis/gluconeogenesis 

Synthesis of L-dopa, 
dopamine, norepinephrine, 
and epinephrine 

DNA condensation 

DNA condensation 

Intracellular [Ca 2+ ] 

Protein dephosphorylation 

Many 



*The phosphorylated S orT residue is shown in red. Ail residues are given as their one-letter 
*x is any amino acid; B is any hydrophobic amino acid. See Box 3-3 for conventions used in 

have diverse functions, they share a region of sequence 
similarity around the Ser or Thr residue that undergoes 
phosphorylation, a sequence that marks them for regu- 
lation by PKA. The substrate-binding cleft of PKA rec- 
ognizes these sequences and phosphorylates their Thr 
or Ser residue. Comparison of the sequences of various 
protein substrates for PKA has yielded the consensus 
sequence — the neighboring residues needed to mark a 
Ser or Thr residue for phosphorylation (see Table 12-2). 

Signal transduction by adenylyl cyclase entails sev- 
eral steps that amplify the original hormone signal (Fig. 
12-7). First, the binding of one hormone molecule to 
one receptor molecule catalytically activates several G s 
molecules. Next, by activating a molecule of adenylyl cy- 
clase, each active G sa molecule stimulates the catalytic 
synthesis of many molecules of cAMP. The second mes- 
senger cAMP now activates PKA, each molecule of which 
catalyzes the phosphorylation of many molecules of the 
target protein — phosphorylase b kinase in Figure 12-7. 
This kinase activates glycogen phosphorylase b, which 
leads to the rapid mobilization of glucose from glycogen. 
The net effect of the cascade is amplification of the 
hormonal signal by several orders of magnitude, which 

FIGURE 12-7 Epinephrine cascade. Epinephrine triggers a series of 
reactions in hepatocytes in which catalysts activate catalysts, resulting 
in great amplification of the signal. Binding of a small number of mol- 
ecules of epinephrine to specific /3-adrenergic receptors on the cell 
surface activates adenylyl cyclase. The numbers of molecules shown 
are simply to illustrate amplification and are probably gross under- 
estimates, (Because two molecules of cAMP are required to activate 
one PKA catalytic subunit, this step does not amplify the signal.) 
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accounts for the very low concentration of epinephrine 
(or any other hormone) required for hormone activity. 

Several Mechanisms Cause Termination of 
the ^-Adrenergic Response 

To be useful, a signal-transducing system has to turn off 
after the hormonal or other stimulus has ended, and 
mechanisms for shutting off the signal are intrinsic to all 
signaling systems. Most systems also adapt to the con- 
tinued presence of the signal by becoming less sensitive 
to it, in the process of desensitization. The /3-adrenergic 
system illustrates both. When the concentration of epi- 
nephrine in the blood drops below the K d for its recep- 
tor, the hormone dissociates from the receptor and the 
latter resumes the inactive conformation, in which it can 
no longer activate G s . 

A second means of ending the response to f3- 
adrenergic stimulation is the hydrolysis of GTP bound 
to the G a subunit, catalyzed by the intrinsic GTPase ac- 
tivity of the G protein. Conversion of bound GTP to GDP 
favors the return of G a to the conformation in which it 
binds the G /3y subunits— the conformation in which the 
G protein is unable to interact with or stimulate adeny- 
lyl cyclase. This ends the production of cAMP. The rate 
of inactivation of G s depends on the GTPase activity, 
which for G a alone is very feeble. However, GTPase acti- 
vator proteins (GAPs) strongly stimulate this GTPase 
activity, causing more rapid inactivation of the G protein 
(see Box 12-2). GAPs can themselves be regulated by 



other factors, providing a fine-tuning of the response to 
^-adrenergic stimulation. A third mechanism for termi- 
nating the response is to remove the second messenger: 
hydrolysis of cAMP to 5 '-AMP (not active as a second 
messenger) by cyclic nucleotide phosphodiesterase 
(Fig. 12-4a, step (7); 12-4b). 

Finally, at the end of the signaling pathway, the 
metabolic effects that result from enzyme phosphoryla- 
tion are reversed by the action of phosphoprotein phos- 
phatases, which hydrolyze phosphorylated Ser, Thr, or 
TVr residues, releasing inorganic phosphate (P^. About 
150 genes in the human genome encode phosphoprotein 
phosphatases, fewer than the number encoding protein 
kinases (—500). Some of these phosphatases are known to 
be regulated; others may act constitutively. When [cAMP] 
drops and PKA returns to its inactive form (step (7) in 
Fig. 12-4a), the balance between phosphorylation and 
dephosphorylation is tipped toward dephosphorylation 
by these phosphatases. 

The ^-Adrenergic Receptor Is Desensitized by 
Phosphorylation and by Association with Arrestin 

The mechanisms for signal termination described above 
take effect when the stimulus ends. A different mecha- 
nism, desensitization, damps the response even while 
the signal persists. Desensitization of the /3-adrenergic 
receptor is mediated by a protein kinase that phosphory- 
lates the receptor on the intracellular domain that nor- 
mally interacts with G s (Fig. 12-8). When the receptor is 
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FIGURE 12-8 Desensitization of the /^-adrenergic receptor in the 
continued presence of epinephrine. This process is mediated by two 
proteins: /3-adrenergic protein kinase (/3ARK) and /3-arrestin (jSarr; also 
known as arrestin 2). 
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occupied by epinephrine, /S-adrenergic receptor ki- 
nase, or PARK (also commonly called GRK2; see 
below), phosphorylates several Ser residues near the 
carboxyl terminus of the receptor, which is on the cyto- 
plasmic side of the plasma membrane. Normally located 
in the cytosol, /3ARK is drawn to the plasma membrane 
by its association with the G s/3y subunits and is thus posi- 
tioned to phosphorylate the receptor. Receptor phos- 
phorylation creates a binding site for the protein 
j3-arrestin, or /?arr (also called arrestin 2), and binding 
of /3-arrestin effectively prevents further interaction 
between the receptor and the G protein. The binding of 
j3-arrestin also facilitates receptor sequestration, the re- 
moval of receptor molecules from the plasma membrane 
by endocytosis into small intracellular vesicles. Recep- 
tors in the endocytic vesicles are eventually dephospho- 
rylated and returned to the plasma membrane, 
completing the circuit and resensitizing the system to 
epinephrine. ^-Adrenergic receptor kinase is a member 
of a family of G protein-coupled receptor kinases 
(GRKs), all of which phosphorylate GPCRs on their car- 
boxyl-terminal cytoplasmic domains and play roles simi- 
lar to that of /3ARK in desensitization and resensitization 
of their receptors. At least five different GRKs and four 
different arrestins are encoded in the human genome; 
each GRK is capable of desensitizing a particular subset 
of GPCRs, and each arrestin can interact with many dif- 
ferent types of phosphorylated receptors. 

Cyclic AMP Acts as a Second Messenger 
for Many Regulatory Molecules 

Epinephrine is just one of many hormones, growth fac- 
tors, and other regulatory molecules that act by chang- 
ing the intracellular [cAMP] and thus the activity of PKA 
(Table 12-3). For example, glucagon binds to its recep- 
tors in the plasma membrane of adipocytes, activating 
(via a G s protein) adenylyl cyclase. PKA, stimulated by 
the resulting rise in [cAMP], phosphorylates and acti- 
vates two proteins critical to the mobilization of the fatty 
acids of stored fats (see Fig. 17-3). Similarly, the pep- 
tide hormone ACTH (adrenocorticotropic hormone, also 
called corticotropin) , produced by the anterior pituitary, 
binds to specific receptors in the adrenal cortex, activat- 
ing adenylyl cyclase and raising the intracellular [cAMP]. 
PKA then phosphorylates and activates several of the 
enzymes required for the synthesis of Cortisol and other 
steroid hormones. In many cell types, the catalytic sub- 
unit of PKA can also move into the nucleus, where it 
phosphorylates the cAMP response element binding 
protein (CREB), which alters the expression of spe- 
cific genes regulated by cAMP 

Some hormones act by inhibiting adenylyl cyclase, 
thus lowering [cAMP] and suppressing protein phos- 
phorylation. For example, the binding of somatostatin to 
its receptor leads to activation of an inhibitory G pro- 
tein, or Gi, structurally homologous to G s , that inhibits 
adenylyl cyclase and lowers [cAMP]. Somatostatin 
therefore counterbalances the effects of glucagon. In 



TABLE 12-3 



Some Signals That Use cAMP as 
Second Messenger 




Corticotropin (ACTH) 
Corticotropin-releasing hormone (CRH) 
Dopamine [D^ D 2 ] 
Epinephrine (^-adrenergic) 
Follicle-stimulating hormone (FSH) 
Glucagon 
Histamine [H 2 ] 
Luteinizing hormone (LH) 
Melanocyte-stimulating hormone (MSH) 
Odorants (many) 
Parathyroid hormone 
Prostaglandins E lt E 2 (PGE X , PGE 2 ) 
Serotonin [5-HT-la, 5-HT-2] 
Somatostatin 
Tastants (sweet, bitter) 
Thyroid-stimulating hormone (TSH) 

Note: Receptor subtypes in square brackets. Subtypes may have different transduction 
mechanisms. For example, serotonin is detected in some tissues by receptor subtypes 
5-HT-la and 5-HT-lb, which act through adenylyl cyclase and cAMR and in other 
tissues by receptor subtype 5-HT-lc r acting through the phospholipase C-IP 3 
mechanism (see Table 12-4). 

adipose tissue, prostaglandin E x (PGE^ see Fig. 10-18) 
inhibits adenylyl cyclase, thus lowering [cAMP] and 
slowing the mobilization of lipid reserves triggered by 
epinephrine and glucagon. In certain other tissues PGEi 
stimulates cAMP synthesis: its receptors are coupled to 
adenylyl cyclase through a stimulatory G protein, G s . In 
tissues with a 2 -adrenergic receptors, epinephrine low- 
ers [cAMP]; in this case, the receptors are coupled to 
adenylyl cyclase through an inhibitory G protein, G { . In 
short, an extracellular signal such as epinephrine or 
PGEi can have quite different effects on different tis- 
sues or cell types, depending on three factors: the type 
of receptor in the tissue, the type of G protein (G s or GO 
with which the receptor is coupled, and the set of PKA tar- 
get enzymes in the cells. By summing the influences that 
tend to increase and decrease [cAMP], a cell achieves the 
integration of signals that we noted as a general feature of 
signal-transducing mechanisms (Fig. 12-ld). 

A fourth factor that explains how so many types of 
signals can be mediated by a single second messenger 
(cAMP) is the confinement of the signaling process to 
a specific region of the cell by adaptor proteins— 
noncatalytic proteins that hold together other protein 
molecules that function in concert (further described 
below). AKAPs (A kinase anchoring proteins) are 
multivalent adaptor proteins; one part binds to the R 
subunits of PKA (see Fig. 12-6a) and another to a spe- 
cific structure in the cell, confining the PKA to the vicin- 
ity of that structure. For example, specific AKAPs bind 
PKA to microtubules, actin filaments, ion channels, 
mitochondria, or the nucleus. Different types of cells 
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FIGURE 12-9 Nucleation of supramolecular complexes by A kinase 
anchoring proteins (AKAPs). Several types of AKAPs (green) act as 
multivalent scaffolds, holding PKA catalytic subunits (blue), through 
the AKAP's interaction with the PKA regulatory subunits (red), in prox- 
imity to a particular region or organelle in the cell. AKAP79, at the cy- 
toplasmic surface of the plasma membrane, binds both PKA and 
adenylyl cyclase (AC). The cAMP produced by AC reaches the nearby 
PKA quickly and with very little dilution. AKAP79 can also bind (not 
shown here) PKA, PKAs target protein (an ion channel), and phospho- 
protein phosphatase, which removes phosphate from the target protein. 
AKAP250, also known as gravin, holds PKA to the plasma membrane 
while also binding cAMP phosphodiesterase (PDE), which terminates 
the PKA signal by converting cAMP to AMP. In both examples, the 
AKAP brings about a high local concentration of enzymes and second 
messengers, so that the signaling circuit remains highly localized. 

have different complements of AKAPs, so cAMP might 
stimulate phosphorylation of mitochondrial proteins in 
one cell and phosphorylation of actin filaments in an- 
other. In some cases, an AKAP connects PKA with the 
enzyme that triggers PKA activation (adenylyl cyclase) 
or terminates PKA action (cAMP phosphodiesterase or 
phosphoprotein phosphatase) (Fig. 12-9). The very 
close proximity of these activating and inactivating en- 
zymes presumably achieves a highly localized, and very 
brief, response. We will see later that some membrane- 
bound signaling proteins (including adenylyl cyclase) 
are localized to specific areas of the membrane in rafts 
or caveolae (see Section 12.5). 



As is now clear, to fully understand cellular signaling 
researchers need tools precise enough to detect and study 
the spatiotemporal aspects of signaling processes at the 
subcellular level and in real time. In studies of the intracel- 
lular localization of biochemical changes, biochemistry 
meets cell biology, and techniques that cross this boundary 
have become essential in understanding signaling path- 
ways. Fluorescent probes have found wide application in 
signaling studies. Labeling of functional proteins with a flu- 
orescent tag such as the green fluorescent protein (GFP) 
reveals their subcellular localizations (see Fig. 9-15a). 
Changes in the state of association of two proteins (such as 
the R and C subunits of PKA) can be seen by measuring 
the nonradiative transfer of energy between fluorescent 
probes attached to each protein, a technique called fluo- 
rescence resonance energy transfer (FRET; Box 12-3). 

Diacylglyceroi, Inositol Trisphosphate, and Ca 2+ 
Have Related Roles as Second Messengers 

A second broad class of GPCRs are coupled through a 
G protein to a plasma membrane phospholipase C 
(PLC) that is specific for the membrane phospholipid 
phosphatidylinositol 4,5-bisphosphate, or PIP 2 (see Fig. 
10-16). When one of the hormones that acts by this 
mechanism (Table 12-4) binds its specific receptor in the 
plasma membrane (Fig. 12-10, step ®), the receptor- 
hormone complex catalyzes GTP-GDP exchange on an 
associated G protein, G q (step (2)), activating it much 
like the jS-adrenergic receptor activates G s (Fig. 12-4). 
The activated G q in turn activates the PIP 2 -specific PLC 
(Fig. 12-10, step ©), which catalyzes (step ®) the 
production of two potent second messengers, diacyl- 
glyceroi and inositol 1,4,5-trisphosphate, or IP 3 
(not to be confused with PIP 3; p. 441). 




Inositol 1,4,5-trisphosphate (IP 3 ) 



Some Signals That Act through Phospholipase t, 1P 3 , and Ca 2+ 



Acetylcholine [muscarinic M x ] 

a r Adrenergic agonists 

Angiogenin 

Angiotensin II 

ATP [P 2x , P 2y ] 

Auxin 



Gastrin-releasing peptide 
Glutamate 

Gonadotropin-releasing hormone (GRH) 
Histamine [HJ 
Light (Drosophila) 
Oxytocin 



Platelet-derived growth factor (PDGF) 
Serotonin [5-HT-lc] 
Thyrotropin-releasing hormone (TRH) 
Vasopressin 



Note: Receptor subtypes are in square brackets; see footnote to Table 12-3. 
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FIGURE 12-10 Hormone-activated phospholipase C and IP 3 . Two in- 
tracellular second messengers are produced in the hormone-sensitive 
phosphatidylinositol system: inositol 1,4,5-trisphosphate (IP 3 ) and 
diacylglycerol. Both contribute to the activation of protein kinase C. By 
raising cytosolic [Ca 2+ ], IP 3 also activates other Ca 2+ -dependent 
enzymes; thus Ca 2+ also acts as a second messenger. 



Inositol triphosphate, a water-soluble compound, 
diffuses from the plasma membrane to the endoplasmic 
reticulum (ER), where it binds to specific IP 3 -gated 
Ca 2+ channels, causing them to open. The action of the 
SERCA pump (see Fig. 11-36) ensures that [Ca 2+ ] in 
the ER is orders of magnitude higher than that in the 
cytosol, so when these gated Ca 2+ channels open, Ca 2+ 
rushes into the cytosol (Fig. 12-10, step ©), and the 
cytosolic [Ca 2+ ] rises sharply to about 1CT 6 m. One ef- 
fect of elevated [Ca 2+ ] is the activation of protein ki- 
nase C (PKC). Diacylglycerol cooperates with Ca 2+ in 
activating PKC, thus also acting as a second messenger 
(step (©)■ Activation involves the movement of a PKC 



domain (the pseudosubstrate domain) away from its lo- 
cation in the substrate-binding region of the enzyme, al- 
lowing the enzyme to bind and phosphorylate proteins 
that contain a PKC consensus sequence— Ser or Thr 
residues embedded in an amino acid sequence recog- 
nized by PKC (step ®). There are several isozymes of 
PKC, each with a characteristic tissue distribution, tar- 
get protein specificity, and role. Their targets include 
cytoskeletal proteins, enzymes, and nuclear proteins 
that regulate gene expression. Taken together, this 
family of enzymes has a wide range of cellular actions, 
affecting neuronal and immune function and the regula- 
tion of cell division, for example. 




BOX 12-3 



METHODS 



I FRET: Biochemistry Visualized in a Living Ce 



Fluorescent probes are commonly used to detect rapid 
biochemical changes in single living cells. They can be 
designed to give an essentially instantaneous report 
(within nanoseconds) on the changes in intracellular 
concentration of a second messenger or in the activity of 
a protein kinase. Furthermore, fluorescence microscopy 
has sufficient resolution to reveal where in the cell such 
changes are occurring. In one widely used procedure, 
the fluorescent probes are derived from a naturally oc- 
curring fluorescent protein, the green fluorescent pro- 
tein (GFP) of the jellyfish Aequorea victoria (Fig. 1). 

When excited by absorption of a photon of light, 
GFP emits a photon (that is, it fluoresces) in the green 
region of the spectrum. GFP is an 11-stranded p barrel, 
and the light-absorbing/emitting center of the protein 
(its chromophore) comprises a modified (oxidized) 
form of the tripeptide -Ser 65 -Tvr 66 -Gly 67 - located 
within the barrel (Fig. 2). Oxidation of the tripeptide is 
catalyzed by the GFP protein itself (Fig. 3), with no 
other protein or cofactors required (other than molecu- 
lar oxygen), so it is possible to clone the protein into vir- 
tually any cell, where it can serve as a fluorescent 
marker for itself or for any protein to which it is fused (see 
Fig. 9-1 5a). Variants of GFP, with different fluorescence 
spectra, are produced by genetic engineering of its gene. 
For example, in the yellow fluorescent protein (YFP), 
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FIGURE 2 Green fluorescent protein (GFP), with the fluorescent chro- 
mophore shown in ball-and-stick form (derived from PDB ID 1GFL). 

Ala 206 in GFP is replaced by a Lys residue, changing the 
wavelength of light absorption and fluorescence. Other 
variants of GFP fluoresce blue (BFP) or cyan (CFP) 
light, and a related protein (mRFPl) fluoresces red light 
(Fig. 4). GFP and its variants are compact structures 
that retain their ability to fold into their native /3-barrel 
conformation even when fused with another protein. In- 
vestigators are using these fluorescent hybrid proteins 
as spectroscopic rulers to measure distances between 
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FIGURE 1 Aequorea victoria, a jellyfish abundant in 
Puget Sound, Washington State, 



FIGURE 3 The chromophore in GFP is derived from a series of three amino acids: 
-Ser 65 -Tyr 6b -Gly 67 -. Maturation of the chromophore involves an internal rearrangement, 
coupled to an oxidation reaction that takes place in multiple steps. An abbreviated 
mechanism is shown here. 
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FIGURE 4 Emission spectra of GFP variants. 

interacting components within a cell and, indirectly, to 
measure local concentrations of compounds that change 
the distance between two proteins. 

An excited fluorescent molecule such as GFP or 
YFP can dispose of the energy from the absorbed pho- 
ton in either of two ways: (1) by fluorescence, emitting 
a photon of slightly longer wavelength (lower energy) 
than the exciting light, or (2) by nonradiative fluores- 
cence resonance energy transfer (FRET), in which 
the energy of the excited molecule (the donor) passes 
directly to a nearby molecule (the acceptor) without 
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FIGURE 5 When the donor protein (CFP) is excited with monochro- 
matic light of wavelength 433 nm, it emits fluorescent light at 476 nm 
(left). When the (red) protein fused with CFP interacts with the (purple) 
protein fused with YFP, that interaction brings CFP and YFP close 
enough to allow fluorescence resonance energy transfer (FRET) be- 
tween them. Now, when CFP absorbs light of 433 nm, instead of fluo- 
rescing at 476 nm, it transfers energy directly to YFP, which then 
fluoresces at its characteristic emission wavelength, 527 nm. The ratio 
of light emission at 527 and 476 nm is therefore a measure of the in- 
teraction of the red and purple proteins. 



emission of a photon, exciting the acceptor (Fig. 5). 
The acceptor can now decay to its ground state by fluo- 
rescence; the emitted photon has a longer wavelength 
(lower energy) than both the original exciting light and 
the fluorescence emission of the donor. This second 
mode of decay (FRET) is possible only when donor and 
acceptor are close to each other (within 1 to 50 A); the 
efficiency of FRET is inversely proportional to the sixth 
power of the distance between donor and acceptor. 
Thus very small changes in the distance between donor 
and acceptor register as very large changes in FRET, 
measured as the fluorescence of the acceptor molecule 
when the donor is excited. With sufficiently sensitive 
light detectors, this fluorescence signal can be located 
to specific regions of a single, living cell. 

FRET has been used to measure [cAMP] in living 
cells. The gene for GFP is fused with that for the regula- 
tory subunit (R) of cAMP-dependent protein kinase 
(PKA), and the gene for BFP is fused with that for the 
catalytic subunit (C) (Fig. 6). When these two hybrid 
proteins are expressed in a cell, BFP (donor; excitation 
at 380 nm, emission at 460 nm) and GFP (acceptor; ex- 
citation at 475 nm, emission at 545 nm) in the inactive 

(continued on next page) 
cAMP-dependent protein kinase (PKA) 
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FIGURE 6 Measuring [cAMP] with FRET. Gene fusion creates hybrid 
proteins that exhibit FRET when the PKA regulatory (R) and catalytic 
(C) subunits are associated (low [cAMP]). When [cAMP] rises, the 
subunits dissociate and FRET ceases. The ratio of emission at 460 nm 
(dissociated) and 545 nm (complexed) thus offers a sensitive measure 
of [cAMP]. 
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FRET: Biochemistry Visualized in a Living Cell 
(continued from previous page) 



PKA (R 2 C 2 tetramer) are close enough to undergo FRET. 
Wherever in the cell [cAMP] increases, the R 2 C 2 complex 
dissociates into R 2 and 2C and the FRET signal is lost, be- 
cause donor and acceptor are now too far apart for effi- 
cient FRET. Viewed in the fluorescence microscope, the 
region of higher [cAMP] has a minimal GFP signal and 
higher BFP signal. Measuring the ratio of emission at 460 
nm and 545 nm gives a sensitive measure of the change in 
[cAMP]. By determining this ratio for all regions of the 
cell, the investigator can generate a false color image of 
the cell in which the ratio, or relative [cAMP], is repre- 
sented by the intensity of the color. Images recorded at 
timed intervals reveal changes in [cAMP] over time. 

A variation of this technology has been used to 
measure the activity of PKA in a living cell (Fig. 7). Re- 
searchers create a phosphorylation target for PKA by 
producing a hybrid protein containing four elements: 
YFP (acceptor); a short peptide with a Ser residue sur- 
rounded by the consensus sequence for PKA; a (P)-Ser- 
binding domain (called 14-3-3); and CFP (donor). When 
the Ser residue is not phosphorylated, 14-3-3 has no 
affinity for the Ser residue and the hybrid protein exists 
in an extended form, with the donor and acceptor too 
far apart to generate a FRET signal. Wherever PKA is 
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FIGURE 7 Measuring the activity of PKA with FRET. An engineered pro- 
tein links YFP and CFP via a peptide that contains a Ser residue sur- 
rounded by the consensus sequence for phosphorylation by PKA, and 
the 1 4-3-3 ®-Ser-binding domain. Active PKA phosphorylates the Ser 
residue, which docks with the 14-3-3 binding domain, bringing the 
fluorescence proteins close enough to allow FRET to occur, revealing 
the presence of active PKA. 

active in the cell, it phosphorylates the Ser residue of 
the hybrid protein, and 14-3-3 binds to the (P)-Ser. In 
doing so, it draws YFP and CFP together and a FRET 
signal is detected with the fluorescence microscope, re- 
vealing the presence of active PKA. 



I The action of a group of compounds known as tu- 
I mor promoters is attributable to their effects on 
PKC. The best understood of these are the phorbol es- 
ters, synthetic compounds that are potent activators of 
PKC. They apparently mimic cellular diacylglycerol as 
second messengers, but unlike the naturally occurring 
diacylglycerols they are not rapidly removed by metabo- 
lism. By continuously activating PKC, these synthetic 
tumor promoters interfere with the normal regulation of 
cell growth and division (as discussed in Section 12.12), 
thus promoting the formation of tumors. ■ 
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Calcium Is a Second Messenger That May 
Be Localized in Space and Time 

There are many variations on this basic scheme for Ca 2+ 
signaling. In many cell types that respond to extracellu- 
lar signals, Ca 2+ serves as a second messenger that 
triggers intracellular responses, such as exocytosis in 
neurons and endocrine cells, contraction in muscle, and 
cytoskeletal rearrangements during amoeboid move- 
ment. In unstimulated cells, cytosolic [Ca 2+ ] is kept 
very low (<10" 7 m) by the action of Ca 2+ pumps in the 
ER, mitochondria, and plasma membrane (as further 
discussed below). Hormonal, neural, or other stimuli 
cause either an influx of Ca 2+ into the cell through spe- 
cific Ca 2+ channels in the plasma membrane or the re- 
lease of sequestered Ca 2+ from the ER or mitochondria, 
in either case raising the cytosolic [Ca 2+ ] and triggering 
a cellular response. 

Changes in intracellular [Ca 2+ ] are detected by 
Ca 2+ -binding proteins that regulate a variety of Ca 2+ - 
dependent enzymes. Calmodulin (CaM;M" r 17,000) is 
an acidic protein with four high-affinity Ca 2+ -binding 
sites. When intracellular [Ca 2+ ] rises to about 10^ 6 m 
(1 jam), the binding of Ca 2+ to calmodulin drives a 
conformational change in the protein (Fig. 12-1 la). 
Calmodulin associates with a variety of proteins and, 
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FIGURE 12-11 Calmodulin. This is the protein mediator of many 
Ca 2+ -stimulated enzymatic reactions. Calmodulin has four high- 
affinity Ca 2+ -binding sites {K d ~ 0.1 to 1 jlam). (a) A ribbon model of the 
crystal structure of calmodulin (PDB ID 1CLL). The four Ca 2+ -binding 
sites are occupied by Ca 2+ (purple). The amino-terminal domain is on 
the left; the carboxyl-terminal domain on the right, (b) Calmodulin 
associated with a helical domain (red) of one of the many enzymes it 
regulates, calmodul in-dependent protein kinase II (PDB ID 1CDL). 
Notice that the long central a helix of calmodulin visible in (a) has 
bent back on itself in binding to the helical substrate domain. The 
central helix of calmodulin is clearly more flexible in solution than in 
the crystal, (c) Each of the four Ca 2+ -binding sites occurs in a helix- 
loop-helix motif called the EF hand, also found in many other Ca 2+ - 
binding proteins, 

in its Ca 2+ -bound state, modulates their activities 
(Fig. 12-1 lb) . It is a member of a family of Ca 2+ -binding 
proteins that also includes troponin (p. 178), which 
triggers skeletal muscle contraction in response to 
increased [Ca 2+ ]. This family shares a characteristic 
Ca 2+ -binding structure, the EF hand (Fig. 12-1 lc). 

Calmodulin is an integral subunit of the Ca 2+ / 
calmodulin-dependent protein kinases (CaM ki- 
nases, types I through IV). When intracellular [Ca 2+ ] 
increases in response to a stimulus, calmodulin binds 
Ca 2+ , undergoes a change in conformation, and acti- 
vates the CaM kinase. The kinase then phosphorylates 
target enzymes, regulating their activities. Calmodulin is 
also a regulatory subunit of phosphorylase b kinase of 
muscle, which is activated by Ca 2+ . Thus Ca 2+ triggers 
ATP-requiring muscle contractions while also activating 
glycogen breakdown, providing fuel for ATP synthesis. 
Many other enzymes are also known to be modulated 
by Ca 2+ through calmodulin (Table 12-5). The activity 
of the second messenger Ca 2+ , like that of cAMP, can 
be spatially restricted; after its release triggers a local 



TABLE 12-5 



Some Proteins Regulated by Ca 2+ 
and Calmodulin 

Adenylyl cyclase (brain) 

Ca 2+ /calmodulin-dependent protein kinases (CaM 

kinases I to IV) 
Ca 2+ -dependent Na + channel (Paramecium) 
Ca 2+ -release channel of sarcoplasmic reticulum 
Calcineurin (phosphoprotein phosphatase 2B) 
cAMP phosphodiesterase 
cAMP-gated olfactory channel 
cGMP-gated Na + , Ca 2+ channels (rod and cone cells) 
Glutamate decarboxylase 
Myosin light chain kinases 
NAD + kinase 
Nitric oxide synthase 
Phosphoinositide 3-kinase 
Plasma membrane Ca 2+ ATPase (Ca 2+ pump) 
RNA helicase (p68) 



response, Ca 2+ is generally removed before it can dif- 
fuse to distant parts of the cell. 

Very commonly, Ca 2+ level does not simply rise and 
then decrease, but rather oscillates with a period of a 
few seconds (Fig. 12-12)— even when the extracellu- 
lar concentration of the triggering hormone remains 
constant. The mechanism underlying [Ca 2+ ] oscillations 
presumably entails feedback regulation by Ca 2+ on 
some part of the Ca 2+ -release process. Whatever the 
mechanism, the effect is that one kind of signal (hor- 
mone concentration, for example) is converted into an- 
other (frequency and amplitude of intracellular [Ca 2+ ] 
"spikes"). Another variation is the occurrence of local- 
ized Ca 2+ "blips," "puffs," and "waves"— transient in- 
creases in [Ca 2+ ] that are limited to specific subcellular 
regions (Fig. 12-13), The Ca 2+ signal diminishes as 
r;r + diffuses away from the initial source (the Ca i_h 
channel), is sequestered in the ER, or is pumped out of 
the cell. 

There is significant cross-talk between the Ca 2+ and 
cAMP signaling systems. In some tissues, both the en- 
zyme that produces cAMP (adenylyl cyclase) and the 
enzyme that degrades cAMP (phosphodiesterase) are 
stimulated by Ca 2+ . Temporal and spatial changes in 
[Ca 2+ ] can therefore produce transient, localized 
changes in [cAMP]. We have noted already that PKA, the 
enzyme that responds to cAMP, is often part of a highly 
localized supramolecular complex assembled on scaf- 
fold proteins such as AKAPs. This subcellular localiza- 
tion of target enzymes, combined with temporal and 
spatial gradients in [Ca 2+ ] and [cAMP], allow a cell to re- 
spond to one or several signals with subtly nuanced 
metabolic changes, localized in space and time. 
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FIGURE 12-12 Triggering of oscillations in intracellular [Ca 2+ ] by 
extracellular signals, (a) A dye (fura) that undergoes fluorescence 
changes when it binds Ca 2+ is allowed to diffuse into cells, and its 
instantaneous light output is measured by fluorescence microscopy. 
Fluorescence intensity is represented by color; the color scale re- 
lates intensity of color to [Ca 2 + ], allowing determination of the ab- 
solute [Ca 2+ ]. In this case, thymocytes (cells of the thymus) have 
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been stimulated with extracellular ATP, which raises their internal 
[Ca 2+ ]. The cells are heterogeneous in their responses; some have 
high intracellular fCa 2+ ] (red), others much lower (blue), (b) When 
such a probe is used in a single hepatocyte, the agonist norepineph- 
rine (added at the arrow) causes oscillations of [Ca 2+ ] from 200 to 
500 nM. Similar oscillations are induced in other cell types by other 
extracellular signals 
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FIGURE 12-13 Transient and highly localized increases in [Ca 2+ ], 

(a) The IP 3 -gated Ca 2 + channels of the endoplasmic reticulum occur 
in clusters, each capable of independently responding to the IP 3 sig- 
nal. A relatively weak stimulus that produces a small rise in [IP 3 ] 
may cause a single channel to open briefly, resulting in a highly 
localized and transient "blip" in [Ca 2+ j. (b) A somewhat stronger 
stimulus that generates a larger increase in [IP 3 ] may cause all the 



• 



m m m& m # m 

0 ^ 



Ca 2+ channels in a cluster to open, producing a "puff of Ca 2+ in 
which the increase in [Ca 2+ ], its duration, and the area (volume) af- 
fected are larger than in a blip, (c) A sufficiently large puff produces 
elevated [Ca 2+ ] over an area great enough to include neighboring 
clusters of Ca 2+ channels. Opening of the channels in neighboring 
clusters propagates this effect, and the result is a wave of elevated 
[Ca 2+ ] moving along the ER. 
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SUMMARY 12.2 G Protein-Coupled 

Receptors and Second 
Messengers 

■ G protein-coupled receptors (GPCRs) act through 
heterotrimeric G proteins. On ligand binding, GPCRs 
catalyze the exchange of GTP for GDP on the 

G protein, causing dissociation of the G a subunit; 
G a then stimulates or inhibits the activity of an 
effector enzyme, changing the level of its second- 
messenger product. 

■ The /3-adrenergic receptor activates a stimulatory 
G protein, G s , thereby activating adenylyl cyclase 
and raising the concentration of the second 
messenger cAMP. Cyclic AMP stimulates 
cAMP-dependent protein kinase to phosphorylate 
key target enzymes, changing their activities. 

■ Enzyme cascades, in which a single molecule of 
hormone activates a catalyst to activate another 
catalyst, and so on, result in the large signal 
amplification that is characteristic of hormone 
receptor systems. 

■ Cyclic AMP concentration is eventually reduced by 
cAMP phosphodiesterase, and G s turns itself off by 
hydrolysis of its bound GTP to GDP, acting as a 
self -limiting binary switch. 

■ When the epinephrine signal persists, /3-adrenergic 
receptor-specific protein kinase and /3-arrestin 
temporarily desensitize the receptor and cause it to 
move into intracellular vesicles. 

■ Some receptors stimulate adenylyl cyclase through 
G s ; others inhibit it through Gi. Thus cellular [cAMP] 
reflects the integrated input of two (or more) signals. 

■ Noncatalytic adaptor proteins such as AKAPs hold 
together proteins involved in a signaling process, 
increasing the efficiency of their interactions and in 
some cases confining the process to a specific 
subcellular location. 

■ Some GPCRs act via a plasma membrane 
phospholipase C that cleaves PIP 2 to diacylglycerol 
and IP 3 . By opening Ca 2+ channels in the 
endoplasmic reticulum, IP 3 raises cytosolic [Ca 2+ ]. 
Diacylglycerol and Ca 2+ act together to activate 
protein kinase C, which phosphorylates and 
changes the activity of specific cellular proteins. 
Cellular [Ca 2+ ] also regulates (often through 
calmodulin) many other enzymes and proteins 
involved in secretion, cytoskeleton rearrangements, 
or contraction. 

123 Receptor Tyrosine Kinases 

The receptor tyrosine kinases (RTKs), a large family 
of plasma membrane receptors with intrinsic protein ki- 
nase activity, transduce extracellular signals by a mecha- 
nism fundamentally different from that of GPCRs. RTKs 



have a ligand-binding domain on the extracellular face of 
the plasma membrane and an enzyme active site on the 
cytoplasmic face, connected by a single transmembrane 
segment. The cytoplasmic domain is a protein kinase 
that phosphorylates Tyr residues in specific target 
proteins — a Tyr kinase. The receptors for insulin and 
epidermal growth factor are prototypes for this group. 

Stimulation of the Insulin Receptor Initiates a Cascade 
of Protein Phosphorylation Reactions 

Insulin regulates both metabolic enzymes and gene ex- 
pression. Insulin does not enter cells, but initiates a sig- 
nal that travels a branched pathway from the plasma 
membrane receptor to insulin-sensitive enzymes in the 
cytosol and to the nucleus, where it stimulates the tran- 
scription of specific genes. The active insulin receptor 
protein (INS-R) consists of two identical a subunits pro- 
truding from the outer face of the plasma membrane 
and two transmembrane (3 subunits with their carboxyl 
termini protruding into the cytosol (Fig. 12-14). The a 
subunits contain the insulin-binding domain, and the in- 
tracellular domains of the (5 subunits contain the protein 
kinase activity that transfers a phosphoryl group from 
ATP to the hydroxyl group of Tvr residues in specific 
target proteins. Signaling through INS-R begins when 
the binding of insulin activates the Tyr kinase activity, 
and each /3 subunit phosphorylates three critical Tyr 
residues near the carboxyl terminus of the other /3 sub- 
unit in the (a/3) 2 dimer. This autophosphorylation 
opens up the active site so that the enzyme can phos- 
phorylate Tyr residues of other target proteins. The 
mechanism of activation of the INS-R protein kinase is 
similar to that described for PKA and PKC: a region of 
the cytoplasmic domain (an autoinhibitory sequence) 
that normally occludes the active site moves out of the 
active site after being phosphorylated, opening up the 
site for the binding of target proteins (Fig. 12-14). 

One of the target proteins of INS-R (Fig. 12-15, 
step (D) is insulin receptor substrate- 1 (IRS-1; step (2)). 
Once phosphorylated on several of its Tyr residues, IRS-1 
becomes the point of nucleation for a complex of proteins 
(step (3)) that carry the message from the insulin recep- 
tor to end targets in the cytosol and nucleus, through a 
long series of intermediate proteins. First, a (PKTyr 
residue of IRS-1 binds to the SH2 domain of the protein 
Grb2. (SH2 is an abbreviation of Src homology 2, so 
named because the sequence of an SH2 domain is simi- 
lar to that of a domain in Src (pronounced sark), an- 
other protein Tyr kinase.) Several signaling proteins 
contain SH2 domains, all of which bind (P>-Tyr residues 
in a protein partner. Grb2 is an adaptor protein, with no 
intrinsic enzymatic activity. Its function is to bring to- 
gether two proteins (in this case, IRS-1 and the protein 
Sos) that must interact to enable signal transduction. In 
addition to its SH2 ((g^Tyr-binding) domain, Grb2 also 
contains a second protein-binding domain, SH3, that 
binds to a proline-rich region of Sos, recruiting Sos to 
the growing receptor complex. When bound to Grb2, 
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FIGURE 12-14 Activation of the insulin-receptor tyrosine kinase by 
autophosphorylation. (a) The insulin-binding region of the insulin re- 
ceptor lies outside the cell and comprises (b) two a subunits and the 
extracellular portions of two j3 subunits, intertwined to form the insulin 
binding site (pink; shown as a surface contour model of the crystal 
structure, derived from PDB ID2DTG). (The structure of the transmem- 
brane domain has not been solved by crystallography.) The binding of 
insulin (red; PDB ID 2CEU) is communicated through the single trans- 
membrane helix of each (3 subunit to the paired Tyr kinase domains in- 
side the cell, activating them to phosphorylate each other on three Tyr 
residues, (c) In the inactive form of the Tyr kinase domain (PDB ID 
1 1RK), the activation loop (blue) sits in the active site, and none of the 
critical Tyr residues (black and red ball-and-stick structures) are phos- 
phorylated. This conformation is stabilized by hydrogen bonding be- 
tween Tyr 1 162 and Asp 1132 , (d) Activation of the Tyr kinase allows each 
(3 subunit to phosphorylate three Tyr residues (Tyr 1158 , Tyr 1162 , Tyr 1163 ) 
on the other (3 subunit, shown here (PDB ID 1 1R3). (Phosphoryl groups 
are depicted as an orange space-filling phosphorus atom and red ball- 
and-stick oxygen atoms.) The introduction of three highly charged 
®-Tyr residues forces a 30 A change in the position of the activation 
loop, away from the substrate-binding site, which becomes available 
to bind and phosphorylate a target protein, shown here as a red arrow. 
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Sos acts as a guanosine nucleotide-exchange factor 
(GEF), catalyzing the replacement of bound GDP with 
GTP on Ras, a G protein. 

Ras is the prototype of a family of small G proteins 
that mediate a wide variety of signal transductions (see 
Box 12-2). Like the trimeric G protein that functions 
with the /3-adrenergic system (Fig. 12-5), Ras can exist 
in either the GTP-bound (active) or GDP-bound (inac- 
tive) conformation, but Ras (-20 kDa) acts as a 
monomer. When GTP binds, Ras can activate a protein 
kinase, Raf-1 (Fig. 12-15, step ©), the first of three 
protein kinases— Raf-1, MEK, and ERK— that form a 
cascade in which each kinase activates the next by 
phosphorylation (step ©). The protein kinases MEK 
and ERK are activated by phosphorylation of both a 
Thr and a Tyr residue. When activated, ERK mediates 
some of the biological effects of insulin by entering the 



nucleus and phosphorylating transcription factors, such 
as Elkl (step ©), that modulate the transcription of 
about 100 insulin-regulated genes (step ®). 

The proteins Raf-1, MEK, and ERK are members of 
three larger families, for which several nomenclatures 
are employed. ERK is in the MAPK family (mitogen- 
activated protein kinases; mitogens are extracellular sig- 
nals that induce mitosis and cell division). Soon after 
discovery of the first MAPK enzyme, that enzyme was 
found to be activated by another protein kinase, which 
was named MAP kinase kinase (MEK belongs to this fam- 
ily); and when a third kinase that activated MAP kinase 
kinase was discovered, it was given the slightly ludicrous 
family name MAP kinase kinase kinase (Raf-1 is in this 
family; Fig. 12-15, step (4)). Somewhat less cumbersome 
are the abbreviations for these three families: MAPK, 
MAPKK, and MAPKKK. Kinases in the MAPK and 
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® 

Insulin receptor binds 
insulin and undergoes 
autophosphorylation on its 
carboxyl-terminal Tyr residues. 



Insulin receptor 
phosphorylates IRS-1 
on its Tyr residues. 

® 




SH2 domain of Grb2 binds 
to ®-Tyr of IRS-1. Sos binds 
to Grb2, then to Ras, 
causing GDP release and 
GTP binding to Ras. 

® 

Activated Ras binds and 
activates Raf-1. 



Raf-1 phosphorylates 
MEK on two Ser residues, 
activating it. MEK 
phosphorylates ERK 
on a Thr and a Tyr residue, 
activating it. 



FIGURE 1 2-1 5 Regulation of gene expression 
by insulin through a MAP kinase cascade. The 

insulin receptor (INS-R) consists of two a sub- 
units on the outer face of the plasma mem- 
brane and two p subunits that traverse the 
membrane and protrude from the cytosolic 
face. Binding of insulin to the a subunits trig- 
gers a conformational change that allows the 
autophosphorylation of Tyr residues in the 
carboxyl-terminal domain of the jS subunits. 
Autophosphorylation further activates the Tyr 
kinase domain, which then catalyzes phos- 
phorylation of other target proteins. The signal- 
ing pathway by which insulin regulates the 
expression of specific genes consists of a 
cascade of protein kinases, each of which ac- 
tivates the next. INS-R is a Tyr-specific ki- 
nase; the other kinases (all shown in blue) 
phosphorylate Ser or Thr residues. MEK is a 
dual-specificity kinase, which phosphorylates 
both a Thr and a Tyr residue in ERK (extracellu- 
lar regulated kinase); MEK is mitogen-activated, 
ERK-activating kinase; SRF is serum response 
factor. 



ERK moves into 
the nucleus and 
phosphorylates 
nuclear transcription 
factors such as Elkl, 
activating them. 



DNA 
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Phosphorylated Elkl 
joins SRF to stimulate 
the transcription and 
translation of a set of 
genes needed for 
cell division. 



MAPKKK families are specific for Ser or Thr residues, 
and MAPKKs (here, MEK) phosphorylate both a Ser and 
a Tyr residue in their substrate, a MAPK (here, ERK). 

Biochemists now recognize the insulin pathway as 
but one instance of a more general scheme in which 
hormone signals, via pathways similar to that shown in 
Figure 12-15, result in phosphorylation of target en- 
zymes by protein kinases. The target of phosphorylation 
is often another protein kinase, which then phosphory- 
lates a third protein kinase, and so on. The result is a 
cascade of reactions that amplifies the initial signal by 
many orders of magnitude (see Fig. 12-lb). MAPK cas- 
cades (Fig. 12-15) mediate signaling initiated by a vari- 
ety of growth factors, such as platelet-derived growth 
factor (PDGF) and epidermal growth factor (EGF). An- 
other general scheme exemplified by the insulin receptor 
pathway is the use of nonenzymatic adaptor proteins to 
bring together the components of a branched signaling 
pathway, to which we now turn. 



The Membrane Phospholipid PIP 3 Functions 
at a Branch in insulin Signaling 

The signaling pathway from insulin branches at IRS-1 
(Fig. 12-15, step (2)). Grb2 is not the only protein that 
associates with phosphorylated IRS-1. The enzyme 
phosphoinositide 3-kinase (PI-3K) binds IRS-1 through 
PI-3K's SH2 domain (Fig. 12-16). Thus activated, PI- 
3K converts the membrane lipid phosphatidylinositol 
4,5-bisphosphate (PIP 2 ; see Fig. 10-16) to phos- 
phatidylinositol 3,4,5-trisphosphate (PIP 3 ). The multi- 
ply charged head group of PIP 3 , protruding on the 
cytoplasmic side of the plasma membrane, is the start- 
ing point for a second signaling branch involving another 
cascade of protein kinases. When bound to PIP 3 , protein 
kinase B (PKB; also called Akt) is phosphorylated and 
activated by yet another protein kinase, PDK1. The acti- 
vated PKB then phosphorylates Ser or Thr residues in 
its target proteins, one of which is glycogen synthase ki- 
nase 3 (GSK3). In its active, nonphosphorylated form, 
GSK3 phosphorylates glycogen synthase, inactivating it 
and thereby contributing to the slowing of glycogen syn- 
thesis. (This mechanism is only part of the explanation 
for the effects of insulin on glycogen metabolism.) When 
phosphorylated by PKB, GSK3 is inactivated. By thus 
preventing inactivation of glycogen synthase in liver 
and muscle, the cascade of protein phosphorylations 
initiated by insulin stimulates glycogen synthesis (Fig. 
12-16). In a third signaling branch in muscle and fat 
tissue, PKB triggers the movement of glucose trans- 
porters (GLUT4) from internal vesicles to the plasma 
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FIGURE 12-16 Activation of 
glycogen synthase by insulin. 

Transmission of the signal is 
mediated by PI-3 kinase (PI-3K) 
and protein kinase B (PKB). 
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GSK3, inactivated by 
phosphorylation, cannot 
convert glycogen synthase 
(GS) to its inactive form 
by phosphorylation, so 
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IRS-1, phosphorylated 
by the insulin receptor, 
activates PI-3K by binding 
to its SH2 domain. PI-3K 
converts PIP 2 to PIP 3 . 



© 

PKB bound to PIP 3 
is phosphorylated by 
PDK1 (not shown). 
Thus activated, PKB 
phosphorylates GSK3 
on a Ser residue, 
inactivating it. 
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PKB stimulates movement 
of glucose transporter GLUT4 
from internal membrane vesicles 
to the plasma membrane, 
increasing the uptake of glucose. 



membrane, stimulating glucose uptake from the blood 
(Fig. 12-16, step ©; see also Box 11-2). 

I Protein kinase B functions in several other signal- 
HH ing pathways, including that triggered by A 9 - 
tetrahydrocannabinol (THC), the active ingredient of 
marijuana and hashish. THC activates the CB! receptor in 
plasma membranes in the brain, triggering a signaling cas- 
cade that involves MAPKs. One consequence of CBx acti- 
vation is the stimulation of appetite, a well-established 
effect of marijuana use. The normal ligands for the CBx re- 
ceptor are endocannabinoids such as anandamide, which 
serve to protect the brain from the toxicity of excessive 
neuronal activity— as in an epileptic seizure. Hashish has 
for centuries been used in the treatment of epilepsy 
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Anandamide (arachidonylethanolamide, 
an endogenous cannabinoid) 



As in all signaling pathways, there is a mechanism 
for terminating the activity of the PI-3K-PKB pathway. 
A PIP 3 -specific phosphatase (PTEN in humans) re- 
moves the phosphoryl group at the 3 position of PIP 3 to 
produce PIP 2 , which no longer serves as a binding site 
for PKB, and the signaling chain is broken. In various 
types of cancer, it is often found that the PTEN gene has 
undergone mutation, resulting in a defective regulatory 
circuit and abnormally high levels of PIP 3 and of PKB ac- 
tivity. The result seems to be a continuing signal for cell 
division and thus tumor growth. ■ 

In addition to the many receptors that act as protein 
Tyr kinases (the RTKs), several receptorlike plasma 
membrane proteins have protein Tyr phosphatase activ- 
ity. Based on the structures of these proteins, we can 
surmise that their ligands are components of the extra- 
cellular matrix or are surface molecules on other cells. 
Although their signaling roles are not yet as well under- 
stood as those of the RTKs, they clearly have the poten- 
tial to reverse the actions of signals that stimulate RTKs. 

The insulin receptor is the prototype for several re- 
ceptor enzymes with a similar structure and RTK activ- 
ity (Fig. 12-17). The receptors for EGF and PDGF, for 
example, have structural and sequence similarities to 
INS-R, and both have a protein Tyr kinase activity that 
phosphorylates IRS-1. Many of these receptors dimerize 
after binding ligand; INS-R is the exception, as it is 
already an (afi) 2 dimer before insulin binds. (The pro- 
tomer of the insulin receptor is one a/3 unit.) The bind- 
ing of adaptor proteins such as Grb2 to (PKTyr residues 
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FIGURE 12-17 Receptor tyrosine kinases. Growth factor re- 
ceptors that signal through Tyr kinase activity include those 
for insulin (INS-R), vascular epidermal growth factor (VEGF- 
R), platelet-derived growth factor (PDGF-R), epidermal 
growth factor (EGF-R), nerve growth factor (NGF-R), and fi- 
broblast growth factor (FGF-R). All these receptors have a Tyr 
kinase domain on the cytoplasmic side of the plasma mem- 
brane (blue). The extracellular domain is unique to each 
type of receptor, reflecting the different growth-factor speci- 
ficities. These extracellular domains are typically combina- 
tions of structural motifs such as cysteine- or ieucine-rich 
segments and segments containing one of several motifs 
common to immunoglobulins (Ig-like domains; see p. 137). 
Many other receptors of this type are encoded in the human 
genome, each with a different extracellular domain. 



is a common mechanism for promoting protein-protein 
interactions initiated by RTKs, a subject to which we 
return in Section 12.5. 

What spurred the evolution of such complicated 
regulatory machinery? This system allows one activated 
receptor to activate several IRS-1 molecules, amplifying 
the insulin signal, and it provides for the integration of 
signals from different receptors such as EGF-R and 
PDGF-R, each of which can phosphorylate IRS-1. Fur- 
thermore, because IRS-1 can activate any of several pro- 
teins that contain SH2 domains, a single receptor acting 
through IRS-1 can trigger two or more signaling 
pathways; insulin affects gene expression through the 
Grb2-Sos-Ras-MAPK pathway and affects glycogen me- 
tabolism and glucose transport through the PI-3K-PKB 
pathway. Finally, there are several closely related IRS 
proteins (IRS-2, IRS-3), each with its own characteristic 
tissue distribution and function, further enriching the 
signaling possibilities in pathways initiated by RTKs. 

The JAK-STAT Signaling System Also Involves 
Tyrosine Kinase Activity 

A variation on the basic theme of receptor Tyr kinases is 
receptors that have no intrinsic protein kinase activity 
but, when occupied by their ligand, bind a cytosolic Tyr 
kinase. One example is the system that regulates the 
formation of erythrocytes in mammals. The develop- 
mental signal, or cytokine, for this system is erythro- 
poietin (EPO), a 165 amino acid protein produced in the 
kidneys. When EPO binds to its plasma membrane re- 
ceptor (Fig. 12-18) the receptor dimerizes, and the 
dimer can bind and activate the soluble protein kinase 
JAK (Janus /cinase). The activated JAK phosphorylates 
several Tyr residues in the cytoplasmic domain of the 
EPO receptor. A family of transcription factors, collec- 
tively called STATs (signal transducers and activators of 
transcription), are also targets of JAK. An SH2 domain 
in STAT5 binds (PKTyr residues in the EPO receptor, 
positioning the STAT for phosphorylation by JAK in 
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FIGURE 12-18 The JAK-STAT transduction mechanism for the ery- 
thropoietin receptor. Binding of erythropoietin (EPO) causes dimeriza- 
tion of the EPO receptor, which allows JAK, a soluble Tyr kinase, to 
bind to the internal domain of the receptor and phosphorylate it on 
several Tyr residues, (a) In one signaling pathway, the SH2 domain of 
the STAT protein STATS binds to ©-Tyr residues on the receptor, 
bringing it into proximity with JAK. Following phosphorylation of 
STATS by JAK, two STATS molecules dimerize, each binding the 
other's ®-Tyr residue, thus exposing a nuclear localization se- 
quence (NLS) that targets the dimer for transport into the nucleus. In 
the nucleus, STATS turns on the expression of EPO-controlled genes, 
(b) In a second signaling pathway, following EPO binding and 
autophosphorylation of JAK, the adaptor protein Grb2 binds ®-Tyr 
in JAK and triggers the MAPK cascade, as in the insulin system (see 
Fig. 12-15). 
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response to EPO. The phosphorylated STATS forms 
dimers, exposing a signal that causes it to be trans- 
ported into the nucleus. There, STATS induces the ex- 
pression (transcription) of specific genes essential for 
erythrocyte maturation. This JAK-STAT system also op- 
erates in other signaling pathways, including that for the 
hormone leptin, described in detail in Chapter 23 (see 
Fig. 23-37). Activated JAK can also trigger, through 
Grb2, the MAPK cascade (Fig. 12-18b), which leads to 
altered expression of specific genes. 

Src is another soluble protein Tyr kinase that associ- 
ates with certain receptors when they bind their ligands. 
Src was the first protein found to have the characteristic 
(P)-Tyr-binding domain that was subsequently named 
the Src homology (SH2) domain. 

Cross Talk among Signaling Systems 
Is Common and Complex 

Although, for simplicity, we have treated individual 
signaling pathways as separate sequences of events 
leading to separate metabolic consequences, there is in 
fact extensive cross talk among signaling systems. The 
regulatory circuitry that governs metabolism is richly in- 
terwoven and multilayered. We have discussed the sig- 
naling pathways for insulin and epinephrine separately, 
but they do not operate independently. Insulin opposes 
the metabolic effects of epinephrine in most tissues, and 
activation of the insulin signaling pathway directly at- 
tenuates signaling through the /3-adrenergic signaling 
system. For example, the INS-R kinase directly phos- 
phorylates two Tyr residues in the cytoplasmic tail of a 
jS 2 -adrenergic receptor, and PKB, activated by insulin 
(Fig. 12-19), phosphorylates two Ser residues in the 
same region. Phosphorylation of these four residues 



triggers internalization of the /^-adrenergic receptor, 
taking it out of service and lowering the cell's sensitivity 
to epinephrine. A second type of cross talk between 
these receptors occurs when ®-Tvr residues on the /3 2 - 
adrenergic receptor, phosphorylated by INS-R, serve as 
nucleation points for SH2 domain-containing proteins 
such as Grb2 (Fig. 12-19, left side). Activation of the 
MAPK ERK by insulin (see Fig. 12-15) is 5- to 10-fold 
greater in the presence of the /^-adrenergic receptor, 
presumably because of this cross talk. Signaling systems 
that use cAMP and Ca 2+ also show extensive interac- 
tion; each second messenger affects the generation 
and concentration of the other. One of the major chal- 
lenges of systems biology is to sort out the effects of 
such interactions on the overall metabolic patterns in 
each tissue — a daunting task! 

SUMMARY 12.3 Receptor Tyrosine Kinases 

■ The insulin receptor, INS-R, is the prototype of 
receptor enzymes with Tyr kinase activity. When 
insulin binds, each a/3 unit of INS-R phosphorylates 
the p subunit of its partner, activating the receptor's 
Tyr kinase activity. The kinase catalyzes the 
phosphorylation of T^r residues on other proteins 
such as IRS-1. 

■ Phosphotyrosine residues in IRS-1 serve as binding 
sites for proteins with SH2 domains. Some of 
these proteins, such as Grb2, have two or more 
protein-binding domains and can serve as adaptors 
that bring two proteins into proximity. 

■ Sos bound to Grb2 catalyzes GDP-GTP exchange 
on Ras (a small G protein), which in turn activates 
a MAPK cascade that ends with the phosphorylation 



FIGURE 12-19 Cross talk between the insulin receptor and 
the j3 2 -adrenergic receptor (or other GPCR). When INS-R is 
activated by insulin binding, its Tyr kinase directly phosphory- 
lates the /^-adrenergic receptor (right side) on two Tyr residues 
(Tyr 350 and Tyr 364 ) near its carboxyl terminus, and indirectly 
(through activation of protein kinase B (PKB); see Fig. 12-16) 
causes phosphorylation of two Ser residues in the same region. 
The effect of these phosphorylations is internalization of the 
adrenergic receptor, reducing the response to the adrenergic 
stimulus. Alternatively (left side), INS-R-catalyzed phosphory- 
lation of a GPCR (an adrenergic or other receptor) on a carboxyl- 
terminal Tyr creates the point of nucleation for activating the 
MAPK cascade (see Fig. 12-15), with Grb2 serving as the 
adaptor protein. In this case, INS-R has used the GPCR to 
enhance its own signaling. 
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of target proteins in the cytosol and nucleus. The 
result is specific metabolic changes and altered 
gene expression. 

■ The enzyme PI-3K, activated by interaction with 
IRS-1, converts the membrane lipid PIP 2 to PIP 3 , 
which becomes the point of nucleation for proteins 
in a second and third branch of insulin signaling. 

■ In the JAK-STAT signaling system, a soluble protein 
Tyr kinase (JAK) is activated by association with a 
receptor, and then phosphorylates the transcription 
factor STAT, which enters the nucleus and alters 
the expression of a set of genes. 

■ There are extensive interconnections among 
signaling pathways, allowing integration and 
fine-tuning of multiple hormonal effects. 

12.4 Receptor Guanylyl Cyclases, cGMP, 
and Protein Kinase G 

Guanylyl cyclases (Fig. 12-20) are receptor enzymes 
that, when activated, convert GTP to the second 
messenger guanosine 3',5'-cyclic monophosphate 
(cyclic GMP, cGMP): 
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Guanosine 3 ',5 '-cyclic monophosphate 
(cGMP) 

Many of the actions of cGMP in animals are mediated by 
cGMP-dependent protein kinase, also called pro- 
tein kinase G (PKG). On activation by cGMP, PKG 
phosphorylates Ser and Thr residues in target proteins. 
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FIGURE 12-20 Two isozymes of guanylyl cyclase that participate in 
signal transduction, (a) One isozyme exists in two similar membrane- 
spanning forms that are activated by their extracellular ligands: atrial 
natriuretic factor (ANF; receptors in cells of the renal collecting ducts 
and vascular smooth muscle) and guanylin (receptors in intestinal 
epithelial cells). The guanylin receptor is also the target of a bacterial 
endotoxin that triggers severe diarrhea, (b) The other isozyme is a 
soluble heme-containing enzyme that is activated by intracellular ni- 
tric oxide (NO); this form is found in many tissues, including smooth 
muscle of the heart and blood vessels. 

The catalytic and regulatory domains of this enzyme are 
in a single polypeptide (M r -80,000). Part of the regula- 
tory domain fits snugly in the substrate-binding cleft. 
Binding of cGMP forces this pseudosubstrate out of the 
binding site, opening the site to target proteins contain- 
ing the PKG consensus sequence. 

Cyclic GMP carries different messages in different 
tissues. In the kidney and intestine it triggers changes in 
ion transport and water retention; in cardiac muscle (a 
type of smooth muscle) it signals relaxation; in the brain 
it may be involved both in development and in adult 
brain function. Guanylyl cyclase in the kidney is acti- 
vated by the peptide hormone atrial natriuretic fac- 
tor (ANF), which is released by cells in the cardiac 
atrium when the heart is stretched by increased blood 
volume. Carried in the blood to the kidney, ANF acti- 
vates guanylyl cyclase in cells of the collecting ducts 
(Fig. 12-20a). The resulting rise in [cGMP] triggers in- 
creased renal excretion of Na + and consequently of wa- 
ter, driven by the change in osmotic pressure. Water loss 
reduces the blood volume, countering the stimulus that 
initially led to ANF secretion. Vascular smooth muscle 
also has an ANF receptor-guanylyl cyclase; on binding 
to this receptor, ANF causes relaxation (vasodilation) of 
the blood vessels, which increases blood flow while 
decreasing blood pressure. 
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A similar receptor guanylyl cyclase in the plasma 
membrane of epithelial cells lining the intestine is acti- 
vated by the peptide guanylin (Fig. 12-20a), which 
regulates CI" secretion in the intestine. This receptor is 
also the target of a heat-stable peptide endotoxin pro- 
duced by Escherichia coli and other gram-negative 
bacteria. The elevation in [cGMP] caused by the 
endotoxin increases CI* secretion and consequently 
decreases reabsorption of water by the intestinal epithe- 
lium, producing diarrhea. 

A distinctly different type of guanylyl cyclase is a 
cytosolic protein with a tightly associated heme group 
(Fig. 12-20b), an enzyme activated by nitric oxide 
(NO). Nitric oxide is produced from arginine by Ca 2+ - 
dependent NO synthase, present in many mam- 
malian tissues, and diffuses from its cell of origin into 
nearby cells. 
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NO is sufficiently nonpolar to cross plasma membranes 
without a carrier. In the target cell, it binds to the heme 
group of guanylyl cyclase and activates cGMP produc- 
tion. In the heart, cGMP reduces the forcefulness of con- 
tractions by stimulating the ion pump(s) that remove 
Ca 2+ from the cytosol. 

NO-induced relaxation of cardiac muscle is the 
same response brought about by nitroglycerin 
and other nitrovasodilators taken to relieve angina pec- 
toris, the pain caused by contraction of a heart de- 
prived of 0 2 because of blocked coronary arteries. Nitric 
oxide is unstable and its action is brief; within seconds 
of its formation, it undergoes oxidation to nitrite or ni- 
trate. Nitrovasodilators produce long-lasting relaxation 
of cardiac muscle because they break down over several 
hours, yielding a steady stream of NO. The value of ni- 
troglycerin as a treatment for angina was discovered 
serendipitously in factories producing nitroglycerin as 
an explosive in the 1860s. Workers with angina reported 
that their condition was much improved during the work 
week but worsened on weekends. The physicians treat- 
ing these workers heard this story so often that they 
made the connection, and a drug was born. 

The effects of increased cGMP synthesis diminish 
after the stimulus ceases, because a specific phospho- 
diesterase (cGMP PDE) converts cGMP to the inactive 
5'-GMP. Humans have several isoforms of cGMP PDE, 
with different tissue distributions. The isoform in the 
blood vessels of the penis is inhibited by the drug silde- 
nafil (Viagra), which therefore causes [cGMP] to re- 
main elevated once raised by an appropriate stimulus, 



accounting for the usefulness of this drug in the treat- 
ment of erectile dysfunction. ■ 
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Cyclic GMP has another mode of action in the verte- 
brate eye: it causes ion-specific channels to open in the 
retinal rod and cone ceDs. We return to this role of cGMP 
in the discussion of vision in Section 12.10. 



SUMMARY 12.4 
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■ Several signals, including atrial natriuretic factor 
and guanylin, act through receptor enzymes with 
guanylyl cyclase activity. The cGMP so produced is 
a second messenger that activates cGMP-dependent 
protein kinase (PKG). This enzyme alters 
metabolism by phosphorylating specific enzyme 
targets. 

■ Nitric oxide is a short-lived messenger that 
stimulates a soluble guanylyl cyclase, raising 
[cGMP] and stimulating PKG. 

12.5 Multivalent Adaptor Proteins and 
Membrane Rafts 

Two generalizations have emerged from studies of sig- 
naling systems such as those we have discussed so far: 
(1) protein kinases that phosphorylate Tyr, Ser, and Thr 
residues are central to signaling, and (2) reversible 
protein-protein interactions brought about by the re- 
versible phosphorylation of Tyr, Ser, and Thr residues in 
signaling proteins create docking sites for other pro- 
teins. In fact, many signaling proteins are multivalent — 
they can interact with several different proteins 
simultaneously to form multiprotein signaling complexes. 
In this section we present a few examples to illustrate 
the general principles of phosphorylation-dependent 
protein interactions in signaling pathways. 

Protein Modules Bind Phosphorylated Tyr, Ser, 
or Thr Residues in Partner Proteins 

The protein Grb2 in the insulin signaling pathway (Figs 
12-15 and 12-19) binds through its SH2 domain to other 
proteins that have exposed ®-Tyr residues. The human 
genome encodes at least 87 SH2-containing proteins, 
many already known to participate in signaling. The 



12.5 Multivalent Adaptor Proteins and Membrane Rafts I 447 




FIGURE 12-21 Interaction of an SH2 domain with a(P)-Tyr residue in 
a partner protein. (PDB ID 1SHC)The SH2 domain is represented as a 
gray surface contour. The phosphorus of the phosphate group in the in- 
teracting (P)-Tyr is visible as an orange sphere; most of the Tyr residue 
is obscured in this view. The next few residues toward the carboxyl end 
of the partner protein are shown in red. The SH2 domain typically in- 
teracts with a @-Tyr (which is assigned the index position 0) and the 
next three residues toward the carboxyl terminus (designated +1, +2, 
+ 3). Some SH2 domains (Src, Fyn, Hck, Nek) favor negatively charged 
residues in the +1 and +2 positions; others (PLC-yl, SHP-2) have a 
long hydrophobic groove that binds to aliphatic residues in positions 
+ 1 to +5 These differences define subclasses of SH2 domains that 
have different partner specificities. 

©^Tyr residue is bound in a deep pocket in an SH2 do- 
main, with each of its phosphate oxygens participating 
in hydrogen-bonding or electrostatic interactions; the 
positive charges on two Arg residues figure prominently 
in the binding. Subtle differences in the structure of 
SH2 domains account for the specificities of the inter- 
actions of SH2-containing proteins with various ©-Tyr- 
containing proteins. The three to five residues on the 
carboxyl-terminal side of the ©-Tyr residue are critical 
in determining the specificity of these interactions 
(Fig. 12-21). 

Phosphotyrosine-binding domains (PTB domains) 
are another binding partner for © u Tyr proteins, but 
their critical sequences and three-dimensional structure 
distinguish them from SH2 domains . The human 
genome encodes 24 proteins that contain PTB domains, 
including IRS-1, which we have already encountered in 
its role as an adaptor protein in insulin-signal transduc- 
tion (Fig. 12-15). The ©-Tyr binding sites for SH2 
and PTB domains on partner proteins are created by 
Tyr kinases and eliminated by phosphoprotein phos- 
phatases (PTPases). 

Other signaling protein kinases, including PKA, 
PKC, PKG, and members of the MAPK cascade, phos- 
phorylate Ser or Thr residues in their target proteins, 
which in some cases acquire the ability to interact with 
partner proteins through the phosphorylated residue, 
triggering a downstream process. An alphabet soup of 
domains that bind ©^Ser or ©-Thr residues has been 
identified, and more are sure to be found. Each domain 



favors a certain sequence around the phosphorylated 
residue, so the domains represent families of highly spe- 
cific recognition sites, able to bind to a specific subset of 
phosphorylated proteins. 

In some cases, the region on a protein that binds ©^ 
Tyr of a substrate protein is masked by its interaction 
with a (F^Tyr in the same protein. For example, the 
soluble protein Tyr kinase Src, when phosphorylated on 
a critical Tyr residue, is rendered inactive; an SH2 do- 
main needed to bind to the substrate protein instead 
binds to the internal ©^Tyr. When this ©KTyr residue 
is hydrolyzed by a phosphoprotein phosphatase, the Tyr 
kinase activity of Src is activated (Fig. 12-22a). Simi- 
larly, glycogen synthase kinase 3 (GSK3) is inactive 
when phosphorylated on a Ser residue in its autoin- 
hibitory domain (Fig. 12-22b). Dephosphorylation of 
that domain frees the enzyme to bind (and then phos- 
phorylate) its target proteins. 




FIGURE 12-22 Mechanism of autoinhibition of Src and GSK3. (a) In 

the active form of the Tyr kinase Src, an SH2 domain binds a®-Jyr in 
the protein substrate, and an SH3 domain binds a proline-rich region 
of the substrate, lining up the active site of the kinase with several tar- 
get Tyr residues in the substrate (top). When Src is phosphorylated on a 
specific Tyr residue (bottom), the SH2 domain binds the internal (P>-Tyr 
instead of the ®-Tyr of the substrate, preventing productive enzyme- 
substrate binding; the enzyme is thus autoinhibited. (b) In the active 
form of glycogen synthase kinase 3 (GSK3), an internal (P>-Ser-binding 
domain is available to bind (P^Ser in its substrate (glycogen synthase), 
and thus to position the kinase to phosphorylate neighboring Ser 
residues (top). Phosphorylation of an internal Ser residue allows this in- 
ternal kinase segment to occupy the (£>-Ser-binding site, blocking sub- 
strate binding (bottom). 
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In addition to the three commonly phosphorylated 
residues in proteins, there is a fourth structure that 
nucleates the formation of supramolecular complexes of 
signaling proteins: the phosphorylated head group of 
the membrane phosphatidylinositols. Many signaling 
proteins contain domains such as SH3 and PH (plextrin 
homology domain) that bind tightly to PIP 3 protruding 
from the inner leaflet of the plasma membrane. Wher- 
ever the enzyme PI-3K creates this head group (as it 
does in response to the insulin signal), proteins that 
bind it will congregate at the membrane surface. 

Most of the proteins involved in signaling at 
the plasma membrane have one or more protein- or 
phospholipid-binding domains; many have three or 
more, and thus are multivalent in their interactions with 
other signaling proteins. Figure 12-23 shows just a few 
of the multivalent proteins known to participate in sig- 
naling. Many of the complexes include components with 
membrane-binding domains. Given the location of so 
many signaling processes at the inner surface of the 



plasma membrane, the molecules that must collide to 
produce the signaling response are effectively confined 
to two-dimensional space — the membrane surface; colli- 
sions here are far more likely than in the three-dimen- 
sional space of the cytosol. 

In summary, a remarkable picture of signaling path- 
ways has emerged from studies of many signaling pro- 
teins and their multiple binding domains. An initial 
signal results in phosphorylation of the receptor or a 
target protein, triggering the assembly of large multi- 
protein complexes, held together on scaffolds made 
from adaptor proteins with multivalent binding capaci- 
ties. Some of these complexes contain several protein 
kinases that activate each other in turn, producing a 
cascade of phosphorylation and a great amplification of 
the initial signal. The interactions between cascade ki- 
nases are not left to the vagaries of random collisions in 
three-dimensional space. In the MAPK cascade, for ex- 
ample, a family of adaptor proteins binds all three ki- 
nases (MAPK, MAPKK, and MAPKKK), assuring their 
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FIGURE 12-23 Some binding modules of signaling proteins. Each 
protein is represented by a line {with the amino terminus to the left); 
symbols indicate the location of conserved binding domains (with 
specificities as listed in the key; PH denotes plextrin homology; other 
abbreviations explained in the text); green boxes indicate catalytic 
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activities. The name of each protein is given at its carboxyl-terminal 
end. These signaling proteins interact with phosphorylated proteins 
or phospholipids in many permutations and combinations to form 
integrated signaling complexes. 
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proximity and correct orientation and even conferring 
allosteric properties on the interactions among the ki- 
nases, which makes their serial phosphorylation sensi- 
tive to very small stimuli. 

Phosphotyrosine phosphatases remove the phos- 
phate from (P^Tyr residues, reversing the effect of 
phosphorylation. Some of these are receptorlike mem- 
brane proteins, presumably controlled by extracellular 
factors not yet identified; other PTPases are soluble 
and contain SH2 domains. In addition, animal cells have 
protein (P)-Ser and (P)-Thr phosphatases, which re- 
verse the effects of Ser- and Thr-specific protein ki- 
nases. We can see, then, that signaling occurs in protein 
circuits, which are effectively hard-wired from signal 
receptor to response effector and can be switched off 
instantly by the hydrolysis of a single upstream phos- 
phate ester bond. 

The multivalency of signaling proteins allows for the 
assembly of many different combinations of signaling 
modules, each combination suited to particular signals, 
cell types, and metabolic circumstances, yielding diverse 
signaling circuits of extraordinary complexity. 

Membrane Rafts and Caveolae Segregate 
Signaling Proteins 

Membrane rafts (Chapter 11) are regions of the mem- 
brane bilayer enriched in sphingolipids, sterols, and 
certain proteins, including many attached to the bi- 
layer by GPI anchors. The jS-adrenergic receptor is 
segregated in rafts that contain G proteins, adenylyl 
cyclase, PKA, and a specific protein phosphatase, 
PP2, which together provide a highly integrated sig- 
naling unit. By segregating in a small region of the 
plasma membrane all of the elements required for re- 
sponding to and ending the signal, the cell is able to 
produce a highly localized and brief "puff" of second 
messenger. 

Some RTKs (EGF-R and PDGF-R) seem to be local- 
ized in rafts, and this sequestration is very probably 
functionally significant. When cholesterol is removed 
from rafts by treatment of the membrane with cy- 
clodextrin (which binds and removes cholesterol), the 
rafts are disrupted and the RTK signaling pathways 
become defective. 

If an RTK in a raft is phosphorylated, and the only 
locally available PTPase that reverses this phosphory- 
lation is in another raft, then dephosphorylation of the 
RTK is slowed or prevented. Interactions between 
adaptor proteins might be strong enough to recruit 
into a raft a signaling protein not normally located 
there, or might even be strong enough to pull recep- 
tors out of a raft. For example, the EGF-R in isolated 
fibroblasts is normally concentrated in specialized 
rafts called caveolae (see Fig. 11-21), but treatment 
with EGF causes the receptor to leave the raft. This 
migration depends on the receptor's protein kinase 



activity; mutant receptors lacking this activity remain 
in the raft during treatment with EGF. Caveolin, an in- 
tegral membrane protein localized in caveolae, is 
phosphorylated on Tyr residues in response to insulin, 
and the now-activated EGF-R may be able to draw its 
binding partners into the raft. Spatial segregation of 
signaling proteins in rafts adds yet another dimension 
to the already complex processes initiated by extra- 
cellular signals. 

SUMMARY 12.5 Multivalent Adaptor 

Proteins and Membrane 
Rafts 

■ Many signaling proteins have domains that bind 
phosphorylated iyr, Ser, or Thr residues in other 
proteins; the binding specificity for each domain 
is determined by sequences that adjoin the 
phosphorylated residue in the substrate. 

■ SH2 and PTB domains bind to proteins containing 
(B^TVr residues; other domains bind (F^Ser and 
©-Thr residues in various contexts. 

■ SH3 and PH domains bind the membrane 
phospholipid PIP 3 . 

■ Many signaling proteins are multivalent, with 
several different binding modules. By combining 
the substrate specificities of various protein kinases 
with the specificities of domains that bind 
phosphorylated Ser, Thr, or Tyr residues, and with 
phosphatases that can rapidly inactivate a signaling 
pathway, cells create a large number of multiprotein 
signaling complexes. 

■ Membrane rafts and caveolae sequester groups of 
signaling proteins in small regions of the plasma 
membrane, enhancing their interactions and 
making signaling more efficient. 

12.6 Gated Ion Channels 

Ion Channels Underlie Electrical Signaling in Excitable Cells 

Certain cells in multicellular organisms are "excitable": 
they can detect an external signal, convert it into an 
electrical signal (specifically, a change in membrane po- 
tential), and pass it on. Excitable cells play central roles 
in nerve conduction, muscle contraction, hormone se- 
cretion, sensory processes, and learning and memory. 
The excitability of sensory cells, neurons, and myocytes 
depends on ion channels, signal transducers that pro- 
vide a regulated path for the movement of inorganic ions 
such as Na + , K + , Ca 2+ , and CI" across the plasma mem- 
brane in response to various stimuli. Recall from Chap- 
ter 11 that these ion channels are "gated": they may be 
open or closed, depending on whether the associated re- 
ceptor has been activated by the binding of its specific 
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FIGURE 1 2-24 Transmembrane electrical potential, (a) The electro- 
genic Na + K + ATPase produces a transmembrane electrical poten- 
tial of about -60 mV (inside negative), (b) Blue arrows show the 
direction in which ions tend to move spontaneously across the 
plasma membrane in an animal cell, driven by the combination of 
chemical and electrical gradients. The chemical gradient drives 
Na + and Ca 2+ inward (producing depolarization) and K + 
outward (producing hyperpolarization). The electrical gradient 
drives Cl~ outward, against its concentration gradient (producing 
depolarization). 



ligand (a neurotransmitter, for example) or by a change 
in the transmembrane electrical potential, V m . The 
Na + K + ATPase is electrogenic; it creates a charge im- 
balance across the plasma membrane by carrying 3 Na + 
out of the cell for every 2 K + carried in (Fig. 12-24a), 
making the inside negative relative to the outside. The 
membrane is said to be polarized, 
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KEY CONVENTION: V m is negative when the inside of the 
cell is negative relative to the outside. For a typical 
animal cell, V m = -50 to -70 mV. ■ 

Because ion channels generally allow passage of ei- 
ther anions or cations but not both, ion flux through a 
channel causes a redistribution of charge on the two 
sides of the membrane, changing V m . Influx of a posi- 
tively charged ion such as Na + , or efflux of a negatively 
charged ion such as Cr, depolarizes the membrane and 
brings V m closer to zero. Conversely, efflux of K + hyper- 
polarizes the membrane and V m becomes more nega- 
tive. These ion fluxes through channels are passive, in 
contrast to active transport by the Na + K + ATPase. 

The direction of spontaneous ion flow across a 
polarized membrane is dictated by the electrochemical 
potential of that ion across the membrane, which has 
two components: the difference in concentration (C) of 
the ion on the two sides of the membrane, and the dif- 
ference in electrical potential, typically expressed in 
millivolts. The force (AG) that causes a cation (say, 
Na + ) to pass spontaneously inward through an ion 
channel is a function of the ratio of its concentrations on 
the two sides of the membrane (C in /C out ) and of the dif- 
ference in electrical potential (V m or At/0: 

AG = RT In (C in /C out ) + Z JV m (12-1) 

where R is the gas constant, T the absolute temperature, 
Z the charge on the ion, and J the Faraday constant. 
(Note that the sign of the charge on the ion determines 
the sign of the second term in Eqn 12-1 .) In a typical neu- 
ron or myocyte, the concentrations of Na + , K + , Ca 2+ , and 
CI" in the cytosol are very different from those in the ex- 
tracellular fluid (Table 12-6). Given these concentration 
differences, the resting V m of about -60 mV, and the re- 
lationship shown in Equation 12-1, the opening of a Na + 
or Ca 2+ channel will result in a spontaneous inward flow 
of Na + or Ca 2+ (and depolarization), whereas opening 




of a K + channel will result in a spontaneous outward 
flux of K + (and hyperpolarization) (Fig. 12-24b). 

A given ionic species continues to flow through a 
channel only as long as the combination of concentra- 
tion gradient and electrical potential provides a driving 
force. For example, as Na + flows down its concentration 
gradient, it depolarizes the membrane. When the mem- 
brane potential reaches^- 70 mV, the effect of this mem- 
brane potential (resistance to further entry of Na + ) 
exactly equals the effect of the [Na + ] gradient (promotion 
of Na + flow inward). At this equilibrium potential (£*), 
the driving force (AG) tending to move a Na + ion is zero. 
The equilibrium potential is different for each ionic 
species because the concentration gradients differ. 

The number of ions that must flow to produce a 
physiologically significant change in the membrane po- 
tential is negligible relative to the concentrations of Na + , 
K + , and Cl~ in cells and extracellular fluid, so the ion 
fluxes that occur during signaling in excitable cells have 
essentially no effect on the concentrations of these ions. 
With Ca 2+ , the situation is different; because the intra- 
cellular [Ca 2+ ] is generally very low (~ 10^ 7 m), inward 
flow of Ca 2+ can significantly alter the cytosolic [Ca 2+ ]. 

The membrane potential of a cell at a given time is 
the result of the types and numbers of ion channels open 
at that instant. In most cells at rest, more K + channels 
than Na + , CI", or Ca 2+ channels are open and thus the 
resting potential is closer to the E for K + (-98 mV) than 
that for any other ion. When channels for Na + , Ca 2+ , or 
Cl~ open, the membrane potential moves toward the 
E for that ion. The precisely timed opening and closing 
of ion channels and the resulting transient changes in 
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Ion Concentrations in Ceils and Extracellular Fluids (m 
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membrane potential underlie the electrical signaling by 
which the nervous system stimulates the skeletal mus- 
cles to contract, the heart to beat, or secretory cells to 
release their contents. Moreover, many hormones exert 
their effects by altering the membrane potential of their 
target cells. These mechanisms are not limited to ani- 
mals; ion channels play important roles in the responses 
of bacteria, protists, and plants to environmental signals. 

To illustrate the action of ion channels in cell-to-cell 
signaling, we describe the mechanisms by which a neu- 
ron passes a signal along its length and across a synapse 
to the next neuron (or to a myocyte) in a cellular circuit, 
using acetylcholine as the neurotransmitter. 

Voltage-Gated Ion Channels Produce 
Neuronal Action Potentials 

Signaling in the nervous system is accomplished by net- 
works of neurons, specialized cells that carry an electri- 
cal impulse (action potential) from one end of the cell 
(the cell body) through an elongated cytoplasmic exten- 
sion (the axon). The electrical signal triggers release of 
neurotransmitter molecules at the synapse, carrying the 
signal to the next cell in the circuit. Three types of 
voltage-gated ion channels are essential to this sig- 
naling mechanism. Along the entire length of the axon 
are voltage-gated Na + channels (Fig. 12-25), which 
are closed when the membrane is at rest (V m = -60 mV) 
but open briefly when the membrane is depolarized 

FIGURE 12-25 Role of voltage-gated and ligand-gated ion channels in 
neural transmission. Initially, the plasma membrane of the presynaptic 
neuron is polarized (inside negative) through the action of the electro- 
genic Na + K + ATPase, which pumps out 3 Na + for every 2 K + pumped 
in (see Fig. 12-24). © A stimulus to this neuron (not shown) causes an 
action potential to move along the axon (white arrow), away from the 
cell body. The opening of a voltage-gated Na + channel allows Na + en- 
try, and the resulting local depolarization causes the adjacent Na + 
channel to open, and so on. The directionality of movement of the ac- 
tion potential is ensured by the brief refractory period that follows the 
opening of each voltage-gated Na + channel. ©When the wave of de- 
polarization reaches the axon tip, voltage-gated Ca 2+ channels open, 
allowing Ca 2+ entry. ©The resulting increase in internal [Ca 2+ ] trig- 
gers exocytic release of the neurotransmitter acetylcholine into the 
synaptic cleft. ©Acetylcholine binds to a receptor on the postsynap- 
tic neuron (or myocyte), causing its ligand-gated ion channel to open. 
© Extracellular Na + and Ca 2+ enter through this channel, depolariz- 
ing the postsynaptic cell. The electrical signal has thus passed to the cell 
body of the postsynaptic neuron (or myocyte) and will move along its 
axon to a third neuron {or a myocyte) by this same sequence of events. 



locally in response to acetylcholine (or some other 
neurotransmitter). Also distributed along the axon are 
voltage-gated K + channels, which open, a split sec- 
ond later, in response to the depolarization when nearby 
Na + channels open. The depolarizing flow of Na + into 
the axon (influx) is thus rapidly countered by a repolar- 
izing flow of K + out (efflux). At the distal end of the 
axon are voltage-gated Ca 2+ channels, which open 
when the wave of depolarization and repolarization 
caused by the activity of Na + and K + channels arrives, 
triggering release of the neurotransmitter acetylcholine — 
which carries the signal to another neuron (fire an action 
potential!) or to a muscle fiber: (contract!). 
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The voltage-gated Na + channels are very selective 
for Na + over other cations (by a factor of 100 or more) 
and have a very high flux rate (>10 7 ions/s). After being 
opened — activated — by a reduction in transmembrane 
electrical potential, a Na + channel undergoes very rapid 
inactivation — within milliseconds, the channel closes and 
remains inactive for many milliseconds. As voltage-gated 
K + channels open in response to the depolarization 
induced by the opening of Na + channels, the resulting 
efflux of K + repolarizes the membrane locally. A brief 
pulse of depolarization thus traverses the axon as local 
depolarization triggers the brief opening of neighboring 
Na + channels, then K + channels (Fig. 12-25). The short 
refractory period that follows the opening of each Na + 
channel, during which it cannot open again, ensures 
that a unidirectional wave of depolarization — the action 
potential — sweeps from the nerve cell body toward the 
end of the axon (step 0 in Fig. 12-25). 

When the wave of depolarization reaches the 
voltage-gated Ca 2+ channels, they open (step (D), and 
Ca 2+ enters from the extracellular space. The rise in cy- 
toplasmic [Ca 2+ ] then triggers release of acetylcholine 
by exocytosis into the synaptic cleft (step (S)). Acetyl- 



choline diffuses to the postsynaptic cell (another neu- 
ron or a myocyte), where it binds to acetylcholine re- 
ceptors and triggers depolarization. Thus the message is 
passed to the next cell in the circuit. We see, then, that 
gated ion channels convey signals in either of two ways: 
by changing the cytoplasmic concentration of an ion 
(such as Ca 2+ ), which then serves as an intracellular 
second messenger, or by changing V m and affecting 
other membrane proteins that are sensitive to V m . The 
passage of an electrical signal through one neuron and 
on to the next illustrates both types of mechanism. 

We discussed the structure and mechanism of volt- 
age-gated K + channels in some detail in Section 11.3 (see 
Figs 11^8 through 11-50). Here we take a closer look at 
Na + channels. The essential component of a Na + channel 
is a single, large polypeptide (1,840 amino acid residues) 
organized into four domains clustered around a central 
channel (Fig. 12-26a, b), providing a path for Na + 
through the membrane. The path is made Na + -specific by 
a "pore region" composed of the segments between trans- 
membrane helices 5 and 6 of each domain, which fold into 
the channel. Helix 4 of each domain has a high density of 
positively charged Arg residues; this segment is believed 
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FIGURE 12-26 Voltage-gated Na + channels of neurons. Sodium chan- 
nels of different tissues and organisms have a variety of subunits, but 
only the principal subunit (a) is essential, (a) The a subunit is a large 
protein with four homologous domains (I to IV, shown spread out here 
to illustrate the parts), each containing six transmembrane helices (1 to 
6). Helix 4 in each domain (blue) is the voltage sensor; helix 6 (orange) 
is thought to be the activation gate. The segments between helices 5 and 
6, the pore region (red), form the selectivity filter, and the segment con- 
necting domains Ml and IV (green) is the inactivation gate, (b) The four 
domains are wrapped about a central transmembrane channel lined 
with polar amino acid residues. The four pore regions (red) come to- 
gether near the extracellular surface to form the selectivity filter, which 
is conserved in all Na + channels. The filter gives the channel its ability 
to discriminate between Na + and other ions of similar size. The inacti- 
vation gate (green) closes (dotted lines) soon after the activation gate 
opens, (c) The voltage-sensing mechanism involves movement of helix 
4 (blue) perpendicular to the plane of the membrane in response to a 
change in transmembrane potential. As shown at the top, the strong 
positive charge on helix 4 allows it to be pulled inward in response to 
the inside-negative membrane potential (V m ). Depolarization lessens 
this pull, and helix 4 relaxes by moving outward (bottom). This move- 
ment is communicated to the activation gate (orange), inducing confor- 
mational changes that open the channel in response to depolarization. 
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to move within the membrane in response to changes in 
the transmembrane voltage, from the resting potential of 
about -60 mV to about +30 mV. The movement of helix 
4 triggers opening of the channel, and this is the basis for 
the voltage gating (Fig. 12-26c). 

Inactivation of the channel is thought to occur by a 
ball-and-chain mechanism. A protein domain on the cy- 
toplasmic surface of the Na + channel, the inactivation 
gate (the ball) , is tethered to the channel by a short seg- 
ment of the polypeptide (the chain; Fig. 12-26b). This 
domain is free to move about when the channel is 
closed, but when it opens, a site on the inner face of the 
channel becomes available for the tethered ball to bind, 
blocking the channel. The length of the tether seems to 
determine how long an ion channel stays open: the 
longer the tether, the longer the open period. Other 
gated ion channels may be inactivated by a similar 
mechanism. 

The Acetylcholine Receptor Is a Ligand-Gated Ion Channel 

The nicotinic acetylcholine receptor mediates the 
passage of an electrical signal at some types of 
synapses and at a neuromuscular junction (between 
motor neuron and muscle fiber), signaling the muscle 
to contract. (Nicotinic acetylcholine receptors were 
originally distinguished from muscarinic acetylcholine 
receptors by the sensitivity of the former to nicotine, 
the latter to the mushroom alkaloid muscarine. They 
are structurally and functionally different.) Acetyl- 
choline released by the presynaptic neuron or motor 
neuron diffuses a few micrometers to the plasma mem- 
brane of the postsynaptic neuron or myocyte, where it 
binds to the acetylcholine receptor. This forces a con- 
formational change in the receptor, causing its ion 
channel to open. The resulting inward movement of 
cations depolarizes the plasma membrane. In a muscle 
fiber, this triggers contraction. The acetylcholine recep- 
tor allows ready passage to Na + , Ca 2+ , and K + ions, but 
other cations and all anions are unable to pass. Move- 
ment of Na + through an acetylcholine receptor ion chan- 
nel is unsaturable (its rate is linear with respect to 
extracellular [Na + ]) and very fast — about 2 x 10 7 ions/s 
under physiological conditions. 
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Like other gated ion channels, the acetylcholine re- 
ceptor opens in response to stimulation by its signal 
molecule and has an intrinsic timing mechanism that 
closes the gate milliseconds later. Thus the acetylcholine 
signal is transient — as we have seen, an essential feature 
of electrical signal conduction. We understand the struc- 
tural changes underlying gating in the acetylcholine re- 



ceptor, but not the exact mechanism of "desensitization," 
in which the gate remains closed even in the continued 
presence of acetylcholine. 

The nicotinic acetylcholine receptor has five sub- 
units: single copies of subunits /3, y, and 5, and two 
identical a subunits that each contain an acetylcholine- 
binding site. All five subunits are related in sequence 
and tertiary structure, each having four transmembrane 
helical segments (Ml to M4) (Fig. 12-27a). The five 
subunits surround a central pore, which is lined with 
their M2 helices (Fig. 12-27b, c). The pore is about 20 A 
wide in the parts of the channel that protrude on the cy- 
toplasmic and extracellular surfaces, but narrows as it 
passes through the lipid bilayer. Near the center of the 
bilayer is a ring of bulky hydrophobic side chains of Leu 
residues in the M2 helices, positioned so close together 
that they prevent ions from passing through the channel 
(Fig. 12-27d). Allosteric conformational changes in- 
duced by acetylcholine binding to the two a subunits in- 
clude a slight twisting of the M2 helices, which draws 
these hydrophobic side chains away from the center of 
the channel, opening it to the passage of ions. 

Neurons Have Receptor Channels That Respond 
to Different Neurotransmitters 

Animal cells, especially those of the nervous system, con- 
tain a variety of ion channels gated by ligands, voltage, or 
both. We have so far focused on acetylcholine as neuro- 
transmitter, but there are many others. 5-Hydroxytrypta- 
mine (serotonin), glutamate, and glycine all can act 
through receptor channels that are structurally related to 
the acetylcholine receptor. Serotonin and glutamate trig- 
ger the opening of cation (K + , Na + , Ca 2+ ) channels, 
whereas glycine opens CP-specific channels. Cation and 
anion channels are distinguished by subtle differences in 
the amino acid residues that line the hydrophilic channel. 
Cation channels have negatively charged Glu and Asp side 
chains at crucial positions. When a few of these acidic 
residues are experimentally replaced with basic residues, 
the cation channel is converted to an anion channel. 
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Depending on which ion passes through a channel, 
binding of the ligand (neurotransmitter) for that chan- 
nel results in either depolarization or hyperpolarization 
of the target cell. A single neuron normally receives 
input from many other neurons, each releasing its own 
characteristic neurotransmitter with its characteristic 
depolarizing or hyperpolarizing effect. The target cell's 
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FIGURE 12-27 The acetylcholine receptor ion channel, (a) Each of the 
five homologous subunits (a 2 /3yS) has four transmembrane helices, M1 
to M4. The M2 helices are amphipathic; the others have mainly hy- 
drophobic residues, (b) The five subunits are arranged around a central 
transmembrane channel, which is lined with the polar sides of the M2 
helices. At the top and bottom of the channel are rings of negatively 
charged amino acid residues, (c) A model of the acetylcholine recep- 
tor, based on electron microscopy and x-ray structure determination of 
a related protein (the acetylcholine-binding protein from a mollusk). 
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(d) This top view of a cross section through the center of the M2 helices 
shows five Leu side chains (yellow), one from each M2 helix, protrud- 
ing into the channel and constricting it to a diameter too small to allow 
passage of Ca 2+ , Na + , or K + . When both acetylcholine receptor sites 
(one on each a subunit) are occupied, a conformational change oc- 
curs. As the M2 helices twist slightly, the five Leu residues rotate away 
from the channel and are replaced by smaller, polar residues (blue). 
This gating mechanism opens the channel, allowing the passage of 
Ca 2+ , Na + , or K + . 



V m therefore reflects the integrated input (Fig. 12-ld) 
from multiple neurons. The cell responds with an action 
potential only if the integrated input adds up to a net de- 
polarization of sufficient size. 

The receptor channels for acetylcholine, glycine, glu- 
tamate, and y-aminobutyric acid (GABA) are gated by 
extracellular ligands. Intracellular second messengers — 
such as cAMP, cGMP, IP 3 , Ca 2+ , and ATP— regulate ion 
channels of another class, which, as we shall see in 
Section 12.10, participate in the sensory transductions 
of vision, olfaction, and gustation. 

Toxins Target Ion Channels 

Many of the most potent toxins found in nature act on 
ion channels. As we noted in Section 11.3, for example, 
dendrotoxin (from the black mamba snake) blocks 
the action of voltage-gated K + channels, tetrodotoxin 



(produced by puffer fish) acts on voltage-gated Na + 
channels, and cobrotoxin disables acetylcholine recep- 
tor ion channels. Why, in the course of evolution, have 
ion channels become the preferred target of toxins, 
rather than some critical metabolic target such as an en- 
zyme essential in energy metabolism? 

Ion channels are extraordinary amplifiers; opening 
of a single channel can allow the flow of 10 million ions 
per second. Consequently, relatively few molecules of an 
ion channel protein are needed per neuron for signaling 
functions. This means that a relatively small number of 
toxin molecules with high affinity for ion channels, 
acting from outside the cell, can have a very pronounced 
effect on neurosignaling throughout the body. A compa- 
rable effect by way of a metabolic enzyme, typically 
present in cells at much higher concentrations than ion 
channels, would require far more copies of the toxin 
molecule. 
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SUMMARY 12.6 Gated Ion Channels 

■ Ion channels gated by membrane potential or ligands 
are central to signaling in neurons and other cells. 

■ The voltage-gated Na + and K + channels of neuronal 
membranes carry the action potential along the 
axon as a wave of depolarization (Na + influx) 
followed by repolarization (K + efflux). 

■ The gating mechanism for voltage-sensitive 
channels involves the movement, perpendicular to 
the plane of the membrane, of a transmembrane 
peptide with a high charge density, due to the 
presence of Arg or other charged residues. 

■ Arrival of an action potential at the distal end of 
a presynaptic neuron triggers neurotransmitter 
release. The neurotransmitter (acetylcholine, for 
example) diffuses to the postsynaptic neuron (or 
the myocyte, at a neuromuscular junction), binds 
to specific receptors in the plasma membrane, and 
triggers a change in V m . 

■ The acetylcholine receptor of neurons and myocytes 
is a ligand-gated ion channel; acetylcholine binding 
triggers a conformational change that opens the 
channel to Na + and Ca 2+ ions. 

■ Neurotoxins produced by many organisms attack 
neuronal ion channels, and are therefore fast-acting 
and deadly. 

12 J Integrins: Bidirectional Cell 
Adhesion Receptors 

Integrins are proteins of the plasma membrane that me- 
diate the adhesion of cells to each other and to the extra- 
cellular matrix, and carry signals in both directions across 
the membrane (Fig. 12-28). The mammalian genome 
encodes 18 different a subunits and 8 different j3 sub- 
units, which are found in a range of combinations with 
various ligand-binding specificities in various tissues. 
Each of the 24 different integrins found thus far seems to 
have a unique function. Because they can inform cells 
about the extracellular neighborhood, integrins play cru- 
cial roles in processes that require selective cell-cell inter- 
actions, such as embryonic development, blood clotting, 
immune cell function, and tumor growth and metastasis. 

The extracellular ligands that interact with integrins 
include collagen, fibrinogen, fibronectin, and many other 
proteins that have the sequence recognized by integrins: 
-Arg-Gly-Asp- (RGD). The short, cytoplasmic exten- 
sions of the a and (3 subunits interact with cytoskeletal 
proteins just beneath the plasma membrane — talin, a- 
actinin, vinculin, paxillin, and others — modulating the 
assembly of actin-based cytoskeletal structures. The 
dual association of integrins with the extracellular matrix 
and the cytoskeleton allows the cell to integrate infor- 
mation about its extracellular and intracellular environ- 
ments, and to coordinate cytoskeletal positioning with 




FIGURE 12-28 Two-way signaling by integrins. All integrins have one 
a and one jS subunit, each with a short cytoplasmic extension, a single 
transmembrane helix, and a large extracellular domain with the lig- 
and-binding site. The (3 subunit is rich in Cys residues and has exten- 
sive intrachain disulfide bonding. The a subunit in many integrins has 
several binding sites for divalent cations such as Ca 2+ , which are in- 
trinsic to the ligand-binding activity. Ligands in the extracellular matrix 
include proteins such as collagen that have the RGD sequence recog- 
nized by an integrin, or proteoglycan components such as heparan sul- 
fate. The extracellular Ngand binding is communicated to the cytosolic 
domains, producing conformational changes that affect the association 
of integrin with proteins such as talin, which, in turn, connect the 
integrin to actin filaments in the cytoskeleton underlying the plasma 
membrane. Binding of intracellular protein ligands to the cytosolic 
domain can alter the affinity of the integrin for its extracellular binding 
partners, changing the cell's adhesion to the extracellular matrix. 

extracellular adhesion sites. In this capacity, integrins 
govern the shape, motility, polarity, and differentiation of 
many cell types. In "outside-in" signaling, the extracellular 
domains of an integrin undergo dramatic, global conforma- 
tional changes when ligand binds at a site many angstroms 
from the transmembrane helices. These changes somehow 
alter the dispositions of the cytoplasmic tails of the a and (3 
subunits, changing their interactions with intracellular pro- 
teins and thereby conducting the signal inward. 

The conformation and adhesiveness of integrin 
extracellular domains are also dramatically altered by 
signals from inside the cell. In one conformation, the 
extracellular domains have no affinity for the proteins of 
the extracellular matrix, but signals from the cell can fa- 
vor another conformation in which integrins adhere 
tightly to extracellular proteins (Fig. 12-28). 



456 J Biosignaling 



E!3 Regulation of adhesiveness is central to leukocyte 
Efl homing to the site of an infection (see Fig. 7 31), 
interactions between immune cells, and phagocytosis by 
macrophages. During an immune response, for example, 
leukocyte integrins are activated (exposing their extra- 
cellular ligand-binding sites) from inside the cell via a 
signaling pathway triggered by cytokines (extracellular 
developmental signals). Thus activated, the integrins 
can mediate the attachment of leukocytes to other im- 
mune cells or can target cells for phagocytosis. Mutation 
in an integrin gene encoding the /3 subunit known as 
CD18 is the cause of leukocyte adhesion deficiency, a 
rare human genetic disease in which leukocytes fail to 
pass out of blood vessels to reach sites of infection. In- 
fants with a severe defect in CD18 commonly die of in- 
fections before the age of two. 

An integrin specific to platelets (^nbfe) is involved in 
both normal and pathological blood clotting. Local damage 
to blood vessels at a site of injury exposes high-affinity 
binding sites (RGD sequences in thrombin and collagen, 
for example) for the integrins of platelets, which attach 
themselves to the lesion, to other platelets, and to the clot- 
ting protein fibrinogen, leading to clot formation that pre- 
vents further bleeding. Mutations in the a or /3 subunit of 
platelet integrin a m fi 3 lead to a bleeding disorder known 
as Glanzmann thrombasthenia, in which individuals bleed 
excessively after a relatively minor injury. Overly effective 
blood coagulation is also undesirable. Dysregulation of 
platelet adhesion can lead to pathological blood clot for- 
mation, resulting in blockage of the arteries that supply 
blood to the heart and brain and increasing the risk of 
heart attack and stroke. Drugs such as tirofiban and eptifi- 
batide that block the external ligand-binding sites of 
platelet integrin reduce clot formation and are useful in 
treating and preventing heart attacks and strokes. 

When tumors metastasize, tumor cells lose their ad- 
hesion to the originating tissue and invade new loca- 
tions. Both the changes in tumor cell adhesion and the 
development of new blood vessels (angiogenesis) to 
support the tumor at a new location are modulated by 
specific integrins. These proteins are therefore potential 
targets for drugs that suppress the migration and reloca- 
tion of tumor cells. ■ 

SUMMARY 12.7 Integrins: Bidirectional 
Cell Adhesion Receptors 

■ Integrins are a family of dimeric {afS) plasma 
membrane receptors that interact with extracellular 
macromolecules and the cytoskeleton, carrying 
signals in and out of the cell 

■ The active and inactive forms of an integrin differ 
in the conformation of their extracellular domains. 
Intracellular events and signals can interconvert 
the active and inactive forms. 

■ Integrins mediate various aspects of the immune 
response, blood clotting, and angiogenesis, and 
they play a role in tumor metastasis. 



12,8 Regulation of Transcription by 
Steroid Hormones 

The steroid, retinoic acid (retinoid), and thyroid hor- 
mones form a large group of hormones (receptor lig- 
ands) that exert at least part of their effects by a 
mechanism fundamentally different from that of other 
hormones: they act in the nucleus to alter gene expres- 
sion. We discuss their mode of action in detail in Chap- 
ter 28, along with other mechanisms for regulating gene 
expression. Here we give a brief overview. 

Steroid hormones (estrogen, progesterone, and Cor- 
tisol, for example), too hydrophobic to dissolve readily 
in the blood, are transported on specific carrier proteins 
from their point of release to their target tissues. In tar- 
get cells, these hormones pass through the plasma 
membrane by simple diffusion and bind to specific re- 
ceptor proteins in the nucleus (Fig. 12-29). Steroid 
hormone receptors with no bound ligand (aporecep- 
tors) often act to suppress the transcription of target 
genes. Hormone binding triggers changes in the confor- 
mation of a receptor protein so that it becomes capable 
of interacting with specific regulatory sequences in DNA 
called hormone response elements (HREs), thus 
altering gene expression (see Fig. 28-34). The bound 
receptor-hormone complex enhances the expression of 
specific genes adjacent to HREs, with the help of several 
other proteins essential for transcription. Hours or days 
are required for these regulators to have their full 
effect — the time required for the changes in RNA syn- 
thesis and subsequent protein synthesis to become 
evident in altered metabolism. 

I The specificity of the steroid-receptor interac- 
Hfl tion is exploited in the use of the drug tamox- 
ifen to treat breast cancer. In some types of breast 
cancer, division of the cancerous cells depends on the 
continued presence of estrogen. Tamoxifen is an estro- 
gen antagonist; it competes with estrogen for binding to 
the estrogen receptor, but the tamoxifen-receptor com- 
plex has little or no effect on gene expression. Conse- 
quently, tamoxifen administered after surgery or during 
chemotherapy for hormone-dependent breast cancer 
slows or stops the growth of remaining cancerous 
cells. Another steroid analog, the drug mifepristone 
(RU486), binds to the progesterone receptor and 
blocks hormone actions essential to implantation of the 
fertilized ovum in the uterus. ■ 
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Hormone (H), carried to the target 
tissue on serum binding proteins, 
diffuses across the plasma membrane 
and binds to its specific receptor 
protein (Rec) in the nucleus. 

® 

Hormone binding changes the 
conformation of Rec; it forms homo- 
or heterodimers with other hormone- 
receptor complexes and binds to 
specific regulatory regions called 
hormone response elements (HREs) 
in the DNA adjacent to specific genes. 

® 

Receptor attracts coactivator or 
corepressor protein(s) and, with them, 
regulates transcription of the adjacent 
gene(s), increasing or decreasing the rate 
of mRNA formation. 
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FIGURE 12-29 General mechanism by which steroid and thyroid hormones, retinoids, and vitamin D regulate 
gene expression. The details of transcription and protein synthesis are discussed in Chapters 26 and 27. Some 
steroids also act through plasma membrane receptors by a completely different mechanism. 



Certain effects of steroids seem to occur too fast to 
be the result of altered protein synthesis via the classic 
mechanism of steroid hormone action through nuclear 
receptors. For example, the estrogen-mediated dilation 
of blood vessels is known to be independent of gene 
transcription or protein synthesis, as is the steroid- 
induced decrease in cellular [cAMP]. Another transduc- 
tion mechanism involving plasma membrane receptors 
may be responsible for some of these effects. 

SUMMARY 12.8 Regulation of 
Transcription by 
Steroid Hormones 

■ Steroid hormones enter cells and bind to specific 
receptor proteins. 

■ The hormone-receptor complex binds specific 
regions of DNA, the hormone response elements, 
and interacts with other proteins to regulate the 
expression of nearby genes. 

■ Certain effects of steroid hormones may 
occur through a different, faster, signaling 
pathway. 



12.9 Signaling in Microorganisms and Plants 

Much of what we have said here about signaling relates 
to mammalian tissues or cultured cells from such tissues. 
Bacteria, archaea, eukaryotic microorganisms, and vas- 
cular plants must also respond to a variety of external 
signals — 0 2 , nutrients, light, noxious chemicals, and so 
on. We turn here to a brief consideration of the kinds of 
signaling machinery used by microorganisms and plants. 

Bacterial Signaling Entails Phosphorylation 
in a Two-Component System 

Escherichia coli responds to nutrients in its environ- 
ment, including sugars and amino acids, by swimming 
toward them, propelled by one or a few flagella. A fam- 
ily of membrane proteins have binding domains on the 
outside of the plasma membrane to which specific at- 
tractants (sugars or amino acids) bind (Fig. 12-30). 
Ligand binding causes another domain on the inside of 
the plasma membrane to autophosphorylate a His 
residue. This first component of the two-component 
system, the receptor histidine kinase, then cat- 
alyzes transfer of the phosphoryl group from the His 
residue to an Asp residue on a second, soluble protein, 
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FIGURE 12-30 The two-component sig- 
naling mechanism in bacterial chemo- 
taxis. When an attractant ligand (A) binds 
to the receptor domain of the membrane- 
bound receptor, a protein His kinase in 
the cytosolic domain {component 1) is 
activated and autophosphorylates a His 
residue. This phosphoryl group is then 
transferred to an Asp residue on compo- 
nent 2 (in some cases, as shown here, a 
separate protein; in others, another do- 
main of the receptor protein). After phos- 
phorylation, component 2 moves to the 
base of the flagellum, where it reverses 
the direction of rotation of the flagellar 
motor. 
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the response regulator; this phosphoprotein moves 
to the base of the flagellum, carrying the signal from the 
membrane receptor. The flagellum is driven by a rotary 
motor that can propel the cell through its medium or 
cause it to stall, depending on the direction of motor 
rotation. Information from the receptor allows the cell 
to determine whether it is moving toward or away from 
the source of the attractant. If its motion is toward the 
attractant, the response regulator signals the cell to 
continue in a straight line; if away from it, the cell tum- 
bles momentarily, acquiring a new direction. Repeti- 
tion of this behavior results in a random path, biased 
toward movement in the direction of increasing attrac- 
tant concentration. 

E. coli detects not only sugars and amino acids but 
also 0 2 , extremes of temperature, and other environ- 
mental factors, using this basic two-component system. 
Two-component systems have been detected in many 
other bacteria, both gram-positive and gram-negative, 
and in archaea, as well as in protists and fungi. Clearly, 
this signaling mechanism developed early in the course 
of cellular evolution and has been conserved. 

Various signaling systems used by animal cells also 
have analogs in bacteria. As the full genomic sequences 
of more, and more diverse, bacteria become known, re- 
searchers have discovered genes that encode proteins 
similar to protein Ser or Thr kinases, Ras-like proteins 
regulated by GTP binding, and proteins with SH3 do- 
mains. Receptor Tyr kinases have not been detected in 
bacteria, but ®-Tyr residues do occur in some bacte- 
rial proteins, so there must be an enzyme that phospho- 
rylates Tyr residues. 

Signaling Systems of Plants Have Some of the Same 
Components Used by Microbes and Mammals 

Like animals, vascular plants must have a means of com- 
munication between tissues to coordinate and direct 
growth and development; to adapt to conditions of 0 2 , 
nutrients, light, temperature, and water availability; and 



to warn of the presence of noxious chemicals and dam- 
aging pathogens (Fig. 12-31). At least a billion years of 
evolution have passed since the plant and animal 
branches of the eukaryotes diverged, which is reflected 
in the differences in signaling mechanisms: some plant 
mechanisms are conserved — that is, are similar to those 
in animals (protein kinases, adaptor proteins, cyclic 
nucleotides, electrogenic ion pumps, and gated ion 
channels); some are similar to bacterial two-component 
systems; and some are unique to plants (light-sensing 
mechanisms, for example) (Table 12-7). The genome of 
the plant Arabidopsis thaliana, for example, encodes 
about 1,000 protein Ser/Thr kinases, including about 60 
MAPKs and nearly 400 membrane-associated receptor ki- 
nases that phosphorylate Ser or Thr residues; a variety of 
protein phosphatases; adaptor proteins that form scaf- 
folds on which proteins assemble in signaling complexes; 
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FIGURE 12-31 Some stimuli that produce responses in plants. 
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TABLE 12-7 



Signaling Components Present in Mammals, Plants, or Bacteria 

Signaling component Mammals Plants Bacteria 

Ion channels + + + 

Electrogenic ion pumps + + + 

Two-component His kinases + + + 

Adenylyl cyclase + + + 

Guanylyl cyclase + + ? 

Receptor protein kinases (Ser/Thr) + + 

Ca 2+ as second messenger + + ? 

Ca 2+ channels + + ? 

Calmodulin, CaM-binding protein + + 

MAPK cascade + + 

Cyclic nucleotide-gated channels + + 

IP 3 -gated Ca 2+ channels + + 

Phosphatidylinositol kinases + + 

GPCRs + +/- + 

Trimeric G proteins + +/— 

Pi-specific phospholipase C + ? 

Tyrosine kinase receptors + 

SH2 domains + ? I 

Nuclear steroid receptors + — 

Protein kinase A + 

Protein kinase G + 



enzymes for the synthesis and degradation of cyclic nu- 
cleotides; and 100 or more ion channels, including about 
20 gated by cyclic nucleotides. Inositol phospholipids are 
present, as are kinases that interconvert them by phos- 
phorylation of inositol head groups. 

However, some types of signaling proteins common 
in animal tissues are not present in plants, or are repre- 
sented by only a few genes. Cyclic nucleotide-dependent 
protein kinases (PKA and PKG) seem to be absent, for 
example. Heterotrimeric G proteins and protein Tyr 
kinase genes are much less prominent in the plant 
genome, and genes for GPCRs, the largest family of pro- 
teins in the human genome (—1,000 genes), are very 
sparsely represented in the plant genome. DNA-binding 
nuclear steroid receptors are certainly not prominent, 
and may be absent from plants. Although plants lack the 
most widely conserved light-sensing mechanism present 
in animals (rhodopsin, with retinal as pigment), they 
have a rich collection of other light-detecting mecha- 
nisms not found in animal tissues — phytochromes and 
cryptochromes, for example (Chapter 19). 

The kinds of compounds that elicit signals in plants 
are similar to certain signaling molecules in mammals 
(Fig. 12-32). Instead of prostaglandins, plants have jas- 
monate; instead of steroid hormones, brassinosteroids. 
About 100 different small peptides serve as plant signals, 
and both plants and animals use compounds derived 
from aromatic amino acids as signals. 
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FIGURE 12-32 Structural similarities between plant and animal signals. 
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Plants Detect Ethylene through a Two-Component 
System and a MAPK Cascade 

The gaseous plant hormone ethylene (CH 2 =CH 2 ), 
which stimulates the ripening of fruits (among other 
functions), acts through receptors that are related in 
primary sequence to the receptor His kinases of 
the bacterial two-component systems and probably 
evolved from them. In Arabidopsis, the two-compo- 
nent signaling system is contained within a single inte- 
gral membrane protein of the endoplasmic reticulum 
(not the plasma membrane). Ethylene diffuses into 
the cell through the plasma membrane and into the 
ER. The first downstream component affected by eth- 
ylene signaling is a protein Ser/Thr kinase (CTRl; 
Fig. 12-33) with sequence homology to Raf, the pro- 
tein kinase that begins the MAPK cascade in the mam- 
malian response to insulin (see Fig. 12-15). In plants, 
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FIGURE 12-33 Transduction mechanism for detection of ethylene by 
plants. The ethylene receptor (pink) in the endoplasmic reticulum is a 
two-component system contained in a single protein, with a receptor 
domain (component 1) and a response regulator domain (component 
2). The receptor controls (in ways we do not yet understand) the activ- 
ity of CTR1, a protein kinase similar to MAPKKKs and therefore pre- 
sumed to be part of a MAPK cascade. CTRl is a negative regulator of 
the ethylene response; when CTR1 is inactive, the ethylene signal is 
transmitted through the gene product EIN2 (thought to be a nuclear 
envelope protein), which causes increased synthesis of ERF1, a tran- 
scription factor. ERF1 stimulates expression of proteins specific to the 
ethylene response. 



in the absence of ethylene, the CTRl kinase is active 
and inhibits the MAPK cascade, preventing transcrip- 
tion of ethylene-responsive genes. Exposure to ethyl- 
ene inactivates the CTRl kinase, thereby activating 
the MAPK cascade that leads to activation of the tran- 
scription factor EIN3. Active EIN3 stimulates the syn- 
thesis of a second transcription factor (ERF1), which 
in turn activates transcription of ethylene-responsive 
genes; the gene products affect processes ranging from 
seedling development to fruit ripening. Although ap- 
parently derived from the bacterial two-component 
signaling system, the ethylene system in Arabidopsis 
is different in that the His kinase activity that defines 
component 1 in bacteria is not essential to signal trans- 
duction in Arabidopsis. 

Receptorlike Protein Kinases Transduce Signals 
from Peptides and Brassinosteroids 

One common motif in plant signaling involves receptor- 
like kinases (RLKs), which have a single helical seg- 
ment in the plasma membrane that connects a receptor 
domain on the outside with a protein Ser/Thr kinase on 
the cytoplasmic side. This type of receptor participates 
in the defense mechanism triggered by infection with a 
bacterial pathogen (Fig. 12-34a). The signal to turn on 
the genes needed for defense against infection is a pep- 
tide (flg22) released by breakdown of flagellin, the ma- 
jor protein of the bacterial flagellum. Binding of flg22 to 
the FLS2 receptor of Arabidopsis induces receptor 
dimerization and autophosphorylation on Ser and Thr 
residues, and the downstream effect is activation of a 
MAPK cascade like that described above for insulin ac- 
tion (Fig. 12-15). The final kinase in this cascade acti- 
vates a specific transcription factor, triggering synthesis 
of the proteins that defend against the bacterial infec- 
tion. The steps between receptor phosphorylation and 
the MAPK cascade are not yet known. A phosphoprotein 
phosphatase (KAPP) associates with the active receptor 
protein and inactivates it by dephosphorylation to end 
the response. 

The MAPK cascade in the plant's defense against 
bacterial pathogens is remarkably similar to the innate 
immune response in mammals (Fig. 12-34b) that is trig- 
gered by bacterial lipopolysaccharide and mediated by 
the Toll-like receptors (TLRs, a name derived from a 
Drosophila mutant originally called Toll (German, 
"mad"); TLRs were subsequently found in many other 
organisms and were shown to function in embryonic de- 
velopment). Other membrane receptors use similar 
mechanisms to activate a MAPK cascade, ultimately ac- 
tivating transcription factors and turning on the genes 
essential to the defense response. 

Most of the several hundred RLKs in plants are pre- 
sumed to act in similar ways: ligand binding induces 
dimerization and autophosphorylation, and the acti- 
vated receptor kinase triggers downstream responses 
by phosphorylating key proteins at Ser or Thr residues. 
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FIGURE 12-34 Similarities between the signaling pathways that trig- 
ger immune responses in plants and animals, (a) In Arabidopsis 
thaliana, the peptide flg22, derived from the flagella of a bacterial 
pathogen, binds to its receptor (FLS) in the plasma membrane, causing 
the receptor to form dimers and triggering autophosphorylation of the 
cytosolic protein kinase domain on a Ser orThr residue (not aTyr), Thus 
activated, the protein kinase phosphorylates downstream proteins (not 
shown). The activated receptor also activates (by means unknown) a 
MAPK cascade, which leads to phosphorylation of a nuclear protein 
that normally inhibits the transcription factors WRKY22 and 29; this 



phosphorylation triggers proteolytic degradation of the inhibitor and 
frees the transcription factors to stimulate gene expression related to 
the immune response, (b) In mammals, a toxic bacterial lipopolysac- 
charide (LPS; see Fig. 7-30) is detected by plasma membrane recep- 
tors, which then associate with and activate a soluble protein kinase 
(IRAK). The major flagellar protein of pathogenic bacteria acts through 
a similar receptor, also activating IRAK. The activated IRAK initiates 
two distinct MAPK cascades that end in the nucleus, causing the 
synthesis of proteins needed in the immune response. Jun, Fos, and 
NFkB are transcription factors. 



SUMMARY 1 2.9 Signaling in Microorganisms 
and Plants 

■ Bacteria and eukaryotic microorganisms have a 
variety of sensory systems that allow them to 
sample and respond to their environment. In the 
two-component system, a receptor His kinase 
senses the signal and autophosphorylates a His 
residue, then phosphorylates an Asp residue of the 
response regulator. 

■ Plants respond to many environmental stimuli and 
employ hormones and growth factors to coordinate 
the development and metabolic activities of their 
tissues. Plant genomes encode hundreds of signaling 
proteins, including some very similar to those of 
mammals. 

■ Two-component signaling mechanisms common in 
bacteria are found in modified forms in plants, used 
in the detection of chemical signals and light. 



■ Plant receptorlike kinases (RLKs) participate in 
detecting a wide variety of stimuli, including 
brassinosteroids, peptides that originate from 
pathogens, and developmental signals. RLKs 
autophosphorylate Ser/Thr residues, then activate 
downstream proteins, which in some cases are 
MAPK cascades. The end result is increased 
transcription of specific genes. 

12.10 Sensory Transduction in Vision, 
Olfaction, and Gustation 

The detection of light, odors, and tastes (vision, olfaction, 
and gustation, respectively) in animals is accomplished by 
specialized sensory neurons that use signal-transduction 
mechanisms fundamentally similar to those that detect 
hormones, neurotransmitters, and growth factors. An 
initial sensory signal is amplified greatly by mechanisms 
that include gated ion channels and intracellular second 
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messengers; the system adapts to continued stimulation 
by changing its sensitivity to the stimulus (desensitiza- 
tion); and sensory input from several receptors is inte- 
grated before the final signal goes to the brain. 

The Visual System Uses Classic GPCR Mechanisms 

In the vertebrate eye, light entering through the pupil 
is focused on a highly organized collection of light- 
sensitive neurons (Fig. 12-35). The light-sensing cells 
are of two types: rods (about 10 9 per retina), which 
sense low levels of light but cannot discriminate colors, 
and cones (about 3 X 10 6 per retina), which are less 
sensitive to light but can discriminate colors. Both cell 
types are long, narrow, specialized sensory neurons with 
two distinct cellular compartments: the outer segment 
contains dozens of membranous disks loaded with re- 
ceptor proteins and their photosensitive chromophore 
retinal; the inner segment contains the nucleus and 
many mitochondria, which produce the ATP essential to 
phototransduction. 

Like other neurons, rods and cones have a trans- 
membrane electrical potential (7 m ) 3 produced by the 
electrogenic pumping of the Na + K + ATPase in the 
plasma membrane of the inner segment (Fig. 12-36). 
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FIGURE 12-35 Light reception in the vertebrate eye. The lens focuses 
light on the retina, which is composed of layers of neurons. The pri- 
mary photosensory neurons are rod cells (yellow), which are responsi- 
ble for high-resolution and night vision, and cone cells of three 
subtypes (pink), which initiate color vision. The rods and cones form 
synapses with several ranks of interconnecting neurons that convey 
and integrate the electrical signals. The signals eventually pass from 
ganglion neurons through the optic nerve to the brain. 
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FIGURE 12-36 Light-induced hyperpolarization of rod cells. The rod 

cell consists of an outer segment, filled with stacks of membranous 
disks (not shown) containing the photoreceptor rhodopsin, and an in- 
ner segment that contains the nucleus and other organelles (not 
shown). The inner segment synapses with interconnecting neurons (see 
Fig. 12-35). Cones have a similar structure. ATP in the inner segment 
powers the Na + K + ATPase, which creates a transmembrane electrical 
potential by pumping 3 Na + out for every 2 K + pumped in. The mem- 
brane potential is reduced by the inflow of Na + and Ca 2+ through 
cGMP-gated cation channels in the outer-segment plasma membrane. 
When rhodopsin absorbs light, it triggers degradation of cGMP (green 
dots) in the outer segment, causing closure of the ion channel. Without 
cation influx through this channel, the cell becomes hyperpolarized. 
This electrical signal is passed to the brain through the ranks of neurons 
shown in Figure 12-35. 



Also contributing to the membrane potential is an ion 
channel in the outer segment that permits passage of ei- 
ther Na + or Ca 2+ and is gated (opened) by cGMR In the 
dark, rod cells contain enough cGMP to keep this chan- 
nel open. The membrane potential is therefore deter- 
mined by the difference between the amount of Na + and 
K + pumped by the inner segment (which polarizes the 
membrane) and the influx of Na + through the ion chan- 
nels of the outer segment (which tends to depolarize the 
membrane) . 

The essence of signaling in the rod or cone cell is a 
light-induced decrease in [cGMP], which causes the 
cGMP-gated ion channel to close. The plasma membrane 
then becomes hyperpolarized by the Na + K + ATPase. 
Rod and cone cells synapse with interconnecting neu- 
rons (Fig. 12-35) that carry information about the elec- 
trical activity to ganglion neurons near the inner surface 
of the retina. The ganglion neurons integrate the output 
from many rod or cone cells and send the resulting signal 
through the optic nerve to the visual cortex of the brain. 

Visual transduction begins when light falls on 
rhodopsin, many thousands of molecules of which are 
present in each disk of the outer segments of rod and 
cone cells. Rhodopsin (M T 40,000) is an integral 
protein with seven membrane-spanning a helices 
(Fig. 12-37), the characteristic GPCR architecture. 
The light-absorbing pigment (chromophore) 11-cis- 
retinal is covalently attached to opsin, the protein 
component of rhodopsin, through a Schiff base to a 
Lys residue. The retinal molecule lies near the middle 
of the bilayer (Fig. 12-37), oriented with its long axis 
approximately in the plane of the membrane. When a 
photon is absorbed by the retinal component of 
rhodopsin, the energy causes a photochemical change; 
11-eis-retinal is converted to alM rans -retinal (see 
Figs l-18b and 10-21). This change in the structure 
of the chromophore forces conformational changes in 
the rhodopsin molecule — the first stage in visual 
transduction. 

H Retinal is derived from vitamin A : (retinol) , which 
is produced from /3-carotene (see Fig. 10-21). 
Dietary deficiency of vitamin A leads to night blindness 
(the inability to adapt to low light levels), which is 
relatively common in some developing countries. Vita- 
min A supplements or vegetables rich in carotene (such 
as carrots) supply the vitamin and reverse the night 
blindness. ■ 

Excited Rhodopsin Acts through the G Protein Transducin 
to Reduce the cGMP Concentration 

In its excited conformation, rhodopsin interacts with a 
second protein, transducin, which hovers nearby on 
the cytoplasmic face of the disk membrane (Fig. 12-37). 
Transducin (T) belongs to the same family of net- 
erotrimeric GTP-binding proteins as G s and G^. Although 
specialized for visual transduction, transducin shares 
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FIGURE 12-37 Complex of rhodopsin with the G protein transducin. 

(PDB ID 1 BAC) Rhodopsin (red) has seven transmembrane helices em- 
bedded in the disk membranes of rod outer segments and is oriented 
with its carboxyl terminus on the cytosoiic side and its amino terminus 
inside the disk. The chromophore 1 1-c/s- retina I (blue space-filling 
structure), attached through a Schiff base linkage to Lys 256 of the sev- 
enth helix, lies near the center of the bilayer. (This location is similar to 
that of the epinephrine-binding site in the /3-adrenergic receptor.) Sev- 
eral Ser and Thr residues near the carboxyl terminus are substrates for 
phosphorylations that are part of the desensitization mechanism for 
rhodopsin. Cytosoiic loops that interact with the C protein transducin 
are shown in orange; their exact positions are not yet known. The three 
subunits of transducin (green) are shown in their likely arrangement. 
Rhodopsin is palmitoylated at its carboxyl terminus, and both the a 
and y subunits of transducin have attached lipids (yellow) that assist in 
anchoring them to the membrane. 

many functional features with G s and Gi. It can bind 
either GDP or GTP. In the dark, GDP is bound, all three 
subunits of the protein (T a , T^, and T y ) remain together, 
and no signal is sent. When rhodopsin is excited by light, 
it interacts with transducin, catalyzing the replacement 
of bound GDP by GTP from the cytosol (Fig. 12-38, 
steps (D and (2)). Transducin then dissociates into T a 
and Tp y} and the T a -GTP carries the signal from the ex- 
cited receptor to the next element in the transduction 
pathway, a cGMP phosphodiesterase; this enzyme con- 
verts cGMP to 5'-GMP (steps ® and ®). Note that 
this is not the same cyclic nucleotide phosphodiesterase 
that hydrolyzes cAMP to terminate the ^-adrenergic re- 
sponse. One isoform of the cGMP-specihc PDE is unique 
to the visual cells of the retina. 
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FIGURE 12-38 Molecular consequences of photon absorption by rhodopsin in the rod outer segment. 

The top half of the figure (steps © to ©) describes excitation; the bottom (steps © to (§)), recovery and 
adaptation after illumination. 



The PDE of the retina is an integral protein with its 
active site on the cytoplasmic side of the disk membrane. 
In the dark, a tightly bound inhibitory subunit very effec- 
tively suppresses the PDE activity. When T a -GTP en- 
counters the PDE, the inhibitory subunit leaves the 
enzyme and instead binds T a , and the enzyme's activity 
immediately increases by several orders of magnitude. 
Each molecule of the active PDE degrades many mole- 
cules of cGMP to the biologically inactive 5'-GMP, lower- 
ing [cGMP] in the outer segment within a fraction of a 
second. At the new, lower [cGMP], the cGMP-gated ion 
channels close, blocking reentry of Na + and Ca 2+ into 
the outer segment and hyperpolarizing the membrane of 
the rod or cone cell (step©). Through this process, the 
initial stimulus— a photon— changes the V m of the cell. 

Several steps in the visual-transduction process re- 
sult in a huge amplification of the signal. Each excited 
rhodopsin molecule activates at least 500 molecules of 
transducin, each of which can activate a molecule of the 



PDE. This phosphodiesterase has a remarkably high 
turnover number, each activated molecule hydrolyzing 
4,200 molecules of cGMP per second. The binding of 
cGMP to cGMP-gated ion channels is cooperative, and a 
relatively small change in [cGMP] therefore registers as a 
large change in ion conductance. The result of these am- 
plifications is exquisite sensitivity to light. Absorption of 
a single photon closes 1,000 or more ion channels and 
changes the cell's membrane potential by about 1 mV. 

The Visual Signal is Quickly Terminated 

As your eyes move across this line, the retinal images of 
the first words disappear rapidly — before you see the 
next series of words. In that short interval, a great deal 
of biochemistry has taken place. Very shortly after illu- 
mination of the rod or cone cells stops, the photo- 
sensory system shuts off. The a subunit of transducin 
(with bound GTP) has intrinsic GTPase activity. Within 
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milliseconds after the decrease in light intensity, GTP is 
hydrolyzed and T a reassociates with Tjg r The inhibitory 
subunit of the PDE, which had been bound to T a -GTP, is 
released and reassociates with the enzyme, strongly in- 
hibiting its activity. To return [cGMP] to its "dark" level, 
the enzyme guanylyl cyclase converts GTP to cGMP 
(step (7) in Fig. 12-38) in a reaction that is inhibited by 
high [Ca 2+ ] (MOO nM). Calcium levels drop during illu- 
mination, because the steady-state [Ca 2+ ] in the outer 
segment is the result of outward pumping of Ca 2+ 
through the Na + -Ca 2+ exchanger of the plasma mem- 
brane (see Fig. 12-36) and influx of Ca 2+ through open 
cGMP-gated channels. In the dark, this produces a 
[Ca 2+ ] of about 500 nM— enough to inhibit cGMP syn- 
thesis. After brief illumination, Ca 2+ entry slows and 
[Ca 2+ ] declines (step ©). The inhibition of guanylyl 
cyclase by Ca 2+ is relieved, and the cyclase converts 
GTP to cGMP to return the system to its prestimulus 
state (step (7)). 

Rhodopsin itself also undergoes changes in response 
to prolonged illumination. The conformational change in- 
duced by light absorption exposes several Thr and Ser 
residues in the carboxyl-terminal domain. These 
residues are quickly phosphorylated by rhodopsin ki- 
nase (step (§) in Fig. 12-38), which is functionally and 
structurally homologous to the /3-adrenergic kinase 
QSARK) that desensitizes the jS-adrenergic receptor 
(Fig. 12-8). The Ca 2+ -binding protein recoverin in- 
hibits rhodopsin kinase at high [Ca 2+ ], but the inhibition 
is relieved when [Ca 2+ ] drops after illumination, as de- 
scribed above. The phosphorylated carboxyl-terminal 
domain of rhodopsin is bound by the protein arrestin 1, 
preventing further interaction between activated 
rhodopsin and transducin. Arrestin 1 is a close homolog 
of arrestin 2 QSarr; Fig. 12-8). On a relatively long time 
scale (seconds to minutes), the all-£rcms-retinal of an ex- 
cited rhodopsin molecule is removed and replaced by 1 1- 
c^s-retinal, to produce rhodopsin that is ready for 
another round of excitation (step (9) in Fig. 12-38). 

Cone Cells Specialize in Color Vision 

Color vision involves a path of sensory transduction in 
cone cells essentially identical to that described above, 
but triggered by slightly different light receptors. Three 
types of cone cells are specialized to detect light from dif- 
ferent regions of the spectrum, using three related pho- 
toreceptor proteins (opsins). Each cone cell expresses 
only one kind of opsin, but each type is closely related to 
rhodopsin in size, amino acid sequence, and presumably 
three-dimensional structure. The differences among the 
opsins, however, are great enough to place the chro- 
mophore, 11-cis-retinal, in three slightly different envi- 
ronments, with the result that the three photoreceptors 
have different absorption spectra (Fig. 12-39). We 
discriminate colors and hues by integrating the output 
from the three types of cone cells, each containing one of 
the three photoreceptors. 
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FIGURE 12-39 Absorption spectra of purified rhodopsin and the red, 
green, and blue receptors of cone cells. The spectra, obtained from in- 
dividual cone cells isolated from cadavers, peak at about 420, 530, 
and 560 nm, and the maximum absorption for rhodopsin is at about 
500 nm. For reference, the visible spectrum for humans is about 380 to 
750 nm. 

Color blindness, such as the inability to distin- 
guish red from green, is a fairly common, geneti- 
cally inherited trait in humans. The various types of 
color blindness result from different opsin mutations. 
One form is due to loss of the red photoreceptor; af- 
fected individuals are red" dichromats (they see only 
two primary colors). Others lack the green pigment and 
are green" dichromats. In some cases, the red and 
green photoreceptors are present but have a changed 
amino acid sequence that causes a change in their 
absorption spectra, resulting in abnormal color vision. 
Depending on which pigment is altered, such individuals 
are red-anomalous trichromats or green-anomalous 
trichromats. Examination of the genes for the visual 
receptors has allowed the diagnosis of color blindness in 
a famous "patient" more than a century after his death 
(Box 12-4)! ■ 

Vertebrate Olfaction and Gustation Use Mechanisms 
Similar to the Visual System 

The sensory cells that detect odors and tastes have 
much in common with the rod and cone cells. Olfactory 
neurons have long thin cilia extending from one end of 
the cell into a mucous layer that overlays the cell. These 
cilia present a large surface area for interaction with ol- 
factory signals. The receptors for olfactory stimuli are 
ciliary membrane proteins with the familiar GPCR struc- 
ture of seven transmembrane a helices. The olfactory 
signal can be any one of the many volatile compounds 
for which there are specific receptor proteins. Our abil- 
ity to discriminate odors stems from hundreds of differ- 
ent olfactory receptors in the tongue and nasal passages 
and from the brain's ability to integrate input from 
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BOX 12-4 ;^-CKlSI2 Color Blind 

The chemist John Dalton (of atomic theory fame) was 
color-blind. He thought it probable that the vitreous hu- 
mor of his eyes (the fluid that fills the eyeball behind the 
lens) was tinted blue, unlike the colorless fluid of normal 
eyes. He proposed that after his death, his eyes should 
be dissected and the color of the vitreous humor deter- 
mined. His wish was honored. The day after Dalton's 
death in July 1844, Joseph Ransome dissected his eyes 
and found the vitreous humor to be perfectly colorless. 
Ransome, like many scientists, was reluctant to throw 
samples away. He placed Dalton's eyes in ajar of preser- 
vative, where they stayed for a century and a half. 

Then, in the mid-1990s, molecular biologists in En- 
gland took small samples of Dalton's retinas and ex- 
tracted DNA. Using the known gene sequences for the 
opsins of the red and green light receptors, they ampli- 
fied the relevant sequences (using techniques described 



ess: John Dalton's Experiment from the Grave 

in Chapter 9) and determined that Dalton had the opsin 
gene for the red photopigment but lacked the opsin 
gene for the green photopigment. Dalton was a green" 
dichromat. So, 150 years after his death, the experiment 
Dalton started — by hypothesizing about the cause of his 
color blindness — was finally finished, 




FIGURE 1 Dalton's eyes. 



different types of olfactory receptors to recognize a 
"hybrid" pattern, extending our range of discrimination 
far beyond the number of receptors. 

The olfactory stimulus arrives at the sensory cells by 
diffusion through the air. In the mucous layer covering 
the olfactory neurons, the odorant molecule binds di- 



rectly to an olfactory receptor or to a specific binding pro- 
tein that carries the odorant to a receptor (Fig. 12-40). 
Interaction between odorant and receptor triggers a 
change in receptor conformation that results in the 
replacement of bound GDP by GTP on a G protein, G olf , 
analogous to transducin and to G s of the ^-adrenergic 
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FIGURE 12-40 Molecular events of olfaction. These interactions occur in the cilia of olfactory receptor eel 
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system. The activated G olf then activates adenylyl cyclase 
of the ciliary membrane, which synthesizes cAMP from 
ATP, raising the local [cAMP]. The cAMP-gated Na + and 
Ca 2+ channels of the ciliary membrane open, and the 
influx of Na + and Ca 2+ produces a small depolarization 
called the receptor potential. If a sufficient number of 
odorant molecules encounter receptors, the receptor 
potential is strong enough to cause the neuron to fire an 
action potential. This is relayed to the brain in several 
stages and registers as a specific smell. All these events 
occur within 100 to 200 ms. 

When the olfactory stimulus is no longer present, 
the transducing machinery shuts itself off in several 
ways. A cAMP phosphodiesterase returns [cAMP] to the 
prestimulus level. G olf hydrolyzes its bound GTP to GDP, 
thereby inactivating itself. Phosphorylation of the 
receptor by a specific kinase prevents its interaction 
with G olf , by a mechanism analogous to that used to 
desensitize the /3-adrenergic receptor and rhodopsin. 
And lastly, some odorants are enzymatically destroyed 
by oxidases. 

The sense of taste in vertebrates reflects the ac- 
tivity of gustatory neurons clustered in taste buds on 
the surface of the tongue. In these sensory neurons, 
GPCRs are coupled to the heterotrimeric G protein 
gustducin (very similar to the transducin of rod and 
cone cells). Sweet-tasting molecules are those that 
bind receptors in "sweet" taste buds. When the mole- 
cule (tastant) binds, gustducin is activated by replace- 
ment of bound GDP with GTP and then stimulates 
cAMP production by adenylyl cyclase. The resulting 
elevation of [cAMP] activates PKA, which phosphory- 
lates K + channels in the plasma membrane, causing 
them to close. Reduced efflux of K + depolarizes the 



cell (Fig. 12-41). Other taste buds specialize in de- 
tecting bitter, sour, salty, or umami (savory) tastants, 
using various combinations of second messengers and 
ion channels in the transduction mechanisms. 

GPCRs of the Sensory Systems Share Several Features 
with GPCRs of Hormone Signaling Systems 

We have now looked at several types of signaling sys- 
tems (hormone signaling, vision, olfaction, and gusta- 
tion) in which membrane receptors are coupled to 
second messenger-generating enzymes through G pro- 
teins. As we have intimated, signaling mechanisms must 
have arisen early in evolution; genomic studies have re- 
vealed hundreds of genes encoding GPCRs in verte- 
brates, arthropods (Drosophila and mosquito), and the 
roundworm Caenorhabditis elegans. Even the com- 
mon baker's yeast Saccharomyces uses GPCRs and 
G proteins to detect the opposite mating type. Overall 
patterns have been conserved, and the introduction of 
variety has given modern organisms the ability to 
respond to a wide range of stimuli (Table 12-8). Of the 
approximately 29,000 genes in the human genome, as 
many as 1,000 encode GPCRs, including hundreds for 
olfactory stimuli and many "orphan receptors" for which 
the natural ligand is not yet known. 

All well-studied transducing systems that act through 
heterotrimeric G proteins share some common features, 
which reflect their evolutionary relatedness (Fig. 12^2). 
The receptors have seven transmembrane segments, a 
domain (generally the loop between transmembrane he- 
lices 6 and 7) that interacts with a G protein, and a 
carboxyl-terminal cytoplasmic domain that undergoes re- 
versible phosphorylation on several Ser or Thr residues. 
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FIGURE 12-41 Transduction mechanism for sweet tastants. 
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TABLE 12-8 



Acetylcholine (muscarinic) 

Adenosine 

Angiotensin 

ATP (extracellular) 

Bradykinin 

Calcitonin 

Cannabinoids 

Catecholamines 

Cholecystokinin 

Corticotropin-releasing 
factor (CRF) 

Cyclic AMP (Dictyostelium 
discoideum) 

Dopamine 



me Signals Transduced by GPCRs 



Follicle-stimulating 
hormone (FSH) 

GABA (y-aminobutyric acid) 

Glucagon 

Glutamate 

Growth hormone-releasing 
hormone (GHRH) 

Histamine 

Leukotrienes 

Light 

Luteinizing hormone (LH) 

Melatonin 

Odorants 

Opioids 



Oxytocin 

Platelet-activating factor 

Prostaglandins 

Secretin 

Serotonin 

Somatostatin 

Tastants 

Thyrotropin 

Thyrotropin-releasing 
hormone (TRH) 

Vasoactive intestinal 
peptide 

Vasopressin 

Yeast mating factors 



The ligand-binding site (or, in the case of light reception, 
the light receptor) is buried deep in the membrane and in- 
cludes residues from several of the transmembrane seg- 
ments. Ligand binding (or light) induces a conformational 
change in the receptor, exposing a domain that can interact 
with a G protein. Heterotrimeric G proteins activate or 
inhibit effector enzymes (adenylyl cyclase, PDE, or PLC), 
which change the concentration of a second messenger 
(cAMP, cGMP, IP 3 , or Ca 2+ ). In the hormone-detecting sys- 
tems, the final output is an activated protein kinase that 
regulates some cellular process by phosphorylating a pro- 
tein critical to that process. In sensory neurons, the output 



is a change in membrane potential and a consequent elec- 
trical signal that passes to another neuron in the pathway 
connecting the sensory cell to the brain. 

All these systems self-inactivate. Bound GTP is con- 
verted to GDP by the intrinsic GTPase activity of G pro- 
teins, often augmented by GTPase-activating proteins 
(GAPs) or RGS proteins (regulators of G-protein signal- 
ing; see Fig. 12-5). In some cases, the effector enzymes 
that are the targets of modulation by G proteins also serve 
as GAPs. The desensitization mechanism involving phos- 
phorylation of the carboxyl-tenninal region followed by 
arrestin binding is widespread, and may be universal. 
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FIGURE 12-42 Common features of signaling systems that detect 
hormones, light, smells, and tastes. GPCRs provide signal specificity, 
and their interaction with G proteins provides signal amplification. 
Heterotrimeric G proteins activate effector enzymes: adenylyl cy- 
clase (AC), phospholipase C (PLC), and phosphodiesterases (PDEs) 
that degrade cAMP or cGMP. Changes in concentration of the second 
messengers (cAMP, cGMP, IP 3 ) result in alterations of enzymatic ac- 
tivities by phosphorylation or alterations in the permeability (P) of 
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surface membranes to Ca 2+ , Na + , and K + . The resulting depolariza- 
tion or hyperpolarization of the sensory cell (the signal) passes 
through relay neurons to sensory centers in the brain. In the best- 
studied cases, desensitization includes phosphorylation of the recep- 
tor and binding of a protein (arrestin) that interrupts receptor-G 
protein interactions. VR is the vasopressin receptor; £-AR is the 
adrenergic receptor. Other receptor and G-protein abbreviations are 
as used in earlier illustrations. 
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SUMMARY 12.10 



Sensory Transduction in 
Vision, Olfaction, and 
Gustation 



■ Vision, olfaction, and gustation in vertebrates 
employ GPCRs, which act through heterotrimeric 
G proteins to change the F m of a sensory neuron. 

■ In rod and cone cells of the retina, light activates 
rhodopsin, which activates the G protein transducin. 
The freed a subunit of transducin activates a 
cGMP phosphodiesterase, which lowers [cGMP] 
and thus closes cGMP-dependent ion channels in 
the outer segment of the neuron. The resulting 
hyperpolarization of the rod or cone cell carries the 
signal to the next neuron in the pathway, and 
eventually to the brain. 

■ In olfactory neurons, olfactory stimuli, acting 
through GPCRs and G proteins, trigger either an 
increase in [cAMP] (by activating adenylyl cyclase) 
or an increase in [Ca 2+ ] (by activating PLC). These 
second messengers affect ion channels and thus 
the V m . 

u Gustatory neurons have GPCRs that respond to 
tastants by altering levels of cAMP, which changes 
V m by gating ion channels, 

■ There is a high degree of conservation of signaling 
proteins and transduction mechanisms across 
signaling systems and across species. 

12.11 Regulation of the Cell Cycle by 
Protein Kinases 

One of the most dramatic manifestations of signaling 
pathways is the regulation of the eukaryotic cell cycle. 
During embryonic growth and later development, cell 
division occurs in virtually every tissue. In the adult 
organism most tissues become quiescent. A cell's "deci- 
sion" to divide or not is of crucial importance to the or- 
ganism. When the regulatory mechanisms that limit cell 
division are defective and cells undergo unregulated 
division, the result is catastrophic — cancer. Proper cell di- 
vision requires a precisely ordered sequence of bio- 
chemical events that assures every daughter cell a full 
complement of the molecules required for life. Investiga- 
tions into the control of cell division in diverse eukaryotic 
cells have revealed universal regulatory mechanisms. Sig- 
naling mechanisms much like those discussed above are 
central in determining whether and when a cell undergoes 
cell division, and they also ensure orderly passage through 
the stages of the cell cycle. 

The Cell Cycle Has Four Stages 

Cell division accompanying mitosis in eukaryotes occurs 
in four well-defined stages (Fig. 12-43). In the S (syn- 
thesis) phase, the DNA is replicated to produce copies 
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FIGURE 12-43 Eukaryotic cell cycle. The durations (in hours) of the 
four stages vary, but those shown are typical. 

for both daughter cells. In the G2 phase (G indicates the 
gap between divisions), new proteins are synthesized 
and the cell approximately doubles in size. In the M 
phase (mitosis), the maternal nuclear envelope breaks 
down, paired chromosomes are pulled to opposite 
poles of the cell, each set of daughter chromosomes is 
surrounded by a newly formed nuclear envelope, and 
cytokinesis pinches the cell in half, producing two 
daughter cells (see Fig. 24-25). In embryonic or rapidly 
proliferating tissue, each daughter cell divides again, but 
only after a waiting period (Gl). In cultured animal cells 
the entire process takes about 24 hours. 

After passing through mitosis and into Gl, a cell ei- 
ther continues through another division or ceases to di- 
vide, entering a quiescent phase (GO) that may last 
hours, days, or the lifetime of the cell. When a cell in GO 
begins to divide again, it reenters the division cycle 
through the Gl phase. Differentiated cells such as hepa- 
tocytes or adipocytes have acquired their specialized 
function and form; they remain in the GO phase. Stem 
cells retain their potential to divide and to differentiate 
into any of a number of cell types. 

Levels of Cyclin-Dependent Protein Kinases Oscillate 

The timing of the cell cycle is controlled by a family of pro- 
tein kinases with activities that change in response to cel- 
lular signals. By phosphorylating specific proteins at 
precisely timed intervals, these protein kinases orches- 
trate the metabolic activities of the ceU to produce 
orderly cell division. The kinases are heterodimers with 
a regulators subunit, cyclin, and a catalytic subunit, cyclin- 
dependent protein kinase (CDK). In the absence of 
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FIGURE 12-44 Activation of cyclin-dependent protein kinases (CDKs) by cyclin and phos- 
phorylation. CDKs, a family of related enzymes, are active only when associated with cy- 
cling another protein family. The crystal structure of CDK2 with and without cyclin reveals 
the basis for this activation, (a) Without cyclin (PDB ID 1HCK), CDK2 folds so that one 
segment, theT loop, obstructs the binding site for protein substrates and thus inhibits protein 
kinase activity. The binding site for ATP is also near the T loop, (b) When cyclin binds (PDB 
ID 1 FIN), it forces conformational changes that move theT loop away from the active site and 
reorient an amino-terminal helix, bringing a residue critical to catalysis (Clu 51 ) into the ac- 
tive site (c) Phosphorylation of a Thr residue in the T loop produces a negatively charged 
residue that is stabilized by interaction with three Arg residues, holding CDK in its active 
conformation (PDB ID 1JST). 




cyclin, the catalytic subunit is virtually inactive. When 
cyclin binds, the catalytic site opens up, a residue essen- 
tial to catalysis becomes accessible (Fig. 12-44), and 
the activity of the catalytic subunit increases 10,000-fold. 
Animal cells have at least 10 different cyclins (desig- 
nated A, B, and so forth) and at least 8 CDKs (CDK1 
through CDK8), which act in various combinations at 
specific points in the cell cycle. Plants also use a family of 
CDKs to regulate their cell division in root and shoot 
meristems, the principal tissues in which division occurs. 

In a population of animal cells undergoing synchro- 
nous division, some CDK activities show striking oscilla- 
tions (Fig. 12-45). These oscillations are the result of 
four mechanisms for regulating CDK activity: phospho- 
rylation or dephosphorylation of the CDK, controlled 
degradation of the cyclin subunit, periodic synthesis of 
CDKs and cyclins, and the action of specific CDK- 
inhibiting proteins. In general, active CDKs enable a cell 
to enter a stage of cell division. 

Regulation of CDKs by Phosphorylation The ac 

tivity of a CDK is strikingly affected by phosphorylation 
and dephosphorylation of two critical residues in the 
protein (Fig. 12-46a). Phosphorylation of Tyr 15 near 
the amino terminus renders CDK2 inactive; the ®-Tyr 
residue is in the ATP-binding site of the kinase, and the 
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FIGURE 12-45 Variations in the activities of specific CDKs during the 
cell cycle in animals. Cyclin E-CDK2 activity peaks near the G1 
phase-S phase boundary, when the active enzyme triggers synthesis of 
enzymes required for DNA synthesis (see Fig. 12-48). Cyclin A-CDK2 
activity rises during the S and G2 phases, then drops sharply in the M 
phase, as cyclin B-CDK1 peaks. 

negatively charged phosphate group blocks the entry of 
ATP. A specific phosphatase (a PTPase) dephosphory- 
lates this (PKtVr residue, permitting the binding of ATP. 
Phosphorylation of Thr 160 in the "T loop" of CDK, cat- 
alyzed by the CDK-activating kinase, forces the T loop 
out of the substrate-binding cleft, permitting substrate 
binding and catalytic activity (see Fig. 12-44c). 
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FIGURE 12-46 Regulation of CDK by phosphorylation and proteoly- 
sis, (a) The cyclin-dependent protein kinase activated at the time of mi- 
tosis (the M-phase CDK) has a "T loop" that can fold into the 
substrate-binding site, When Thr 160 in the T loop is phosphorylated, 
the loop moves out of the substrate-binding site, activating the CDK 
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manyfold (step®), (b) The active cyclin-CDK complex triggers its own 
inactivation by phosphorylation of DBRP (destruction box recognizing 
protein; step ©). DBRP and ubiquitin ligase then attach several mole- 
cules of ubiquitin (U) to cyclin (step ©), targeting it for destruction by 
proteasomes, proteolytic enzyme complexes (step®). 



One circumstance that triggers this control mecha- 
nism is the presence of single-strand breaks in DNA, 
which leads to arrest of the cell cycle in G2. A specific 
protein kinase (called Rad3 in yeast), which is activated 
by single-strand breaks, triggers a cascade leading to the 
inactivation of the PTPase that dephosphorylates Tvr 15 
of CDK. The CDK remains inactive and the cell is 
arrested in G2. The cell cannot divide until the DNA is 
repaired and the effects of the cascade are reversed. 

Controlled Degradation of Cyclin Highly specific 
and precisely timed proteolytic breakdown of mitotic 
cyclins regulates CDK activity throughout the cell cycle. 
Progress through mitosis requires first the activation 
then the destruction of cyclins A and B, which activate 
the catalytic subunit of the M-phase CDK. These cy- 
clins contain near their amino terminus the sequence 
-Arg-Thr-Ala-Leu-Gly-Asp-Ile-Gly-Asn-, the "destruc- 
tion box," which targets them for degradation. (This 
usage of "box" derives from the common practice, in 



diagramming the sequence of a nucleic acid or protein, 
of enclosing within a box a short sequence of nucleotide 
or amino acid residues with some specific function. It 
does not imply any three-dimensional structure.) The 
protein DBRP (destruction box recognizing protein) rec- 
ognizes this sequence and initiates the process of cyclin 
degradation by bringing together the cyclin and another 
protein, ubiquitin. Cyclin and activated ubiquitin are 
covalently joined by the enzyme ubiquitin ligase (Fig. 
12-46b). Several more ubiquitin molecules are then ap- 
pended, providing the signal for a proteolytic enzyme 
complex, or proteasome, to degrade cyclin. 

What controls the timing of cyclin breakdown? A 
feedback loop occurs in the overall process shown in 
Figure 12-46. Increased CDK activity activates cyclin 
proteolysis. Newly synthesized cyclin associates with 
and activates CDK, which phosphorylates and activates 
DBRP. Active DBRP then causes proteolysis of cyclin. 
The lowered cyclin level causes a decline in CDK activ- 
ity, and the activity of DBRP also drops through slow, 
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constant dephosphorylation and inactivation by a DBRP 
phosphatase. The cyclin level is ultimately restored by 
synthesis of new cyclin molecules. 

The role of ubiquitin and proteasomes is not limited 
to the regulation of cyclin; as we shall see in Chapter 27, 
both also take part in the turnover of cellular proteins, a 
process fundamental to cellular housekeeping. 

Regulated Synthesis of CDKs and Cyclins The 

third mechanism for changing CDK activity is regulation 
of the rate of synthesis of cyclin or CDK or both. For ex- 
ample, cyclin D, cyclin E, CDK2, and CDK4 are synthe- 
sized only when a specific transcription factor, E2F, is 
present in the nucleus to activate transcription of their 
genes. Synthesis of E2F is in turn regulated by extracel- 
lular signals such as growth factors and cytokines 
(developmental signals that induce cell division), com- 
pounds found to be essential for the division of mam- 
malian cells in culture. They induce the synthesis of 
specific nuclear transcription factors essential to the 
production of the enzymes of DNA synthesis. Growth 
factors trigger phosphorylation of the nuclear proteins 
Jun and Fos, transcription factors that promote the syn- 
thesis of a variety of gene products, including cyclins, 
CDKs, and E2F. In turn, E2F controls production of sev- 
eral enzymes essential for the synthesis of deoxynu- 
cleotides and DNA, enabling cells to enter the S phase 
(Fig. 12-47). 
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FIGURE 12-47 Regulation of cell division by growth factors. The path 
from growth factors to cell division leads through the enzyme cascade 
that activates MAPK; phosphorylation of the nuclear transcription fac- 
tors Jun and Fos; and the activity of the transcription factor E2F, which 
promotes synthesis of several enzymes essential for DNA synthesis. 



Inhibition of CDKs Finally, specific protein inhibitors 
bind to and inactivate specific CDKs. One such protein is 
p21, which we discuss below. 

These four control mechanisms modulate the activity of 
specific CDKs that, in turn, control whether a cell will 
divide, differentiate, become permanently quiescent, or 
begin a new cycle of division after a period of quies- 
cence. The details of cell cycle regulation, such as the 
number of different cyclins and kinases and the combi- 
nations in which they act, differ from species to species, 
but the basic mechanism has been conserved in the 
evolution of all eukaryotic cells. 

CDKs Regulate Cell Division by Phosphoryiating 
Critical Proteins 

We have examined how cells maintain close control of 
CDK activity, but how does the activity of CDK control 
the cell cycle? The list of target proteins that CDKs are 
known to act upon continues to grow, and much remains 
to be learned. But we can see a general pattern behind 
CDK regulation by inspecting the effect of CDKs on the 
structures of lamin and myosin and on the activity of 
retinoblastoma protein. 

The structure of the nuclear envelope is maintained 
in part by highly organized meshworks of intermediate 
filaments composed of the protein lamin. Breakdown of 
the nuclear envelope before segregation of the sister 
chromatids in mitosis is partly due to the phosphoryla- 
tion of lamin by a CDK, which causes lamin filaments to 
depolymerize. 

A second kinase target is the ATP-driven contractile 
machinery (actin and myosin) that pinches a dividing 
cell into two equal parts during cytokinesis. After the di- 
vision, CDK phosphorylates a small regulatory subunit 
of myosin, causing dissociation of myosin from actin fil- 
aments and inactivating the contractile machinery. Sub- 
sequent dephosphorylation allows reassembly of the 
contractile apparatus for the next round of cytokinesis. 

A third and very important CDK substrate is the 
retinoblastoma protein, pRb; when DNA damage is 
detected, this protein participates in a mechanism that ar- 
rests cell division in Gl (Fig. 12^8). Named for the retinal 
tumor cell line in which it was discovered, pRb functions 
in most, perhaps all, cell types to regulate cell division in re- 
sponse to a variety of stimuli. Unphosphorylated pRb binds 
the transcription factor E2F; while bound to pRb, E2F can- 
not promote transcription of a group of genes necessary for 
DNA synthesis (the genes for DNA polymerase a, ribonu- 
cleotide reductase, and other proteins; see Chapter 25). In 
this state, the cell cycle cannot proceed from the Gl to the 
S phase, the step that commits a cell to mitosis and cell 
division. The pRb-E2F blocking mechanism is relieved 
when pRb is phosphorylated by cyclin E-CDK2, which oc- 
curs in response to a signal for cell division to proceed. 

When the protein kinases ATM and ATR detect dam- 
age to DNA (signaled by the presence of the protein MRN 
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FIGURE 12-48 Regulation of passage from Gl to S by phosphoryla- 
tion of pRb. Transcription factor E2F promotes transcription of genes 
for certain enzymes essential to DNA synthesis. The retinoblastoma 
protein, pRb, can bind E2F (lower left), inactivating it and preventing 
transcription of these genes. Phosphorylation of pRb by CDK2 prevents 
it from binding and inactivating E2F, and the genes are transcribed, al- 
lowing cell division. Damage to the cell's DNA (upper left) triggers a 
series of events that inactivate CDK2, blocking cell division. When the 
protein MRN detects damage to the DNA, it activates two protein ki- 
nases, ATM and ATR, and they phosphorylate and activate the tran- 
scription factor p53. Active p53 promotes the synthesis of another 
protein, p21, an inhibitor of CDK2. Inhibition of CDK2 stops the phos- 
phorylation of pRb, which therefore continues to bind and inhibit E2F. 
With E2F inactivated, genes essential to cell division are not tran- 
scribed and cell division is blocked. When DNA has been repaired, 
this series of events is reversed, and the cell divides. 

at a double-strand break site), they phosphorylate p53, 
activating it to serve as a transcription factor that stimu- 
lates the synthesis of the protein p21 (Fig. 12-48). This 
protein inhibits the protein kinase activity of cyclin 
E-CDK2. In the presence of p21, pRb remains unphos- 
phorylated and bound to E2F, blocking the activity of this 
transcription factor, and the cell cycle is arrested in Gl. 
This gives the cell time to repair its DNA before entering 
the S phase, thereby avoiding the potentially disastrous 
transfer of a defective genome to one or both daughter 



cells. When the damage is too severe to allow effective re- 
pair, this same machinery triggers a process (apoptosis ; 
described below) that leads to the death of the cell, pre- 
venting the possible development of a cancer. 



SUMMARY 12.11 



Regulation of the Cell 
Cycle by Protein Kinases 

■ Progression through the cell cycle is regulated by the 
cyclin-dependent protein kinases (CDKs), which 
act at specific points in the cycle, phosphorylating 
key proteins and modulating their activities. The 
catalytic subunit of CDKs is inactive unless 
associated with the regulatory cyclin subunit. 

■ The activity of a cyclin-CDK complex changes 
during the cell cycle through differential synthesis 
of CDKs, specific degradation of cyclin, 
phosphorylation and dephosphorylation of critical 
residues in CDKs, and binding of inhibitory proteins 
to specific cyclin-CDKs. 

12.12 Oncogenes, Tumor Suppressor Genes, 
and Programmed Cell Death 

Tumors and cancer are the result of uncontrolled cell di- 
vision. Normally, cell division is regulated by a family of 
extracellular growth factors, proteins that cause resting 
cells to divide and, in some cases, differentiate. The re- 
sult is a precise balance between the formation of new 
cells (such as skin cells that die and are replaced every 
few weeks, or white blood cells that are replaced every 
few days) and cell destruction. When this balance is 
disturbed by defects in regulatory proteins, the result 
is sometimes the formation of a clone of cells that di- 
vide repeatedly and without regulation (a tumor) until 
their presence interferes with the function of normal 
tissues— cancer. The direct cause is almost always a ge- 
netic defect in one or more of the proteins that regulate 
cell division. In some cases, a defective gene is inherited 
from one parent; in other cases, the mutation occurs 
when a toxic compound from the environment (a muta- 
gen or carcinogen) or high-energy radiation interacts 
with the DNA of a single cell to damage it and introduce 
a mutation. In most cases there is both an inherited and 
an environmental contribution, and in most cases, more 
than one mutation is required to cause completely un- 
regulated division and full-blown cancer. 

Oncogenes Are Mutant Forms of the Genes for 
Proteins That Regulate the Cell Cycle 

Oncogenes were originally discovered in tumor- 
causing viruses, then later found to be derived 
from genes in the animal host cells, proto-oncogenes, 

which encode growth-regulating proteins. During a vi- 
ral infection, the host DNA sequence of a proto-onco- 
gene is sometimes copied into the viral genome, where 
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it proliferates with the virus. In subsequent viral infec- 
tion cycles, the proto-oncogenes can become defective 
by truncation or mutation. Viruses, unlike animal cells, 
do not have effective mechanisms for correcting mis- 
takes during DNA replication, so they accumulate mu- 
tations rapidly. When a virus carrying an oncogene 
infects a new host cell, the viral DNA (and oncogene) 
can be incorporated into the host cell's DNA, where it 
can now interfere with the regulation of cell division in 
the host cell. In an alternative, nonviral mechanism, a 
single cell in a tissue exposed to carcinogens may suf- 
fer DNA damage that renders one of its regulatory 
proteins defective, with the same effect as the onco- 
genic mechanism: failed regulation of cell division. 

The mutations that produce oncogenes are geneti- 
cally dominant; if either of a pair of chromosomes con- 
tains a defective gene, that gene product sends the 
signal "divide" and a tumor will result. The oncogenic 
defect can be in any of the proteins involved in commu- 
nicating the "divide" signal. Oncogenes discovered thus 
far include those that encode secreted proteins, growth 
factors, transmembrane proteins (receptors), cytoplas- 
mic proteins (G proteins and protein kinases), and the 
nuclear transcription factors that control the expression 
of genes essential for cell division (Jun, Fos). 

Some oncogenes encode surface receptors with de- 
fective or missing signal-binding sites, such that their 
intrinsic Tyr kinase activity is unregulated. For example, 
the oncoprotein ErbB is essentially identical to the 
normal receptor for epidermal growth factor, except 
that ErbB lacks the amino -terminal domain that nor- 
mally binds EGF (Fig. 12-49) and as a result sends the 
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FIGURE 12-49 Oncogene-encoded defective EGF receptor. The prod- 
uct of the erbB oncogene (the ErbB protein) is a truncated version of 
the normal receptor for epidermal growth factor (EGF). Its intracellular 
domain has the structure normally induced by EGF binding, but the 
protein lacks the extracellular binding site for EGF. Unregulated by 
EGF, ErbB continuously signals cell division. 



"divide" signal whether EGF is present or not. Mutations 
in erbB2, the gene for a receptor Tyr kinase related to 
ErbB, are commonly associated with cancers of the 
glandular epithelium in breast, stomach, and ovary. (For 
an explanation of the use of abbreviations in naming 
genes and their products, see Chapter 25.) 

The prominent role played by protein kinases in sig- 
naling processes related to normal and abnormal cell divi- 
sion has made them a prime target in the development of 
drugs for the treatment of cancer (Box 12-5). Mutant 
forms of the G protein Ras are common in tumor cells. 
The ras oncogene encodes a protein with normal GTP 
binding but no GTPase activity. The mutant Ras protein is 
therefore always in its activated (GTP-bound) form, re- 
gardless of the signals arriving through normal receptors. 
The result can be unregulated growth. Mutations in ras 
are associated with 30% to 50% of lung and colon carcino- 
mas and more than 90% of pancreatic carcinomas. ■ 

Defects in Certain Genes Remove Normal 
Restraints on Cel! Division 

r Tumor suppressor genes encode proteins that 
normally restrain cell division. Mutation in one or 
more of these genes can lead to tumor formation. Unreg- 
ulated growth due to defective tumor suppressor genes, 
unlike that due to oncogenes, is genetically recessive; 
tumors form only if both chromosomes of a pair contain 
a defective gene. This is because the function of these 
genes is to prevent cell division, and if either copy of the 
gene for such a protein is normal, the normal inhibition 
of division will take place. In a person who inherits one 
correct copy and one defective copy, every cell begins 
with one defective copy of the gene. If any one of those 
10 12 somatic cells undergoes mutation in the one good 
copy, a tumor may grow from that doubly mutant cell. 
Mutations in both copies of the genes for pRb, p53, or 
p21 yield cells in which the normal restraint on cell divi- 
sion is lost and a tumor forms. 

Retinoblastoma occurs in children and causes blind- 
ness if not surgically treated. The cells of a retinoblas- 
toma have two defective versions of the Rb gene (two 
defective alleles). Very young children who develop 
retinoblastoma commonly have multiple tumors in both 
eyes. These children have inherited one defective copy 
of the Rb gene, which is present in every cell; each tu- 
mor is derived from a single retinal cell that has under- 
gone a mutation in its one good copy of the Rb gene. (A 
fetus with two mutant alleles in every cell is nonviable.) 
People with retinoblastoma who survive childhood also 
have a high incidence of cancers of the lung, prostate, 
and breast later in life. 

A far less likely event is that a person born with two 
good copies of the Rb gene will have independent muta- 
tions in both copies in the same cell Some individuals 
do develop retinoblastomas later in childhood, usually 
with only one tumor in one eye. These individuals were 
presumably born with two good copies (alleles) of Rb in 
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BOX 12-5 Development of Protein Kinase Inhibitors for Cancer Treatment 



When a single cell divides without any regulatory limita- 
tion, it eventually gives rise to a clone of cells so large 
that it interferes with normal physiological functions 
(Fig. 1). This is cancer, a leading cause of death in the 
developed world, and increasingly so in the developing 
world. In all types of cancer, the normal regulation of 
cell division has become dysfunctional due to defects in 
one or more genes. For example, genes encoding pro- 
teins that normally send intermittent signals for cell 
division become oncogenes, producing constitutively 
active signaling proteins; or genes encoding proteins 
that normally restrain cell division (tumor suppressor 
genes) mutate to produce proteins that lack this brak- 
ing function. In many tumors, both kinds of mutation 
have occurred. 

Many oncogenes and tumor suppressor genes en- 
code protein kinases or proteins that act in pathways up- 
stream from protein kinases. It is therefore reasonable to 
hope that specific inhibitors of protein kinases could 
prove valuable in the treatment of cancer. For example, 
a mutant form of the EGF receptor is a constantly active 
receptor Tyr kinase (RTK), signaling cell division 
whether EGF is present or not (see Fig. 12-49). In about 
30% of all women with invasive breast cancer, a mutation 
in the receptor gene HER2/neu yields an RTK with activ- 
ity increased up to 100-fold. Another RTK, vascular en- 
dothelial growth factor receptor (VEGF-R), must 
be activated for the formation of new blood vessels (an- 
giogenesis) to provide a solid tumor with its own blood 
supply, and inhibition of VEGF-R might starve a tumor of 
essential nutrients. Nonreceptor Tyr kinases can also 
mutate, resulting in constant signaling and unregulated 
cell division. For example, the oncogene Abl (from the 
ylbelson leukemia virus) is associated with acute myeloid 
leukemia, a relatively rare blood disease (-5,000 cases a 
year in the United States). Another group of oncogenes 




FIGURE 1 Unregulated division of a single cell in the colon led to a pri- 
mary cancer that metastasized to the liver. Secondary cancers are seen 
as white patches in this liver obtained at autopsy. 



encode unregulated cyclin-dependent protein kinases. 
In each of these cases, specific protein kinase inhibitors 
might be valuable chemotherapeutic agents in the treat- 
ment of disease. Not surprisingly, huge efforts are under 
way to develop such inhibitors. How should one ap- 
proach this challenge? 

Protein kinases of all types show striking conserva- 
tion of structure at the active site. All share with the pro- 
totypical PKA structure the features shown in Figure 2: 
two lobes that enclose the active site, with a P loop that 
helps to align and bind the phosphoryl groups of ATP, an 
activation loop that moves to open the active site to the 
protein substrate, and a C helix that changes position as 
the enzyme is activated, bringing the residues in the 
substrate-binding cleft into their binding positions. 

The simplest protein kinase inhibitors are ATP 
analogs that occupy the ATP-binding site but cannot 
serve as phosphoryl group donors. Many such com- 
pounds are known, but their clinical usefulness is limited 
by their lack of selectivity— they inhibit virtually all pro- 
tein kinases and would produce unacceptable side 

(continued on next page) 




FIGURE 2 Conserved features of the active site of protein kinases (PDB 
ID 1S9I). The amino-terminal and carboxyl-terminal lobes surround 
the active site of the enzyme, near the catalytic loop and the site where 
ATP binds. The activation loop of this and many other kinases under- 
goes phosphorylation, then moves away from the active site to expose 
the substrate-binding cleft, which in this image is occupied by a spe- 
cific inhibitor of this enzyme, PD31 8088. The P loop is essential in the 
binding of ATP, and the C helix must also be correctly aligned for ATP 
binding and kinase activity. 
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Development of Protein Kinase Inhibitors for Cancer Treatment 
(continued from previous page) 



effects. More selectivity is seen with compounds that 
fill part of the ATP-binding site but also interact out- 
side this site, with parts of the protein unique to the 
target protein kinase. A third possible strategy is 
based on the fact that although the active conforma- 
tions of all protein kinases are similar, their inactive 
conformations are not. Drugs that target the inactive 
conformation of a specific protein kinase and prevent 
its conversion to the active form may have a higher 
specificity of action. A fourth approach employs the 
great specificity of antibodies. For example, mono- 
clonal antibodies (p. 173) that bind the extracellular 
portions of specific RTKs could eliminate the recep- 
tors' kinase activity by preventing dimerization or by 
causing their removal from the cell surface. In some 
cases, an antibody selectively binding to the surface 
of cancer cells could cause the immune system to at- 
tack those cells. 

Between 1998 and mid-2006, only eight new drugs 
were approved in the United States for use in cancer 
therapy: five small molecules and three monoclonal an- 
tibodies, each having shown efficacy in clinical trials. 
For example, imatinib mesylate (Gleevec; Fig. 3a), one 
of the small molecule inhibitors, has proved nearly 
100% effective in bringing about remission in patients 
with early-stage chronic myeloid leukemia. Erlotinib 
(Tarceva; Fig. 3b), which targets EGF-R, is effective 
against advanced non-small-cell lung cancer (NSCLC). 
Because many cell-division signaling systems involve 
more than one protein kinase, inhibitors that act on sev- 
eral protein kinases may be useful in the treatment of 
cancer. Sunitinib (Sutent) and sorafenib (Nexavar) tar- 
get several protein kinases, including VEGF-R and 
PDGF-R. These two drugs are in clinical use for pa- 
tients with gastrointestinal stromal tumors and ad- 
vanced renal cell carcinoma, respectively. Trastuzumab 
(Herceptin), cetuximab (Erbitux), and bevacizumab 
(Avastin) are monoclonal antibodies that target 
HER2/neu, EGF-R, and VEGF-R, respectively; all three 
drugs are in clinical use for certain types of cancer. 

At least a hundred more compounds are in pre- 
clinical trials. Among the drugs being evaluated are 
some obtained from natural sources and some pro- 
duced by synthetic chemistry. Indirubin is a compo- 
nent of a Chinese herbal preparation traditionally used 
to treat certain leukemias; it inhibits CDK2 and CDK5. 
Flavopiridol (Fig. 3d), a synthetic analog of an alkaloid 
extracted from the stem bark of the Indian plant 
Amoora rohituka, is a general CDK inhibitor. With 
several hundred potential anticancer drugs heading 
toward clinical testing, it is realistic to hope that some 
will prove more effective or more target-specific than 
those now in use. 



(a) Imatinib (Gleevec) 
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FIGURE 3 Some protein kinase inhibitors now in clinical trials or clini- 
cal use, showing their binding to the target protein, (a) Imatinib binds to 
the Abl oncogene kinase active site (PDB ID 1 1EP); it occupies both the 
ATP-binding site and a region adjacent to that site, (b) Erlotinib binds 
to the active site of EGF-R (PDB ID 1M17). (c), (d) Flavopiridol is an 
inhibitor of the cyclin-dependent kinase CDK2; shown here are nor- 
mal ATP binding (c) at the active site (PDB ID 1S9I) and flavopiridol 
binding (d), which prevents the binding of ATP (PDB ID 2A4L). 
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every cell, but both Rb alleles in a single retinal cell have 
undergone mutation, leading to a tumor. After about age 
three, retinal cells stop dividing, and retinoblastomas at 
later ages are quite rare. 

Stability genes (also called caretaker genes) en- 
code proteins that function in the repair of major 
genetic defects that result from aberrant DNA replica- 
tion, ionizing radiation, or environmental carcinogens. 
Mutations in these genes lead to a high frequency of un- 
repaired damage (mutations) in other genes, including 
proto-oncogenes and tumor suppressor genes, and thus 
to cancer. Among the stability genes are ATM (see Fig. 
12-48); the XP gene family, in which mutations lead to 
xeroderma pigmentosum; and the BRCA1 genes associ- 
ated with some types of breast cancer (see Box 25-1). 
Mutations in the gene for p53 also cause tumors; in more 
than 90% of human cutaneous squamous cell carcinomas 
(skin cancers) and in about 50% of all other human can- 
cers, p53 is defective. Those very rare individuals who 
inherit one defective copy of p53 commonly have the Li- 
Fraumeni cancer syndrome, with multiple cancers (of 
the breast, brain, bone, blood, lung, and skin) occurring 
at high frequency and at an early age. The explanation 
for multiple tumors in this case is the same as that for Rb 
mutations: an individual born with one defective copy of 
p53 in every somatic cell is likely to suffer a second p53 
mutation in more than one cell during his or her lifetime. 

In summary, then, three classes of defects can con- 
tribute to the development of cancer: oncogenes, in 
which the defect is the equivalent of a car's accelerator 
pedal being stuck down, with the engine racing; mu- 
tated tumor suppressor genes, in which the defect leads 
to the equivalent of brake failure; and mutated stability 
genes, with the defect leading to unrepaired damage to 
the cell's replication machinery, the equivalent of an 
unskilled car mechanic. 

Mutations in oncogenes and tumor suppressor genes 
do not have an all-or-none effect. In some cancers, per- 
haps in all, the progression from a normal cell to a malig- 
nant tumor requires an accumulation of mutations 
(sometimes over several decades), none of which, alone, 
is responsible for the end effect. For example, the devel- 
opment of colorectal cancer has several recognizable 
stages, each associated with a mutation (Fig. 12-50). If 
an epithelial cell in the colon undergoes mutation of both 
copies of the tumor suppressor gene APC (adenomatous 
polyposis coli), it begins to divide faster than normal and 
produces a clone of itself, a benign polyp (early ade- 
noma), For reasons not yet known, the APC mutation re- 
sults in chromosomal instability, and whole regions of a 
chromosome are lost or rearranged during cell division. 
This instability can lead to another mutation, commonly 
in ras, that converts the clone into an intermediate ade- 
noma. A third mutation (often in the tumor suppressor 
gene DCC) leads to a late adenoma. Only when both 
copies of p53 become defective does this cell mass 
become a carcinoma — a malignant, life-threatening 
tumor. The full sequence therefore requires at least 
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FIGURE 12-50 From normal epithelial cell to colorectal cancer. In the 

colon, mutations in both copies of the tumor suppressor gene APC lead 
to benign clusters of epithelial cells that multiply too rapidly (early ade- 
noma). If a cell already defective in APC undergoes a second mutation 
in the proto-oncogene ras, the doubly mutant cell gives rise to an inter- 
mediate adenoma, forming a benign polyp. When one of these cells un- 
dergoes further mutations in the tumor suppressor genes DCC and p53, 
increasingly aggressive tumors form. Finally, mutations in genes not yet 
characterized lead to a malignant tumor and finally to a metastatic 
tumor that can spread to other tissues. Most malignant tumors of other 
tissues probably result from a series of mutations such as this. 

seven genetic "hits": two on each of three tumor sup- 
pressor genes (APC, DCC, and p53~) and one on the 
proto-oncogene ras. There are probably several other 
routes to colorectal cancer as well, but the principle 
that full malignancy results only from multiple muta- 
tions is likely to hold true for all of them. When a polyp 
is detected in the early adenoma stage and the cells 
containing the first mutations are removed surgically 
late adenomas and carcinomas will not develop; hence 
the importance of early detection. Cells and organisms, 
too, have their early detection systems. For example, 
the ATM and ATR proteins described in Section 12.11 
can detect DNA damage too extensive to be repaired ef- 
fectively. They then trigger, through a pathway that in- 
cludes p53, the process of apoptosis, in which a cell that 
has become dangerous to the organism kills itself. ■ 

Apoptosis Is Programmed Cell Suicide 

Many cells can precisely control the time of their own 
death by the process of programmed cell death, or 
apoptosis (app'-a-toe'-sis; from the Greek for "drop- 
ping off," as in leaves dropping in the fall). One trigger 
for apoptosis is irreparable damage to DNA. Programmed 
cell death also occurs during the development of an 
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embryo, when some cells must die to give a tissue or or- 
gan its final shape. Carving fingers from stubby limb 
buds requires the precisely timed death of cells between 
developing finger bones. During development of the 
nematode C. elegans from a fertilized egg, exactly 131 
cells (of a total of 1,090 somatic cells in the embryo) 
must undergo programmed death in order to construct 
the adult body 

Apoptosis also has roles in processes other than de- 
velopment. If a developing antibody-producing cell gen- 
erates antibodies against a protein or glycoprotein 
normally present in the body, that cell undergoes pro- 
grammed death in the thymus gland — an essential 
mechanism for eliminating anti-self antibodies. The 
monthly sloughing of cells of the uterine wall (menstru- 
ation) is another case of apoptosis mediating normal cell 
death. The dropping of leaves in the fall is the result of 
apoptosis in specific cells of the stem. Sometimes cell 
suicide is not programmed but occurs in response to bi- 
ological circumstances that threaten the rest of the or- 
ganism. For example, a virus-infected cell that dies 
before completion of the infection cycle prevents spread 
of the virus to nearby cells. Severe stresses such as heat, 
hyperosmolarity, UV light, and gamma irradiation also 
trigger cell suicide; presumably the organism is better 
off with any aberrant, potentially mutated cells dead. 

The regulatory mechanisms that trigger apoptosis 
involve some of the same proteins that regulate the cell 
cycle. The signal for suicide often comes from outside, 
through a surface receptor. Tumor necrosis factor 
(TNF), produced by cells of the immune system, inter- 
acts with cells through specific TNF receptors. These re- 
ceptors have TNF-binding sites on the outer face of the 
plasma membrane and a "death domain" (-80 amino 
acid residues) that carries the self-destruct signal through 
the membrane to cytosolic proteins such as TRADD 
(TNF receptor-associated death domain) (Fig. 12-51). 
Another receptor, Fas, has a similar death domain that 
allows it to interact with the cytosolic protein FADD 
(Fas-associated death domain), which activates the cy- 
tosolic protease caspase 8. This enzyme belongs to a 
family of proteases that participate in apoptosis; all are 
synthesized as inactive proenzymes, all have a critical 
Cys residue at the active site, and all hydrolyze their tar- 
get proteins on the carboxyl-terminal side of specific Asp 
residues (hence the name caspase, from Cys and Asp). 

When caspase 8, an "initiator" caspase, is activated 
by an apoptotic signal carried through FADD, it further 
self-activates by cleaving its own proenzyme form. Mito- 
chondria are one target of active caspase 8. The pro- 
tease causes the release of certain proteins contained 
between the inner and outer mitochondrial membranes: 
cytochrome c (Chapter 19) and several "effector" cas- 
pases. Cytochrome c binds to the proenzyme form of the 
effector enzyme caspase 9 and stimulates its proteolytic 
activation. The activated caspase 9 in turn catalyzes 
wholesale destruction of cellular proteins— a major 
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FIGURE 12-51 Initial events of apoptosis. Receptors in the plasma mem- 
brane (Fas,TNF-R1) receive signals from outside the cell (the Fas ligand or 
tumor necrosis factor (TNF), respectively). Activated receptors foster inter- 
action between the "death domain" (an 80 amino acid sequence) of Fas 
orTNF-R1 and a similar death domain in the cytosolic proteins FADD or 
TRADD. FADD activates a cytosolic protease, caspase 8, that proteolyti- 
cally activates other cellular proteases. TRADD also activates proteases. 
The resulting proteolysis is a primary factor in cell death. 

cause of apoptotic cell death. One specific target of cas- 
pase action is a caspase-activated deoxyribonuclease. 

In apoptosis, the monomeric products of protein 
and DNA degradation (amino acids and nucleotides) are 
released in a controlled process that allows them to be 
taken up and reused by neighboring cells. Apoptosis 
thus allows the organism to eliminate a cell that is un- 
needed or potentially dangerous without wasting its 
components. 

SUMMARY 12.12 Oncogenes, Tumor 

Suppressor Genes, and 
Programmed Cell Death 

■ Oncogenes encode defective signaling proteins. By 
continually giving the signal for cell division, they 
lead to tumor formation. Oncogenes are genetically 
dominant and may encode defective growth factors, 
receptors, G proteins, protein kinases, or nuclear 
regulators of transcription. 

■ Tumor suppressor genes encode regulatory proteins 
that normally inhibit cell division; mutations in 
these genes are genetically recessive but can lead 
to tumor formation. 
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Cancer is generally the result of an accumulation of 
mutations in oncogenes and tumor suppressor 
genes. 

When stability genes, which encode proteins 
necessary for the repair of genetic damage, are 
mutated, other mutations go unrepaired, including 
mutations in proto-oncogenes and tumor suppressor 
genes that can lead to cancer. 

Apoptosis can be triggered by extracellular signals 
such as TNF, acting through plasma membrane 
receptors. 
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Problems — 

1. Hormone Experiments in Cell-Free Systems In the 

1950s, Earl W. Sutherland, Jr., and his colleagues carried out 
pioneering experiments to elucidate the mechanism of action 
of epinephrine and glucagon. Given what you have learned 



in this chapter about hormone action, interpret each of the 
experiments described below. Identify substance X and indi- 
cate the significance of the results. 

(a) Addition of epinephrine to a homogenate of normal 
liver resulted in an increase in the activity of glycogen phos- 
phorylase. However, when the homogenate was first cen- 
trifuged at a high speed and epinephrine or glucagon was 
added to the clear supernatant fraction that contains phos- 
phorylase, no increase in the phosphorylase activity occurred. 

(b) When the particulate fraction from the centrifugation 
in (a) was treated with epinephrine, substance X was pro- 
duced. The substance was isolated and purified. Unlike epi- 
nephrine, substance X activated glycogen phosphorylase 
when added to the clear supernatant fraction of the cen- 
trifuged homogenate. 

(c) Substance X was heat-stable; that is, heat treatment 
did not affect its capacity to activate phosphorylase. (Hint: 
Would this be the case if substance X were a protein?) Sub- 
stance X was nearly identical to a compound obtained when 
pure ATP was treated with barium hydroxide. (Fig. 8-6 will 
be helpful) 

2. Effect of Dibutyryl cAMP versus cAMP on Intact 
Cells The physiological effects of epinephrine should in prin- 
ciple be mimicked by addition of cAMP to the target cells. In 
practice, addition of cAMP to intact target cells elicits only a 
minimal physiological response. Why? When the structurally 
related derivative dibutyryl cAMP (shown below) is added to 
intact cells, the expected physiological response is readily ap- 
parent. Explain the basis for the difference in cellular re- 
sponse to these two substances. Dibutyryl cAMP is widely 
used in studies of cAMP function. 



C-(CH 2 ) 2 CH ; 
I 

NH 




0=P O O 

^ ^C-(CH 2 ) 2 CH 3 
O 

Dibutyryl cAMP 
(N 6 ,0 2 ' -Dibutyryl adenosine 3 ',5 '-cyclic monophosphate) 

HH 3. Effect of Cholera Toxin on Adenylyl Cyclase 

la The gram-negative bacterium Vibrio cholerae pro- 
duces a protein, cholera toxin (M T 90,000), that is responsi- 
ble for the characteristic symptoms of cholera: extensive 
loss of body water and Na + through continuous, debilitating 
diarrhea. If body fluids and Na + are not replaced, severe de- 
hydration results; untreated, the disease is often fatal. 
When the cholera toxin gains access to the human intestinal 
tract it binds tightly to specific sites in the plasma mem- 
brane of the epithelial cells lining the small intestine, 
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causing adenylyl cyclase to undergo prolonged activation 
(hours or days). 

(a) What is the effect of cholera toxin on [cAMP] in the in- 
testinal cells? 

(b) Based on the information above, suggest how cAMP 
normally functions in intestinal epithelial cells. 

(c) Suggest a possible treatment for cholera. 

4. Mutations in PKA Explain how mutations in the R or C 
subunit of cAMP-dependent protein kinase (PKA) might lead 
to (a) a constantly active PKA or (b) a constantly inactive PKA. 

EH 5. Therapeutic Effects of Albuterol The respira- 
Ei tory symptoms of asthma result from constriction of 
the bronchi and bronchioles of the lungs, caused by contrac- 
tion of the smooth muscle of their walls. This constriction can 
be reversed by raising [cAMP] in the smooth muscle. Explain 
the therapeutic effects of albuterol, a /3-adrenergic agonist 
taken (by inhalation) for asthma. Would you expect this drug 
to have any side effects? How might one design a better drug 
that did not have these effects? 

6. Termination of Hormonal Signals Signals carried by 
hormones must eventually be terminated. Describe several 
different mechanisms for signal termination. 

7. Using FRET to Explore Protein-Protein Interac- 
tions in Vivo Figure 12-8 shows the interaction between 
/3-arrestin and the /3-adrenergic receptor. How would you 
use FRET (see Box 12-3) to demonstrate this interaction in 
living cells? Which proteins would you fuse? Which wave- 
lengths would you use to illuminate the cells, and which 
would you monitor? What would you expect to observe if the 
interaction occurred? If it did not occur? How might you ex- 
plain the failure of this approach to demonstrate this inter- 
action? 

8. EGTA Injection EGTA (ethylene glycol-bis(^-aminoethyl 
ether) -AWV^'-tetraacetie acid) is a chelating agent with 
high affinity and specificity for Ca 2+ . By microinjecting a cell 
with an appropriate Ca 2+ -EGTA solution, an experimenter can 
prevent cytosolic [Ca 2+ ] from rising above 1CT 7 m. How would 
EGTA microinjection affect a cell's response to vasopressin 
(see Table 12-4)? To glucagon? 

9. Amplification of Hormonal Signals Describe all the 
sources of amplification in the insulin receptor system. 

10. Mutations in ras How would a mutation in ras that 
leads to formation of a Ras protein with no GTPase activity 
affect a cell's response to insulin? 

11. Differences among G Proteins Compare the G proteins 
G s , which acts in transducing the signal from ^-adrenergic 
receptors, and Ras. What properties do they share? How do they 
differ? What is the functional difference between G s and Gi? 

12. Mechanisms for Regulating Protein Kinases Identify 
eight general types of protein kinases found in eukaryotic 
cells, and explain what factor is directly responsible for acti- 
vating each type. 



13. Nonhydrolyzable GTP Analogs Many enzymes can hy- 
drolyze GTP between the (3 and y phosphates. The GTP analog 
/3,y-iirridoguanosine 5 '-triphosphate (Gpp(NH)p), shown be- 
low, cannot be hydrolyzed between the 0 and y phosphates. 



O 




OH OH 

Gpp(NH)p 
(/3,y-imidoguanosine 5 ' -triphosphate) 



Predict the effect of microinjection of Gpp(NH)p into a my- 
ocyte on the cell's response to /3-adrenergic stimulation. 

14. Use of Toxin Binding to Purify a Channel Protein 

a-Bungarotoxin is a powerful neurotoxin found in the venom 
of a poisonous snake {Bungarus multicinctus) . It binds with 
high specificity to the nicotinic acetylcholine receptor (AChR) 
protein and prevents the ion channel from opening. This inter- 
action was used to purify AChR from the electric organ of tor- 
pedo fish. 

(a) Outline a strategy for using a-bungarotoxin covalently 
bound to chromatography beads to purify the AChR protein. 
(Hint: See Fig. 3-17c.) 

(b) Outline a strategy for the use of [ 125 I]a-bungarotoxin 
to purify the AChR protein 

15. Resting Membrane Potential A variety of unusual in- 
vertebrates, including giant clams, mussels, and polychaete 
worms, live on the fringes of deep-sea hydrothermal vents, 
where the temperature is 60 °C. 

(a) The adductor muscle of a giant clam has a resting 
membrane potential of -95 mV. Given the intracellular and 
extracellular ionic compositions shown below, would you have 
predicted this membrane potential? Why or why not? 



Concentration (him) 



Ion 


Intracellular 


Extracellular 


Na + 


50 


440 


K + 


400 


20 


cr 


21 


560 


Ca 2+ 


0.4 


10 



(b) Assume that the adductor muscle membrane is per- 
meable to only one of the ions listed above. Which ion could 
determine the 7 m ? 

16. Membrane Potentials in Frog Eggs Fertilization of a frog 
oocyte by a sperm cell triggers ionic changes similar to those ob- 
served in neurons (during movement of the action potential) 
and initiates the events that result in cell division and develop- 
ment of the embryo. Oocytes can be stimulated to divide 
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without fertilization, by suspending them in 80 ium KC1 (normal 
pond water contains 9 mM KC1). 

(a) Calculate how much the change in extracellular [KC1] 
changes the resting membrane potential of the oocyte. (Hint: 
Assume the oocyte contains 120 ium K + and is permeable only 
to K + .) Assume a temperature of 20 °C. 

(b) When the experiment is repeated in Ca 2+ -free water, 
elevated [KC1] has no effect. What does this suggest about the 
mechanism of the KC1 effect? 

17. Excitation Triggered by Hyperpolarization In 

most neurons, membrane depolarization leads to the 
opening of voltage-dependent ion channels, generation of 
an action potential, and ultimately an influx of Ca 2+ , which 
causes release of neurotransmitter at the axon terminus. 
Devise a cellular strategy by which hyperpolarization in 
rod cells could produce excitation of the visual pathway 
and passage of visual signals to the brain. (Hint: The neu- 
ronal signaling pathway in higher organisms consists of a 
series of neurons that relay information to the brain (see 
Fig. 12-35). The signal released by one neuron can be ei- 
ther excitatory or inhibitory to the following, postsynaptic 
neuron.) 

B9 18. Genetic "Channelopathies" There are many 

Efll genetic diseases that result from defects in ion chan- 
nels. For each of the following, explain how the molecular de- 
fect might lead to the symptoms described. 

(a) A loss-of-function mutation in the gene encoding the a 
subunit of the cGMP-gated cation channel of retinal cone cells 
leads to a complete inability to distinguish colors. 

(b) Loss-of-function alleles of the gene encoding the a 
subunit of the ATP-gated K + channel shown in Figure 23-29 
lead to a condition known as congenital hyperinsulinism — 
persistently high levels of insulin in the blood. 

(c) Mutations affecting the (3 subunit of the ATP-gated K + 
channel that prevent ATP binding lead to neonatal diabetes — 
persistently low levels of insulin in the blood in newborn 
babies. 

■B 19- Visual Desensitization Oguchi's disease is an 

m'M inherited form of night blindness. Affected individuals 
are slow to recover vision after a flash of bright light against a 
dark background, such as the headlights of a car on the free- 
way. Suggest what the molecular defect(s) might be in 
Oguchi's disease. Explain in molecular terms how this defect 
would account for night blindness. 

20. Effect of a Permeant cGMP Analog on Rod Cells An 

analog of cGMP, 8-Br-cGMP, will permeate cellular mem- 
branes, is only slowly degraded by a rod cell's PDE activity, and 
is as effective as cGMP in opening the gated channel in the 
cell's outer segment. If you suspended rod cells in a buffer con- 
taining a relatively high [8-Br-cGMP], then illuminated the 
cells while measuring their membrane potential, what would 
you observe? 

21. Hot and Cool Taste Sensations The sensations of 
heat and cold are transduced by a group of temperature- 



gated cation channels. For example, TRPV1, TRPV3, and 
TRPM8 are usually closed, but open under the following 
conditions: TRPV1 at >43 °C; TRPV3 at >33 °C; and 
TRPM8 at <25 °C. These channels are expressed in sen- 
sory neurons known to be responsible for temperature sen- 
sation. 

(a) Propose a reasonable model to explain how exposing a 
sensory neuron containing TRPV1 to high temperature leads 
to a sensation of heat. 

(b) Capsaicin, one of the active ingredients in "hot" 
peppers, is an agonist of TRPV1. Capsaicin shows 50% ac- 
tivation of the TRPV1 response at a concentration (i.e., it 
has an EC 50 ) of 32 nM. Explain why even a very few drops 
of hot pepper sauce can taste very "hot" without actually 
burning you. 

(c) Menthol, one of the active ingredients in mint, is an ag- 
onist of TRPM8 (EC 50 = 30 /am) and TRPV3 (EC 50 = 20 mM). 
What sensation would you expect from contact with low levels 
of menthol? With high levels? 

■a 22. Oncogenes, Tumor-Suppressor Genes, and 
EjI Tumors For each of the following situations, provide a 
plausible explanation for how it could lead to unrestricted cell 
division. 

(a) Colon cancer cells often contain mutations in the gene 
encoding the prostaglandin E 2 receptor, PGE 2 is a growth 
factor required for the division of cells in the gastrointestinal 
tract. 

(b) Kaposi sarcoma, a common tumor in people with un- 
treated AIDS, is caused by a virus carrying a gene for a protein 
similar to the chemokine receptors CXCR1 and CXCR2. 
Chemokines are cell-specific growth factors. 

(c) Adenovirus, a tumor virus, carries a gene for the pro- 
tein El A, which binds to the retinoblastoma protein, pRb. 
(Hint: See Fig. 12-48.) 

An important feature of many oncogenes and tumor 
suppressor genes is their cell-type specificity. For example, 
mutations in the PGE 2 receptor are not typically found in lung 
tumors. 

(d) Explain this observation. (Note that PGE 2 acts through 
a GPCR in the plasma membrane.) 

23. Mutations in TYunor Suppressor Genes and Onco- 
genes Explain why mutations in tumor suppressor genes are 
recessive (both copies of the gene must be defective for the 
regulation of cell division to be defective) whereas mutations 
in oncogenes are dominant. 

24. Retinoblastoma in Children Explain why some 
m'M children with retinoblastoma develop multiple tumors 
of the retina in both eyes, whereas others have a single tumor 
in only one eye. 

25. Specificity of a Signal for a Single Cell Type Discuss 
the validity of the following proposition. A signaling molecule 
(hormone, growth factor, or neurotransmitter) elicits identical 
responses in different types of target cells if they contain 
identical receptors. 
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Data Analysis Problem 

26. Exploring Taste Sensation in Mice Figure 12—41 shows 
the signal- transduction pathway for sweet taste in mammals. 
Pleasing tastes are an evolutionary adaptation to encourage an- 
imals to consume nutritious foods. Zhao and coauthors (2003) 
examined the two major pleasurable taste sensations: sweet and 
umami. Umami is a "distinct savory taste" triggered by amino 
acids, especially aspartate and glutamate, and probably encour- 
ages animals to consume protein-rich foods. Monosodium gluta- 
mate (MSG) is a flavor enhancer that exploits this sensitivity. 

At the time the article was published, specific taste recep- 
tor proteins (labeled SR in Fig. 12-41) for sweet and umami 
had been tentatively characterized. Three such proteins were 
known— T1R1, T1R2, and T1R3— which function as het- 
erodimeric receptor complexes: T1R1-T1R3 was tentatively 
identified as the umami receptor, and T1R2-T1R3 as the sweet 
receptor It was not clear how taste sensation was encoded and 
sent to the brain, and two possible models had been suggested. 
In the cell-based model, individual taste-sensing cells express 
only one kind of receptor; that is, there are "sweet cells," "bit- 
ter cells," "umami cells," and so on, and each type of cell sends 
its information to the brain via a different nerve. The brain 
"knows" which taste is detected by the identity of the nerve 
fiber that transmits the message. In the receptor-based model, 
individual taste-sensing cells have several kinds of receptors 
and send different messages along the same nerve fiber to the 
brain, the message depending on which receptor is activated. 
Also unclear at the time was whether there was any interaction 
between the different taste sensations, or whether parts of one 
taste-sensing system were required for other taste sensations, 

(a) Previous work had shown that different taste receptor 
proteins are expressed in nonoverlapping sets of taste receptor 
cells. Which model does this support? Explain your reasoning. 

Zhao and colleagues constructed a set of "knockout 
mice" — mice homozygous for loss-of-function alleles for one of 
the three receptor proteins, T1R1, T1R2, or T1R3— and 
double-knockout mice with nonfunctioning T1R2 and T1R3. 
The researchers measured the taste perception of these mice 
by measuring their "lick rate" of solutions containing different 
taste molecules. Mice will lick the spout of a feeding bottle 
with a pleasant-tasting solution more often than one with an 
unpleasant-tasting solution. The researchers measured rela- 
tive lick rates: how often the mice licked a sample solution 
compared with water A relative lick rate of 1 indicated no 
preference; <1, an aversion; and >1, a preference, 

(b) All four types of knockout strains had the same re- 
sponses to salt and bitter tastes as did wild-type mice. Which 
of the above issues did this experiment address? What do you 
conclude from these results? 

The researchers then studied umami taste reception by 
measuring the relative lick rates of the different mouse strains 
with different quantities of MSG in the feeding solution Note 
that the solutions also contained inosine monophosphate 
(IMP), a strong potentiator of umami taste reception (and a 
common ingredient in ramen soups, along with MSG), and 
ameloride, which suppresses the pleasant salty taste imparted 
by the sodium of MSG. The results are shown in the graph. 
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(c) Are these data consistent with the umami taste recep- 
tor consisting of aheterodimer of T1R1 and T1R3? Why or why 

not? 

(d) Which model (s) of taste encoding does this result sup- 
port? Explain your reasoning. 

Zhao and coworkers then performed a series of similar ex- 
periments using sucrose as a sweet taste. These results are 
shown below. 
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(e) Are these data consistent with the sweet taste recep- 
tor consisting of a heterodimer of T1R2 and T1R3? Why or why 
not? 

(f) There were some unexpected responses at very high 
sucrose concentrations. How do these complicate the idea of a 
heterodimeric system as presented above? 

In addition to sugars, humans also taste other compounds 
(e.g., the peptides monellin and aspartame) as sweet; mice do 
not taste these as sweet. Zhao and coworkers inserted into 
TIR2 knockout mice a copy of the human T1R2 gene under the 
control of the mouse T1R2 promoter. These modified mice 
now tasted monellin and saccharin as sweet. The researchers 
then went further, adding to T1R1 knockout mice the RASSL 
protein — a G protein-linked receptor for the synthetic opiate 
spiradoline; the RASSL gene was under the control of a pro- 
moter that could be induced by feeding the mice tetracycline. 
These mice did not prefer spiradoline in the absence of tetra- 
cycline; in the presence of tetracycline, they showed a strong 
preference for nanomolar concentrations of spiradoline. 

(g) How do these results strengthen Zhao and coauthors' 
conclusions about the mechanism of taste sensation? 

Reference 

Zhao, G.Q., Zhang, Y., Hoon, M.A., Chandrashekar, J., Erlen- 
bach, I., Ryba, N.J.P., & Zuker, C. (2003) The receptors for mam- 
malian sweet and umami taste. Cell 115, 255-266. 
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Metabolism is a highly coordinated cellular activ- 
ity in which many multienzyme systems (meta- 
bolic pathways) cooperate to (1) obtain chemical 
energy by capturing solar energy or degrading energy- 
rich nutrients from the environment; (2) convert nutrient 
molecules into the cell's own characteristic molecules, 
including precursors of macromolecules; (3) polymerize 
monomeric precursors into macromolecules: proteins, 
nucleic acids, and polysaccharides; and (4) synthesize 
and degrade biomolecules required for specialized cellu- 
lar functions, such as membrane lipids, intracellular 
messengers, and pigments. 

Although metabolism embraces hundreds of differ- 
ent enzyme-catalyzed reactions, our major concern in 



Part II is the central metabolic pathways, which are few 
in number and remarkably similar in all forms of life. Liv- 
ing organisms can be divided into two large groups 
according to the chemical form in which they obtain car- 
bon from the environment. Autotrophs (such as photo- 
synthetic bacteria, green algae, and vascular plants) can 
use carbon dioxide from the atmosphere as their sole 
source of carbon, from which they construct all their 
carbon-containing biomolecules (see Fig. 1-5). Some 
autotrophic organisms, such as cyanobacteria, can 
also use atmospheric nitrogen to generate all their 
nitrogenous components. Heterotrophs cannot use 
atmospheric carbon dioxide and must obtain carbon 
from their environment in the form of relatively complex 

[485] 
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organic molecules such as glucose. Multicellular animals 
and most microorganisms are heterotrophic. Autotrophic 
cells and organisms are relatively self-sufficient, whereas 
heterotrophic cells and organisms, with their require- 
ments for carbon in more complex forms, must subsist 
on the products of other organisms. 

Many autotrophic organisms are photosynthetic 
and obtain their energy from sunlight, whereas het- 
erotrophic organisms obtain their energy from the 
degradation of organic nutrients produced by au- 
totrophs. In our biosphere, autotrophs and heterotrophs 
live together in a vast, interdependent cycle in which au- 
totrophic organisms use atmospheric carbon dioxide to 
build their organic biomolecules, some of them generat- 
ing oxygen from water in the process. Heterotrophs in 
turn use the organic products of autotrophs as nutrients 
and return carbon dioxide to the atmosphere. Some of 
the oxidation reactions that produce carbon dioxide also 
consume oxygen, converting it to water. Thus carbon, 
oxygen, and water are constantly cycled between the 
heterotrophic and autotrophic worlds, with solar energy 
as the driving force for this global process (Fig. 1). 

All living organisms also require a source of nitro- 
gen, which is necessary for the synthesis of amino acids, 
nucleotides, and other compounds. Bacteria and plants 
can generally use either ammonia or nitrate as their sole 
source of nitrogen, but vertebrates must obtain nitrogen 
in the form of amino acids or other organic compounds. 
Only a few organisms — the cyanobacteria and many 
species of soil bacteria that live symbiotically on the 
roots of some plants — are capable of converting 
("fixing") atmospheric nitrogen (N 2 ) into ammonia. 
Other bacteria (the nitrifying bacteria) oxidize ammonia 
to nitrites and nitrates; yet others convert nitrate to N 2 . 
The anammox bacteria convert ammonia and nitrite to 



FIGURE 1 Cycling of carbon dioxide and oxy- 
gen between the autotrophic (photosynthetic) 
and heterotrophic domains in the biosphere. 

The flow of mass through this cycle is enor- 
mous; about 4 x 1 0 1 1 metric tons of carbon are 
turned over in the biosphere annually. 
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FIGURE 2 Cycling of nitrogen in the biosphere. Gaseous nitrogen (N 2 ) 
makes up 80% of the earth's atmosphere. 

N 2 . Thus, in addition to the global carbon and oxygen 
cycle, a nitrogen cycle operates in the biosphere, turn- 
ing over huge amounts of nitrogen (Fig. 2). The cycling 
of carbon, oxygen, and nitrogen, which ultimately in- 
volves all species, depends on a proper balance between 
the activities of the producers (autotrophs) and con- 
sumers (heterotrophs) in our biosphere. 

These cycles of matter are driven by an enormous 
flow of energy into and through the biosphere, beginning 
with the capture of solar energy by photosynthetic organ- 
isms and use of this energy to generate energy-rich car- 
bohydrates and other organic nutrients; these nutrients 
are then used as energy sources by heterotrophic organ- 
isms. In metabolic processes, and in all energy transfor- 
mations, there is a loss of useful energy (free energy) and 
an inevitable increase in the amount of unusable energy 
(heat and entropy). In contrast to the cycling of matter, 
therefore, energy flows one way through the biosphere; or- 
ganisms cannot regenerate useful energy from energy dis- 
sipated as heat and entropy. Carbon, oxygen, and nitrogen 
recycle continuously, but energy is constantly trans- 
formed into unusable forms such as heat. 

Metabolism, the sum of all the chemical transfor- 
mations taking place in a cell or organism, occurs 
through a series of enzyme-catalyzed reactions that con- 
stitute metabolic pathways. Each of the consecutive 
steps in a metabolic pathway brings about a specific, 
small chemical change, usually the removal, transfer, or 
addition of a particular atom or functional group. The 
precursor is converted into a product through a series of 
metabolic intermediates called metabolites. The term 
intermediary metabolism is often applied to the com- 
bined activities of all the metabolic pathways that inter- 
convert precursors, metabolites, and products of low 
molecular weight (generally, M r < 1,000). 
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Catabolism is the degradative phase of metabolism 
in which organic nutrient molecules (carbohydrates, 
fats, and proteins) are converted into smaller, simpler 
end products (such as lactic acid, C0 2 , NH 3 ). Catabolic 
pathways release energy, some of which is conserved in 
the formation of ATP and reduced electron carriers 
(NADH, NADPH, and FADH 2 ); the rest is lost as heat. In 
anabolism, also called biosynthesis, small, simple pre- 
cursors are built up into larger and more complex mole- 
cules, including lipids, polysaccharides, proteins, and 
nucleic acids. Anabolic reactions require an input of en- 
ergy, generally in the form of the phosphoryl group 
transfer potential of ATP and the reducing power of 
NADH, NADPH, and FADH 2 (Fig. 3). 

Some metabolic pathways are linear, and some are 
branched, yielding multiple useful end products from a 
single precursor or converting several starting materials 
into a single product. In general, catabolic pathways are 
convergent and anabolic pathways divergent (Fig. 4). 
Some pathways are cyclic: one starting component of 
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FIGURE 3 Energy relationships between catabolic and anabolic path- 
ways. Catabolic pathways deliver chemical energy in the form of ATP, 
NADH, NADPH, and FADH 2 . These energy carriers are used in ana- 
bolic pathways to convert small precursor molecules into cellular 
macromolecules. 



the pathway is regenerated in a series of reactions that 
converts another starting component into a product. We 
shall see examples of each type of pathway in the follow- 
ing chapters. 

Most cells have the enzymes to carry out both the 
degradation and the synthesis of the important categories 
of biomolecules — fatty acids, for example. The simultane- 
ous synthesis and degradation of fatty acids would be 
wasteful, however, and this is prevented by reciprocally 
regulating the anabolic and catabolic reaction sequences: 
when one sequence is active, the other is suppressed. 
Such regulation could not occur if anabolic and catabolic 
pathways were catalyzed by exactly the same set of en- 
zymes, operating in one direction for anabolism, the oppo- 
site direction for catabolism: inhibition of an enzyme 
involved in catabolism would also inhibit the reaction se- 
quence in the anabolic direction. Catabolic and anabolic 
pathways that connect the same two end points (glucose 
— > -> pyruvate, and pyruvate — > glucose, for example) 
may employ many of the same enzymes, but invariably at 
least one of the steps is catalyzed by different enzymes in 
the catabolic and anabolic directions, and these enzymes 
are the sites of separate regulation. Moreover, for both an- 
abolic and catabolic pathways to be essentially irre- 
versible, the reactions unique to each direction must 
include at least one that is thermodynamically very favor- 
able — in other words, a reaction for which the reverse re- 
action is very unfavorable. As a further contribution to the 
separate regulation of catabolic and anabolic reaction se- 
quences, paired catabolic and anabolic pathways com- 
monly take place in different cellular compartments: for 
example, fatty acid catabolism in mitochondria, fatty acid 
synthesis in the cytosol. The concentrations of intermedi- 
ates, enzymes, and regulators can be maintained at differ- 
ent levels in these different compartments. Because 
metabolic pathways are subject to kinetic control by sub- 
strate concentration, separate pools of anabolic and cata- 
bolic intermediates also contribute to the control of 
metabolic rates. Devices that separate anabolic and cata- 
bolic processes will be of particular interest in our discus- 
sions of metabolism. 

Metabolic pathways are regulated at several levels, 
from within the cell and from outside. The most immedi- 
ate regulation is by the availability of substrate; when the 
intracellular concentration of an enzyme's substrate is 
near or below K m (as is commonly the case), the rate of 
the reaction depends strongly upon substrate concentra- 
tion (see Fig. 6-11). A second type of rapid control from 
within is allosteric regulation (p. 220) by a metabolic in- 
termediate or coenzyme — an amino acid or ATP, for ex- 
ample — that signals the cell's internal metabolic state. 
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FIGURE 4 Three types of nonlinear metabolic pathways. 

(a) Converging, catabolic, (b) diverging, anabolic, and (c) 
C0 2 cyclic pathways. In (c), one of the starting materials (ox- 
aloacetate in this case) is regenerated and reenters the 
pathway. Acetate, a key metabolic intermediate, is the 
breakdown product of a variety of fuels (a), serves as the 
precursor for an array of products (b), and is consumed in 
the catabolic pathway known as the citric acid cycle (c). 



When the cell contains an amount of, say, aspartate suffi- 
cient for its immediate needs, or when the cellular level 
of ATP indicates that further fuel consumption is unnec- 
essary at the moment, these signals allosterically inhibit 
the activity of one or more enzymes in the relevant path- 
way In multicellular organisms the metabolic activities 
of different tissues are regulated and integrated by 
growth factors and hormones that act from outside the 
cell. In some cases this regulation occurs virtually instan- 
taneously (sometimes in less than a millisecond) through 
changes in the levels of intracellular messengers that 
modify the activity of existing enzyme molecules by al- 
losteric mechanisms or by covalent modification such as 
phosphorylation. In other cases, the extracellular signal 
changes the cellular concentration of an enzyme by al- 
tering the rate of its synthesis or degradation, so the ef- 
fect is seen only after minutes or hours. 

We begin Part II with a discussion of the basic ener- 
getic principles that govern all metabolism (Chapter 13). 
We then consider the major catabolic pathways by 
which cells obtain energy from the oxidation of various 
fuels (Chapters 14 through 19). Chapter 19 is the pivotal 
point of our discussion of metabolism; it concerns 
chemiosmotic energy coupling, a universal mechanism 
in which a transmembrane electrochemical potential, 



produced either by substrate oxidation or by light 
absorption, drives the synthesis of ATP. 

Chapters 20 through 22 describe the major anabolic 
pathways by which cells use the energy in ATP to pro- 
duce carbohydrates, lipids, amino acids, and nucleotides 
from simpler precursors. In Chapter 23 we step back 
from our detailed look at the metabolic pathways — as 
they occur in all organisms, from Escherichia coli to 
humans — and consider how they are regulated and inte- 
grated in mammals by hormonal mechanisms. 

As we undertake our study of intermediary metabo- 
lism, a final word. Keep in mind that the myriad reac- 
tions described in these pages take place in, and play 
crucial roles in, living organisms. As you encounter each 
reaction and each pathway ask, What does this chemical 
transformation do for the organism? How does this path- 
way interconnect with the other pathways operating si- 
multaneously in the same cell to produce the energy and 
products required for cell maintenance and growth? 
How do the multilayered regulatory mechanisms coop- 
erate to balance metabolic and energy inputs and out- 
puts, achieving the dynamic steady state of life? Studied 
with this perspective, metabolism provides fascinating 
and revealing insights into life, with countless applica- 
tions in medicine, agriculture, and biotechnology. 



The total energy of the universe is constant; the total entropy is contin- 
ual ly Increasing. 

— Rudolf Clausius, The Mechanical Theory of Heat wilh Its 
Applications to the Steam- Engine and to the Physical 
Properties of Bodies, 1865 i trans. 1H()~: 

The i so m o rph i s rn of e n tro p\ ■ and r r it b r m L j t i ( ■> n e s to b I i s h es c\ I i 1 1 k I -> e f \ v t mi 
the two forms of power; the power to do and the power lo direct what 
is done. 

— Francois Jacob, La logiquc du vivant: une histoire de I'heredite 
(The Logic of Life: A History of Heredity), 1970 
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iving cells and organisms must perform work to stay 
alive, to grow, and to reproduce. The ability to har- 
mm ness energy and to channel it into biological work is a 
fundamental property of all living organisms; it must have 
been acquired very early in cellular evolution. Modern or- 
ganisms carry out a remarkable variety of energy trans- 
ductions, conversions of one form of energy to another. 
They use the chemical energy in fuels to bring about the 
synthesis of complex, highly ordered macromolecules 
from simple precursors. They also convert the chemical 
energy of fuels into concentration gradients and electrical 
gradients, into motion and heat, 
and, in a few organisms such as 
fireflies and deep-sea fish, into 
light. Photosynthetic organisms 
transduce light energy into all 
these other forms of energy. 

The chemical mechanisms 
that underlie biological energy 
transductions have fascinated 
and challenged biologists for 
centuries. The French chemist 
Antoine Lavoisier recognized 
Antoine Lavoisier, that animals somehow trans- 

1743-1794 form chemical fuels (foods) 




into heat and that this process of respiration is essential 
to life. He observed that 

... in general, respiration is nothing but a slow 
combustion of carbon and hydrogen, which is en- 
tirely similar to that which occurs in a lighted lamp 
or candle, and that, from this point of view, animals 
that respire are true combustible bodies that burn 
and consume themselves . . . One may say that this 
analogy between combustion and respiration has 
not escaped the notice of the poets, or rather the 
philosophers of antiquity, and which they had ex- 
pounded and interpreted. This fire stolen from 
heaven, this torch of Prometheus, does not only 
represent an ingenious and poetic idea, it is a faith- 
ful picture of the operations of nature, at least for 
animals that breathe; one may therefore say, with 
the ancients, that the torch of life lights itself at the 
moment the infant breathes for the first time, and it 
does not extinguish itself except at death.* 

In the twentieth century, we began to understand 
much of the chemistry underlying that "torch of life." Bi- 
ological energy transductions obey the same chemical 
and physical laws that govern all other natural processes. 
It is therefore essential for a student of biochemistry to 
understand these laws and how they apply to the flow of 
energy in the biosphere. 

In this chapter we first review the laws of thermody- 
namics and the quantitative relationships among free en- 
ergy, enthalpy, and entropy. We then review the common 
types of biochemical reactions that occur in living cells, 
reactions that harness, store, transfer, and release the 

*From a memoir by Armand Seguin and Antoine Lavoisier, dated 1789, 
quoted in Lavoisier, A. (1862) Oeuvres de Lavoisier, Imprimerie Imperi- 
ale, Paris 
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energy taken up by organisms from their surroundings. 
Our focus then shifts to reactions that have special roles 
in biological energy exchanges, particularly those involv- 
ing ATP. We finish by considering the importance of oxi- 
dation-reduction reactions in living cells, the energetics 
of biological electron transfers, and the electron carriers 
commonly employed as cofactors in these processes. 

13.1 Bioenergetics and Thermodynamics 

Bioenergetics is the quantitative study of energy 
transductions — changes of one form of energy into an- 
other — that occur in living cells, and of the nature and 
function of the chemical processes underlying these 
transductions. Although many of the principles of ther- 
modynamics have been introduced in earlier chapters 
and may be familiar to you, a review of the quantitative 
aspects of these principles is useful here. 

Biological Energy Transformations Obey 
the Laws of Thermodynamics 

Many quantitative observations made by physicists and 
chemists on the interconversion of different forms of en- 
ergy led, in the nineteenth century, to the formulation of 
two fundamental laws of thermodynamics. The first law 
is the principle of the conservation of energy: for any 
physical or chemical change, the total amount of 
energy in the universe remains constant; energy may 
change form or it may be transported from one region 
to another, but it cannot be created or destroyed. The 
second law of thermodynamics, which can be stated in 
several forms, says that the universe always tends toward 
increasing disorder: in all natural processes, the en- 
tropy of the universe increases. 




"Now, in the second law of thermodynainics . , , " 



Living organisms consist of collections of molecules 
much more highly organized than the surrounding ma- 
terials from which they are constructed, and organisms 
maintain and produce order, seemingly oblivious to the 



second law of thermodynamics. But living organisms do 
not violate the second law; they operate strictly within 
it. To discuss the application of the second law to biolog- 
ical systems, we must first define those systems and 
their surroundings. 

The reacting system is the collection of matter that 
is undergoing a particular chemical or physical process; 
it may be an organism, a cell, or two reacting com- 
pounds. The reacting system and its surroundings to- 
gether constitute the universe. In the laboratory, some 
chemical or physical processes can be carried out in iso- 
lated or closed systems, in which no material or energy 
is exchanged with the surroundings. Living cells and or- 
ganisms, however, are open systems, exchanging both 
material and energy with their surroundings; living sys- 
tems are never at equilibrium with their surroundings, 
and the constant transactions between system and sur- 
roundings explain how organisms can create order 
within themselves while operating within the second law 
of thermodynamics. 

In Chapter 1 (p. 22) we defined three thermody- 
namic quantities that describe the energy changes oc- 
curring in a chemical reaction: 

Gibbs free energy, G, expresses the amount of en- 
ergy capable of doing work during a reaction at con- 
stant temperature and pressure. When a reaction 
proceeds with the release of free energy (that is, 
when the system changes so as to possess less free 
energy), the free-energy change, AG, has a negative 
value and the reaction is said to be exergonic. In en- 
dergonic reactions, the system gains free energy 
and AG is positive. 

Enthalpy, H, is the heat content of the reacting 
system. It reflects the number and kinds of chemical 
bonds in the reactants and products. When a chem- 
ical reaction releases heat, it is said to be exother- 
mic; the heat content of the products is less than 
that of the reactants and AH has, by convention, a 
negative value. Reacting systems that take up heat 
from their surroundings are endothermic and have 
positive values of AH. 

Entropy, S, is a quantitative expression for the ran- 
domness or disorder in a system (see Box 1-3). 
When the products of a reaction are less complex 
and more disordered than the reactants, the reac- 
tion is said to proceed with a gain in entropy. 

The units of AG and AH are joules/mole or calories/mole 
(recall that 1 cal = 4.184 J); units of entropy are joules/ 
mole * Kelvin (J/mol • K) (Table 13-1). 

Under the conditions existing in biological systems 
(including constant temperature and pressure), changes 
in free energy, enthalpy, and entropy are related to each 
other quantitatively by the equation 

AG = AH - TAS (13-1) 
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TABLE 13-1 



me Physical Constants and Units 
sed in Thermodynamics 



Boltzmann constant, k = 1.381 X 10"^ J/K 
Avogadro's number, N = 6.022 X 10 23 moF 1 
Faraday constant, J = 96,480 J/V • mol 
Gas constant, R = 8.315 J/mol • K 

(= 1.987 cal/mol -K) 

Units of AG and AH are J/mol (or cal/mol) 
Units of AS are J/mol ■ K (or cal/mol • K) 
leal = 4.184 J 

Units of absolute temperature, T, are Kelvin, K 
25 °C = 298 K 
At 25 °C,RT = 2.478 kJ/mol 

(= 0.592 kcal/mol) 



in which AG is the change in Gibbs free energy of the re- 
acting system, AH is the change in enthalpy of the sys- 
tem, Tis the absolute temperature, and AS is the change 
in entropy of the system. By convention, AS has a posi- 
tive sign when entropy increases and AH, as noted 
above, has a negative sign when heat is released by the 
system to its surroundings. Either of these conditions, 
which are typical of energetically favorable processes, 
tend to make AG negative. In fact, AG of a sponta- 
neously reacting system is always negative. 

The second law of thermodynamics states that 
the entropy of the universe increases during all 
chemical and physical processes, but it does not re- 
quire that the entropy increase take place in the re- 
acting system itself. The order produced within cells 
as they grow and divide is more than compensated for 
by the disorder they create in their surroundings in 
the course of growth and division (see Box 1-3, case 2). 
In short, living organisms preserve their internal or- 
der by taking from the surroundings free energy in 
the form of nutrients or sunlight, and returning to 
their surroundings an equal amount of energy as heat 
and entropy. 

Cells Require Sources of Free Energy 

Cells are isothermal systems — they function at essen- 
tially constant temperature (and also function at con- 
stant pressure). Heat flow is not a source of energy for 
cells, because heat can do work only as it passes to a 
zone or object at a lower temperature. The energy that 
cells can and must use is free energy, described by the 
Gibbs free-energy function G, which allows prediction of 
the direction of chemical reactions, their exact equilib- 
rium position, and the amount of work they can (in 
theory) perform at constant temperature and pressure. 
Heterotrophic cells acquire free energy from nutrient 
molecules, and photosynthetic cells acquire it from ab- 
sorbed solar radiation. Both kinds of cells transform this 
free energy into ATP and other energy-rich compounds 



capable of providing energy for biological work at con- 
stant temperature. 

Standard Free-Energy Change Is Directly Related 
to the Equilibrium Constant 

The composition of a reacting system (a mixture of 
chemical reactants and products) tends to continue 
changing until equilibrium is reached. At the equilib- 
rium concentration of reactants and products, the 
rates of the forward and reverse reactions are exactly 
equal and no further net change occurs in the sys- 
tem. The concentrations of reactants and products at 
equilibrium define the equilibrium constant, K e(l (p. 
24). In the general reaction aA + bB ^^cC + dD, 
where a, b, c, and d are the number of molecules of 
A, B, C, and D participating, the equilibrium constant 
is given by 



_ [C] c [D] d 
eq [A] a [B] b 



(13-2) 



where [A], [B], [C], and [D] are the molar concentrations 
of the reaction components at the point of equilibrium. 

When a reacting system is not at equilibrium, the 
tendency to move toward equilibrium represents a driv- 
ing force, the magnitude of which can be expressed as 
the free-energy change for the reaction, AG. Under stan- 
dard conditions (298 K = 25 °C), when reactants and 
products are initially present at 1 m concentrations or, for 
gases, at partial pressures of 101.3 kilopascals (kPa), or 
1 atm, the force driving the system toward equilibrium is 
defined as the standard free-energy change, AG°. By this 
definition, the standard state for reactions that involve 
hydrogen ions is [H + ] = 1 m, or pH 0. Most biochemical 
reactions, however, occur in well-buffered aqueous solu- 
tions near pH 7; both the pH and the concentration of 
water (55.5 m) are essentially constant. 

KEY CONVENTION: For convenience of calculations, bio- 
chemists define a standard state different from that 
used in chemistry and physics: in the biochemical stan- 
dard state, [H + ] is 10" 7 m (pH 7) and [H 2 0] is 55.5 M. For 
reactions that involve Mg 2+ (which include most of 
those with ATP as a reactant), [Mg 2+ ] in solution is com- 
monly taken to be constant at 1 mM. ■ 

Physical constants based on this biochemical stan- 
dard state are called standard transformed con- 
stants and are written with a prime (such as AG'° and 
JT eq ) to distinguish them from the untransformed con- 
stants used by chemists and physicists. (Note that most 
other textbooks use the symbol AG 0 ' rather than AG'°. 
Our use of AG'°, recommended by an international com- 
mittee of chemists and biochemists, is intended to 
emphasize that the transformed free energy, G\ is the 
criterion for equilibrium.) For simplicity, we will here- 
after refer to these transformed constants as standard 
free-energy changes. 
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KEY CONVENTION: In another simplifying convention used 
by biochemists, when H 2 0, H + , and/or Mg 2+ are reac- 
tants or products, their concentrations are not included 
in equations such as Equation 13-2 but are instead 
incorporated into the constants K' eq and AG' 0 . ■ 

Just as K' eq is a physical constant characteristic for 
each reaction, so too is AG'° a constant. As we noted in 
Chapter 6, there is a simple relationship between K' 
and AG'°: 



AG'° = -RTlnKL 



(13-3) 



The standard free-energy change of a chemical reac- 
tion is simply an alternative mathematical way of 
expressing its equilibrium constant. Table 13-2 
shows the relationship between AG'° and JT eq . If the 
equilibrium constant for a given chemical reaction is 1.0, 
the standard free-energy change of that reaction is 0.0 
(the natural logarithm of 1.0 is zero). If K' eq of a reaction 
is greater than 1.0, its AG'° is negative. If JT eq is less than 
1.0, AG'° is positive. Because the relationship between 
AG'° and K' eq is exponential, relatively small changes in 
AG'° correspond to large changes in K f eq . 

It may be helpful to think of the standard free- 
energy change in another way AG'° is the difference be- 
tween the free-energy content of the products and the 
free-energy content of the reactants, under standard 
conditions. When AG'° is negative, the products contain 
less free energy than the reactants and the reaction will 
proceed spontaneously under standard conditions; all 
chemical reactions tend to go in the direction that re- 
sults in a decrease in the free energy of the system. A 



TABLE 13-2 



ft'™ 



Relationship between Equilibrium 
Constants and Standard Free-Energy 
Changes of Chemical Reactions 



(kJ/mol) 



(kcal/mol)* 



10 3 


-17.1 


-4.1 


10 2 


-11 4 


-2.7 


10 1 


-5.7 


-14 


1 


0.0 


0.0 


10" 1 


5.7 


1.4 


10" 2 


11.4 


2.7 


10~ 3 


17.1 


4.1 


10" 4 


22.8 


5.5 


10~ 5 


28.5 


6.8 


10" 6 


34.2 


8.2 



♦Although joules and kilojoules are the standard units of energy and are used through 
out this text, biochemists and nutritionists sometimes express AG'° values in kilocalo- 
ries per mole. We have therefore included values in both kilojoules and kilocalories in 
this table and in Tables 13-4 and 13-6. To convert kilojoules to kilocalories, divide the 
number of kilojoules by 4.184. 



TABLE 13-3 


Relationships among K'^ AG' 0 , and 
the Direction of Chemical Reactions 


WheniT eq is 


. AG'°is... 


Starting with all 
components at 1 m, 
the reaction . . . 


>1.0 


negative 


proceeds forward 


1.0 


zero 


is at equilibrium 


<1.0 


positive 


proceeds in reverse 



positive value of AG' 0 means that the products of the re- 
action contain more free energy than the reactants, and 
this reaction will tend to go in the reverse direction if we 
start with 1.0 M concentrations of all components (stan- 
dard conditions). Table 13-3 summarizes these points. 

Hi WORKED EXAMPLE 13-1 Calculation of AG' 0 

Calculate the standard free-energy change of the reac- 
tion catalyzed by the enzyme phosphoglucomutase 

Glucose 1-phosphate glucose 6-phosphate 

given that, starting with 20 mM glucose 1-phosphate and 
no glucose 6-phosphate, the final equilibrium mixture at 
25 °C and pH 7.0 contains 1.0 mM glucose 1-phosphate 
and 19 mM glucose 6-phosphate. Does the reaction in 
the direction of glucose 6-phosphate formation proceed 
with a loss or a gain of free energy? 

Solution: First we calculate the equilibrium constant: 

[glucose 6-phosphate] 19 mM 
eq [glucose 1-phosphate] 1.0 mM 

We can now calculate the standard free-energy change: 

AG' 0 = -RT In K f eq 

= -(8.315J/mol-K)(298K)(lnl9) 
= -7.3kJ/mol 

Because the standard free-energy change is negative, 
the conversion of glucose 1-phosphate to glucose 
6-phosphate proceeds with a loss (release) of free 
energy (For the reverse reaction, AG'° has the same 
magnitude but the opposite sign.) 



Table 13-4 gives the standard free-energy changes 
for some representative chemical reactions. Note that 
hydrolysis of simple esters, amides, peptides, and glyco- 
sides, as well as rearrangements and eliminations, pro- 
ceed with relatively small standard free-energy changes, 
whereas hydrolysis of acid anhydrides is accompanied 
by relatively large decreases in standard free energy. 
The complete oxidation of organic compounds such as 
glucose or palmitate to C0 2 and H 2 0, which in cells re- 
quires many steps, results in very large decreases in 
standard free energy. However, standard free-energy 
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TABLE 13-4 



Reaction type 



Standard Free-Energy Changes of Some Chemical Reactions 



AG' 0 



(kJ/mol) (kcal/mol) 



- 2 acetate 



glucose 1 -phosphate 



Hydrolysis reactions 

Acid anhydrides 

Acetic anhydride + H 2 0 - 

ATP + H 2 0 > ADP + Pi 

ATP + H 2 0 > AMP + PPi 

PPi + H 2 0 > 2Pi 

UDP-glucose + H 2 0 > UMP 

Esters 

Ethyl acetate + H 2 0 > ethanol + acetate 

Glucose 6-phosphate + H 2 0 > glucose + Pi 

Amides and peptides 

Glutamine + H 2 0 > glutamate + NH4 

Glycylglycine + H 2 0 > 2 glycine 

Glycosides 

► 2 glucose 

> glucose + galactose 



glucose 6-phosphate 
> glucose 6-phosphate 



Maltose + H 2 0 
Lactose + H 2 0 

Rearrangements 

Glucose 1 -phosphate - 
Fructose 6-phosphate 

Elimination of water 

Malate > fumarate + H 2 0 

Oxidations with molecular oxygen 

Glucose + 60 2 > 6C0 2 + 6H 2 0 

Palmitate + 230 2 > 16C0 2 + 16H 2 0 



-91.1 
-30.5 
-45.6 
-19.2 
-43.0 

-19.6 
-13.8 

-14.2 
-9.2 

-15.5 
-15.9 

-7.3 
-1.7 

3.1 

-2,840 
-9,770 



-21.8 
-7.3 

-10.9 
-4.6 

-10.3 

-4.7 
-3.3 

-3.4 
-2.2 

-3.7 
-3.8 

-1.7 
-0.4 

0.8 

-686 
-2,338 



changes such as those in Table 13-4 indicate how much 
free energy is available from a reaction under standard 
conditions. To describe the energy released under the 
conditions existing in cells, an expression for the actual 
free-energy change is essential. 

Actual Free-Energy Changes Depend on Reactant 
and Product Concentrations 

We must be careful to distinguish between two different 
quantities: the actual free-energy change, AG, and the 
standard free-energy change, AG'°. Each chemical reac- 
tion has a characteristic standard free-energy change, 
which may be positive, negative, or zero, depending on 
the equilibrium constant of the reaction. The standard 
free-energy change tells us in which direction and how 
far a given reaction must go to reach equilibrium when 
the initial concentration of each component is 1 0 m, 
the pH is 7.0, the temperature is 25 °C, and the pressure 
is 101.3 kPa (1 atm). Thus AG'° is a constant: it has a 
characteristic, unchanging value for a given reaction. 
But the actual free-energy change, AG, is a function of 
reactant and product concentrations and of the temper- 
ature prevailing during the reaction, none of which will 



necessarily match the standard conditions as defined 
above. Moreover, the AG of any reaction proceeding 
spontaneously toward its equilibrium is always negative, 
becomes less negative as the reaction proceeds, and is 
zero at the point of equilibrium, indicating that no more 
work can be done by the reaction. 

AG and AG' 0 for any reaction aA + bB cC + dD 
are related by the equation 

A0 -AG" + OTm|jg §(13-4) 

in which the terms in red are those actually prevailing 
in the system under observation. The concentration 
terms in this equation express the effects commonly 
called mass action, and the term [C] c [D] d /[A] a [B] b is called 
the mass-action ratio, Q Thus Equation 13^ can be 
expressed as AG = AG'° + RT\nQ. As an example, let us 
suppose that the reaction A + B C + D is taking place 
under the standard conditions of temperature (25 °C) and 
pressure (101.3 kPa) but that the concentrations 
of A, B, C, and D are not equal and none of the compo- 
nents is present at the standard concentration of 1.0 m. 
To determine the actual free-energy change, AG, under 
these nonstandard conditions of concentration as the 
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reaction proceeds from left to right, we simply enter the 
actual concentrations of A, B, C, and D in Equation 
13-4; the values of R, T, and AG' 0 are the standard val- 
ues. AG is negative and approaches zero as the reaction 
proceeds, because the actual concentrations of A and B 
decrease and the concentrations of C and D increase. 
Notice that when a reaction is at equilibrium — when 
there is no force driving the reaction in either direction 
and AG is zero — Equation 13-4 reduces to 



0 = AG = AG'° + RT In 



[C] eq [D] e 
[A] ep [B] e 



: 



AG' 0 



-RTlnK' 



which is the equation relating the standard free-energy 
change and equilibrium constant (Eqn 13-3). 

The criterion for spontaneity of a reaction is the 
value of AG, not AG' 0 . A reaction with a positive AG'° 
can go in the forward direction if AG is negative. This is 
possible if the term RT In ([products]/[reactants]) in 
Equation 13-4 is negative and has a larger absolute 
value than AG' 0 . For example, the immediate removal of 
the products of a reaction can keep the ratio [products]/ 
[reactants] well below 1, such that the term RT In 
Qproducts]/[reactants]) has a large, negative value. AG'° 
and AG are expressions of the maximum amount of 
free energy that a given reaction can theoretically 
deliver— an amount of energy that could be realized 
only if a perfectly efficient device were available to trap 
or harness it. Given that no such device is possible 
(some energy is always lost to entropy during any 
process), the amount of work done by the reaction at 
constant temperature and pressure is always less than 
the theoretical amount. 

Another important point is that some thermody- 
namically favorable reactions (that is, reactions for 
which AG'° is large and negative) do not occur at mea- 
surable rates. For example, combustion of firewood to 
C0 2 and H 2 0 is very favorable thermodynamically, but 
firewood remains stable for years because the activation 
energy (see Figs 6-2 and 6-3) for the combustion reac- 
tion is higher than the energy available at room temper- 
ature. If the necessary activation energy is provided 
(with a lighted match, for example), combustion will be- 
gin, converting the wood to the more stable products 
C0 2 and H 2 0 and releasing energy as heat and light. The 
heat released by this exothermic reaction provides the 
activation energy for combustion of neighboring regions 
of the firewood; the process is self -perpetuating. 

In living cells, reactions that would be extremely slow 
if uncatalyzed are caused to proceed not by supplying 
additional heat but by lowering the activation energy 
through use of an enzyme. An enzyme provides an alter- 
native reaction pathway with a lower activation energy 
than the uncatalyzed reaction, so that at room tempera- 
ture a large fraction of the substrate molecules have 
enough thermal energy to overcome the activation barrier, 



and the reaction rate increases dramatically. The free- 
energy change for a reaction is independent of the path- 
way by which the reaction occurs; it depends only on 
the nature and concentration of the initial reactants and 
the final products. Enzymes cannot, therefore, change 
equilibrium constants; but they can and do increase the 
rate at which a reaction proceeds in the direction dictated 
by thermodynamics (see Section 6.2). 

Standard Free-Energy Changes Are Additive 

In the case of two sequential chemical reactions, A B 
and B C, each reaction has its own equilibrium con- 
stant and each has its characteristic standard free- 
energy change, AGi°and AG 2 °. As the two reactions 
are sequential, B cancels out to give the overall reaction 
A C, which has its own equilibrium constant and thus 
its own standard free-energy change, AG[° tal . The AG' 0 
values of sequential chemical reactions are additive. 
For the overall reaction A C, AG[° ta i is the sum of the 
individual standard free-energy changes, AGi°and AG 2 °, 
of the two reactions: AG' t ° tal = AG{°+ AG£°. 



(1) 

(2) 



A- 
B- 



B 

■C 



AG{° 
AG? 



Sum: A - 



AG{° + AG 2 ° 



This principle of bioenergetics explains how a thermody- 
namically unfavorable (endergonic) reaction can be 
driven in the forward direction by coupling it to a highly 
exergonic reaction through a common intermediate. For 
example, the synthesis of glucose 6-phosphate is the first 
step in the utilization of glucose by many organisms: 

Glucose + Pi > glucose 6-phosphate + H 2 0 

AG'° = 13.8 kJ/mol 

The positive value of AG'° predicts that under standard 
conditions the reaction will tend not to proceed sponta- 
neously in the direction written. Another cellular reac- 
tion, the hydrolysis of ATP to ADP and P i; is very 
exergonic: 

ATP + H 2 0 > ADP + ^ AG'° - -30.5 kJ/mol 

These two reactions share the common intermediates P t 
and H 2 0 and may be expressed as sequential reactions: 



(1) Glucose + Pi- 

(2) ATP + H 2 0 - 



-» glucose 6-phosphate + H 2 0 
► ADP + Pi 



Sum: ATP + glucose > ADP + glucose 6-phosphate 

The overall standard free-energy change is obtained by 
adding the AG'° values for individual reactions: 

AG'° = 13.8 kJ/mol + (-30.5 kJ/mol) = -16.7 kJ/mol 

The overall reaction is exergonic. In this case, energy 
stored in ATP is used to drive the synthesis of glucose 
6-phosphate, even though its formation from glucose and 
inorganic phosphate (PJ is endergonic. The pathway 
of glucose 6-phosphate formation from glucose by 
phosphoryl transfer from ATP is different from reactions 
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(1) and (2) above, but the net result is the same as the 
sum of the two reactions. In thermodynamic calcula- 
tions, all that matters is the state of the system at the be- 
ginning of the process and its state at the end; the route 
between the initial and final states is immaterial. 

We have said that AG'° is a way of expressing 
the equilibrium constant for a reaction. For reaction 
(1) above, 



[glucose 6-phosphate] 
[glucose] [PJ 



= 3.9 X 10~ 3 M _1 



Notice that H 2 0 is not included in this expression, as its 
concentration (55.5 m) is assumed to remain unchanged 
by the reaction. The equilibrium constant for the hydro- 
lysis of ATP is 



[ADPHPil 
[ATP] 



2.0 X 10 5 M 



The equilibrium constant for the two coupled reactions is 
[glucose 6-phosphate] [ADP][PJ 



[glucose] [PJ [ATP] 



= (X' eq KX' eq2 ) = (3-9 x 10" 3 m- 1 )(2.0 x 10 5 m) 
= 7.8 X 10 2 

This calculation illustrates an important point about 
equilibrium constants: although the AG'° values for two 
reactions that sum to a third, overall reaction are addi- 
tive, the iTeq for the overall reaction is the product of 
the individual K' eq values for the two reactions. Equilib- 
rium constants are multiplicative. By coupling ATP hy- 
drolysis to glucose 6-phosphate synthesis, the K' e(i for 
formation of glucose 6-phosphate from glucose has been 
raised by a factor of about 2 x 10 5 . 

This common-intermediate strategy is employed by 
all living cells in the synthesis of metabolic intermediates 
and cellular components. Obviously, the strategy works 
only if compounds such as ATP are continuously avail- 
able. In the following chapters we consider several of the 
most important cellular pathways for producing ATP. 

SUMMARY 13.1 Bioenergetics and 
Thermodynamics 

■ Living cells constantly perform work. They require 
energy for maintaining their highly organized 
structures, synthesizing cellular components, 
generating electric currents, and many other 
processes. 

■ Bioenergetics is the quantitative study of energy 
relationships and energy conversions in biological 
systems. Biological energy transformations obey 
the laws of thermodynamics. 

■ All chemical reactions are influenced by two forces: 
the tendency to achieve the most stable bonding 
state (for which enthalpy, H, is a useful expression) 
and the tendency to achieve the highest degree of 



randomness, expressed as entropy, S. The net 
driving force in a reaction is AG, the free-energy 
change, which represents the net effect of these 
two factors: AG = Atf - 7AS. 

m The standard transformed free-energy change, AG'°, 
is a physical constant that is characteristic for a given 
reaction and can be calculated from the equilibrium 
constant for the reaction: AG'° = -RT In K' eq . 

■ The actual free-energy change, AG, is a 
variable that depends on AG' 0 and on the 
concentrations of reactants and products: 
AG = AG' 0 + RT In ([products]/[reactants]). 

■ When AG is large and negative, the reaction tends 
to go in the forward direction; when AG is large and 
positive, the reaction tends to go in the reverse 
direction; and when AG = 0, the system is at 
equilibrium. 

■ The free-energy change for a reaction is 
independent of the pathway by which the reaction 
occurs. Free-energy changes are additive; the net 
chemical reaction that results from successive 
reactions sharing a common intermediate has an 
overall free-energy change that is the sum of the 
AG values for the individual reactions. 

13.2 Chemical Logic and Common 
Biochemical Reactions 

The biological energy transductions we are concerned 
with in this book are chemical reactions. Cellular chem- 
istry does not encompass every kind of reaction learned 
in a typical organic chemistry course. Which reactions 
take place in biological systems and which do not is de- 
termined by (1) their relevance to that particular meta- 
bolic system and (2) their rates. Both considerations 
play major roles in shaping the metabolic pathways we 
consider throughout the rest of the book. A relevant re- 
action is one that makes use of an available substrate 
and converts it to a useful product. However, even a po- 
tentially relevant reaction may not occur. Some chemi- 
cal transformations are too slow (have activation 
energies that are too high) to contribute to living sys- 
tems even with the aid of powerful enzyme catalysts. 
The reactions that do occur in cells represent a toolbox 
that evolution has used to construct metabolic pathways 
that circumvent the "impossible" reactions. Learning to 
recognize the plausible reactions can be a great aid in 
developing a command of biochemistry. 

Even so, the number of metabolic transformations 
taking place in a typical cell can seem overwhelming. 
Most cells have the capacity to carry out thousands of 
specific, enzyme-catalyzed reactions: for example, trans- 
formation of a simple nutrient such as glucose into amino 
acids, nucleotides, or lipids; extraction of energy from fu- 
els by oxidation; and polymerization of monomeric sub- 
units into macromolecules. 
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To study these reactions, some organization is es- 
sential. There are patterns within the chemistry of life; 
you do not need to learn every individual reaction 
to comprehend the molecular logic of biochemistry. 
Most of the reactions in living cells fall into one of five 
general categories: (1) reactions that make or break 
carbon-carbon bonds; (2) internal rearrangements, iso- 
merizations, and eliminations; (3) free-radical reactions; 
(4) group transfers; and (5) oxidation-reductions. We 
discuss each of these in more detail below and refer to 
some examples of each type in later chapters. Note that 
the five reaction types are not mutually exclusive; for 
example, an isomerization reaction may involve a free- 
radical intermediate. 

Before proceeding, however, we should review two 
basic chemical principles. First, a covalent bond consists 
of a shared pair of electrons, and the bond can be broken 
in two general ways (Fig. 13-1). In homolytic cleav- 
age, each atom leaves the bond as a radical, carrying 
one unpaired electron. In heterolytic cleavage, which 
is more common, one atom retains both bonding elec- 
trons. The species most often generated when C— C and 
C — H bonds are cleaved are illustrated in Figure 13-1. 
Carbanions, carbocations, and hydride ions are highly 
unstable; this instability shapes the chemistry of these 
ions, as we shall see. 

The second basic principle is that many biochemical 
reactions involve interactions between nucleophiles 
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FIGURE 13-1 Two mechanisms for cleavage of a C— C or C— H bond. 

In a homolytic cleavage, each atom keeps one of the bonding electrons, 
resulting in the formation of carbon radicals (carbons having unpaired 
electrons) or uncharged hydrogen atoms. In a heterolytic cleavage, one 
of the atoms retains both bonding electrons. This can result in the for- 
mation of carbanions, carbocations, protons, or hydride ions. 
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FIGURE 13-2 Common nucleophiles and electrophiles in biochemi- 
cal reactions. Chemical reaction mechanisms, which trace the for- 
mation and breakage of covalent bonds, are communicated with dots 
and curved arrows, a convention known informally as "electron push- 
ing/' A covalent bond consists of a shared pair of electrons. Non- 
bonded electrons important to the reaction mechanism are designated 
by dots (:). Curved arrows (^ N ) represent the movement of electron 
pairs. For movement of a single electron (as in a free radical reaction), 
a single-headed (fishhook-type) arrow is used (^ N ). Most reaction 
steps involve an unshared electron pair. 

(functional groups rich in and capable of donating elec- 
trons) and electrophiles (electron-deficient functional 
groups that seek electrons). Nucleophiles combine with 
and give up electrons to electrophiles. Common biological 
nucleophiles and electrophiles are shown in Figure 13-2 . 
Note that a carbon atom can act as either a nucleophile or 
an electrophile, depending on which bonds and functional 
groups surround it. 

Reactions That Make or Break Carbon-Carbon 
Bonds Heterolytic cleavage of a C — C bond yields a 
carbanion and a carbocation (Fig. 13-1). Con- 
versely, the formation of a C — C bond involves the 
combination of a nucleophilic carbanion and an elec- 
trophilic carbocation. Carbanions and carbocations are 
generally so unstable that their formation as reaction 
intermediates can be energetically inaccessible even 
with enzyme catalysts. For the purpose of cellular bio- 
chemistry they are impossible reactions — unless 
chemical assistance is provided in the form of func- 
tional groups containing electronegative atoms (0 and 
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N) that can alter the electronic structure of adjacent 
carbon atoms so as to stabilize and facilitate the forma- 
tion of carbanion and carbocation intermediates. 

Carbonyl groups are particularly important in the 
chemical transformations of metabolic pathways. The car- 
bon of a carbonyl group has a partial positive charge due 
to the electron-withdrawing property of the carbonyl oxy- 
gen, and thus is an electrophilic carbon (Fig. 13-3a). A 
carbonyl group can thus facilitate the formation of a car- 
banion on an adjoining carbon by delocalizing the carban- 
ion's negative charge (Fig. 13-3b). An imine (C=NH 2 ) 
group can serve a similar function (Fig. 13-3c). The ca- 
pacity of carbonyl and imine groups to delocalize elec- 
trons can be further enhanced by a general acid catalyst 
or by a metal ion such as Mg 2+ (Fig. 13-3d; see also Figs 
6-21 and 6-23). 

The importance of a carbonyl group is evident in 
three major classes of reactions in which C — C bonds 
are formed or broken (Fig. 13-4): aldol condensations, 
Claisen ester condensations, and decarboxylations. In 
each type of reaction, a carbanion intermediate is stabi- 
lized by a carbonyl group, and in many cases another 
carbonyl provides the electrophile with which the nucle- 
ophilic carbanion reacts. 

An aldol condensation is a common route to the 
formation of a C — C bond; the aldolase reaction, which 
converts a six-carbon compound to two three-carbon 
compounds in glycolysis, is an aldol condensation in re- 
verse (see Fig. 14-5). In a Claisen condensation, the 
carbanion is stabilized by the carbonyl of an adjacent 
thioester; an example is the synthesis of citrate in the cit- 
ric acid cycle (see Fig. 16-9). Decarboxylation also com- 
monly involves the formation of a carbanion stabilized 
by a carbonyl group; the acetoacetate decarboxylase 
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FI6URE 13-3 Chemical properties of carbonyl groups, (a) The carbon 
atom of a carbonyl group is an electrophile by virtue of the electron- 
withdrawing capacity of the electronegative oxygen atom, which re- 
sults in a resonance hybrid structure in which the carbon has a partial 
positive charge, (b) Within a molecule, derealization of electrons 
into a carbonyl group stabilizes a carbanion on an adjacent carbon, 
facilitating its formation, (c) Imines function much like carbonyl 
groups in facilitating electron withdrawal, (d) Carbonyl groups do not 
always function alone; their capacity as electron sinks often is aug- 
mented by interaction with either a metal ion (Me 2+ , such as Mg 2+ ) or 
a general acid (HA). 
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FIGURE 13-4 Some common reactions that form and break C— C 
bonds in biological systems. For both the aldol condensation and the 
Claisen condensation, a carbanion serves as nucleophile and the car- 
bon of a carbonyl group serves as electrophile. The carbanion is stabi- 
lized in each case by another carbonyl at the adjoining carbon. In the 
decarboxylation reaction, a carbanion is formed on the carbon shaded 
blue as the C0 2 leaves. The reaction would not occur at an apprecia- 
ble rate without the stabilizing effect of the carbonyl adjacent to the 
carbanion carbon. Wherever a carbanion is shown, a stabilizing reso- 
nance with the adjacent carbonyl, as shown in Figure 13-3b, is as- 
sumed. An imine (Fig. 13-3c) or other electron-withdrawing group 
(including certain enzymatic cofactors such as pyridoxal) can replace 
the carbonyl group in the stabilization of carbanions. 



reaction that occurs in the formation of ketone bodies 
during fatty acid catabolism provides an example (see 
Fig. 17-18). Entire metabolic pathways are organized 
around the introduction of a carbonyl group in a particu- 
lar location so that a nearby carbon-carbon bond can be 
formed or cleaved. In some reactions, an imine or a spe- 
cialized cofactor such as pyridoxal phosphate plays the 
electron-withdrawing role of the carbonyl group. 

The carbocation intermediate occurring in some re- 
actions that form or cleave C — C bonds is generated by 
the elimination of a very good leaving group, such as py- 
rophosphate (see Group Transfer Reactions below). An 
example is the prenyltransf erase reaction (Fig. 13-5), 
an early step in the pathway of cholesterol biosynthesis. 

Internal Rearrangements, Isomerizations, and 
Eliminations Another common type of cellular reac- 
tion is an intramolecular rearrangement in which redis- 
tribution of electrons results in alterations of many 
different types without a change in the overall oxidation 
state of the molecule. For example, different groups in a 
molecule may undergo oxidation-reduction, with no net 
change in oxidation state of the molecule; groups at a 
double bond may undergo a cis-trans rearrangement; or 
the positions of double bonds may be transposed. An ex- 
ample of an isomerization entailing oxidation-reduction 
is the formation of fructose 6-phosphate from glucose 
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FIGURE 13-5 Carbocations in carbon-carbon bond formation. In 

one of the early steps in cholesterol biosynthesis, the enzyme 
prenyl transferase catalyzes condensation of isopentenyl py- 
rophosphate and dimethylallyl pyrophosphate to form geranyl py- 
rophosphate (see Fig. 21-36). The reaction is initiated by 
elimination of pyrophosphate from the dimethylallyl pyrophos- 
phate to generate a carbocation, stabilized by resonance with the 
adjacent C=C bond. 



6-phosphate in glycolysis (Fig. 13-6; this reaction is 
discussed in detail in Chapter 14): C-l is reduced (alde- 
hyde to alcohol) and C-2 is oxidized (alcohol to ketone). 
Figure 13-6b shows the details of the electron move- 
ments in this type of isomerization. A cis-trans re- 
arrangement is illustrated by the prolyl cis-trans 
isomerase reaction in the folding of certain proteins (see 
Fig. 4-7b). A simple transposition of a C=C bond oc- 
curs during metabolism of oleic acid, a common fatty 
acid (see Fig. 17-9). Some spectacular examples of 
double-bond repositioning occur in the biosynthesis of 
cholesterol (see Fig. 21-33). 

An example of an elimination reaction that does not 
affect overall oxidation state is the loss of water from an 
alcohol, resulting in the introduction of a C=C bond: 
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Similar reactions can result from eliminations in amines. 



Free-Radical Reactions Once thought to be rare, the 
homolytic cleavage of covalent bonds to generate free 
radicals has now been found in a wide range of biochem- 
ical processes. These include: isomerizations that make 
use of adenosylcobalamin (vitamin B 12 ) or S-adenosyl- 
methionine, which are initiated with a 5'-deoxyadenosyl 
radical (see the methylmalonyl-CoA mutase reaction in 
Box 17-2); certain radical-initiated decarboxylation re- 
actions (Fig. 13-7); some reductase reactions, such as 
that catalyzed by ribonucleotide reductase (see Fig. 
22-41); and some rearrangement reactions, such as that 
catalyzed by DNA photolyase (see Fig. 25-27). 
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FIGURE 13-6 Isomerization and elimination reactions, (a) The con- 
version of glucose 6-phosphate to fructose 6-phosphate, a reaction of 
sugar metabolism catalyzed by phosphohexose isomerase. (b) This 
reaction proceeds through an enediol intermediate. The curved blue 
arrows represent movement of bonding electron pairs. Pink screens 
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follow the path of oxidation from left to right. and B 2 are ionizable 
groups on the enzyme; they are capable of donating and accepting 
protons (acting as general acids or general bases) as the reaction pro- 
ceeds. Pink screens indicate nucleophilic groups; blue, electrophilic. 
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FIGURE 13-7 A free radical-initiated decarboxylation reaction. The 

biosynthesis of heme (see Fig. 22-24) in Escherichia coli includes a de- 
carboxylation step in which propionyl side chains on the copropor- 
phyrinogen III intermediate are converted to the vinyl side chains of 
protoporphyrinogen IX. When the bacteria are grown anaerobically, 
the enzyme oxygen-independent coproporphyrinogen 111 oxidase, also 
called HemN protein, promotes decarboxylation via the free-radical 



mechanism shown here. The acceptor of the released electron is not 
known. For simplicity, only the relevant portions of the large copropor- 
phyrinogen III and protoporphyrinogen molecules are shown; the en- 
tire structures are given in Figure 22-24. When E. coli are grown in the 
presence of oxygen, this reaction is an oxidative decarboxylation and 
is catalyzed by a different enzyme. 



Group Transfer Reactions The transfer of acyl, gly- 
ceryl, and phosphoryl groups from one nucleophile to 
another is common in living cells. Acyl group transfer 
generally involves the addition of a nucleophile to the 
carbonyl carbon of an acyl group to form a tetrahedral 
intermediate: 
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The chymotrypsin reaction is one example of acyl group 
transfer (see Fig. 6-21). Glycosyl group transfers in- 
volve nucleophilic substitution at C-l of a sugar ring, 
which is the central atom of an acetal. In principle, the 
substitution could proceed by an S N 1 or S N 2 pathway, as 
described in Figure 6-25 for the enzyme lysozyme. 

Phosphoryl group transfers play a special role in 
metabolic pathways, and these transfer reactions are 
discussed in detail in Section 13.3. A general theme in 
metabolism is the attachment of a good leaving group 
to a metabolic intermediate to "activate" the interme- 
diate for subsequent reaction. Among the better leav- 
ing groups in nucleophilic substitution reactions are 
inorganic orthophosphate (the ionized form of H 3 P0 4 
at neutral pH, a mixture of H2PO4 and HPO^ - , com- 
monly abbreviated PJ and inorganic pyrophosphate 
(P 2 OV> abbreviated PPJ; esters and anhydrides of 
phosphoric acid are effectively activated for reaction. 
Nucleophilic substitution is made more favorable by 
the attachment of a phosphoryl group to an otherwise 
poor leaving group such as — OH. Nucleophilic substi- 
tutions in which the phosphoryl group ( — PO3") 
serves as a leaving group occur in hundreds of meta- 
bolic reactions. 

Phosphorus can form five covalent bonds. The con- 
ventional representation of P^ (Fig. 13-8a), with three 
P — 0 bonds and one P=0 bond, is a convenient but 
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FIGURE 13-8 Alternative ways of showing the structure of inorganic 
orthophosphate. (a) In one (inadequate) representation, three oxygens 
are single-bonded to phosphorus, and the fourth is double-bonded, al- 
lowing the four different resonance structures shown here, (b) The res- 
onance structures can be represented more accurately by showing all 
four phosphorus-oxygen bonds with some double-bond character; the 
hybrid orbitals so represented are arranged in a tetrahedron with P at 
its center (c) When a nucleophile Z (in this case, the —OH on C-6 of 
glucose) attacks ATP, it displaces ADP (W). In this S N 2 reaction, a pen- 
tacovalent intermediate (d) forms transiently. 
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inaccurate picture. In P i? four equivalent phosphorus- 
oxygen bonds share some double-bond character, and 
the anion has a tetrahedral structure (Fig. 13-8b). Be- 
cause oxygen is more electronegative than phosphorus, 
the sharing of electrons is unequal: the central phospho- 
rus bears a partial positive charge and can therefore act 
as an electrophile. In a great many metabolic reactions, 
a phosphoryl group (— PO|~) is transferred from ATP 
to an alcohol, forming a phosphate ester (Fig. 13-8c), or 
to a carboxylic acid, forming a mixed anhydride. When a 
nucleophile attacks the electrophilic phosphorus atom 
in ATP, a relatively stable pentacovalent structure forms 
as a reaction intermediate (Fig. 13-8d). With departure 
of the leaving group (ADP), the transfer of a phosphoryl 
group is complete. The large family of enzymes that 
catalyze phosphoryl group transfers with ATP as donor 
are called kinases (Greek kinein, "to move"). Hexoki- 
nase, for example, "moves" a phosphoryl group from 
ATP to glucose. 

Phosphoryl groups are not the only groups that ac- 
tivate molecules for reaction. Thioalcohols (thiols), in 
which the oxygen atom of an alcohol is replaced with a 
sulfur atom, are also good leaving groups. Thiols acti- 
vate carboxylic acids by forming thioesters (thiol es- 
ters). In later chapters we discuss several reactions, 
including those catalyzed by the fatty acyl synthases in 
lipid synthesis (see Fig. 21-2), in which nucleophilic 
substitution at the carbonyl carbon of a thioester results 
in transfer of the acyl group to another moiety. 

Oxidation-Reduction Reactions Carbon atoms can 
exist in five oxidation states, depending on the elements 
with which they share electrons (Fig. 13-9), and tran- 
sitions between these states are of crucial importance in 
metabolism (oxidation-reduction reactions are the topic 
of Section 13.4). In many biological oxidations, a com- 
pound loses two electrons and two hydrogen ions (that 
is, two hydrogen atoms); these reactions are commonly 
called dehydrogenations and the enzymes that catalyze 
themarecaJIrd dHivdm^ unses (Fig, 13-10). In some, 
but not all, biological oxidations, a carbon atom becomes 
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FIGURE 13-9 The oxidation states of carbon in biomolecules. Each 
compound is formed by oxidation of the red carbon in the compound 
shown immediately above. Carbon dioxide is the most highly oxidized 
form of carbon found in living systems. 
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FIGURE 13-10 An oxidation-reduction reaction. Shown here is the 
oxidation of lactate to pyruvate. In this dehydrogenation, two electrons 
and two hydrogen ions (the equivalent of two hydrogen atoms) are re- 
moved from C-2 of lactate, an alcohol, to form pyruvate, a ketone. In 
cells the reaction is catalyzed by lactate dehydrogenase and the elec- 
trons are transferred to the cofactor nicotinamide adenine dinucleotide 
(NAD). This reaction is fully reversible; pyruvate can be reduced by 
electrons transferred from the cofactor. 

covalently bonded to an oxygen atom. The enzymes that 
catalyze these oxidations are generally called oxidases 
or, if the oxygen atom is derived directly from molecular 
oxygen (0 2 ), oxygenases. 

Every oxidation must be accompanied by a reduc- 
tion, in which an electron acceptor acquires the elec- 
trons removed by oxidation. Oxidation reactions 
generally release energy (think of camp fires: the 
compounds in wood are oxidized by oxygen mole- 
cules in the air). Most living cells obtain the energy 
needed for cellular work by oxidizing metabolic fuels 
such as carbohydrates or fat (photosynthetic organ- 
isms can also trap and use the energy of sunlight). 
The catabolic (energy-yielding) pathways described 
in Chapters 14 through 19 are oxidative reaction se- 
quences that result in the transfer of electrons from 
fuel molecules, through a series of electron carriers, 
to oxygen. The high affinity of 0 2 for electrons makes 
the overall electron-transfer process highly exer- 
gonic, providing the energy that drives ATP synthe- 
sis — the central goal of catabolism. 

Many of the reactions within these five classes are facil- 
itated by cofactors, in the form of coenzymes and metals 
(vitamin B 12 , S-adenosylmetMonine, folate, nicotinamide, 
and iron are some examples). Cofactors bind to en- 
zymes—in some cases reversibly, in other cases almost 
irreversibly— and confer on them the capacity to pro- 
mote a particular kind of chemistry (p. 184). Most cofac- 
tors participate in a narrow range of closely related 
reactions. In the following chapters, we will introduce 
and discuss each important cofactor at the point where 
we first encounter it. The cofactors provide another way 
to organize the study of biochemical processes, since 
the reactions facilitated by a given cofactor generally are 
mechanistically related. 

Biochemical and Chemical Equations Are Not Identical 

Biochemists write metabolic equations in a simplified 
way, and this is particularly evident for reactions involv- 
ing ATP. Phosphorylated compounds can exist in several 
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ionization states and, as we have noted, the different 
species can bind Mg 2+ . For example, at pH 7 and 2 mM 
Mg 2+ , ATP exists in the forms ATP 4 ", HATP 3 ", 
H 2 ATP 2 ~, MgHATP", and Mg 2 ATP. In thinking about the 
biological role of ATP, however, we are not always inter- 
ested in all this detail, and so we consider ATP as an en- 
tity made up of a sum of species, and we write its 
hydrolysis as the biochemical equation 

ATP + H 2 0 > ADP + Pi 

where ATP, ADP, and P^ are sums of species. The corre- 
sponding standard transformed equilibrium constant, 
K f eq = [ADP][PJ/[ATP], depends on the pH and the con- 
centration of free Mg 2+ . Note that H + and Mg 2+ do not 
appear in the biochemical equation because they are 
held constant. Thus a biochemical equation does not 
necessarily balance H, Mg, or charge, although it does 
balance all other elements involved in the reaction (C, 
N, 0, and P in the equation above). 

We can write a chemical equation that does balance 
for all elements and for charge. For example, when ATP 
is hydrolyzed at a pH above 8.5 in the absence of Mg 2+ , 
the chemical reaction is represented by 

ATP 4 " + H 2 0 > ADP 3 " + HPOl" + H + 

The corresponding equilibrium constant, K' eq = [ADP 3- ] 
[HP0 4 2 "][H + ]/[ATP 4 ~], depends only on temperature, 
pressure, and ionic strength. 

Both ways of writing a metabolic reaction have 
value in biochemistry. Chemical equations are needed 
when we want to account for all atoms and charges in 
a reaction, as when we are considering the mecha- 
nism of a chemical reaction. Biochemical equations 
are used to determine in which direction a reaction 
will proceed spontaneously, given a specified pH and 
[Mg 2+ ], or to calculate the equilibrium constant of 
such a reaction. 

Throughout this book we use biochemical equa- 
tions, unless the focus is on chemical mechanism, and 
we use values of AG' 0 and K r eq as determined at pH 7 and 
1 mMMg 2+ . 

SUMMARY 13.2 Chemical Logic and 
Common Biochemical 
Reactions 

■ Living systems make use of a large number of 
chemical reactions that can be classified into five 
general types. 

■ Carbonyl groups play a special role in reactions 
that form or cleave C — C bonds. Carbanion 
intermediates are common and are stabilized by 
adjacent carbonyl groups or, less often, by imines 
or certain cofactors. 

■ A redistribution of electrons can produce internal 
rearrangements, isomerizations, and eliminations. 



Such reactions include intramolecular oxidation- 
reduction, change in cis-trans arrangement 
at a double bond, and transposition of double 
bonds. 

■ Homolytic cleavage of covalent bonds to generate 
free radicals occurs in some pathways, such as in 
certain isomerization, decarboxylation, reductase, 
and rearrangement reactions. 

■ Phosphoryl transfer reactions are an especially 
important type of group transfer in cells, required 
for the activation of molecules for reactions that 
would otherwise be highly unfavorable. 

■ Oxidation-reduction reactions involve the loss or 
gain of electrons: one reactant gains electrons and 
is reduced, while the other loses electrons and is 
oxidized. Oxidation reactions generally release 
energy and are important in catabolism. 

13.3 Phosphoryl Group Transfers 
and ATP 

Having developed some fundamental principles of en- 
ergy changes in chemical systems and reviewed the com- 
mon classes of reactions, we can now examine the 
energy cycle in cells and the special role of ATP as the 
energy currency that links catabolism and anabolism 
(see Fig. 1-28). Heterotrophic cells obtain free energy 
in a chemical form by the catabolism of nutrient mole- 
cules, and they use that energy to make ATP from 
ADP and P it ATP then donates some of its chemical 
energy to endergonic processes such as the synthesis 
of metabolic intermediates and macromolecules from 
smaller precursors, the transport of substances across 
membranes against concentration gradients, and me- 
chanical motion. This donation of energy from ATP 
generally involves the covalent participation of ATP in 
the reaction that is to be driven, with the eventual re- 
sult that ATP is converted to ADP and Pi or, in some 
reactions, to AMP and 2 Pi. We discuss here the chem- 
ical basis for the large free-energy changes that ac- 
company hydrolysis of ATP and other high-energy 
phosphate compounds, and we show that most cases 
of energy donation by ATP involve group transfer, not 
simple hydrolysis of ATP To illustrate the range of en- 
ergy transductions in which ATP provides the energy, 
we consider the synthesis of information-rich macro- 
molecules, the transport of solutes across membranes, 
and motion produced by muscle contraction. 

The Free-Energy Change for ATP Hydrolysis Is 
Large and Negative 

Figure 13-11 summarizes the chemical basis for the 
relatively large, negative, standard free energy of hy- 
drolysis of ATP The hydrolytic cleavage of the terminal 
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phosphoric acid anhydride (phosphoanhydride) bond 
in ATP separates one of the three negatively charged 
phosphates and thus relieves some of the electrostatic 
repulsion in ATP; the Pi released is stabilized by the 
formation of several resonance forms not possible in 
ATP; and ADP 2- , the other direct product of hydrolysis, 



FIGURE 13-11 Chemical basis for the large free -energy change as- 
sociated with ATP hydrolysis. ® The charge separation that results 
from hydrolysis relieves electrostatic repulsion among the four nega- 
tive charges on ATP. © The product inorganic phosphate (P ; ) is sta- 
bilized by formation of a resonance hybrid, in which each of the four 
phosphorus-oxygen bonds has the same degree of double-bond 
character and the hydrogen ion is not permanently associated with 
any one of the oxygens. (Some degree of resonance stabilization also 
occurs in phosphates involved in ester or anhydride linkages, but 
fewer resonance forms are possible than for Pj.) (5) The product 
ADP 2 " immediately ionizes, releasing a proton into a medium of 
very low [H + ] (pH 7). A fourth factor (not shown) that favors ATP hy- 
drolysis is the greater degree of solvation (hydration) of the products 
Pi and ADP relative to ATP, which further stabilizes the products rel- 
ative to the reactants. 



immediately ionizes, releasing H + into a medium of 
very low [H + ] (~1CT 7 m). Because the concentrations 
of the direct products of ATP hydrolysis are, in the cell, 
far below the concentrations at equilibrium (Table 
13-5), mass action favors the hydrolysis reaction in 
the cell. 

The free-energy change for ATP hydrolysis is 
-30.5 kJ/mol under standard conditions, but the ac- 
tual free energy of hydrolysis (AG) of ATP in living 
cells is very different: the cellular concentrations of 
ATP, ADP, and Pi are not identical and are much lower 
than the 1.0 M of standard conditions (Table 13-5). Fur- 
thermore, Mg 2+ in the cytosol binds to ATP and ADP 
(Fig. 13-12), and for most enzymatic reactions that in- 
volve ATP as phosphoryl group donor, the true sub- 
strate is MgATP 2 ". The relevant AG' 0 is therefore that 
for MgATP 2 " hydrolysis. We can calculate AG for ATP 
hydrolysis using data such as those in Table 13-5. The 
actual free energy of hydrolysis of ATP under intracel- 
lular conditions is often called its phosphorylation 
potential, A(t p . 



TABLE 13-5 



Adenine Nucleotide, Inorganic Phosphate, and 







Concentration (him)* 






ATP 


ADP 1 


AMP 


Pi 


PCr 


Rat hepatocyte 


3.38 


1.32 


0.29 


4.8 


0 


Rat myocyte 


8.05 


0.93 


0.04 


8.05 


28 


Rat neuron 


2.59 


0.73 


0.06 


2.72 


4.7 


Human erythrocyte 


2.25 


0.25 


0.02 


1.65 


0 


E. coli cell 


7.90 


1.04 


0.82 


7.9 


0 



♦For erythrocytes the concentrations are those of the cytosol (human erythrocytes lack a nucleus and 
mitochondria). In the other types of cells the data are for the entire cell contents, although the cytosol and the 
mitochondria have very different concentrations of ADR PCr is phosphocreatine, discussed on p. 510. 
f This value reflects total concentration; the true value for free ADP may be much lower (p. 503). 
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FIGURE 13-12 Mg 2+ and ATP. Formation of Mg 2 + 
complexes partially shields the negative charges 
and influences the conformation of the phosphate 
groups in nucleotides such as ATP and ADP. 
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mm WORKED EXAMPLE 13-2 Calculation of AG P 

Calculate the actual free energy of hydrolysis of ATP, AG P , in human erythrocytes. 
The standard free energy of hydrolysis of ATP is -30.5 kJ/mol, and the concentra- 
tions of ATP, ADP, and Pi in erythrocytes are as shown in Table 13-5. Assume that 
the pH is 7.0 and the temperature is 37 °C (body temperature). What does this re- 
veal about the amount of energy required to synthesize ATP under the same cellu- 
lar conditions? 

Solution: The concentrations of ATP, ADP, and Pi in human erythrocytes are 2.25, 
0.25, and 1.65 mM, respectively. The actual free energy of hydrolysis of ATP under 
these conditions is given by the relationship (see Eqn 13-4) 



[ADPHPJ 

;iin 

Substituting the appropriate values we get 

r , , (0.25 X 10 _3 )(1.65 X 10" 3 ) 

AG P = -30.5 kJ/mol + (8.315 J/mol - K)(310 K) In — — T 

(2.25 X 10 ) 

= -30.5 kJ/mol + (2.58 kJ/mol) In 1.8 X 10" 4 

= -30.5kJ/mol + (2.58kJ/mol)(-8.6) 

= -30.5 kJ/mol - 22 kJ/mol 

= -52 kJ/mol 

(Note that the final answer has been rounded to the correct number of significant 
figures (52.5 rounded to 52), following rules for rounding 5 down to the nearest even 
number to avoid the bias inherent in rounding "up.") Thus AG P , the actual free- 
energy change for ATP hydrolysis in the intact erythrocyte (-52 kJ/mol), is much 
larger than the standard free-energy change (-30.5 kJ/mol). By the same token, the 
free energy required to synthesize ATP from ADP and Pi under the conditions pre- 
vailing in the erythrocyte would be 52 kJ/mol. 



Because the concentrations of ATP, ADP, and Pi differ 
from one cell type to another, AG P for ATP likewise differs 
among cells. Moreover, in any given cell, AG P can vary from 
time to time, depending on the metabolic conditions and 
how they influence the concentrations of ATP, ADP, P i} and 
H + (pH). We can calculate the actual free-energy change 
for any given metabolic reaction as it occurs in a cell, pro- 
viding we know the concentrations of all the reactants and 
products and other factors (such as pH, temperature, and 
[Mg 2+ ]) that may affect the actual free-energy change. 

To further complicate the issue, the total concentra- 
tions of ATP, ADP, Pi, and H + in a cell may be substan- 
tially higher than the free concentrations, which are the 
thermodynamically relevant values. The difference is 
due to tight binding of ATP, ADP, and Pj to cellular pro- 
teins. For example, the free [ADP] in resting muscle has 
been variously estimated at between 1 and 37 jam. Using 
the value 25 pM in Worked Example 13-2, we would get 
a AG P of -64 kJ/mol. Calculation of the exact value of 
AG P , however, is perhaps less instructive than the gen- 
eralization we can make about actual free-energy 
changes: in vivo, the energy released by ATP hydrolysis 
is greater than the standard free-energy change, AG'°. 

In the following discussions we use the AG'° value 
for ATP hydrolysis because this allows comparison, on 



the same basis, with the energetics of other cellular re- 
actions. Always keep in mind, however, that in living 
cells AG is the relevant quantity— for ATP hydrolysis 
and all other reactions— and may be quite different 
from AG' 0 . 

Here we must make an important point about cellu- 
lar ATP levels. We have shown (and will discuss further) 
how the chemical properties of ATP make it a suitable 
form of energy currency in cells. But it is not merely the 
molecule's intrinsic chemical properties that give it this 
ability to drive metabolic reactions and other energy- 
requiring processes. Even more important is that, in the 
course of evolution, there has been a very strong selec- 
tive pressure for regulatory mechanisms that hold cel- 
lular ATP concentrations far above the equilibrium 
concentrations for the hydrolysis reaction. When the 
ATP level drops, not only does the amount of fuel 
decrease, but the fuel itself loses its potency: AG for 
its hydrolysis (that is, its phosphorylation potential, 
AG P ) is diminished. As our discussions of the metabolic 
pathways that produce and consume ATP will show, 
living cells have developed elaborate mechanisms— 
often at what might seem to us the expense of efficiency 
and common sense— to maintain high concentrations 
of ATP. 
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FIGURE 13-13 Hydrolysis of phosphoenol pyru- 
vate (PEP). Catalyzed by pyruvate kinase, this re- 
action is followed by spontaneous tautomerization 
of the product, pyruvate. Tautomerization is not 
possible in PEP, and thus the products of hydroly- 
sis are stabilized relative to the reactants. Reso- 
nance stabilization of Pj also occurs, as shown in 
Figure 13-11. 
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Other Phosphorylated Compounds and Thioesters 
Also Have Large Free Energies of Hydrolysis 

Phosphoenolpyruvate (PEP; Fig. 13-13) contains a 
phosphate ester bond that undergoes hydrolysis to 
yield the enol form of pyruvate, and this direct product 
can immediately tautomerize to the more stable keto 
form. Because the reactant (PEP) has only one form 
(enol) and the product (pyruvate) has two possible 
forms, the product is stabilized relative to the reactant. 
This is the greatest contributing factor to the high stan- 
dard free energy of hydrolysis of phosphoenolpyruvate: 
AG'° = -61.9kJ/mol. 

Another three-carbon compound, 1,3-bisphospho- 
glycerate (Fig. 13-14), contains an anhydride bond 
between the C-l carboxyl group and phosphoric acid. 
Hydrolysis of this acyl phosphate is accompanied by a 
large, negative, standard free-energy change (AG'° = 
-49.3 kJ/mol), which can, again, be explained in terms 



of the structure of reactant and products. When H 2 0 is 
added across the anhydride bond of 1,3-bisphospho- 
glycerate, one of the direct products, 3-phosphoglyc- 
eric acid, can immediately lose a proton to give the 
carboxylate ion, 3-phosphoglycerate, which has two 
equally probable resonance forms (Fig. 13-14). Re- 
moval of the direct product (3-phosphoglyceric acid) 
and formation of the resonance-stabilized ion favor the 
forward reaction. 

In phosphocreatine (Fig. 13-15), the P— N bond 
can be hydrolyzed to generate free creatine and P it The 
release of P { and the resonance stabilization of creatine 
favor the forward reaction. The standard free-energy 
change of phosphocreatine hydrolysis is again large, 
-43.0 kJ/mol. 

In all these phosphate-releasing reactions, the sev- 
eral resonance forms available to Pi (Fig. 13-11) stabi- 
lize this product relative to the reactant, contributing to 
an already negative free-energy change. Table 13-6 lists 
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FIGURE 13-14 Hydrolysis of 1,3- 
bisphosphoglycerate. The direct prod- 
uct of hydrolysis is 3-phosphoglyceric 
acid, with an undissociated car- 
boxylic acid group, but dissociation 
occurs immediately. This ionization 
and the resonance structures it makes 
possible stabilize the product relative 
to the reactants. Resonance stabiliza- 
tion of Pj further contributes to the 
negative free-energy change. 
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FIGURE 13-15 Hydrolysis of phospho- 
creatine. Breakage of the P— N bond in 
phosphocreatine produces creatine, 
which is stabilized by formation of a 
resonance hybrid. The other product, 
Pi, is also resonance stabilized. 
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TABLE 13-6 


Standard Free Energies of Hydrolysis of 
Some Phosphorylated Compounds and 
AcetyKoA (aThioester) 








(kJ/mol) (kcal/mol) 



f 

CH 3 — C Acetyl-CoA 
S-CoA 



Phosphoenolpyruvate 


-61.9 


-14.8 


1 ,3-bisphosphoglycerate 






(-> 3-phosphoglycerate + Pj) 


-49.3 


-11.8 


Phosphocreatine 


-43.0 


-10.3 


AFiP f \ AMP -1- P ^ 




/ .0 


ATP (-> ADP + Pi) 


-30.5 


-7.3 


ATP (-> AMP + PPO 


-45.6 


-10.9 


AMP (-> adenosine + 


-14.2 


-3.4 


PPi (-> 2P0 


-19.2 


-4.0 


Glucose 3-phosphate 


-20.9 


-5.0 


Fructose 6-phosphate 


-15.9 


-3.8 


Glucose 6-phosphate 


-13.8 


-3.3 


Glycerol 3-phosphate 


-9.2 


-2.2 


Acetyl-CoA 


-31.4 


-7.5 



Source: Data mostly from Jencks, W.R (1976) in Handbook of Biochemistry and 
Molecular Biology, 3rd edn (Fasman, G.D., ed.), Physical and Chemical Data, Vol. 1, pp. 
296-304, CRC Press, Boca Raton, FL. The value for the free energy of hydrolysis of PP, 
is from Frey, RA. & Arabshahi T A. (1995) Standard free-energy change for the hydrolysis 
of the o:-/3-phosphoanhydride bridge in ATP Biochemistry 34, 11,307-11,310. 



the standard free energies of hydrolysis for some biolog- 
ically important phosphorylated compounds. 

Thioesters, in which a sulfur atom replaces the 
usual oxygen in the ester bond, also have large, negative, 
standard free energies of hydrolysis. Acetyl-coenzyme A, 
or acetyl-CoA (Fig. 13-16), is one of many thioesters 
important in metabolism. The acyl group in these com- 
pounds is activated for transacylation, condensation, 
or oxidation-reduction reactions. Thioesters undergo 
much less resonance stabilization than do oxygen es- 
ters; consequently, the difference in free energy be- 
tween the reactant and its hydrolysis products, which 
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FIGURE 13-16 Hydrolysis of acetyl-coenzyme A. Acetyl-CoA is a 
thioester with a large, negative, standard free energy of hydrolysis. 
Thioesters contain a sulfur atom in the position occupied by an oxy- 
gen atom in oxygen esters. The complete structure of coenzyme A 
(CoA, or CoASH) is shown in Figure 8-38. 



are resonance-stabilized, is greater for thioesters than 
for comparable oxygen esters (Fig. 13-17). In both 
cases, hydrolysis of the ester generates a carboxylic 
acid, which can ionize and assume several resonance 
forms. Together, these factors result in the large, nega- 
tive AG'° (-31.4 kJ/mol) for acetyl-CoA hydrolysis. 

To summarize, for hydrolysis reactions with large, 
negative, standard free-energy changes, the products 
are more stable than the reactants for one or more of the 
following reasons: (1) the bond strain in reactants due to 
electrostatic repulsion is relieved by charge separation, 
as for ATP; (2) the products are stabilized by ionization, 
as for ATP, acyl phosphates, and thioesters; (3) the prod- 
ucts are stabilized by isomerization (tautomerization), as 
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FIGURE 13-17 Free energy of hydrolysis for 
thioesters and oxygen esters. The products of 
both types of hydrolysis reaction have about the 
same free-energy content (C), but the thioester 
has a higher free-energy content than the oxygen 
ester. Orbital overlap between the O and C atoms 
allows resonance stabilization in oxygen esters; 
orbital overlap between S and C atoms is poorer 
and provides little resonance stabilization. 
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for PEP; and/or (4) the products are stabilized by reso- 
nance, as for creatine released from phosphocreatine, 
carboxylate ion released from acyl phosphates and 
thioesters, and phosphate (PO released from anhydride 
or ester linkages. 

ATP Provides Energy by Group Transfers, 
Not by Simple Hydrolysis 

Throughout this book you will encounter reactions or 
processes for which ATP supplies energy, and the con- 
tribution of ATP to these reactions is commonly indi- 
cated as in Figure 13-18a, with a single arrow showing 
the conversion of ATP to ADP and Pj (or, in some cases, 
of ATP to AMP and pyrophosphate, PPJ. When written 
this way, these reactions of ATP seem to be simple hy- 
drolysis reactions in which water displaces Pj (or PPO, 
and one is tempted to say that an ATP-dependent reac- 
tion is "driven by the hydrolysis of ATP" This is not the 
case. ATP hydrolysis per se usually accomplishes noth- 
ing but the liberation of heat, which cannot drive a 
chemical process in an isothermal system. A single re- 
action arrow such as that in Figure 13-18a almost in- 
variably represents a two-step process (Fig. 13-18b) 
in which part of the ATP molecule, a phosphoryl or 

(a) Written as a one-step reaction 



COO" coo- 
H 3 N-CH ATP ADP * P. H 3 N-CH 




Enzyme-bound 
glutamyl phosphate 

(b) Actual two-step reaction 

FIGURE 13-18 ATP hydrolysis in two steps, (a) The contribution of ATP 
to a reaction is often shown as a single step, but is almost always a two- 
step process, (b) Shown here is the reaction catalyzed by ATP-dependent 
glutamine synthetase. (T) A phosphoryl group is transferred from ATP 
to glutamate, then © the phosphoryl group is displaced by NH 3 and 
released as P ; . 



pyrophosphoryl group or the adenylate moiety (AMP), 
is first transferred to a substrate molecule or to an 
amino acid residue in an enzyme, becoming covalently 
attached to the substrate or the enzyme and raising its 
free-energy content. Then, in a second step, the phos- 
phate-containing moiety transferred in the first step is 
displaced, generating P i5 PP i3 or AMP. Thus ATP partic- 
ipates covalently in the enzyme-catalyzed reaction to 
which it contributes free energy. 

Some processes do involve direct hydrolysis of ATP 
(or GTP), however. For example, noncovalent binding 
of ATP (or GTP), followed by its hydrolysis to ADP (or 
GDP) and P t , can provide the energy to cycle some pro- 
teins between two conformations, producing mechani- 
cal motion. This occurs in muscle contraction (see 
Fig. 5-31), and in the movement of enzymes along DNA 
(see Fig. 25-35) or of ribosomes along messenger RNA 
(see Fig. 27-30). The energy-dependent reactions cat- 
alyzed by helicases, RecA protein, and some topoiso- 
merases (Chapter 25) also involve direct hydrolysis of 
phosphoanhydride bonds. The AAA+ ATPases involved 
in DNA replication and other processes described in 
Chapter 25 use ATP hydrolysis to cycle associated pro- 
teins between active and inactive forms. GTP-binding 
proteins that act in signaling pathways directly hy- 
drolyze GTP to drive conformational changes that termi- 
nate signals triggered by hormones or by other 
extracellular factors (Chapter 12). 

The phosphate compounds found in living organisms 
can be divided somewhat arbitrarily into two groups, 
based on their standard free energies of hydrolysis 
(Fig. 13-19). "High-energy" compounds have a AG'° of 
hydrolysis more negative than -25 kJ/mol; "low-energy" 
compounds have a less negative AG'°. Based on this cri- 
terion, ATP, with a AG'° of hydrolysis of -30.5 kJ/mol 
(-7.3 kcal/mol), is a high-energy compound; glucose 
6-phosphate, with a AG'° of hydrolysis of -13.8 kJ/mol 
(-3.3 kcal/mol), is a low-energy compound. 

The term "high-energy phosphate bond," long used 
by biochemists to describe the P — 0 bond broken in 
hydrolysis reactions, is incorrect and misleading as it 
wrongly suggests that the bond itself contains the en- 
ergy. In fact, the breaking of all chemical bonds re- 
quires an input of energy. The free energy released by 
hydrolysis of phosphate compounds does not come 
from the specific bond that is broken; it results from 
the products of the reaction having a lower free-energy 
content than the reactants. For simplicity, we will 
sometimes use the term "high-energy phosphate com- 
pound" when referring to ATP or other phosphate 
compounds with a large, negative, standard free en- 
ergy of hydrolysis. 

As is evident from the additivity of free-energy 
changes of sequential reactions (see Section 13.1), any 
phosphorylated compound can be synthesized by cou- 
pling the synthesis to the breakdown of another phos- 
phorylated compound with a more negative free energy 
of hydrolysis. For example, because cleavage of P^ from 
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FIGURE 13-19 Ranking of biological phosphate 
compounds by standard free energies of hydrolysis. 

This shows the flow of phosphoryl groups, repre- 
sented by (p), from high-energy phosphoryl group 
donors via ATP to acceptor molecules (such as glu- 
cose and glycerol) to form their low-energy phos- 
phate derivatives. This flow of phosphoryl groups, 
catalyzed by kinases, proceeds with an overall loss 
of free energy under intracellular conditions. Hy- 
drolysis of low-energy phosphate compounds re- 
leases Pi, which has an even lower phosphoryl 
group transfer potential (as defined in the text). 



phosphoenolpyruvate releases more energy than is 
needed to drive the condensation of P { with ADP, the di- 
rect donation of a phosphoryl group from PEP to ADP is 
thermodynamically feasible: 



(1) 

(2) 



PEP + H 2 0 
ADP + P: — 



— > pyruvate + P { 
ATP + H 2 0 



AG'°(kJ/mol) 

-61.9 
+30.5 



Sum: 



PEP + ADP 



pyruvate + ATP 



-31.4 



Notice that while the overall reaction is represented as 
the algebraic sum of the first two reactions, the overall 
reaction is actually a third, distinct reaction that does 
not involve P^; PEP donates a phosphoryl group di- 
rectly to ADP. We can describe phosphorylated com- 
pounds as having a high or low phosphoryl group 
transfer potential, on the basis of their standard free en- 
ergies of hydrolysis (as listed in Table 13-6). The phos- 
phoryl group transfer potential of PEP is very high, that 
of ATP is high, and that of glucose 6-phosphate is low 
(Fig. 13-19). 

Much of catabolism is directed toward the synthesis 
of high-energy phosphate compounds, but their forma- 
tion is not an end in itself; they are the means of activat- 
ing a very wide variety of compounds for further 
chemical transformation. The transfer of a phosphoryl 
group to a compound effectively puts free energy into 
that compound, so that it has more free energy to give 
up during subsequent metabolic transformations. We 
described above how the synthesis of glucose 6-phos- 
phate is accomplished by phosphoryl group transfer 
from ATP. In the next chapter we see how this phospho- 
rylation of glucose activates, or "primes," the glucose for 
catabolic reactions that occur in nearly every living cell. 
Because of its intermediate position on the scale of 



group transfer potential, ATP can carry energy from 
high-energy phosphate compounds produced by catabo- 
lism to compounds such as glucose, converting them 
into more reactive species. ATP thus serves as the uni- 
versal energy currency in all living cells. 

One more chemical feature of ATP is crucial to its 
role in metabolism: although in aqueous solution ATP is 
thermodynamically unstable and is therefore a good 
phosphoryl group donor, it is kinetically stable. Because 
of the huge activation energies (200 to 400 kJ/mol) re- 
quired for uncatalyzed cleavage of its phosphoanhy- 
dride bonds, ATP does not spontaneously donate 
phosphoryl groups to water or to the hundreds of other 
potential acceptors in the cell. Only when specific en- 
zymes are present to lower the energy of activation does 
phosphoryl group transfer from ATP proceed. The cell is 
therefore able to regulate the disposition of the energy 
carried by ATP by regulating the various enzymes that 
act on it. 

ATP Donates Phosphoryl, Pyrophosphoryl, 
and Adenylyi Groups 

The reactions of ATP are generally S N 2 nucleophilic 
displacements (see Section 13.2) in which the nucle- 
ophile may be, for example, the oxygen of an alcohol or 
carboxylate, or a nitrogen of creatine or of the side 
chain of arginine or histidine. Each of the three phos- 
phates of ATP is susceptible to nucleophilic attack 
(Fig. 13-20), and each position of attack yields a dif- 
ferent type of product. 

Nucleophilic attack by an alcohol on the y phosphate 
(Fig. 13-20a) displaces ADP and produces a new phos- 
phate ester. Studies with 18 0-labeled reactants have 
shown that the bridge oxygen in the new compound is 
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FIGURE 13-20 Nucleophilic displacement reactions of 
ATP. Any of the three P atoms (a, p, or y) may serve as the 
electrophilic target for nucleophilic attack— in this case, 
by the labeled nucleophile R — 18 0: The nucleophile 
may be an alcohol (ROH), a carboxyl group (RCOCT), or 
a phosphoanhydride (a nucleoside mono- or diphos- 
phate, for example), (a) When the oxygen of the nucleo- 
phile attacks the y position, the bridge oxygen of the 
product is labeled, indicating that the group transferred 
from ATP is a phosphoryl (— POf"), not a phosphate 
(— OPOiH. (b) Attack on the p position displaces AMP 
and leads to the transfer of a pyrophosphoryl (not py- 
rophosphate) group to the nucleophile. (c) Attack on the 
a position displaces PPj and transfers the adenylyl group 
to the nucleophile. 
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derived from the alcohol, not from ATP; the group trans- 
ferred from ATP is therefore a phosphoryl (— P0§"), not 
a phosphate (— OPOg - ). Phosphoryl group transfer from 
ATP to glutamate (Fig. 13-18) or to glucose (p. 212) in- 
volves attack at the y position of the ATP molecule. 

Attack at the phosphate of ATP displaces AMP and 
transfers a pyrophosphoryl (not pyrophosphate) group to 
the attacking nucleophile (Fig. 13-20b). For example, the 
formation of 5-phosphoribosyl-l -pyrophosphate (p. 861), 
a key intermediate in nucleotide synthesis, results from 
attack of an — OH of the ribose on the /3 phosphate. 

Nucleophilic attack at the a position of ATP dis- 
places PPi and transfers adenylate (5 '-AMP) as an 
adenylyl group (Fig. 13-20c); the reaction is an adeny- 
lylation (a-den'-i-li-la'-shun, one of the most ungainly 
words in the biochemical language). Notice that hydro- 
lysis of the a~/3 phosphoanhydride bond releases con- 
siderably more energy (-46 kJ/mol) than hydrolysis of 
the p-y bond (-31 kJ/mol) (Table 13-6). Furthermore, 
the PPi formed as a byproduct of the adenylylation is hy- 
drolyzed to two Pi by the ubiquitous enzyme inorganic 
pyrophosphatase, releasing 19 kJ/mol and thereby 
providing a further energy "push" for the adenylylation 
reaction. In effect, both phosphoanhydride bonds of 
ATP are split in the overall reaction. Adenylylation reac- 
tions are therefore thermodynamically very favorable. 
When the energy of ATP is used to drive a particularly 
unfavorable metabolic reaction, adenylylation is often 
the mechanism of energy coupling. Fatty acid activation 
is a good example of this energy-coupling strategy. 

The first step in the activation of a fatty acid— 
either for energy-yielding oxidation or for use in the 
synthesis of more complex lipids— is the formation of 
its thiol ester (see Fig. 17-5). The direct condensation 
of a fatty acid with coenzyme A is endergonic, but the 
formation of fatty acyl-CoA is made exergonic by step- 
wise removal of two phosphoryl groups from ATP. 
First, adenylate (AMP) is transferred from ATP to the 
carboxyl group of the fatty acid, forming a mixed anhy- 



dride (fatty acyl adenylate) and liberating PPi. The 
thiol group of coenzyme A then displaces the adenylyl 
group and forms a thioester with the fatty acid. The 
sum of these two reactions is energetically equivalent 
to the exergonic hydrolysis of ATP to AMP and PPi 
(AG'° = -45.6 kJ/mol) and the endergonic formation 
of fatty acyl-CoA (AG'° = 31.4 kJ/mol). The formation 
of fatty acyl-CoA is made energetically favorable by 
hydrolysis of the PPi by inorganic pyrophosphatase. 
Thus, in the activation of a fatty acid, both phosphoan- 
hydride bonds of ATP are broken. The resulting AG'° is 
the sum of the AG'° values for the breakage of these 
bonds, or -45.6 kJ/mol + (-19.2) kJ/mol: 

ATP + 2H 2 0 > AMP + 2Pi AG' 0 = -64.8kJ/mol 

The activation of amino acids before their polymer- 
ization into proteins (see Fig. 27-19) is accomplished by 
an analogous set of reactions in which a transfer RNA 
molecule takes the place of coenzyme A. An interesting 
use of the cleavage of ATP to AMP and PPi occurs in the 
firefly, which uses ATP as an energy source to produce 
light flashes (Box 13-1). 

Assembly of Informational Macromolecules 
Requires Energy 

When simple precursors are assembled into high molec- 
ular weight polymers with defined sequences (DNA, 
RNA, proteins), as described in detail in Part III, energy 
is required both for the condensation of monomeric 
units and for the creation of ordered sequences. The 
precursors for DNA and RNA synthesis are nucleoside 
triphosphates, and polymerization is accompanied by 
cleavage of the phosphoanhydride linkage between the 
a and /3 phosphates, with the release of PPi (Fig. 13-20). 
The moieties transferred to the growing polymer in 
these reactions are adenylate (AMP), guanylate (GMP), 
cytidylate (CMP), or uridylate (UMP) for RNA synthe- 
sis, and their deoxy analogs (with TMP in place of UMP) 
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BOX 13-1 Firefly Flashes: Glowing Reports of AT 



Bioluminescence requires considerable amounts of en- 
ergy. In the firefly, ATP is used in a set of reactions that 
converts chemical energy into light energy. In the 1950s, 
from many thousands of fireflies collected by children in 
and around Baltimore, William McElroy and his col- 
leagues at The Johns Hopkins University isolated the 
principal biochemical components: luciferin, a complex 
carboxylic acid, and luciferase, an enzyme. The genera- 
tion of a light flash requires activation of luciferin by an 
enzymatic reaction involving pyrophosphate cleavage of 
ATP to form luciferyl adenylate (Fig. 1). In the presence 
of molecular oxygen and luciferase, the luciferin under- 
goes a multistep oxidative decarboxylation to oxyluci- 



ferin. This process is accompanied by emission of light. 
The color of the light flash differs with the firefly species 
and seems to be determined by differences in the struc- 
ture of the luciferase. Luciferin is regenerated from oxy- 
luciferin in a subsequent series of reactions. 

In the laboratory, pure firefly luciferin and luciferase 
are used to measure minute quantities of ATP by the in- 
tensity of the light flash produced. As little as a few pico- 
moles (10~ 12 mol) of ATP can be measured in this way. 
An enlightening extension of the studies in luciferase 
was the cloning of the luciferase gene into tobacco 
plants. When watered with a solution containing lu- 
ciferin, the plants glowed in the dark (see Fig. 9-29). 




The firefly, a beetle of the Lampyridae family. 
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FIGURE 1 Important components in the firefly bioluminescence cycle. 



for DNA synthesis. As noted above, the activation of 
amino acids for protein synthesis involves the donation 
of adenylyl groups from ATP, and we shall see in Chap- 
ter 27 that several steps of protein synthesis on the ribo- 
some are also accompanied by GTP hydrolysis. In all 
these cases, the exergonic breakdown of a nucleoside 
triphosphate is coupled to the endergonic process of 
synthesizing a polymer of a specific sequence. 

ATP Energizes Active Transport and Muscle Contraction 

ATP can supply the energy for transporting an ion or a 
molecule across a membrane into another aqueous 
compartment where its concentration is higher (see 
Fig. 11-38). Transport processes are major consumers 
of energy; in human kidney and brain, for example, 
as much as two-thirds of the energy consumed at rest is 
used to pump Na + and K + across plasma membranes 



via the Na + K + ATPase. The transport of Na + and K + 
is driven by cyclic phosphorylation and dephosphoryla- 
tion of the transporter protein, with ATP as the phos- 
phoryl group donor (see Fig. 11-37). Na + -dependent 
phosphorylation of the Na + K + ATPase forces a change 
in the protein's conformation, and K + -dependent de- 
phosphorylation favors return to the original conforma- 
tion. Each cycle in the transport process results in the 
conversion of ATP to ADP and Pj, and it is the free- 
energy change of ATP hydrolysis that drives the cyclic 
changes in protein conformation that result in the elec- 
trogenic pumping of Na + and K + . Note that in this case 
ATP interacts covalently by phosphoryl group transfer 
to the enzyme, not the substrate. 

In the contractile system of skeletal muscle cells, 
myosin and actin are specialized to transduce the chem- 
ical energy of ATP into motion (see Fig. 5-31). ATP 
binds tightly but noncovalently to one conformation of 
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myosin, holding the protein in that conformation. When 
myosin catalyzes the hydrolysis of its bound ATP, the 
ADP and P^ dissociate from the protein, allowing it to re- 
lax into a second conformation until another molecule of 
ATP binds. The binding and subsequent hydrolysis of 
ATP (by myosin ATPase) provide the energy that forces 
cyclic changes in the conformation of the myosin head. 
The change in conformation of many individual myosin 
molecules results in the sliding of myosin fibrils along 
actin filaments (see Fig. 5-30), which translates into 
macroscopic contraction of the muscle fiber. As we noted 
earlier, this production of mechanical motion at the ex- 
pense of ATP is one of the few cases in which ATP hy- 
drolysis per se, rather than group transfer from ATP, is 
the source of the chemical energy in a coupled process. 

Transphosphorylations between Nucleotides 
Occur in All Cell Types 

Although we have focused on ATP as the cell's energy 
currency and donor of phosphoryl groups, all other nu- 
cleoside triphosphates (GTP, UTP, and OTP) and all de- 
oxynucleoside triphosphates (dATP, dGTP, dTTP, and 
dCTP) are energetically equivalent to ATP. The stan- 
dard free-energy changes associated with hydrolysis of 
their phosphoanhydride linkages are very nearly identi- 
cal with those shown in Table 13-6 for ATP. In prepara- 
tion for their various biological roles, these other 
nucleotides are generated and maintained as the nucle- 
oside triphosphate (NTP) forms by phosphoryl group 
transfer to the corresponding nucleoside diphosphates 
(NDPs) and monophosphates (NMPs). 

ATP is the primary high-energy phosphate com- 
pound produced by catabolism, in the processes of glyco- 
lysis, oxidative phosphorylation, and, in photosynthetic 
cells, photophosphorylation. Several enzymes then carry 
phosphoryl groups from ATP to the other nucleotides. 
Nucleoside diphosphate kinase, found in all cells, 
catalyzes the reaction 

Mg 2 ^ 

ATP + NDP (or dNDP) ; ADP + NTP (or dNTP) 

AG'° « 0 

Although this reaction is fully reversible, the relatively 
high [ATP]/[ADP] ratio in cells normally drives the reac- 



tion to the right, with the net formation of NTPs and 
dNTPs. The enzyme actually catalyzes a two-step phos- 
phoryl group transfer, which is a classic case of a double- 
displacement (Ping-Pong) mechanism (Fig. 13-21; see 
also Fig. 6-13b). First, phosphoryl group transfer from 
ATP to an active-site His residue produces a phosphoen- 
zyme intermediate; then the phosphoryl group is trans- 
ferred from the(P^His residue to an NDP acceptor. 
Because the enzyme is nonspecific for the base in the 
NDP and works equally well on dNDPs and NDPs, it can 
synthesize all NTPs and dNTPs, given the correspon- 
ding NDPs and a supply of ATP 

Phosphoryl group transfers from ATP result in an 
accumulation of ADP; for example, when muscle is con- 
tracting vigorously, ADP accumulates and interferes 
with ATP-dependent contraction. During periods of in- 
tense demand for ATP, the cell lowers the ADP concen- 
tration, and at the same time replenishes ATP, by the 
action of adenylate kinase: 

Mg 2 + 

2 ADP ; ATP + AMP AG'° « 0 

This reaction is fully reversible, so after the intense de- 
mand for ATP ends, the enzyme can recycle AMP by 
converting it to ADP, which can then be phosphorylated 
to ATP in mitochondria. A similar enzyme, guanylate ki- 
nase, converts GMP to GDP at the expense of ATP. By 
pathways such as these, energy conserved in the cata- 
bolic production of ATP is used to supply the cell with all 
required NTPs and dNTPs. 

Phosphocreatine (PCr; Fig. 13-15), also called cre- 
atine phosphate, serves as a ready source of phosphoryl 
groups for the quick synthesis of ATP from ADP. The 
PCr concentration in skeletal muscle is approximately 
30 mM, nearly 10 times the concentration of ATP, and in 
other tissues such as smooth muscle, brain, and kidney 
[PCr] is 5 to 10 mM. The enzyme creatine kinase cat- 
alyzes the reversible reaction 

ADP + PCr ; - ATP + Cr AG' 0 = -12.5kJ/mol 

When a sudden demand for energy depletes ATP, 
the PCr reservoir is used to replenish ATP at a rate 
considerably faster than ATP can be synthesized by 
catabolic pathways. When the demand for energy 
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FIGURE 13-21 Ping-Pong mechanism of nucleoside diphosphate kinase. The enzyme 
binds its first substrate (ATP in our example), and a phosphoryl group is transferred to the 
side chain of a His residue. ADP departs, and another nucleoside (or deoxynucleoside) 
diphosphate replaces it, and this is converted to the corresponding triphosphate by trans- 
fer of the phosphoryl group from the phosphohistidine residue. 
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slackens, ATP produced by catabolism is used to re- 
plenish the PCr reservoir by reversal of the creatine ki- 
nase reaction. Organisms in the lower phyla employ 
other PCr-like molecules (collectively called phospha- 
gens) as phosphoryl reservoirs. 

Inorganic Polyphosphate Is a Potential 
Phosphoryl Group Donor 

Inorganic polyphosphate, polyP (or (po]yP) TC , where n is 
the number of orthophosphate residues), is a linear 
polymer composed of many tens or hundreds of Pi 
residues linked through phosphoanhydride bonds. This 
polymer, present in all organisms, may accumulate to 
high levels in some cells. In yeast, for example, the 
amount of polyP that accumulates in the vacuoles would 
represent, if distributed uniformly throughout the cell, a 
concentration of 200 ium! (Compare this with the con- 
centrations of other phosphoryl group donors listed in 
Table 13-5.) 

0 0 0 0 0 

II II II II II 

-0— P-O-P— 0-P— O— P— 0-P— o— 

1 I I I I 

o- o- o- o- o- 

Inorganic polyphosphate (polyP) 

One potential role for polyP is to serve as a phos- 
phagen, a reservoir of phosphoryl groups that can be 
used to generate ATP, as creatine phosphate is used in 
muscle. PolyP has about the same phosphoryl group 
transfer potential as PPj. The shortest polyphosphate, 
PPi (n = 2), can serve as the energy source for active 
transport of H + across the vacuolar membrane in plant 
cells. For at least one form of the enzyme phosphofruc- 
tokinase in plants, PP[ is the phosphoryl group donor, a 
role played by ATP in animals and microbes (p. 533). 
The finding of high concentrations of polyP in volcanic 
condensates and steam vents suggests that it could have 
served as an energy source in prebiotic and early cellu- 
lar evolution. 

In bacteria, the enzyme polyphosphate kinase-1 
(PPK-1) catalyzes the reversible reaction 

ATP + polyP„ ; ADP + polyP„ +1 

AG'° = -20kJ/mol 

by a mechanism involving an enzyme-bound ©-His in- 
termediate (recall the mechanism of nucleoside diphos- 
phate kinase, described in Fig. 13-22). A second 
enzyme, polyphosphate kinase-2 (PPK-2), catalyzes 
the reversible synthesis of GTP (or ATP) from polyphos- 
phate and GDP (or ADP): 

Mn 2 + 

GDP + polyP, + i ; GTP + polyP„ 

PPK-2 is believed to act primarily in the direction of 
GTP and ATP synthesis, and PPK-1 in the direction 
of polyphosphate synthesis. PPK-1 and PPK-2 are 



present in a wide variety of bacteria, including many 
pathogenic species. 

In bacteria, elevated levels of polyP have been 
shown to promote expression of genes involved in adap- 
tation of the organism to conditions of starvation or 
other threats to survival. In Escherichia coli, for exam- 
ple, polyP accumulates when cells are starved for amino 
acids or P i} and this accumulation confers a survival ad- 
vantage. Deletion of the genes for polyphosphate ki- 
nases diminishes the ability of certain pathogenic 
bacteria to invade animal tissues. The enzymes may 
therefore prove to be suitable targets in the develop- 
ment of new antimicrobial drugs. 

No yeast gene encodes a PPK-like protein, but four 
genes— unrelated to bacterial PPK genes — are neces- 
sary for the synthesis of polyphosphate. The mechanism 
for polyphosphate synthesis in eukaryotes seems to be 
quite different from that in bacteria. 

SUMMARY 13.3 Phosphoryl Group 
Transfers and ATP 

■ ATP is the chemical link between catabolism and 
anabolism. It is the energy currency of the living 
cell. The exergonic conversion of ATP to ADP 
and Pi, or to AMP and PP b is coupled to many 
endergonic reactions and processes. 

■ Direct hydrolysis of ATP is the source of energy 
in some processes driven by conformational 
changes, but in general it is not ATP hydrolysis 
but the transfer of a phosphoryl, pyrophosphoryl, 
or adenylyl group from ATP to a substrate or 
enzyme that couples the energy of ATP 
breakdown to endergonic transformations of 
substrates. 

■ Through these group transfer reactions, ATP 
provides the energy for anabolic reactions, including 
the synthesis of informational macromolecules, and 
for the transport of molecules and ions across 
membranes against concentration gradients and 
electrical potential gradients. 

■ To maintain its high group transfer potential, ATP 
concentration must be held far above the equilibrium 
concentration by energy-yielding reactions of 
catabolism. 

■ Cells contain other metabolites with large, 
negative, free energies of hydrolysis, including 
phosphoenolpyruvate, 1,3-bisphosphoglycerate, and 
phosphocreatine. These high-energy compounds, 
like ATP, have a high phosphoryl group transfer 
potential. Thioesters also have high free energies of 
hydrolysis. 

■ Inorganic polyphosphate, present in all cells, may 
serve as a reservoir of phosphoryl groups with high 
group transfer potential. 
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13.4 Biological Oxidation-Reduction 
Reactions 

The transfer of phosphoryl groups is a central feature of 
metabolism. Equally important is another kind of trans- 
fer, electron transfer in oxidation-reduction reactions. 
These reactions involve the loss of electrons by one 
chemical species, which is thereby oxidized, and the gain 
of electrons by another, which is reduced. The flow of 
electrons in oxidation-reduction reactions is responsible, 
directly or indirectly, for all work done by living orga- 
nisms. In nonphotosynthetic organisms, the sources of 
electrons are reduced compounds (foods); in photosyn- 
thetic organisms, the initial electron donor is a chemical 
species excited by the absorption of light. The path of 
electron flow in metabolism is complex. Electrons move 
from various metabolic intermediates to specialized elec- 
tron carriers in enzyme-catalyzed reactions. The carriers 
in turn donate electrons to acceptors with higher elec- 
tron affinities, with the release of energy. Cells contain a 
variety of molecular energy transducers, which convert 
the energy of electron flow into useful work. 

We begin by discussing how work can be accom- 
plished by an electromotive force (emf), then consider 
the theoretical and experimental basis for measuring 
energy changes in oxidation reactions in terms of emf 
and the relationship between this force, expressed in 
volts, and the free-energy change, expressed in joules. 
We conclude by describing the structures and oxidation- 
reduction chemistry of the most common of the special- 
ized electron carriers, which you will encounter 
repeatedly in later chapters. 

The Flow of Electrons Can Do Biological Work 

Every time we use a motor, an electric light or heater, or 
a spark to ignite gasoline in a car engine, we use the flow 
of electrons to accomplish work. In the circuit that pow- 
ers a motor, the source of electrons can be a battery con- 
taining two chemical species that differ in affinity for 
electrons. Electrical wires provide a pathway for elec- 
tron flow from the chemical species at one pole of the 
battery, through the motor, to the chemical species at 
the other pole of the battery. Because the two chemical 
species differ in their affinity for electrons, electrons 
flow spontaneously through the circuit, driven by a force 
proportional to the difference in electron affinity, the 
electromotive force, emf. The emf (typically a few 
volts) can accomplish work if an appropriate energy 
transducer — in this case a motor— is placed in the cir- 
cuit. The motor can be coupled to a variety of mechani- 
cal devices to do useful work. 

Living cells have an analogous biological "circuit," 
with a relatively reduced compound such as glucose as 
the source of electrons. As glucose is enzymatically oxi- 
dized, the released electrons flow spontaneously through 
a series of electron-carrier intermediates to another 
chemical species, such as 0 2 . This electron flow is exer- 



gonic, because 0 2 has a higher affinity for electrons than 
do the electron-carrier intermediates. The resulting emf 
provides energy to a variety of molecular energy trans- 
ducers (enzymes and other proteins) that do biological 
work. In the mitochondrion, for example, membrane- 
bound enzymes couple electron flow to the production of 
a transmembrane pH difference and a transmembrane 
electrical potential, accomplishing osmotic and electrical 
work. The proton gradient thus formed has potential en- 
ergy, sometimes called the proton-motive force by anal- 
ogy with electromotive force. Another enzyme, ATP 
synthase in the inner mitochondrial membrane, uses the 
proton-motive force to do chemical work: synthesis of 
ATP from ADP and Pj as protons flow spontaneously 
across the membrane. Similarly, membrane-localized 
enzymes in E. coli convert emf to proton-motive force, 
which is then used to power flagellar motion. The princi- 
ples of electrochemistry that govern energy changes in 
the macroscopic circuit with a motor and battery apply 
with equal validity to the molecular processes accompa- 
nying electron flow in living cells. 

Oxidation-Reductions Can Be Described as Half-Reactions 

Although oxidation and reduction must occur together, 
it is convenient when describing electron transfers to 
consider the two halves of an oxidation-reduction reac- 
tion separately. For example, the oxidation of ferrous 
ion by cupric ion, 

Fe 2+ + Cu 2 + Fe 3+ + Cu + 

can be described in terms of two half-reactions: 

(1) Fe 2+ Fe 3+ + e" 

(2) Cu 2+ +e~ ^ Cu + 

The electron-donating molecule in an oxidation-reduction 
reaction is called the reducing agent or reductant; the 
electron-accepting molecule is the oxidizing agent or 
oxidant. A given agent, such as an iron cation existing in 
the ferrous (Fe 2+ ) or ferric (Fe 3+ ) state, functions as a 
conjugate reductant-oxidant pair (redox pair), just as 
an acid and corresponding base function as a conjugate 
acid-base pair. Recall from Chapter 2 that in acid-base 
reactions we can write a general equation: proton donor 
H + + proton acceptor. In redox reactions we can 
write a similar general equation: electron donor (reduc- 
tant) =f=^ e~ + electron acceptor (oxidant). In the 
reversible half-reaction (1) above, Fe 2+ is the electron 
donor and Fe 3+ is the electron acceptor; together, Fe 2+ 
and Fe 3+ constitute a conjugate redox pair. 

The electron transfers in the oxidation-reduction 
reactions of organic compounds are not fundamentally 
different from those of inorganic species. In Chapter 7 
we considered the oxidation of a reducing sugar (an 
aldehyde or ketone) by cupric ion (see Fig. 7-10): 

P 9 P 

R-C x + 40H~ + 2Cu 2+ ;==s R-C + Cu 2 0 + 2H 2 0 

H OH 
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This overall reaction can be expressed as two half- 
reactions: 



,0 



(1) R-C + 20H" ^ R-C + 2e" + H 2 0 

H OH 

(2) 2Cu 2+ + 2e~ + 20H" ^ Cu 2 0 + H 2 0 



Methane 



Ethane 
(alkane) 



H 

H:C:H 
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H H 
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H H 



Because two electrons are removed from the aldehyde 
carbon, the second half-reaction (the one-electron re- 
duction of cupric to cuprous ion) must be doubled to 
balance the overall equation. 

Biological Oxidations Often Involve Dehydrogenation 

The carbon in living cells exists in a range of oxidation 
states (Fig. 13-22). When a carbon atom shares an 
electron pair with another atom (typically H, C, S, N, or 
0), the sharing is unequal in favor of the more elec- 
tronegative atom. The order of increasing electronega- 
tivity isH<C<S<N<0. In oversimplified but useful 
terms, the more electronegative atom "owns" the bond- 
ing electrons it shares with another atom. For example, 
in methane (CH 4 ), carbon is more electronegative than 
the four hydrogens bonded to it, and the C atom there- 
fore "owns" all eight bonding electrons (Fig. 13-22). In 
ethane, the electrons in the C — C bond are shared 
equally, so each C atom "owns" only seven of its eight 
bonding electrons. In ethanol, C-l is less electronegative 
than the oxygen to which it is bonded, and the 0 atom 
therefore "owns" both electrons of the C — 0 bond, leav- 
ing C-l with only five bonding electrons. With each for- 
mal loss of "owned" electrons, the carbon atom has 
undergone oxidation — even when no oxygen is in- 
volved, as in the conversion of an alkane 
(— CH 2 — CH 2 — ) to an alkene (— CH=CH— ). In this 
case, oxidation (loss of electrons) is coincident with the 
loss of hydrogen. In biological systems, as we noted ear- 
lier in the chapter, oxidation is often synonymous with 
dehydrogenation and many enzymes that catalyze 
oxidation reactions are dehydrogenases. Notice that 
the more reduced compounds in Figure 13-22 (top) are 
richer in hydrogen than in oxygen, whereas the more 
oxidized compounds (bottom) have more oxygen and 
less hydrogen. 

Not all biological oxidation-reduction reactions in- 
volve carbon. For example, in the conversion of molecu- 
lar nitrogen to ammonia, 6H + + 6e~ 4- N 2 — > 2NH 3 , the 
nitrogen atoms are reduced. 

Electrons are transferred from one molecule (elec- 
tron donor) to another (electron acceptor) in one of 
four ways: 

1. Directly as electrons. For example, the Fe 2+ /Fe 3+ 
redox pair can transfer an electron to the 
Cu + /Cu 2+ redox pair: 

Fe 2+ + Cu 2+ ^ Fe 3+ + Cu + 
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FIGURE 13-22 Oxidation states of carbon in the biosphere. The oxi- 
dation states are illustrated with some representative compounds. Fo- 
cus on the red carbon atom and its bonding electrons. When this 
carbon is bonded to the less electronegative H atom, both bonding 
electrons (red) are assigned to the carbon. When carbon is bonded to 
another carbon, bonding electrons are shared equally, so one of the 
two electrons is assigned to the red carbon. When the red carbon is 
bonded to the more electronegative O atom, the bonding electrons are 
assigned to the oxygen. The number to the right of each compound is 
the number of electrons "owned" by the red carbon, a rough expres- 
sion of the oxidation state of that carbon. As the red carbon undergoes 
oxidation (loses electrons), the number gets smaller. Thus the oxidation 
state increases from top to bottom of the list. 
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As hydrogen atoms. Recall that a hydrogen atom 
consists of a proton (H + ) and a single electron 

In this case we can write the general equation 



AH 2 



A + 2e~ 



2H + 



where AH 2 is the hydrogen/electron donor. (Do not 
mistake the above reaction for an acid dissociation, 
which involves a proton and no electron.) AH 2 and 
A together constitute a conjugate redox pair 
(A/AH 2 ), which can reduce another compound B 
(or redox pair, B/BH 2 ) by transfer of hydrogen 
atoms: 



AHo + B 



A + BH 2 



4. 



As a hydride ion (:H~), which has two electrons. 
This occurs in the case of NAD-linked dehydroge- 
nases, described below. 

Through direct combination with oxygen. In this 
case, oxygen combines with an organic reductant 
and is covalently incorporated in the product, as in 
the oxidation of a hydrocarbon to an alcohol: 



R — CH 2 — OH 



The hydrocarbon is the electron donor and the oxy- 
gen atom is the electron acceptor. 

All four types of electron transfer occur in cells. The 
neutral term reducing equivalent is commonly used 
to designate a single electron equivalent participating 
in an oxidation-reduction reaction, no matter whether 
this equivalent is an electron per se or part of a hydro- 
gen atom or a hydride ion, or whether the electron 
transfer takes place in a reaction with oxygen to yield 
an oxygenated product. Because biological fuel mole- 
cules are usually enzymatically dehydrogenated to lose 
two reducing equivalents at a time, and because each 
oxygen atom can accept two reducing equivalents, bio- 
chemists by convention regard the unit of biological ox- 
idations as two reducing equivalents passing from 
substrate to oxygen. 
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FIGURE 13-23 Measurement of the standard reduction potential (F°) 
of a redox pair. Electrons flow from the test electrode to the reference 
electrode, or vice versa. The ultimate reference half-cell is the hydro- 
gen electrode, as shown here, at pH 0. The electromotive force (emf) of 
this electrode is designated 0.00 V. At pH 7 in the test cell (and 25 °C), 
f'° for the hydrogen electrode is -0.414 V. The direction of electron 
flow depends on the relative electron "pressure" or potential of the two 
cells. A salt bridge containing a saturated KCI solution provides a path 
for counter-ion movement between the test cell and the reference cell. 
From the observed emf and the known emf of the reference cell, the 
experimenter can find the emf of the test cell containing the redox pair. 
The cell that gains electrons has, by convention, the more positive re- 
duction potential. 



Reduction Potentials Measure Affinity for Electrons 

When two conjugate redox pairs are together in solu- 
tion, electron transfer from the electron donor of one 
pair to the electron acceptor of the other may proceed 
spontaneously. The tendency for such a reaction de- 
pends on the relative affinity of the electron acceptor of 
each redox pair for electrons. The standard reduction 
potential, E°, a measure (in volts) of this affinity, can 
be determined in an experiment such as that described 
in Figure 13-23. Electrochemists have chosen as a 
standard of reference the half-reaction 

H + + e~ > \n 2 

The electrode at which this half-reaction occurs 
(called a half-cell) is arbitrarily assigned an E° of 0.00 V. 



When this hydrogen electrode is connected through an 
external circuit to another half-cell in which an oxi- 
dized species and its corresponding reduced species 
are present at standard concentrations (25 °C, each 
solute at 1 m, each gas at 101.3 kPa), electrons tend to 
flow through the external circuit from the half-cell of 
lower E° to the half-cell of higher E°. By convention, a 
half-cell that takes electrons from the standard hydro- 
gen cell is assigned a positive value of E°, and one that 
donates electrons to the hydrogen cell, a negative 
value. When any two half-cells are connected, that with 
the larger (more positive) E° will get reduced; it has the 
greater reduction potential. 

The reduction potential of a half-cell depends not 
only on the chemical species present but also on their 
activities, approximated by their concentrations. About 
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a century ago, Walther Nernst derived an equation that 
relates standard reduction potential (E°') to the actual 
reduction potential (E) at any concentration of oxidized 
and reduced species in a living cell: 



RT [electron acceptor] 

E = E° + — -In 

nJ [electron donor] 



(13-5) 



where R and T have their usual meanings, n is the num- 
ber of electrons transferred per molecule, and J is the 
Faraday constant (Table 13-1). At 298 K (25 °C), this 
expression reduces to 



0.026V [electron acceptor] 

E° + In —— — 

n [electron donor] 



(13-6) 



KEY CONVENTION: Many half-reactions of interest to bio- 
chemists involve protons. As in the definition of AG'°, 
biochemists define the standard state for oxidation- 
reduction reactions as pH 7 and express a standard trans- 
formed reduction potential, E'°, the standard reduction 
potential at pH 7 and 25 °C. By convention, A£"° for any 
redox reaction is given as E'° of the electron acceptor 
minus E'° of the electron donor. ■ 

The standard reduction potentials given in Table 13-7 
and used throughout this book are values for£"° and are 
therefore valid only for systems at neutral pH. Each 
value represents the potential difference when the 
conjugate redox pair, at 1 m concentrations, 25 °C, and 
pH 7, is connected with the standard (pH 0) hydrogen 
electrode. Notice in Table 13-7 that when the conjugate 
pair 2H + /H 2 at pH 7 is connected with the standard hy- 
drogen electrode (pH 0), electrons tend to flow from the 
pH 7 cell to the standard (pH 0) cell; the measured E r0 
for the 2H + /H 2 pair is -0.414 V. 

Standard Reduction Potentials Can Be Used 
to Calculate Free-Energy Change 

Why are reduction potentials so useful to the bio- 
chemist? When E values have been determined for any 
two half-cells, relative to the standard hydrogen elec- 
trode, we also know their reduction potentials relative 
to each other. We can then predict the direction in 
which electrons will tend to flow when the two half-cells 
are connected through an external circuit or when 
components of both half-cells are present in the same 
solution. Electrons tend to flow to the half-cell with the 
more positive E, and the strength of that tendency is 
proportional to AE, the difference in reduction poten- 
tial. The energy made available by this spontaneous 
electron flow (the free-energy change, AG, for the oxi- 
dation-reduction reaction) is proportional to AE: 



AG 



1 3AE 



AG'° 



-nJ AE'° (13-7) 



TABLE 13-7 


Standard Reduction Potentials of Some 
Biologically Important Half-Reactions 


Half-reaction 


E'° (V) 



|0 2 + 2H + + 2e~ 



>H 2 0 



^Fe^ 



Fe 3+ + e~ 

NO3 + 2H + + 2e~ > NO2 + H 2 0 

Cytochrome / (Fe 3+ ) + e~ > 

cytochrome/ (Fe 2+ ) 
Fe (GN)l~ (ferricyanide) + e~ — » Fe(CN)|~ 

Cytochrome a s (Fe 3+ ) + e ~ > 

cytochrome a 3 (Fe 2+ ) 

0 2 + 2H + + 2e~ > H 2 0 2 

Cytochrome a (Fe 3+ ) + e~ > 

cytochrome a (Fe 2+ ) 

Cytochrome c (Fe 3+ ) + e > 

cytochrome c (Fe 2+ ) 

Cytochrome c x (Fe 3+ ) + e > 

cytochrome c x (Fe 2+ ) 

Cytochrome b (Fe 3+ ) + e > 

cytochrome b (Fe 2+ ) 

Ubiquinone + 2H + + 2e~ > ubiquinol + H 2 

Fumarate 2 " + 2H + + 2e~ > succinate 2 " 

2H + + 2e~ > H 2 (at standard conditions, pH 0) 

Crotonyl-CoA + 2H + + 2e~ > butyryl-CoA 

Oxaloacetate 2 " + 2H + + 2e~ > malate 2 " 

Pyruvate™ + 2H + + 2e~ > lactate" 

Acetaldehyde + 2H + + 2e~ > ethanol 

FAD + 2H + + 2e~ > FADH 2 

Glutathione + 2H + + 2e > 

2 reduced glutathione 
S + 2H + + 2e~ »H 2 S 

Lipoic acid + 2H + + 2e~ > dihydrolipoic acid 

NAD + + H + + 2e~ > NADH 

NADP + + H + + 2e~ > NADPH 

Acetoacetate + 2H + + 2e~ > 

j3-hydroxybutyrate 
a-Ketoglutarate + C0 2 + 2H + + 2e~ > 

isocitrate 

2H + + 2e~ > H 2 (at pH 7) 

Ferredoxin (Fe 3+ ) + e~ »ferredoxin (Fe 2+ ) 



0.816 
0.771 
0.421 

0.365 
0.36 

0.35 
0.295 

0.29 

0,254 

0.22 

0.077 
0.045 
0.031 
0.000 
-0.015 
-0.166 
-0.185 
-0.197 
-0.219* 

-0.23 

-0.243 

-0.29 

-0.320 

-0.324 

-0.346 

-0.38 

-0.414 

-0.432 



where n is the number of electrons transferred in the 
reaction. With this equation we can calculate the actual 



Source: Data mostly from Loach, R.A. (1976) In Handbook of Biochemistry and Molecular 
Biology, 3rd edn (Fasman, G.D., ed.), Physical and Chemical Data, Vol. 1, pp. 122-130, x 
CRC Press, Boca Raton, FL 

* This is the value for free FAD; FAD bound to a specific flavoprotein (e.g., succinate dehy- 
drogenase) has a different E'° that depends on its protein environment. 



free-energy change for any oxidation-reduction reaction 
from the values of E'° in a table of reduction potentials 
(Table 13-7) and the concentrations of reacting species. 



[_51 6j Bioenergetics and Biochemical Reaction Types 



■ WORKED EXAMPLE 13-3 Calculation of AG'°and 

AG of a Redox Reaction 

Calculate the standard free-energy change, AG'°, for the 
reaction in which acetaldehyde is reduced by the biolog- 
ical electron carrier NADH: 



Acetaldehyde + NADH + H + 



ethanol + NAD + 



Then calculate the actual free-energy change, AG, 
when [acetaldehyde] and [NADH] are 1.00 m, and 
[ethanol] and [NAD + ] are 0.100 m. The relevant half-re- 
actions and their E'° values are: 

(1) Acetaldehyde + 2H + + 2e~ > ethanol 

E'° = -0.197 V 

(2) NAD+ + 2H + + 2e~ ► NADH + H + 

E'° = -0.320V 

Remember that, by convention, AE'° is E'° of the elec- 
tron acceptor minus E'° of the electron donor. 

Solution: Because acetaldehyde is accepting electrons 
(n = 2) from NADH, A£"° = -0.197 V - (-0.320 V) = 
0.123 V. Therefore, 

AG' 0 = -nJ AE'° = -2(96.5kJ/V-mol)(0.123V) 
= -23.7kJ/mol 

This is the free-energy change for the oxidation-reduction 
reaction at 25 °C and pH 7, when acetaldehyde, ethanol, 
NAD + , and NADH are all present at 1.00 m concentrations. 

To calculate AG when [acetaldehyde] and [NADH] 
are 1.00 m, and [ethanol] and [NAD + ] are 0.100 m, we first 
determine E for each reductant, using Equation 13-5: 



acetaldehyde 



RT [acetaldehyde] 

E° H — In 

nJ [ethanol] 

-0.197 V + M|6V ln .l-00 



2 0.100 
-0.197 V + 0.013 V (2.303) = 

fio + ^ ln [NAD^ 
nJ [NADH] 

-0.320 V + ^ln 0100 



-0.167 V 



2 *" 1.00 
= -0.320 V + 0.013 V(-2.303) = -0.350 V 

From this we can calculate AE, then use Equation 13-6 
to calculate AG: 

A£ = -0.167 V - (-0.350) V = 0.183 V 
AG = -nJ AE 

= -2(96.5 kJ/V • mol)(0.183 V) 

= -35.3kJ/mol 

This is the actual free-energy change at the specified 
concentrations of the redox pairs. 



Cellular Oxidation of Glucose to Carbon Dioxide 
Requires Specialized Electron Carriers 

The principles of oxidation-reduction energetics de- 
scribed above apply to the many metabolic reactions that 



involve electron transfers. For example, in many organ- 
isms, the oxidation of glucose supplies energy for the 
production of ATP. The complete oxidation of glucose: 



60 2 



6C0 2 + 6H 2 0 



has a AG'° of -2,840 kJ/mol. This is a much larger re- 
lease of free energy than is required for ATP synthesis in 
cells (50 to 60 kJ/mol; see Worked Example 13-2). Cells 
convert glucose to C0 2 not in a single, high-energy- 
releasing reaction but rather in a series of controlled re- 
actions, some of which are oxidations. The free energy 
released in these oxidation steps is of the same order of 
magnitude as that required for ATP synthesis from ADP, 
with some energy to spare. Electrons removed in these 
oxidation steps are transferred to coenzymes special- 
ized for carrying electrons, such as NAD + and FAD 
(described below). 

A Few Types of Coenzymes and Proteins Serve 
as Universal Electron Carriers 

The multitude of enzymes that catalyze cellular oxida- 
tions channel electrons from their hundreds of different 
substrates into just a few types of universal electron car- 
riers. The reduction of these carriers in catabolic 
processes results in the conservation of free energy re- 
leased by substrate oxidation. NAD, NADP, FMN, and 
FAD are water-soluble coenzymes that undergo re- 
versible oxidation and reduction in many of the elec- 
tron-transfer reactions of metabolism. The nucleotides 
NAD and NADP move readily from one enzyme to an- 
other; the flavin nucleotides FMN and FAD are usually 
very tightly bound to the enzymes, called flavoproteins, 
for which they serve as prosthetic groups. Lipid-soluble 
quinones such as ubiquinone and plastoquinone act as 
electron carriers and proton donors in the nonaqueous 
environment of membranes. Iron-sulfur proteins and cy- 
tochromes, which have tightly bound prosthetic groups 
that undergo reversible oxidation and reduction, also 
serve as electron carriers in many oxidation-reduction 
reactions. Some of these proteins are water-soluble, but 
others are peripheral or integral membrane proteins 
(see Fig. 11-6). 

We conclude this chapter by describing some chem- 
ical features of nucleotide coenzymes and some of the 
enzymes (dehydrogenases and flavoproteins) that use 
them. The oxidation-reduction chemistry of quinones, 
iron-sulfur proteins, and cytochromes is discussed in 
Chapter 19. 

NADH and NADPH Act with Dehydrogenases 
as Soluble Electron Carriers 

Nicotinamide adenine dinucleotide (NAD; NAD + in its 
oxidized form) and its close analog nicotinamide ade- 
nine dinucleotide phosphate (NADP; NADP + when oxi- 
dized) are composed of two nucleotides joined through 
their phosphate groups by a phosphoanhydride bond 
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FIGURE 13-24 NAD and NADP. (a) Nicotinamide adenine dinu- 
cleotide, NAD + , and its phosphorylated analog NADP + undergo re- 
duction to NADH and NADPH, accepting a hydride ion (two electrons 
and one proton) from an oxidizable substrate. The hydride ion is added 
to either the front {the A side) or the back (the B side) of the planar 
nicotinamide ring (see Table 1 3—8) (b) The UV absorption spectra of 
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NAD + and NADH. Reduction of the nicotinamide ring produces a 
new, broad absorption band with a maximum at 340 nm. The produc- 
tion of NADH during an enzyme-catalyzed reaction can be conve- 
niently followed by observing the appearance of the absorbance at 
340 nm (molar extinction coefficient e 340 = 6,200 M~ 1 cm~ 1 )* 



(Fi£. 13-24a). Because the nicotinamide ring resem- 
bles pyridine, these compounds are sometimes called 
pyridine nucleotides. The vitamin niacin is the source 
of the nicotinamide moiety in nicotinamide nucleotides. 

Both coenzymes undergo reversible reduction of 
the nicotinamide ring (Fig. 13-24). As a substrate mole- 
cule undergoes oxidation (dehydrogenation) , giving up 
two hydrogen atoms, the oxidized form of the nu- 
cleotide (NAD + or NADP + ) accepts a hydride ion (:H~, 
the equivalent of a proton and two electrons) and is re- 
duced (to NADH or NADPH). The second proton re- 
moved from the substrate is released to the aqueous 
solvent. The half-reactions for these nucleotide cofac- 
tors are 

NAD + + 2e~ + 2H + > NADH + H + 

NADP + 4- 2e~ + 2H + > NADPH + H + 

Reduction of NAD + or NADP + converts the benzenoid 
ring of the nicotinamide moiety (with a fixed positive 
charge on the ring nitrogen) to the quinonoid form 
(with no charge on the nitrogen). The reduced nu- 
cleotides absorb light at 340 nm; the oxidized forms do 
not (Fig. 13-24b); this difference in absorption is used 
by biochemists to assay reactions involving these coen- 
zymes. Note that the plus sign in the abbreviations 
NAD + and NADP + does not indicate the net charge on 
these molecules (in fact, both are negatively charged); 
rather, it indicates that the nicotinamide ring is in its 



oxidized form, with a positive charge on the nitrogen 
atom. In the abbreviations NADH and NADPH, the "H" 
denotes the added hydride ion. To refer to these nu- 
cleotides without specifying their oxidation state, we 
use NAD and NADP 

The total concentration of NAD + + NADH in most 
tissues is about 10" 5 M; that of NADP + 4- NADPH is 
about 1CT 6 m. In many cells and tissues, the ratio of 
NAD + (oxidized) to NADH (reduced) is high, favoring 
hydride transfer from a substrate to NAD + to form 
NADH. By contrast, NADPH is generally present at a 
higher concentration than NADP + , favoring hydride 
transfer from NADPH to a substrate. This reflects the 
specialized metabolic roles of the two coenzymes: NAD + 
generally functions in oxidations — usually as part of a 
catabolic reaction; NADPH is the usual coenzyme in 
reductions — nearly always as part of an anabolic reac- 
tion. A few enzymes can use either coenzyme, but most 
show a strong preference for one over the other. Also, 
the processes in which these two cofactors function are 
segregated in eukaryotic cells: for example, oxidations 
of fuels such as pyruvate, fatty acids, and a-keto acids 
derived from amino acids occur in the mitochondrial 
matrix, whereas reductive biosynthetic processes such 
as fatty acid synthesis take place in the cytosol. This 
functional and spatial specialization allows a cell to 
maintain two distinct pools of electron carriers, with two 
distinct functions. 
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More than 200 enzymes are known to catalyze reac- 
tions in which NAD 4 " (or NADP + ) accepts a hydride ion 
from a reduced substrate, or NADPH (or NADH) do- 
nates a hydride ion to an oxidized substrate. The general 
reactions are 

AH 2 + NAD + ► A + NADH + H + 

A 4- NADPH + H + > AH 2 + NADP + 

where AH 2 is the reduced substrate and A the oxidized 
substrate. The general name for an enzyme of this type 
is oxidoreductase; they are also commonly called de- 
hydrogenases. For example, alcohol dehydrogenase cat- 
alyzes the first step in the catabolism of ethanol, in 
which ethanol is oxidized to acetaldehyde: 

CH 3 CH 2 OH + NAD + ► CH 3 CHO + NADH + H + 

Ethanol Acetaldehyde 

Notice that one of the carbon atoms in ethanol has lost a 
hydrogen; the compound has been oxidized from an al- 
cohol to an aldehyde (refer again to Fig. 13-22 for the 
oxidation states of carbon). 

When NAD + or NADP + is reduced, the hydride ion 
could in principle be transferred to either side of the 
nicotinamide ring: the front (A side) or the back (B 
side), as represented in Figure 13-24a. Studies with iso- 
topically labeled substrates have shown that a given en- 
zyme catalyzes either an A-type or a B-type transfer, but 
not both. For example, yeast alcohol dehydrogenase and 
lactate dehydrogenase of vertebrate heart transfer a hy- 
dride ion to (or remove a hydride ion from) the A side of 
the nicotinamide ring; they are classed as type A dehy- 
drogenases to distinguish them from another group of 
enzymes that transfer a hydride ion to (or remove a hy- 
dride ion from) the B side of the nicotinamide ring 
(Table 13-8). The specificity for one side or another can 
be very striking; lactate dehydrogenase, for example, 
prefers the A side over the B side by a factor of 5 X 10 7 ! 
The basis for this preference lies in the exact positioning 
of the enzyme groups involved in hydrogen bonding 
with the — CONH 2 group of the nicotinamide. 



■ fiinffBl Stereospecificity of Dehydrogenases That Employ NAD 


+ or NADP + as(oen; 


symes 


Enzyme 


Coenzyme 


Stereochemical 
specificity for 
nicotinamide 
ring (A or B) 


Text page(s) 


Isocitrate dehydrogenase 


NAD + 


A 


624 


a-Ketoglutarate dehydrogenase 


NAD + 


B 


625 


Glucose 6-phosphate dehydrogenase 


NADP + 


B 


560 


Malate dehydrogenase 


NAD + 


A 


628 


Glutamate dehydrogenase 


NAD + or NADP + 


B 


680 


Glyceraldehyde 3-phosphate dehydrogenase 


NAD + 


B 


535 


Lactate dehydrogenase 


NAD + 


A 


547 


Alcohol dehydrogenase 


NAD + 


A 


547 




FIGURE 13-25 The Rossman fold. This structural motif is found in the 
NAD-binding site of many dehydrogenases, (a) It consists of a pair of 
structurally similar motifs, each having three parallel p sheets and two 
a helices {p-a-p-a-p), (b) The nucleotide-binding domain of the en- 
zyme lactate dehydrogenase (derived from PDB ID 3LDH) with NAD 
(ball-and-stick structure) bound in an extended conformation through 
hydrogen bonds and salt bridges to the paired p-a-p-a-p motifs of the 
Rossman fold (shades of green). 

Most dehydrogenases that use NAD or NADP bind 
the cofactor in a conserved protein domain called the 
Rossmann fold (named for Michael Rossmann, who de- 
duced the structure of lactate dehydrogenase and first 
described this structural motif). The Rossmann fold typ- 
ically consists of a six-stranded parallel f3 sheet and four 
associated a helices (Fig. 13-25). 

The association between a dehydrogenase and NAD 
or NADP is relatively loose; the coenzyme readily diffuses 
from one enzyme to another, acting as a water-soluble 
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carrier of electrons from one metabolite to another. For 
example, in the production of alcohol during fermenta- 
tion of glucose by yeast cells, a hydride ion is removed 
from glyceraldehyde 3-phosphate by one enzyme (glycer- 
aldehyde 3-phosphate dehydrogenase, a type B enzyme) 
and transferred to NAD + . The NADH produced then 
leaves the enzyme surface and diffuses to another en- 
zyme (alcohol dehydrogenase, a type A enzyme), which 
transfers a hydride ion to acetaldehyde, producing 
ethanol: 

(1) Glyceraldehyde 3-phosphate + NAD + > 

3-phosphoglycerate 4- NADH + H + 

(2) Acetaldehyde + NADH + H + > ethanol + NAD 

Sum: Glyceraldehyde 3-phosphate + acetaldehyde > 

3-phosphoglycerate 4- ethanol 

Notice that in the overall reaction there is no net produc- 
tion or consumption of NAD + or NADH; the coenzymes 
function catalytically and are recycled repeatedly with- 
out a net change in the concentration of NAD + + NADH. 

Dietary Deficiency of Niacin, the Vitamin Form of 
NAD and NADP, Causes Pellagra 

As we noted in Chapter 6, and will discuss further 
in the chapters to follow, most coenzymes are de- 
rived from the substances we call vitamins. The pyridine- 
like rings of NAD and NADP are derived from the vitamin 
niacin (nicotinic acid; Fig. 13-26), which is synthesized 
from tryptophan. Humans generally cannot synthesize 
sufficient quantities of niacin, and this is especially so for 
individuals with diets low in tryptophan (maize, for exam- 
ple, has a low tryptophan content). Niacin deficiency, 
which affects all the NAD (P) -dependent dehydroge- 
nases, causes the serious human disease pellagra (Italian 
for "rough skin") and a related disease in dogs, black- 
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FIGURE 13-26 Niacin (nicotinic acid) and its derivative nicotinamide. 

The biosynthetic precursor of these compounds is tryptophan. In the 
laboratory, nicotinic acid was first produced by oxidation of the natu- 
ral product nicotine — thus the name. Both nicotinic acid and nicoti- 
namide cure pellagra, but nicotine (from cigarettes or elsewhere) has 
no curative activity. 



tongue. These diseases are 
characterized by the "three 
Ds": dermatitis, diarrhea, and 
dementia, followed in many 
cases by death. A century ago, 
pellagra was a common human 
disease; in the southern United 
States, where maize was a di- 
etary staple, about 100,000 
people were afflicted and 
about 10,000 died as a result of 
this disease between 1912 and 
1916. In 1920 Joseph Gold- 
berger showed pellagra to be 
caused by a dietary insuffi- 
ciency, and in 1937 Frank 
Strong, D. Wayne Woolley, and 
Conrad Elvehjem identified 
niacin as the curative agent for 
blacktongue. Supplementation 
of the human diet with this 
inexpensive compound has 
eradicated pellagra in the pop- 
ulations of the developed 
world, with one significant ex- 
ception: people with alco- 
holism, or who drink excessive 
amounts of alcohol. In these in- 
dividuals, intestinal absorption 
of niacin is much reduced, and 
caloric needs are often met 
with distilled spirits that are 
virtually devoid of vitamins, in- 
cluding niacin. In some parts of 
the world, including the Dec- 
can Plateau in India, pellagra 
still occurs in the general pop- 
ulation, especially among peo- 
ple living in poverty. ■ 
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Flavin Nucleotides Are Tightly Bound in Flavoproteins 

Flavoproteins (Table 13-9) are enzymes that catalyze 
oxidation-reduction reactions using either flavin mononu- 
cleotide (FMN) or flavin adenine dinucleotide (FAD) as 
coenzyme (Fig. 13-27). These coenzymes, the flavin 
nucleotides, are derived from the vitamin riboflavin. 
The fused ring structure of flavin nucleotides (the isoal- 
loxazine ring) undergoes reversible reduction, accept- 
ing either one or two electrons in the form of one or two 
hydrogen atoms (each atom an electron plus a proton) 
from a reduced substrate. The fully reduced forms are 
abbreviated FADH 2 and FMNH 2 . When a fully oxidized 
flavin nucleotide accepts only one electron (one hydro- 
gen atom), the semiquinone form of the isoalloxazine 
ring is produced, abbreviated FADH* and FMNH*. Be- 
cause flavin nucleotides have a slightly different chemical 
specialty from that of the nicotinamide coenzymes— 
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Some Enzymes (Flavoproteins) That 
Employ Flavin Nucleotide Coenzymes 



Enzyme 


171 oinn 

nucleotide 


iexi 
pagefs) 


Acyl-CoA dehydrogenase 


FAD 


653 


Dihydrolipoyl dehydrogenase 


FAD 


619 


Succinate dehydrogenase 


FAD 


628 


Glycerol 3-phosphate 
dehydrogenase 


FAD 


732 


Thioredoxin reductase 


FAD 


888 


NADH dehydrogenase 
(Complex I) 


FMN 


712-714 


Glycolate oxidase 


FMN 


787 



the ability to participate in either one- or two-electron 
transfers — flavoproteins are involved in a greater diversity 
of reactions than the NAD (P) -linked dehydrogenases. 

Like the nicotinamide coenzymes (Fig. 13-24), the 
flavin nucleotides undergo a shift in a major absorption 
band on reduction (again, useful to biochemists who 
want to monitor reactions involving these coenzymes. 
Flavoproteins that are fully reduced (two electrons ac- 
cepted) generally have an absorption maximum near 360 
nm. When partially reduced (one electron), they acquire 
another absorption maximum at about 450 nm; when 
fully oxidized, the flavin has maxima at 370 and 440 nm. 

The flavin nucleotide in most flavoproteins is bound 
rather tightly to the protein, and in some enzymes, such 



as succinate dehydrogenase, it is bound covalently. Such 
tightly bound coenzymes are properly called prosthetic 
groups. They do not transfer electrons by diffusing from 
one enzyme to another; rather, they provide a means by 
which the flavoprotein can temporarily hold electrons 
while it catalyzes electron transfer from a reduced sub- 
strate to an electron acceptor. One important feature of 
the flavoproteins is the variability in the standard reduc- 
tion potential (£"°) of the bound flavin nucleotide. Tight 
association between the enzyme and prosthetic group 
confers on the flavin ring a reduction potential typical of 
that particular flavoprotein, sometimes quite different 
from the reduction potential of the free flavin nucleotide. 
FAD bound to succinate dehydrogenase, for example, has 
an E f ° close to 0.0 V, compared with -0.219 V for free 
FAD; E'° for other flavoproteins ranges from -0.40 V to 
+0.06 V. Flavoproteins are often very complex; some 
have, in addition to a flavin nucleotide, tightly bound inor- 
ganic ions (iron or molybdenum, for example) capable of 
participating in electron transfers. 

Certain flavoproteins act in a quite different role, as 
light receptors. Cryptochromes are a family of flavo- 
proteins, widely distributed in the eukaryotic phyla, 
that mediate the effects of blue light on plant develop- 
ment and the effects of light on mammalian circadian 
rhythms (oscillations in physiology and biochemistry, 
with a 24-hour period). The cryptochromes are ho- 
mologs of another family of flavoproteins, the pho- 
tolyases. Found in both bacteria and eukaryotes, 
photolyases use the energy of absorbed light to repair 
chemical defects in DNA. 
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FIGURE 13-27 Oxidized and reduced FAD and FMN. FMN consists of 
the structure above the dashed line on the FAD (oxidized form). The 
flavin nucleotides accept two hydrogen atoms (two electrons and two 
protons), both of which appear in the flavin ring system. When FAD or 
FMN accepts only one hydrogen atom, the semiquinone, a stable free 
radical, forms. 
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We examine the function of fiavoproteins as elec- 
tron carriers in Chapter 19, when we consider their roles 
in oxidative phosphorylation (in mitochondria) and pho- 
tophosphorylation (in chloroplasts), and we describe 
the photolyase reactions in Chapter 25. 

SUMMARY 13.4 Biological Oxidation- 
Reduction Reactions 

■ In many organisms, a central energy-conserving 
process is the stepwise oxidation of glucose to 
C0 2 , in which some of the energy of oxidation is 
conserved in ATP as electrons are passed to 0 2 . 

■ Biological oxidation-reduction reactions can be 
described in terms of two half-reactions, each with 
a characteristic standard reduction potential, E'°. 

■ When two electrochemical half-cells, each 
containing the components of a half-reaction, are 
connected, electrons tend to flow to the half-cell 
with the higher reduction potential. The strength 
of this tendency is proportional to the difference 
between the two reduction potentials (A£T) and is 
a function of the concentrations of oxidized and 
reduced species. 

■ The standard free-energy change for an oxidation- 
reduction reaction is directly proportional to the 
difference in standard reduction potentials of the 
two half-cells: AG'° - —n3 A£7'°. 

■ Many biological oxidation reactions are 
dehydrogenations in which one or two hydrogen 
atoms (H + + e~) are transferred from a substrate 
to a hydrogen acceptor. Oxidation-reduction 
reactions in living cells involve specialized 
electron carriers. 

■ NAD and NADP are the freely diffusible coenzymes 
of many dehydrogenases. Both NAD + and NADP + 
accept two electrons and one proton. 

■ FAD and FMN, the flavin nucleotides, serve as 
tightly bound prosthetic groups of fiavoproteins. 
They can accept either one or two electrons 
and one or two protons. Fiavoproteins also serve 
as light receptors in cryptochromes and 
photolyases. 

Key Terms 

Terms in bold are defined in the glossary. 

autotroph 485 anabolism 487 

heterotroph 485 standard transformed 

metabolism 486 constants 491 

metabolic pathways 486 homolytic cleavage 496 

metabolite 486 radical 496 

intermediary heterolytic cleavage 496 

metabolism 486 nucleophile 496 

catabolism 487 electrophile 496 



carbanion 496 
carbocation 496 

aldol condensation 497 
Claisen condensation 497 
kinase 500 

phosphorylation potential 

(AG P ) 502 
thioester 505 
adenylylation 508 
inorganic 

pyrophosphatase 508 
nucleoside diphosphate 

kinase 510 
adenylate kinase 510 
creatine kinase 510 
phosphagens 511 



polyphosphate kinase- 1, 

kinase-2 511 
electromotive force 

(emf) 512 
conjugate redox pair 512 
dehydrogenation 513 
dehydrogenases 513 
reducing equivalent 514 
standard reduction 

potential (E'°) 514 
pyridine nucleotide 517 
oxidoreductase 518 
flavoprotein 519 
flavin nucleotides 519 
cryptochrome 520 
photolyase 520 
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Problems 

1. Entropy Changes during Egg Development Consider a 
system consisting of an egg in an incubator. The white and yolk 
of the egg contain proteins, carbohydrates, and lipids. If fertil- 
ized, the egg is transformed from a single cell to a complex or- 
ganism. Discuss this irreversible process in terms of the 
entropy changes in the system, surroundings, and universe. Be 
sure that you first clearly define the system and surroundings. 

2. Calculation of AG' 0 from an Equilibrium Constant 

Calculate the standard free-energy change for each of the fol- 
lowing metabolically important enzyme-catalyzed reactions, 
using the equilibrium constants given for the reactions at 25 °C 
and pH 7 0 
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(a) Glutamate + oxaloacetate ; 



aspartate 
aminotransferase 



aspartate + a-ketoglutarate !T eq = 6.8 



(b) Dihydroxyacetone phosphate ; 



triose phosphate 
isomerase 



glyceraldehyde 3-phosphate K' eq = 0.0475 

phosphofruc tokinase 



(c) Fructose 6-phosphate + ATP = 



fructose 1,6-bisphosphate + ADP K' e(l = 254 

3. Calculation of the Equilibrium Constant from AG' 0 

Calculate the equilibrium constant K' eq for each of the follow- 
ing reactions at pH 7.0 and 25 °C, using the AG' 0 values in 
Table 13-4. 



(a) Glucose 6-phosphate + H 2 0 



glucose 
6-phosphatase 



glucose + Pi 



(b) Lactose + H 2 0 =± glucose + galactose 

flllUQI'drSG 

(c) Malate ; fumarate + H 2 0 

4. Experimental Determination of !T eq and AG'° If a 

0.1 M solution of glucose 1 -phosphate at 25 °C is incubated 
with a catalytic amount of phosphoglucomutase, the glucose 
1 -phosphate is transformed to glucose 6-phosphate. At equi- 
librium, the concentrations of the reaction components are 



Glucose 1-phosphate 
4.5 x 10" 3 M 



glucose 6-phosphate 
9.6 X 10" 2 M 



Calculate K r eci and AG'° for this reaction. 

5. Experimental Determination of AG' 0 for ATP Hydro- 
lysis A direct measurement of the standard free-energy 
change associated with the hydrolysis of ATP is technically 
demanding because the minute amount of ATP remaining at 
equilibrium is difficult to measure accurately. The value of 
AG' 0 can be calculated indirectly, however, from the equilib- 
rium constants of two other enzymatic reactions having less 
favorable equilibrium constants: 



Glucose 6-phosphate + H 2 0 > glucose + Pi 

ATP + glucose > ADP + glucose 6-phosphate 



= 270 
K' e(l = 890 



Using this information for equilibrium constants determined at 
25 °C, calculate the standard free energy of hydrolysis of ATP. 

6. Difference between AG' 0 and AG Consider the follow- 
ing interconversion, which occurs in glycolysis (Chapter 14): 

Fructose 6-phosphate glucose 6-phosphate 

K' eq = 1.97 

(a) What is AG'° for the reaction (K' eq measured at 25 °C)? 

(b) If the concentration of fructose 6-phosphate is ad- 
justed to 1.5 m and that of glucose 6-phosphate is adjusted to 
0.50 m, what is AG? 

(c) Why are AG'° and AG different? 



7. Free Energy of Hydrolysis of CTP Compare the struc- 
ture of the nucleoside triphosphate CTP with the structure 
of ATP 




OH OH 
Cytidine triphosphate (CTP) 



Nil, 



0 



O 
I 



CO 




G-P-O-P-O-P-O— CH^O. 
II II II 

o o o 

OH OH 

Adenosine triphosphate (ATP) 



Now predict the K' e(l and AG'° for the following reaction: 
ATP + CDP > ADP + CTP 

8. Dependence of AG on pH The free energy released by 
the hydrolysis of ATP under standard conditions is -30.5 kJ/mol. 
If ATP is hydrolyzed under standard conditions except at pH 
5.0, is more or less free energy released? Explain. Use the 
Living Graph to explore this relationship. 0 

9. The AG' 0 for Coupled Reactions Glucose 1-phosphate is 
converted into fructose 6-phosphate in two successive reactions: 

Glucose 1-phosphate > glucose 6-phosphate 

Glucose 6-phosphate ► fructose 6-phosphate 

Using the AG' 0 values in Table 13-4, calculate the equilibrium 
constant, K' ecL , for the sum of the two reactions: 

Glucose 1-phosphate > fructose 6-phosphate 

10. Effect of [ATP]/[ADP] Ratio on Free Energy of 
Hydrolysis of ATP Using Equation 13-4, plot AG against 
In Q (mass-action ratio) at 25 °C for the concentrations of 
ATP, ADP, and Pi in the table below. AG'° for the reaction is 
-30.5 kJ/mol. Use the resulting plot to explain why metabolism 
is regulated to keep the ratio [ATP]/[ADP] high. 

Concentration (him) 



ATP 


5 


3 


1 


0.2 


5 


ADP 


0.2 


2.2 


4.2 


5.0 


25 


Pi 


10 


12.1 


14.1 


14.9 


10 



11. Strategy for Overcoming an Unfavorable Reaction: 
ATP-Dependent Chemical Coupling The phosphorylation 
of glucose to glucose 6-phosphate is the initial step in the 
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catabolism of glucose. The direct phosphorylation of glucose 
by Pi is described by the equation 

Glucose + Pi > glucose 6-phosphate + H 2 0 

AG'° = 13.8 kJ/mol 

(a) Calculate the equilibrium constant for the above reac- 
tion at 37 °C. In the rat hepatocyte the physiological concen- 
trations of glucose and Pi are maintained at approximately 
4.8 mM. What is the equilibrium concentration of glucose 
6-phosphate obtained by the direct phosphorylation of glucose 
by Pi? Does this reaction represent a reasonable metabolic 
step for the catabolism of glucose? Explain. 

(b) In principle, at least, one way to increase the concentra- 
tion of glucose 6-phosphate is to drive the equilibrium reaction 
to the right by increasing the intracellular concentrations of glu- 
cose and Pi. Assuming a fixed concentration of P i at 4.8 mM, 
how high would the intracellular concentration of glucose have 
to be to give an equilibrium concentration of glucose 6-phos- 
phate of 250 jj,M (the normal physiological concentration)? 
Would this route be physiologically reasonable, given that the 
maximum solubility of glucose is less than 1 m? 

(c) The phosphorylation of glucose in the cell is coupled 
to the hydrolysis of ATP; that is, part of the free energy of ATP 
hydrolysis is used to phosphorylate glucose: 



(1) Glucose + Pi- 

(2) ATP + H 2 Q — 



-» glucose 6-phosphate + H 2 0 

AG'°= 13.8kJ/mol 
►ADP + ^ AG'°= -30.5 kJ/mol 



Sum: Glucose + ATP > glucose 6-phosphate + ADP 

Calculate if' eq at 37 °C for the overall reaction. For the ATP- 
dependent phosphorylation of glucose, what concentration of 
glucose is needed to achieve a 250 fjiM intracellular concentra- 
tion of glucose 6-phosphate when the concentrations of ATP 
and ADP are 3.38 mM and 1.32 mM, respectively? Does this 
coupling process provide a feasible route, at least in principle, 
for the phosphorylation of glucose in the cell? Explain. 

(d) Although coupling ATP hydrolysis to glucose phos- 
phorylation makes thermodynamic sense, we have not yet 
specified how this coupling is to take place. Given that cou- 
pling requires a common intermediate, one conceivable route 
is to use ATP hydrolysis to raise the intracellular concentra- 
tion of Pi and thus drive the unfavorable phosphorylation of 
glucose by Pi. Is this a reasonable route? (Think about the sol- 
ubility products of metabolic intermediates.) 

(e) The ATP-coupled phosphorylation of glucose is cat- 
alyzed in hepatocytes by the enzyme glucokinase. This en- 
zyme binds ATP and glucose to form a glucose-ATP-enzyme 
complex, and the phosphoryl group is transferred directly 
from ATP to glucose. Explain the advantages of this route. 

12. Calculations of \G ro for ATP-Coupled Reactions 

From data in Table 13-6 calculate the AG'° value for the 
following reactions. 

(a) Phosphocreatine + ADP > creatine + ATP 

(b) ATP + fructose > ADP + fructose 6-phosphate 

13. Coupling ATP Cleavage to an Unfavorable Reaction 

To explore the consequences of coupling ATP hydrolysis 



under physiological conditions to a thermodynamically unfa- 
vorable biochemical reaction, consider the hypothetical trans- 
formation X -» Y, for which AG'° = 20 kJ/mol. 

(a) What is the ratio [Y]/[X] at equilibrium? 

(b) Suppose X and Y participate in a sequence of reactions 
during which ATP is hydrolyzed to ADP and Pi. The overall re- 
action is 



X + ATP + H 2 0 



Y + ADP + P n 



Calculate [Y]/[X] for this reaction at equilibrium. Assume that 
the temperature is 25 °C and the equilibrium concentrations of 
ATP, ADP, and Pj are 1 M. 

(c) We know that [ATP], [ADP], and [Pi] are not 1 m under 
physiological conditions. Calculate [Y]/[X] for the ATP-coupled 
reaction when the values of [ATP], [ADP], and [Pi] are those 
found in rat myocytes (Table 13-5). 

14. Calculations of AG at Physiological Concentrations 

Calculate the actual, physiological AG for the reaction 



Phosphocreatine + ADP 



creatine + ATP 



at 37 °C, as it occurs in the cytosol of neurons, with phospho- 
creatine at 4.7 mM, creatine at 1.0 mM, ADP at 0.73 mM, and 
ATP at 2.6 mM. 

15. Free Energy Required for ATP Synthesis under 
Physiological Conditions In the cytosol of rat hepatocytes, 
the temperature is 37 °C and the mass-action ratio, Q, is 



[ATP] 
[ADP] [Pi] 



5.33 x 10 2 M _1 



Calculate the free energy required to synthesize ATP in a rat 
hepatocyte. 

16. Chemical Logic In the glycolytic pathway, a six-carbon 
sugar (fructose 1,6-bisphosphate) is cleaved to form two 
three-carbon sugars, which undergo further metabolism 
(see Fig. 14-5). In this pathway, an isomerization of glucose 
6-phosphate to fructose 6-phosphate (shown below) occurs 
two steps before the cleavage reaction (the intervening step 
is phosphorylation of fructose 6-phosphate to fructose 1,6- 
bisphosphate (p. 532)). 



v 

I 

H-C— OH 

I 

HO— C— H 

I 

H-C-OH 

I 

H-C-OH 

CH 2 OPOf 
Glucose 6-phosphate 



iiiu>p!ioJu so 



II 

H-C-OH 

I 

C=0 

I 

HO— C-H 

I 

H— C— OH 

I 

H— C— OH 

I 

CH 2 OPO^ 
Fructose 6-phosphate 



What does the isomerization step accomplish from a chemical 
perspective? (Hint: Consider what might happen if the C— C 
bond cleavage were to proceed without the preceding isomer- 
ization.) 
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17. Enzymatic Reaction Mechanisms I Lactate dehydro- 
genase is one of the many enzymes that require NADH as 
coenzyme. It catalyzes the conversion of pyruvate to lactate: 



°v/ r + \/° 

C NADH + H NAD C 

r ^ ^ . i 

c=o : — HO— C-H 



dehydrogenase 



Pyruvate 



i 

CH 3 
L-Lactate 



Draw the mechanism of this reaction (show electron-pushing 
arrows) , (Hint: This is a common reaction throughout metabo- 
lism; the mechanism is similar to that catalyzed by other dehy- 
drogenases that use NADH, such as alcohol dehydrogenase.) 

18. Enzymatic Reaction Mechanisms II Biochemical 
reactions often look more complex than they really are. In 
the pentose phosphate pathway (Chapter 14), sedoheptulose 
7-phosphate and glyceraldehyde 3-phosphate react to form 
erythrose 4-phosphate and fructose 6-phosphate in a reaction 
catalyzed by transaldolase. 





CH 2 OH 




C = 0 


HO- 


C — H 


H- 


C-OH 


H- 


1 

C-OH 


H- 


1 

C-OH 




CH 2 OPO| 



Sedoheptulose 
7-phosphate 



V 

I 

H— C — OH 

CH 2 OPOl" 

Glyceraldehyde 
3-phosphate 



V 

I 

H— C — OH 

I 

H— C— OH 

CH 2 OPOi" 

Erythrose 
4-phosphate 



CH 2 OH 

C = 0 

I 

HO — C — H 

H— C — OH 

I 

H— C — OH 

CH 2 OPO|" 

Fructose 
6-phosphate 



Draw a mechanism for this reaction (show electron-pushing 
arrows) , (Hint: Take another look at aldol condensations, then 
consider the name of this enzyme.) 

19. Daily ATP Utilization by Human Adults 

(a) A total of 30,5 kJ/mol of free energy is needed to syn- 
thesize ATP from ADP and Pj when the reactants and products 
are at 1 m concentrations and the temperature is 25 °C (stan- 
dard state). Because the actual physiological concentrations 
of ATP, ADP, and Pi are not 1 m, and the temperature is 37 °C, 
the free energy required to synthesize ATP under physiologi- 
cal conditions is different from AG'°. Calculate the free energy 
required to synthesize ATP in the human hepatocyte when the 



physiological concentrations of ATP, ADP, and Pi are 3.5, 1.50, 
and 5.0 mM, respectively. 

(b) A 68 kg (150 lb) adult requires a caloric intake of 
2,000 kcal (8,360 kJ) of food per day (24 hours). The food is 
metabolized and the free energy is used to synthesize ATP, 
which then provides energy for the body's daily chemical and 
mechanical work. Assuming that the efficiency of converting 
food energy into ATP is 50% , calculate the weight of ATP used 
by a human adult in 24 hours. What percentage of the body 
weight does this represent? 

(c) Although adults synthesize large amounts of ATP daily, 
their body weight, structure, and composition do not change sig- 
nificantly during this period. Explain this apparent contradiction. 

20. Rates of Turnover of y and p Phosphates of ATP If a 

small amount of ATP labeled with radioactive phosphorus in 
the terminal position, [y- 32 P]ATP, is added to a yeast extract, 
about half of the 32 P activity is found in Pi within a few min- 
utes, but the concentration of ATP remains unchanged. Ex- 
plain. If the same experiment is carried out using ATP labeled 
with 32 P in the central position, [£- 32 P]ATP, the 32 P does not 
appear in P A within such a short time. Why? 

21. Cleavage of ATP to AMP and PP S during Metabolism 

Synthesis of the activated form of acetate (acetyl-CoA) is car- 
ried out in an ATP-dependent process: 



Acetate + CoA + ATP 



acetyl-CoA + AMP + PPi 



(a) The AG'° for hydrolysis of acetyl-CoA to acetate and 
CoA is -32.2 kJ/mol and that for hydrolysis of ATP to AMP and 
PPi is -30.5 kJ/mol. Calculate AG'° for the ATP-dependent 
synthesis of acetyl- CoA, 

(b) Almost all cells contain the enzyme inorganic py- 
rophosphatase, which catalyzes the hydrolysis of PPi to P t . 
What effect does the presence of this enzyme have on the syn- 
thesis of acetyl-CoA? Explain. 

22. Energy for H + Pumping The parietal cells of the stom- 
ach lining contain membrane "pumps" that transport hydro- 
gen ions from the cytosol (pH 7.0) into the stomach, 
contributing to the acidity of gastric juice (pH 1.0). Calculate 
the free energy required to transport 1 mol of hydrogen ions 
through these pumps. (Hint: See Chapter 11.) Assume a 
temperature of 37 °C. 

23. Standard Reduction Potentials The standard reduc- 
tion potential, E'°, of any redox pair is denned for the half-cell 
reaction: 

Oxidizing agent + n electrons ► reducing agent 

The E f ° values for the NAD + /NADH and pyruvate/lactate con- 
jugate redox pairs are -0.32 V and -0.19 V, respectively. 

(a) Which redox pair has the greater tendency to lose 
electrons? Explain. 

(b) Which pair is the stronger oxidizing agent? Explain. 

(c) Beginning with 1 M concentrations of each reactant 
and product at pH 7 and 25 °C, in which direction will the fol- 
lowing reaction proceed? 

Pyruvate + NADH + H + lactate + NAD + 
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(d) What is the standard free-energy change (AG'°) for 
the conversion of pyruvate to lactate? 

(e) What is the equilibrium constant (iT eq ) for this reaction? 

24. Energy Span of the Respiratory Chain Electron trans- 
fer in the mitochondrial respiratory chain may be represented 
by the net reaction equation 

NADH + H + + \0 2 ^ H 2 0 + NAD + 

(a) Calculate isE'° for the net reaction of mitochondrial 
electron transfer. Use E'° values from Table 13-7. 

(b) Calculate AG'° for this reaction. 

(c) How many ATP molecules can theoretically be gener- 
ated by this reaction if the free energy of ATP synthesis under 
cellular conditions is 52 kJ/mol? 

25. Dependence of Electromotive Force on Concentra- 
tions Calculate the electromotive force (in volts) registered 
by an electrode immersed in a solution containing the follow- 
ing mixtures of NAD + and NADH at pH 7.0 and 25 °C, with ref- 
erence to a half-cell of E'° 0.00 V. 

(a) 1 0 mM NAD + and 10 mM NADH 

(b) 1 0 mM NAD + and 1.0 mM NADH 

(c) 10 mM NAD + and 10 mM NADH 

26. Electron Affinity of Compounds List the following in 
order of increasing tendency to accept electrons: a-ketoglu- 
tarate + C0 2 (yielding isocitrate); oxaloacetate; 0 2 ; NADP + . 

27. Direction of Oxidation-Reduction Reactions Which 
of the following reactions would you expect to proceed in the 
direction shown, under standard conditions, in the presence of 
the appropriate enzymes? 

(a) Malate + NAD + > oxaloacetate + NADH + H + 

(b) Acetoacetate + NADH + H + — > 

/3-hydroxybutyrate + NAD + 

(c) Pyruvate + NADH + H + — > lactate + NAD + 

(d) Pyruvate + 0-hydroxybutyrate — > 

lactate + acetoacetate 

(e) Malate + pyruvate — > oxaloacetate + lactate 

(f) Acetaldehyde + succinate > ethanol + fumarate 

Data Analysis Problem 

28. Thermodynamics Can Be Tricky Thermodynamics is a 
challenging area of study and one with many opportunities for 
confusion. An interesting example is found in an article by 
Robinson, Hampson, Munro, and Vaney, published in Science in 
1993. Robinson and colleagues studied the movement of small 
molecules between neighboring cells of the nervous system 
through cell-to-cell channels (gap junctions) . They found that 
the dyes Lucifer yellow (a small, negatively charged molecule) 



and biocytin (a small zwitterionic molecule) moved in only one 
direction between two particular types of glia (nonneuronal 
cells of the nervous system). Dye injected into astrocytes would 
rapidly pass into adjacent astrocytes, oligodendrocytes, or 
Muller cells, but dye injected into oligodendrocytes or Muller 
cells passed slowly if at all into astrocytes. All of these cell types 
are connected by gap junctions. 

Although it was not a central point of their article, the au- 
thors presented a molecular model for how this unidirectional 
transport might occur, as shown in their Figure 3: 

(A) Astrocyte Oligodendrocyte 




(B) Astrocyte Oligodendrocyte 




The figure legend reads: "Model of the unidirectional diffusion 
of dye between coupled oligodendrocytes and astrocytes, 
based on differences in connection pore diameter. Like a fish 
in a fish trap, dye molecules (black circles) can pass from an 
astrocyte to an oligodendrocyte (A) but not back in the other 
direction (B) ." 

Although this article clearly passed review at a well- 
respected journal, several letters to the editor (1994) followed, 
showing that Robinson and coauthors' model violated the sec- 
ond law of thermodynamics. 

(a) Explain how the model violates the second law. Hint: 
Consider what would happen to the entropy of the system if 
one started with equal concentrations of dye in the astrocyte 
and oligodendrocyte connected by the "fish trap" type of gap 
junctions. 

(b) Explain why this model cannot work for small mole- 
cules, although it may allow one to catch fish. 

(c) Explain why a fish trap does work for fish. 

(d) Provide two plausible mechanisms for the unidirec- 
tional transport of dye molecules between the cells that do not 
violate the second law of thermodynamics. 
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The problem of alcoholic fermentation, of the origin and nature of 
that mysterious and apparently spontaneous change, which converted 
the insipid juice of the grape into stimulating wine, seems to have ex- 
erted a fascination over the minds of natural philosophers from the 
very earliest times. 

— Arthur Harden, Alcoholic Fermentation, 1923 
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Glucose occupies a central position in the metabo- 
lism of plants, animals, and many microorganisms. 
It is relatively rich in potential energy, and thus a 
good fuel; the complete oxidation of glucose to carbon 
dioxide and water proceeds with a standard free-energy 
change of -2,840 kJ/mol. By storing glucose as a high 
molecular weight polymer such as starch or glycogen, a 
cell can stockpile large quantities of hexose units while 
maintaining a relatively low cytosolic osmolarity. When 
energy demands increase, glucose can be released from 
these intracellular storage polymers and used to pro- 
duce ATP either aerobically or anaerobically. 

Glucose is not only an excellent fuel, it is also a re- 
markably versatile precursor, capable of supplying a 
huge array of metabolic intermediates for biosynthetic 
reactions. A bacterium such as Escherichia coli can ob- 
tain from glucose the carbon skeletons for every amino 
acid, nucleotide, coenzyme, fatty acid, or other meta- 
bolic intermediate it needs for growth. A comprehensive 
study of the metabolic fates of glucose would encompass 
hundreds or thousands of transformations. In animals 
and vascular plants, glucose has four major fates: it may 
be used in the synthesis of complex polysaccharides des- 
tined for the extracellular space; stored in cells (as a 
polysaccharide or as sucrose) ; oxidized to a three-carbon 
compound (pyruvate) via glycolysis to provide ATP and 



metabolic intermediates; or oxidized via the pentose 
phosphate (phosphogluconate) pathway to yield ribose 
5-phosphate for nucleic acid synthesis and NADPH for 
reductive biosynthetic processes (Fig. 14-1). 

Organisms that do not have access to glucose from 
other sources must make it. Photosynthetic organisms 
make glucose by first reducing atmospheric C0 2 to 
trioses, then converting the trioses to glucose. Nonpho- 
tosynthetic cells make glucose from simpler three- and 
four-carbon precursors by the process of gluconeogene- 
sis, effectively reversing glycolysis in a pathway that; 
uses many of the glycolytic enzymes. 

In this chapter we describe the individual reactions 
of glycolysis, gluconeogenesis, and the pentose phos- 
phate pathway and the functional significance of each 
pathway. We also describe the various metabolic fates of 
the pyruvate produced by glycolysis. They include the 
fermentations that are used by many organisms in 
anaerobic niches to produce ATP and that are exploited 
industrially as sources of ethanol, lactic acid, and other 
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FIGURE 14-1 Major pathways of glucose utilization. Although not the 
only possible fates for glucose, these four pathways are the most signif- 
icant in terms of the amount of glucose that flows through them in most 
cells. 
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commercially useful products. And we look at the path- 
ways that feed various sugars from mono-, di-, and poly- 
saccharides into the glycolytic pathway. The discussion 
of glucose metabolism continues in Chapter 15, where 
we use the processes of carbohydrate synthesis and 
degradation to illustrate the many mechanisms by which 
organisms regulate metabolic pathways. The biosyn- 
thetic pathways from glucose to extracellular matrix 
and cell wall polysaccharides and storage polysaccha- 
rides are discussed in Chapter 20. 



14.1 Glycolysis 



In glycolysis (from the Greek glykys, "sweet" or 
"sugar," and lysis, "splitting"), a molecule of glucose is 
degraded in a series of enzyme-catalyzed reactions to 
yield two molecules of the three-carbon compound 
pyruvate. During the sequential reactions of glycolysis, 
some of the free energy released from glucose is con- 
served in the form of ATP and NADH. Glycolysis was the 
first metabolic pathway to be elucidated and is probably 
the best understood. From Eduard Buchner's discovery 
in 1897 of fermentation in broken extracts of yeast cells 
until the elucidation of the whole pathway in yeast (by 
Otto Warburg and Hans von Euler-Chelpin) and in mus- 
cle (by Gustav Embden and Otto Meyerhof) in the 
1930s, the reactions of glycolysis in extracts of yeast and 
muscle were a major focus of biochemical research. The 
philosophical shift that accompanied these discoveries 
was announced by Jacques Loeb in 1906: 

Through the discovery of Buchner, Biology was 
relieved of another fragment of mysticism. The 
splitting up of sugar into C0 2 and alcohol is no more 
the effect of a "vital principle" than the splitting up 
of cane sugar by invertase. The history of this prob- 
lem is instructive, as it warns us against considering 
problems as beyond our reach because they have 
not yet found their solution. 

The development of methods of enzyme purifica- 
tion, the discovery and recognition of the importance of 
coenzymes such as NAD, and the discovery of the piv- 
otal metabolic role of ATP and other phosphorylated 




Hans von Euler-Chelpin 
1873-1964 



Gustav Embden, 
1874-1933 



compounds all came out of studies of glycolysis. The gly- 
colytic enzymes of many species have long since been 
purified and thoroughly studied. 

Glycolysis is an almost universal central pathway of 
glucose catabolism, the pathway with the largest flux of 
carbon in most cells. The glycolytic breakdown of glu- 
cose is the sole source of metabolic energy in some 
mammalian tissues and cell types (erythrocytes, renal 
medulla, brain, and sperm, for example). Some plant 
tissues that are modified to store starch (such as potato 
tubers) and some aquatic plants (watercress, for 
example) derive most of their energy from glycolysis; 
many anaerobic microorganisms are entirely dependent 
on glycolysis. 

Fermentation is a general term for the anaerobic 
degradation of glucose or other organic nutrients to ob- 
tain energy, conserved as ATP. Because living organisms 
first arose in an atmosphere without oxygen, anaerobic 
breakdown of glucose is probably the most ancient bio- 
logical mechanism for obtaining energy from organic 
fuel molecules. And as genome sequencing of a wide va- 
riety of organisms has revealed, some archaea and some 
parasitic microorganisms lack one or more of the enzymes 
of glycolysis but retain the core of the pathway; they pre- 
sumably carry out variant forms of glycolysis. In the course 
of evolution, the chemistry of this reaction sequence has 
been completely conserved; the glycolytic enzymes of ver- 
tebrates are closely similar, in amino acid sequence and 
three-dimensional structure, to their homologs in yeast 
and spinach. Glycolysis differs among species only in the 
details of its regulation and in the subsequent metabolic 
fate of the pyruvate formed. The thermodynamic princi- 
ples and the types of regulatoiy mechanisms that govern 
glycolysis are common to all pathways of cell metabolism. 
The glycolytic pathway, of central importance in itself, can 
also serve as a model for many aspects of the pathways dis- 
cussed throughout this book. 

Before examining each step of the pathway in some 
detail, we take a look at glycolysis as a whole. 

An Overview: Glycolysis Has Two Phases 

The breakdown of the six-carbon glucose into two mole- 
cules of the three-carbon pyruvate oc- 
curs in 10 steps, the first 5 of which 
constitute the preparatory phase 
(Fig. 14-2a). In these reactions, glu- 
cose is first phosphorylated at the hy- 
droxyl group on C-6 (step ®). The 
D-glucose 6-phosphate thus formed is 
converted to D-fructose 6-phosphate 
*J^^^ ( ste P ®), which is again phosphory- 
lated, this time at C-l, to yield D-fruc- 
Wm tose 1,6-bisphosphate (step (3)). For 

both phosphorylations, ATP is the phos- 
phoryl group donor. As all sugar deriva- 
tives in glycolysis are the d isomers, we 
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FIGURE 14-2 The two phases of glycolysis. For each molecule of glu- 
cose that passes through the preparatory phase (a), two molecules of 
glyceraldehyde 3-phosphate are formed; both pass through the payoff 
phase (b). Pyruvate is the end product of the second phase of glycolysis. 
For each glucose molecule, two ATP are consumed in the preparatory 
phase and four ATP are produced in the payoff phase, giving a net yield 



of two ATP per molecule of glucose converted to pyruvate. The num- 
bered reaction steps are catalyzed by the enzymes listed on the right, 
and also correspond to the numbered headings in the text discussion. 
Keep in mind that each phosphoryl group, represented here as (p), has 
two negative charges ( — PO3I. 
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will usually omit the d designation except when emphasiz- 
ing stereochemistry. 

Fructose 1,6-bisphosphate is split to yield two 
three-carbon molecules, dihydroxyacetone phosphate 
and glyceraldehyde 3-phosphate (step @); this is the "ly- 
sis" step that gives the pathway its name. The dihydroxy- 
acetone phosphate is isomerized to a second molecule of 
glyceraldehyde 3-phosphate (step (5)), ending the first 
phase of glycolysis. From a chemical perspective, the iso- 
merization in step (2) is critical for setting up the phos- 
phorylation and C — C bond cleavage reactions in steps 
® and (43), as detailed later. Note that two molecules of 
ATP are invested before the cleavage of glucose into two 
three-carbon pieces; there will be a good return on this 
investment. To summarize: in the preparatory phase of 
glycolysis the energy of ATP is invested, raising the free- 
energy content of the intermediates, and the carbon 
chains of all the metabolized hexoses are converted to a 
common product, glyceraldehyde 3-phosphate. 

The energy gain comes in the payoff phase of gly- 
colysis (Fig. 14-2b). Each molecule of glyceraldehyde 
3-phosphate is oxidized and phosphorylated by inor- 
ganic phosphate (not by ATP) to form 1,3-bisphospho- 
glycerate (step ©). Energy is then released as the two 
molecules of 1,3-bisphosphoglycerate are converted to 
two molecules of pyruvate (steps (7) through Much 
of this energy is conserved by the coupled phosphoryla- 
tion of four molecules of ADP to ATP. The net yield is 
two molecules of ATP per molecule of glucose used, be- 
cause two molecules of ATP were invested in the 
preparatory phase. Energy is also conserved in the pay- 
off phase in the formation of two molecules of the elec- 
tron carrier NADH per molecule of glucose. 

In the sequential reactions of glycolysis, three types 
of chemical transformations are particularly notewor- 
thy: (1) degradation of the carbon skeleton of glucose to 
yield pyruvate; (2) phosphorylation of ADP to ATP by 
compounds with high phosphoryl group transfer poten- 
tial, formed during glycolysis; and (3) transfer of a hy- 
dride ion to NAD + , forming NADH. 

Fates of Pyruvate With the exception of some inter- 
esting variations in the bacterial realm, the pyruvate 
formed by glycolysis is further metabolized via one of 
three catabolic routes. In aerobicorganisms or tissues, 
under aerobic conditions, glycolysis is only the first stage 
in the complete degradation of glucose (Fig. 14-3). 
Pyruvate is oxidized, with loss of its carboxyl group as 
C0 2) to yield the acetyl group of acetyl-coenzyme A; the 
acetyl group is then oxidized completely to C0 2 by the 
citric acid cycle (Chapter 16). The electrons from these 
oxidations are passed to 0 2 through a chain of carriers 
in mitochondria, to form H 2 0. The energy from the 
electron- transfer reactions drives the synthesis of ATP in 
mitochondria (Chapter 19). 

The second route for pyruvate is its reduction to 
lactate via lactic acid fermentation. When vigorously 
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FIGURE 14-3 Three possible catabolic fates of the pyruvate formed in 
glycolysis. Pyruvate also serves as a precursor in many anabolic reac- 
tions, not shown here. 

contracting skeletal muscle must function under low- 
oxygen conditions (hypoxia), NADH cannot be reoxi- 
dized to NAD + , but NAD + is required as an electron 
acceptor for the further oxidation of pyruvate. Under 
these conditions pyruvate is reduced to lactate, accept- 
ing electrons from NADH and thereby regenerating the 
NAD + necessary for glycolysis to continue. Certain tis- 
sues and cell types (retina and erythrocytes, for exam- 
ple) convert glucose to lactate even under aerobic 
conditions, and lactate is also the product of glycolysis 
under anaerobic conditions in some microorganisms 
(Fig. 14-3). 

The third major route of pyruvate catabolism leads 
to ethanol. In some plant tissues and in certain inverte- 
brates, protists, and microorganisms such as brewer's or 
baker's yeast, pyruvate is converted under hypoxic or 
anaerobic conditions to ethanol and C0 2 , a process 
called ethanol (alcohol) fermentation (Fig. 14-3). 

The oxidation of pyruvate is an important catabolic 
process, but pyruvate has anabolic fates as well. It can, 
for example, provide the carbon skeleton for the synthe- 
sis of the amino acid alanine or for the synthesis of fatty 
acids. We return to these anabolic reactions of pyruvate 
in later chapters. 

ATP and NADH Formation Coupled to Glycolysis 

During glycolysis some of the energy of the glucose mol- 
ecule is conserved in ATP, while much remains in the 
product, pyruvate. The overall equation for glycolysis is 

Glucose + 2NAD+ + 2ADP + 2Pi > 

2 pyruvate + 2NADH + 2H + + 2ATP + 2H 2 0 (14-1) 



14.1 Glycolysis 531 



For each molecule of glucose degraded to pyruvate, two 
molecules of ATP are generated from ADP and P i? and 
two molecules of NADH are produced by the reduction 
of NAD + . The hydrogen acceptor in this reaction is 
NAD 4 " (see Fig. 13-24), bound to a Rossmann fold as 
shown in Figure 13-25. The reduction of NAD + proceeds 
by the enzymatic transfer of a hydride ion (:H~) from the 
aldehyde group of glyceraldehyde 3-phosphate to the 
nicotinamide ring of NAD + , yielding the reduced coen- 
zyme NADH. The other hydrogen atom of the substrate 
molecule is released to the solution as H + . 

We can now resolve the equation of glycolysis into 
two processes — the conversion of glucose to pyruvate, 
which is exergonic: 

Glucose + 2NAD + — > 2 pyruvate + 2NADH + 2H + (14-2) 

AG{° = -146kJ/mol 

and the formation of ATP from ADP and P i5 which is 
endergonic: 

2 ADP + 2Pi — > 2ATP + 2H 2 0 (14-3) 
AG'2 = 2(30.5 kJ/mol) = 61.0 kJ/mol 

The sum of Equations 14-2 and 14-3 gives the overall 
standard free-energy change of glycolysis, AG f s °: 

/±G' S ° = AG{° + AG£° = -146 kJ/mol + 61 kJ/mol 
= -85kJ/mol 

Under standard conditions, and under the (nonstan- 
dard) conditions that prevail in a cell, glycolysis is an es- 
sentially irreversible process, driven to completion by a 
large net decrease in free energy. 

Energy Remaining in Pyruvate Glycolysis releases 
only a small fraction of the total available energy of the 
glucose molecule; the two molecules of pyruvate formed 
by glycolysis still contain most of the chemical potential 
energy of glucose, energy that can be extracted by ox- 
idative reactions in the citric acid cycle (Chapter 16) 
and oxidative phosphorylation (Chapter 19). 

Importance of Phosphorylated Intermediates 

Each of the nine glycolytic intermediates between glu- 
cose and pyruvate is phosphorylated (Fig. 14-2). The 
phosphoryl groups seem to have three functions. 

1 . Because the plasma membrane generally lacks 
transporters for phosphorylated sugars, the phos- 
phorylated glycolytic intermediates cannot leave 
the cell. After the initial phosphorylation, no fur- 
ther energy is necessary to retain phosphorylated 
intermediates in the cell, despite the large differ- 
ence in their intracellular and extracellular 
concentrations. 

2. Phosphoryl groups are essential components in the 
enzymatic conservation of metabolic energy. Energy 
released in the breakage of phosphoanhydride 
bonds (such as those in ATP) is partially conserved 



in the formation of phosphate esters such as glucose 
6-phosphate. High-energy phosphate compounds 
formed in glycolysis (1,3-bisphosphoglycerate and 
phosphoenolpyruvate) donate phosphoryl groups 
to ADP to form ATP. 
3. Binding energy resulting from the binding of phos- 
phate groups to the active sites of enzymes lowers 
the activation energy and increases the specificity 
of the enzymatic reactions (Chapter 6). The phos- 
phate groups of ADP, ATP, and the glycolytic inter- 
mediates form complexes with Mg 2+ , and the 
substrate binding sites of many glycolytic enzymes 
are specific for these Mg 2+ complexes. Most gly- 
colytic enzymes require Mg 2+ for activity. 

The Preparatory Phase of Glycolysis 
Requires ATP 

In the preparatory phase of glycolysis, two molecules of 
ATP are invested and the hexose chain is cleaved into 
two triose phosphates. The realization that phosphory- 
lated hexoses were intermediates in glycolysis came 
slowly and serendipitously. In 1906, Arthur Harden and 
William Young tested their hypothesis that inhibitors 
of proteolytic enzymes would stabilize the glucose- 
fermenting enzymes in yeast extract. They added blood 
serum (known to contain inhibitors of proteolytic en- 
zymes) to yeast extracts and observed the predicted 
stimulation of glucose metabolism. However, in a control 
experiment intended to show that boiling the serum de- 
stroyed the stimulatory activity, they discovered that 
boiled serum was just as effective at stimulating glycoly- 
sis! Careful examination and testing of the contents of 
the boiled serum revealed that inorganic phosphate was 
responsible for the stimulation. Harden and Young soon 
discovered that glucose added to their yeast extract was 
converted to a hexose bisphosphate (the "Harden- 
Young ester," eventually identified as fructose 1,6-bis- 
phosphate). This was the beginning of a long series of 
investigations on the role of organic esters and anhy- 
drides of phosphate in biochemistry, which has led to 
our current understanding of the central role of phos- 
phoryl group transfer in biology. 
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® Phosphorylation of Glucose In the first step of 
glycolysis, glucose is activated for subsequent reactions 
by its phosphorylation at C-6 to yield glucose 6-phos- 
phate, with ATP as the phosphoryl donor: 



CHs-OH 



HO 




ATP ADP 



OH 



H OH 
Glucose 



1 


-OFOf" 


& 


0 


H 


\ H 


OH 


H J* 




X/^OH 


U 


OH 



Glucose 6-phosphate 



AG' 0 



-16.7 kJ/mol 



This reaction, which is irreversible under intracellular 
conditions, is catalyzed by hexokinase. Recall that ki- 
nases are enzymes that catalyze the transfer of the ter- 
minal phosphoryl group from ATP to an acceptor 
nucleophile (see Fig. 13-20). Kinases are a subclass of 
transferases (see Table 6-3). The acceptor in the case of 
hexokinase is a hexose, normally D-glucose, although 
hexokinase also catalyzes the phosphorylation of other 
common hexoses, such as D-fructose and D-mannose, in 
some tissues. 

Hexokinase, like many other kinases, requires 
Mg 2+ for its activity, because the true substrate of the 
enzyme is not ATP 4 ^ but the MgATP 2 " complex (see 
Fig. 13-12). Mg 2+ shields the negative charges of the 
phosphoryl groups in ATP, making the terminal phos- 
phorus atom an easier target for nucleophilic attack by 
an — OH of glucose. Hexokinase undergoes a profound 
change in shape, an induced fit, when it binds glucose; 
two domains of the protein move about 8 A closer to 
each other when ATP binds (see Fig. 6-22). This 
movement brings bound ATP closer to a molecule of 
glucose also bound to the enzyme and blocks the ac- 
cess of water (from the solvent), which might other- 
wise enter the active site and attack (hydrolyze) the 
phosphoanhydride bonds of ATP. Like the other nine 
enzymes of glycolysis, hexokinase is a soluble, cytoso- 
lic protein. 

Hexokinase is present in nearly all organisms. 
The human genome encodes four different hexoki- 
nases (I to IV), all of which catalyze the same reac- 
tion. Two or more enzymes that catalyze the same 
reaction but are encoded by different genes are 
called isozymes (see Box 15-2). One of the isozymes 
present in hepatocytes, hexokinase IV (also called 
glucokinase), differs from other forms of hexokinase 
in kinetic and regulatory properties, with important 
physiological consequences that are described in 
Section 15.3. 

(D Conversion of Glucose 6-Phosphate to Fructose 
6-Phosphate The enzyme phosphohexose isomerase 
(phosphoglucose isomerase) catalyzes the reversible 



isomerization of glucose 6-phosphate, an aldose, to fruc- 
tose 6-phosphate, a ketose: 




Mg 2+ 



phossphohexose 
Ik< unerase 



H OH 
Glucose 6-phosphate 



CH 2 OPOf- 
^^°\CH 2 OH 

h\_~l/oh 

OH H 
Fructose 6-phosphate 

AG' 0 = 1.7 kJ/mol 



The mechanism for this reaction involves an enediol in- 
termediate (Fig. 14-4). The reaction proceeds readily 
in either direction, as might be expected from the rela- 
tively small change in standard free energy This isomer- 
ization has a critical role in the overall chemistry of the 
glycolytic pathway, as the rearrangement of the car- 
bonyl and hydroxyl groups at C-l and C-2 is a necessary 
prelude to the next two steps. The phosphorylation that 
occurs in the next reaction (step ®) requires that the 
group at C-l first be converted from a carbonyl to an al- 
cohol, and in the subsequent reaction (step @) cleav- 
age of the bond between C-3 and C-4 requires a carbonyl 
group at C-2 (p. 497). 

(D Phosphorylation of Fructose 6-Phosphate to 
Fructose 1,6-Bisphosphate In the second of the two 
priming reactions of glycolysis, phosphofructokinase-1 

(PFK-1) catalyzes the transfer of a phosphoryl group 
from ATP to fructose 6-phosphate to yield fructose 1,6- 
bisphosphate: 
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KEY CONVENTION: Compounds that contain two phos- 
phate or phosphoryl groups attached at different po- 
sitions in the molecule are named bisphosphates 
(or bisphospho compounds); for example, fructose 
1,6-bisphosphate and 1,3-bisphosphoglycerate. Com- 
pounds with two phosphates linked together as a py- 
rophosphoryl group are named diphosphates; for 
example, adenosine diphosphate (ADP). Similar 
rules apply for the naming of trisphosphates (such 
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MECHANISM FIGURE 14-4 The phosphohexose isomerase reaction. 

The ring opening and closing reactions (steps ©and @) are catalyzed 
by an active-site His residue, by mechanisms omitted here for simplic- 
ity. The proton (pink) initially at C-2 is made more easily abstractable 
by electron withdrawal by the adjacent carbonyl and nearby hydroxyl 
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group. After its transfer from C-2 to the active-site Glu residue 
(a weak acid), the proton is freely exchanged with the surrounding 
solution; that is, the proton abstracted from C-2 in step @ is not 
necessarily the same one that is added to C-1 in step (3). 
9 Phosphohexose Isomerase Mechanism 



as inositol 1,4,5-trisphosphate; see p. 432) and triphos- 
phates (such as adenosine triphosphate, ATP). ■ 

The enzyme that forms fructose 1,6-bisphosphate 
is called PFK-1 to distinguish it from a second enzyme 
(PFK-2) that catalyzes the formation of fructose 
2,6-bisphosphate from fructose 6-phosphate in a sepa- 
rate pathway (the roles of PFK-2 and fructose 2,6- 
bisphosphate are discussed in Chapter 15). The PFK-1 
reaction is essentially irreversible under cellular con- 
ditions, and it is the first "committed" step in the 
glycolytic pathway; glucose 6-phosphate and fructose 
6-phosphate have other possible fates, but fructose 
1,6-bisphosphate is targeted for glycolysis. 

Some bacteria and protists and perhaps all plants 
have a phosphofructokinase that uses pyrophosphate 
(PPO, not ATP, as the phosphoryl group donor in the 
synthesis of fructose 1,6-bisphosphate: 



Fructose 6-phosphate + PPj 



Mg 2 



fructose 1,6-bisphosphate + Pi 
AG'° - -2.9kJ/mol 

Phosphofructokinase- 1 is subject to complex al- 
losteric regulation; its activity is increased whenever the 
cell's ATP supply is depleted or when the ATP breakdown 
products, ADP and AMP (particularly the latter), accu- 
mulate. The enzyme is inhibited whenever the cell has 
ample ATP and is well supplied by other fuels such as 
fatty acids. In some organisms, fructose 2,6-bisphosphate 



(not to be confused with the PFK-1 reaction product, 
fructose 1,6-bisphosphate) is a potent allosteric activator 
of PFK-1. Ribulose 5-phosphate, an intermediate in the 
pentose phosphate pathway discussed later in this chap- 
ter, also activates phosphofructokinase indirectly. The 
multiple layers of regulation of this step in glycolysis are 
discussed in greater detail in Chapter 15. 

(J) Cleavage of Fructose 1,6-Bisphosphate The en- 
zyme fructose 1,6-bisphosphate aldolase, often called 
simply aldolase, catalyzes a reversible aldol condensa- 
tion (see Fig. 13-4). Fructose 1,6-bisphosphate is cleaved 
to yield two different triose phosphates, glyceraldehyde 
3-phosphate, an aldose, and dihydroxyacetone phos- 
phate, a ketose: 



gH 2 opor CH 2 OPO|- 




Fructose 1,6-bisphosphate 



O H 

i,cH 2 opor hmb 

.3>CH 2 OH .gCH,OPO|- 
Dihydroxyacetone Glyceraldehyde 
phosphate 3-phosphate 

AG'°= 23.8 kJ/mol 



0 



~~1 
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There are two classes of aldolases. Class I aldolases, 
found in animals and plants, use the mechanism shown 
in Figure 14-5. Class II enzymes, in fungi and bacteria, 
do not form the Schiff base intermediate. Instead, a zinc 
ion at the active site is coordinated with the carbonyl 
oxygen at C-2; the Zn 2+ polarizes the carbonyl group 
and stabilizes the enolate intermediate created in the 
C — C bond cleavage step. 

Although the aldolase reaction has a strongly posi- 
tive standard free-energy change in the direction of 
fructose 1,6-bisphosphate cleavage, at the lower con- 
centrations of reactants present in cells the actual free- 
energy change is small and the aldolase reaction is 
readily reversible. We shall see later that aldolase acts in 
the reverse direction during the process of gluconeoge- 
nesis (see Fig. 14-16). 



(5)Interconversion of the Triose Phosphates Only 
one of the two triose phosphates formed by aldolase, 
glyceraldehyde 3-phosphate, can be directly degraded 
in the subsequent steps of glycolysis. The other prod- 
uct, dihydroxyacetone phosphate, is rapidly and re- 
versibly converted to glyceraldehyde 3-phosphate by 
the fifth enzyme of the glycolytic sequence, triose 
phosphate isomer ase: 
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AG'°= 7.5 kJ/mol 
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MECHANISM FIGURE 14-5 The class I aldolase reaction. The reaction 
shown here is the reverse of an aldol condensation. Note that cleavage 
between C-3 and C-4 depends on the presence of the carbonyl group 



Covalent enzyme- 
enamine intermediate 



at C-2. A and B represent amino acid residues that serve as general 
acid (A) or base (B). 
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glucose 
carbon 
1 

% 

3 




cHg) 



A 



CH 2 — O— (?) H-C-0 
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CH 2 OH 

Dihydroxyacetone 
phosphate 
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H-C-OH 
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triose; phosphate isumeiase 

(a) 

FIGURE 14-6 Fate of the glucose carbons in the formation of glycer- 
aldehyde 3-phosphate. (a) The origin of the carbons in the two three-car- 
bon products of the aldolase and triose phosphate isomerase reactions. 
The end product of the two reactions is glyceraldehyde 3-phosphate 
(two molecules), (b) Each carbon of glyceraldehyde 3-phosphate is 
derived from either of two specific carbons of glucose. Note that the 

The reaction mechanism is similar to the reaction pro- 
moted by phosphohexose isomerase in step (2) of glyco- 
lysis (Fig. 14-4). After the triose phosphate isomerase 
reaction, the carbon atoms derived from C-l, C-2, and 
C-3 of the starting glucose are chemically indistinguish- 
able from C-6, C-5, and C-4, respectively (Fig. 14-6); 
the two "halves" of glucose have both yielded glycer- 
aldehyde 3-phosphate. 

This reaction completes the preparatory phase of 
glycolysis. The hexose molecule has been phosphory- 
lated at C-l and C-6 and then cleaved to form two mole- 
cules of glyceraldehyde 3-phosphate. 

The Payoff Phase of Glycolysis Yields ATP and NADH 

The payoff phase of glycolysis (Fig. 14-2b) includes the 
energy-conserving phosphorylation steps in which some 
of the chemical energy of the glucose molecule is con- 
served in the form of ATP and NADH. Remember that 
one molecule of glucose yields two molecules of glycer- 
aldehyde 3-phosphate, and both halves of the glucose 
molecule follow the same pathway in the second phase 
of glycolysis. The conversion of two molecules of glycer- 
aldehyde 3-phosphate to two molecules of pyruvate is 
accompanied by the formation of four molecules of ATP 
from ADR However, the net yield of ATP per molecule of 
glucose degraded is only two, because two ATP were in- 
vested in the preparatory phase of glycolysis to phos- 
phorylate the two ends of the hexose molecule. 
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H— C-OH 
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(b) 

numbering of the carbon atoms of glyceraldehyde 3-phosphate differs 
from that of the glucose from which it is derived. In glyceraldehyde 
3-phosphate, the most complex functional group (the carbonyl) is spec- 
ified as C-1 . This numbering change is important for interpreting experi- 
ments with glucose in which a single carbon is labeled with a 
radioisotope. (See Problems 6 and 9 at the end of this chapter.) 

(6) Oxidation of Glyceraldehyde 3-Phosphate to 
1,3-Bisphosphoglycerate The first step in the payoff 
phase is the oxidation of glyceraldehyde 3-phosphate to 
1,3-bisphosphoglycerate, catalyzed by glyceralde- 
hyde 3-phosphate dehydrogenase: 



O H 

V 

I 

HCOH 

I 2 _ 
CH.OPOa 

Glyceraldehyde 
3-phosphate 




NADH + H + 



g'lyeei aldehyde 

3-phosphate 
dehydrogenase 



Inorganic 
phosphate 




CH 2 OPC>3 
1,3-Bisphosphoglycerate 

AG'° = 6.3 kJ/mol 

This is the first of the two energy-conserving reactions 
of glycolysis that eventually lead to the formation of 
ATP. The aldehyde group of glyceraldehyde 3-phos- 
phate is oxidized, not to a free carboxyl group but to a 
carboxylic acid anhydride with phosphoric acid. This 
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MECHANISM FIGURE 14-7 The glyceraldehyde 3-phosphate dehydrogenase reaction. 



type of anhydride, called an acyl phosphate, has a very 
high standard free energy of hydrolysis (AG'° = -49.3 
kJ/mol; see Fig. 13-14, Table 13-6). Much of the free en- 
ergy of oxidation of the aldehyde group of glyceralde- 
hyde 3-phosphate is conserved by formation of the acyl 
phosphate group at C-l of 1,3-bisphosphoglycerate. 

Glyceraldehyde 3-phosphate is covalently bound to 
the dehydrogenase during the reaction (Fig. 14-7). 
The aldehyde group of glyceraldehyde 3-phosphate re- 
acts with the — SH group of an essential Cys residue in 
the active site, in a reaction analogous to the formation 
of a hemiacetal (see Fig. 7-5), in this case producing a 
^ohemiacetal. Reaction of the essential Cys residue 
with a heavy metal such as Hg 2+ irreversibly inhibits the 
enzyme. 

The amount of NAD + in a cell (<10~ 5 m) is far 
smaller than the amount of glucose metabolized in a 
few minutes. Glycolysis would soon come to a halt if 
the NADH formed in this step of glycolysis were not 
continuously reoxidized and recycled. We return to 
a discussion of this recycling of NAD + later in the 
chapter. 

(7) Phosphoryl Transfer from 1,3-Bisphosphoglyc- 
erate to ADP The enzyme phosphoglycerate ki- 
nase transfers the high-energy phosphoryl group from 



the carboxyl group of 1,3-bisphosphoglycerate to ADP, 
forming ATP and 3-phosphoglycerate: 
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Notice that phosphoglycerate kinase is named for the 
reverse reaction, in which it transfers a phosphoryl 
group from ATP to 3-phosphoglycerate. Like all en- 
zymes, it catalyzes the reaction in both directions. This 
enzyme acts in the direction suggested by its name dur- 
ing gluconeogenesis (see Fig. 14-16) and during photo- 
synthetic C0 2 assimilation (see Fig. 20-4). In glycolysis, 
the reaction it catalyzes proceeds as shown above, in the 
direction of ATP synthesis. 

Steps (6) and (7) of glycolysis together constitute an 
energy-coupling process in which 1,3-bisphosphoglycer- 
ate is the common intermediate; it is formed in the first 
reaction (which would be endergonic in isolation), and 
its acyl phosphate group is transferred to ADP in the 
second reaction (which is strongly exergonic). The sum 
of these two reactions is 

Glyceraldehyde 3-phosphate + ADP + Pi + NAD + 

3-phosphoglycerate + ATP + NADH + H + 
AG' 0 = -12.2kJ/mol 

Thus the overall reaction is exergonic 

Recall from Chapter 13 that the actual free-energy 
change, AG, is determined by the standard free-energy 
change, AG' 0 , and the mass-action ratio, Q, which is the 
ratio [products]/[reactants] (see Eqn 13-4). For step (6) 



AG = AG' 0 + RT\n Q 
= AG' 0 + RT In 



[1,3-bisphosphoglycerate] [NADH] 



[glyceraldehyde 3-phosphate] [Pi] [NAD ] 

Notice that [H + ] is not included in Q. In biochemical cal- 
culations, [H + ] is assumed to be a constant (10~ 7 m), and 
this constant is included in the definition of AG' 0 (p. 491). 

When the mass-action ratio is less than 1.0, its natu- 
ral logarithm has a negative sign. Step (7), by consuming 
the product of step (6) (1,3-bisphosphoglycerate), keeps 
[1,3-bisphosphoglycerate] relatively low in the steady 
state and thereby keeps Q for the overall energy-coupling 
process small. When Q is small, the contribution of In Q 
can make AG strongly negative. This is simply another 
way of showing how the two reactions, steps (6) and (7), 
are coupled through a common intermediate. 

The outcome of these coupled reactions, both re- 
versible under cellular conditions, is that the energy re- 
leased on oxidation of an aldehyde to a carboxylate group is 
conserved by the coupled formation of ATP from ADP and 
Pi. The formation of ATP by phosphoryl group transfer from 
a substrate such as 1,3-bisphosphoglycerate is referred to 
as a substrate-level phosphorylation, to distinguish this 
mechanism from respiration-linked phosphorylation. 
Substrate-level phosphorylations involve soluble enzymes 
and chemical intermediates (1,3-bisphosphoglycerate in 
this case). Respiration-linked phosphorylations, on the 
other hand, involve membrane-bound enzymes and trans- 
membrane gradients of protons (Chapter 19). 



group between C-2 and C-3 of glycerate; Mg 2+ is essential 
for this reaction: 
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The reaction occurs in two steps (Fig. 14-8). A phos- 
phoryl group initially attached to a His residue of the 
mutase is transferred to the hydroxyl group at C-2 of 
3-phosphoglycerate, forming 2,3-bisphosphoglycerate 
(2,3-BPG). The phosphoryl group at C-3 of 2,3-BPG is 
then transferred to the same His residue, producing 
2-phosphoglycerate and regenerating the phosphorylated 
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(8) Conversion of 3-Phosphoglycerate to 2- 
Phosphoglycerate The enzyme phosphoglycerate 
mutase catalyzes a reversible shift of the phosphoryl 



FIGURE 14-8 The phosphoglycerate mutase reaction. 
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enzyme. Phosphoglycerate mutase is initially phospho- 
rylated by phosphoryl transfer from 2,3-BPG, which is 
required in small quantities to initiate the catalytic cycle 
and is continuously regenerated by that cycle. 

(D Dehydration of 2-Phosphoglycerate to Phos- 
phoenolpyruvate In the second glycolytic reaction 
that generates a compound with high phosphoryl group 
transfer potential (the first was step ©), enolase 
promotes reversible removal of a molecule of water from 
2-phosphoglycerate to yield phosphoenolpyruvate 
(PEP): 
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H 2 0 



2-Phosphoglycerate 
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c-opof- 
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CH 2 

Phosphoenolpyruvate 



AG'° = 7.5 kJ/mol 



The mechanism of the enolase reaction involves an eno- 
lic intermediate stabilized by Mg 2+ (see Fig. 6-23). The 
reaction converts a compound with a relatively low 
phosphoryl group transfer potential (AG'° for hydrolysis 
of 2-phosphoglycerate is - 17.6 kJ/mol) to one with high 
phosphoryl group transfer potential (AG'° for PEP hy- 
drolysis is -61.9 kJ/mol) (see Fig. 13-13, Table 13-6). 

@ Transfer of the Phosphoryl Group from Phos- 
phoenolpyruvate to ADP The last step in glycolysis 
is the transfer of the phosphoryl group from phospho- 
enolpyruvate to ADP, catalyzed by pyruvate kinase, 
which requires K + and either Mg 2+ or Mn 2+ : 
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In this substrate-level phosphorylation, the product 
pyruvate first appears in its enol form, then tauto- 
merizes rapidly and nonenzymatically to its keto form, 
which predominates at pH 7: 
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The overall reaction has a large, negative standard free- 
energy change, due in large part to the spontaneous 
conversion of the enol form of pyruvate to the keto form 
(see Fig. 13-13). About half of the energy released by 
PEP hydrolysis (AG'° = -61.9 kJ/mol) is conserved in 
the formation of the phosphoanhydride bond of ATP 
(AG'° = -30.5 kJ/mol), and the rest (-31.4 kJ/mol) 
constitutes a large driving force pushing the reaction to- 
ward ATP synthesis. We discuss the regulation of pyru- 
vate kinase in Chapter 15. 

The Overall Balance Sheet Shows a Net Gain of ATP 

We can now construct a balance sheet for glycolysis to 
account for (1) the fate of the carbon skeleton of glu- 
cose, (2) the input of Pi and ADP and output of ATP, and 
(3) the pathway of electrons in the oxidation-reduction 
reactions. The left-hand side of the following equation 
shows all the inputs of ATP, NAD + , ADP, and P { (consult 
Fig. 14-2), and the right-hand side shows all the outputs 
(keep in mind that each molecule of glucose yields two 
molecules of pyruvate): 

Glucose + 2ATP + 2NAD + + 4ADP + 2P { > 

2 pyruvate + 2ADP + 2NADH + 2H + + 4ATP + 2H 2 0 

Canceling out common terms on both sides of the equa- 
tion gives the overall equation for glycolysis under aero- 
bic conditions: 

Glucose + 2NAD + + 2ADP + 2Pj > 

2 pyruvate + 2NADH + 2H + + 2ATP + 2H 2 0 

The two molecules of NADH formed by glycolysis in 
the cytosol are, under aerobic conditions, reoxidized to 
NAD + by transfer of their electrons to the electron- 
transfer chain, which in eukaryotic cells is located in the 
mitochondria. The electron-transfer chain passes these 
electrons to their ultimate destination, 0 2 : 



2NADH + 2H + + 0 9 



2NAD + + 2H 2 0 



-31.4 kJ/mol 



Electron transfer from NADH to 0 2 in mitochondria pro- 
vides the energy for synthesis of ATP by respiration- 
linked phosphorylation (Chapter 19). 

In the overall glycolytic process, one molecule of 
glucose is converted to two molecules of pyruvate (the 
pathway of carbon). Two molecules of ADP and two of 
Pi are converted to two molecules of ATP (the pathway 
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of phosphoryl groups). Four electrons, as two hydride 
ions, are transferred from two molecules of glyceralde- 
hyde 3-phosphate to two of NAD + (the pathway of 
electrons). 

Glycolysis Is under Tight Regulation 

During his studies on the fermentation of glucose by 
yeast, Louis Pasteur discovered that both the rate and 
the total amount of glucose consumption were many 
times greater under anaerobic than aerobic conditions. 
Later studies of muscle showed the same large differ- 
ence in the rates of anaerobic and aerobic glycolysis. 
The biochemical basis of this "Pasteur effect" is now 
clear. The ATP yield from glycolysis under anaerobic 
conditions (2 ATP per molecule of glucose) is much 
smaller than that from the complete oxidation of glu- 
cose to C0 2 under aerobic conditions (30 or 32 ATP per 
glucose; see Table 19-5). About 15 times as much glu- 
cose must therefore be consumed anaerobically as aero- 
bically to yield the same amount of ATP 

The flux of glucose through the glycolytic pathway 
is regulated to maintain nearly constant ATP levels (as 
well as adequate supplies of glycolytic intermediates 
that serve biosynthetic roles). The required adjust- 
ment in the rate of glycolysis is achieved by a complex 
interplay among ATP consumption, NADH regenera- 
tion, and allosteric regulation of several glycolytic en- 
zymes — including hexokinase, PFK-1, and pyruvate 
kinase — and by second-to-second fluctuations in the 
concentration of key metabolites that reflect the cellu- 
lar balance between ATP production and consumption. 
On a slightly longer time scale, glycolysis is regulated 
by the hormones glucagon, epinephrine, and insulin, 
and by changes in the expression of the genes for sev- 
eral glycolytic enzymes. An especially interesting case 
of abnormal regulation of glycolysis is seen in cancer. 
The German biochemist Otto Warburg first observed in 
1928 that tumors of nearly all types carry out glycoly- 
sis at a much higher rate than normal tissue, even 
when oxygen is available. This "Warburg effect" is 
the basis for several methods of detecting and treating 
cancer (Box 14-1). 

Warburg is generally considered the preeminent 
biochemist of the first half of the twentieth century. He 

made seminal contributions to 
many other areas of biochem- 
istry, including respiration, 
photosynthesis, and the enzy- 
mology of intermediary metab- 
olism. Beginning in 1930, 
Warburg and his associates pu- 
rified and crystallized seven of 
the enzymes of glycolysis. They 
developed an experimental tool 
that revolutionized biochemical 
Otto Warburg, studies of oxidative metabo- 

1883-1970 lism: the Warburg manometer, 




which directly measured the oxygen consumption of tis- 
sues by monitoring changes in gas volume, and thus al- 
lowed quantitative measurement of any enzyme with 
oxidase activity. 

Trained in carbohydrate chemistry in the laboratory 
of the great Emil Fischer (who won the Nobel Prize in 
Chemistry in 1902), Warburg himself won the Nobel 
Prize in Physiology or Medicine in 1931. Several of War- 
burg's students and colleagues also were awarded Nobel 
Prizes: Otto Meyerhof in 1922, Hans Krebs and Fritz 
Lipmann in 1953, and Hugo Theorell in 1955. Meyerhof 's 
laboratory provided training for Lipmann, and for sev- 
eral other Nobel Prize winners: Severo Ochoa (1959), 
Andre Lwoff (1965), and George Wald (1967). 

Glucose Uptake Is Deficient in Type 1 Diabetes Mellitus 

The metabolism of glucose in mammals is limited 
by the rate of glucose uptake into cells and its 
phosphorylation by hexokinase. Glucose uptake from 
the blood is mediated by the GLUT family of glucose 
transporters (see Table 11-3). The transporters of 
hepatocytes (GLUT1, GLUT2) and of brain neurons 
(GLUT3) are always present in plasma membranes. In 
contrast, the main glucose transporter in the cells of 
skeletal muscle, cardiac muscle, and adipose tissue 
(GLUT4) is sequestered in small intracellular vesicles 
and moves into the plasma membrane only in response 
to an insulin signal (Fig. 14-9). We discussed this in- 
sulin signaling mechanism in Chapter 12 (see Fig. 
12-16). Thus in skeletal muscle, heart, and adipose tis- 
sue, glucose uptake and metabolism depend on the nor- 
mal release of insulin by pancreatic /3 cells in response 
to elevated blood glucose (see Fig. 23-27). 

Individuals with type 1 diabetes mellitus (also 
called insulin-dependent diabetes) have too few j3 cells 
and cannot release sufficient insulin to trigger glucose 
uptake by the cells of skeletal muscle, heart, or adipose 
tissue. Thus, after a meal containing carbohydrates, glu- 
cose accumulates to abnormally high levels in the 
blood, a condition known as hyperglycemia. Unable to 
take up glucose, muscle and fat tissue use the fatty 
acids of stored triacylglycerols as their principal fuel. In 
the liver, acetyl-CoA derived from this fatty acid break- 
down is converted to "ketone bodies" — acetoacetate 
and jS-hydroxybutyrate— which are exported and car- 
ried to other tissues to be used as fuel (Chapter 17). 
These compounds are especially critical to the brain, 
which uses ketone bodies as alternative fuel when glu- 
cose is unavailable. (Fatty acids cannot pass through 
the blood-brain barrier and thus are not a fuel for brain 
neurons.) 

In untreated type 1 diabetes, overproduction of ace- 
toacetate and jS-hydroxybutyrate leads to their accumu- 
lation in the blood, and the consequent lowering of 
blood pH produces ketoacidosis, a life-threatening 
condition. Insulin injection reverses this sequence of 
events: GLUT4 moves into the plasma membranes of 
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BOX 14-1 



M E D I C I N 



High Rate of Glycolysis in Tumors Suggests Targets for Chemotherapy 
and Facilitates Diagnosis 



In many types of tumors found in humans and other 
animals, glucose uptake and glycolysis proceed about 
10 times faster than in normal, noncancerous tissues. 
Most tumor cells grow under hypoxic conditions (i.e., 
with limited oxygen supply) because, at least initially, 
they lack the capillary network to supply sufficient 
oxygen. Cancer cells located more than 100 to 200 fim 
from the nearest capillaries must depend on glycolysis 
alone (without further oxidation of pyruvate) for 
much of their ATP production. The energy yield (2 
ATP per glucose) is far lower than can be obtained by 
the complete oxidation of pyruvate to C0 2 in mito- 
chondria (about 30 ATP per glucose; Chapter 19). So, 
to make the same amount of ATP, tumor cells must 
take up much more glucose than do normal cells, con- 
verting it to pyruvate and then to lactate as they recy- 
cle NADH. It is likely that two early steps in the 
transformation of a normal cell into a tumor cell are 
(1) the change to dependence on glycolysis for ATP 
production, and (2) the development of tolerance to a 
low pH in the extracellular fluid (caused by release of 
the end product of glycolysis, lactic acid). In general, 
the more aggressive the tumor, the greater is its rate 
of glycolysis. 

This increase in glycolysis is achieved at least in 
part by increased synthesis of the glycolytic enzymes 
and of the plasma membrane transporters GLUT1 and 
GLUT3 (see Table 11-3) that carry glucose into cells. 
(Recall that GLUT1 and GLUT3 are not dependent on 
insulin.) The hypoxia-inducible transcription fac- 
tor (HIF-1) is a protein that acts at the level of mRNA 
synthesis to stimulate the production of at least eight 
glycolytic enzymes and the glucose transporters when 
oxygen supply is limited (Fig. 1). With the resulting high 
rate of glycolysis, the tumor cell can survive anaerobic 
conditions until the supply of blood vessels has caught 
up with tumor growth. Another protein induced by HIF- 
1 is the peptide hormone VEGF (vascular endothelial 
growth factor), which stimulates the outgrowth of blood 
vessels (angiogenesis) toward the tumor. 



FIGURE 1 The anaerobic metabolism of glucose in tumor cells yields 
far less ATP (2 per glucose) than the complete oxidation to C0 2 that 
takes place in healthy ceils under aerobic conditions (-30 ATP per glu- 
cose), so a tumor cell must consume much more glucose to produce 
the same amount of ATP. Glucose transporters and most of the gly- 
colytic enzymes are overproduced in tumors. Compounds that inhibit 
hexokinase, glucose 6-phosphate dehydrogenase, or transketolase 
block ATP production by glycolysis, thus depriving the cancer cell of 
energy and killing it 



There is also evidence that the tumor suppressor 
protein p53, which is mutated in most types of can- 
cer (p. 477), controls the synthesis and assembly of 
mitochondrial proteins essential to the passage of 
electrons to 0 2 . Cells with mutant p53 are defective 
in mitochondrial electron transport and are forced to 
rely more heavily on glycolysis for ATP production 
(Fig. 1). 

This heavier reliance of tumors than of normal tissue 
on glycolysis suggests a possibility for anticancer therapy: 
inhibitors of glycolysis might target and kill tumors by de- 
pleting their supply of ATP. Three inhibitors of hexoki- 
nase have shown promise as chemotherapeutic agents: 
2-deoxyglucose, lonidamine, and 3-bromopyruvate. By 
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[ 18 F]2-Fluoro-2-deoxyglucose [i8F]6-Phospho-2-fluoro-2-deoxyglucose 
(FdG) (6-Phospho-FdG) 

FIGURE 2 Phosphorylation of 18 F-labeled 2-fluoro-2-deoxyglucose by hexokinase traps the FdG in 
cells (as 6-phospho-FdG), where its presence can be detected by positron emission from 18 F. 



preventing the formation of glucose 6-phosphate, these 
compounds not only deprive tumor cells of glycolyti- 
cally produced ATP but also prevent the formation of 
pentose phosphates via the pentose phosphate path- 
way, which also begins with glucose 6-phosphate. With- 
out pentose phosphates, a cell cannot synthesize the 
nucleotides essential to DNA and RNA synthesis and 
thus cannot grow or divide. Another anticancer drug 
already approved for clinical use is imatinib (Gleevec), 
described in Box 12-5. It inhibits a specific tyrosine ki- 
nase, preventing the increased synthesis of hexokinase 
normally triggered by that kinase. The thiamine analog 
oxythiamine, which blocks the action of a transketo- 
lase-like enzyme that converts xylulose 5-phosphate to 
glyceraldehyde 3-phosphate (Fig. 1), is in preclinical 
trials as an antitumor drug. 




The high glycolytic rate in tumor cells also has diag- 
nostic usefulness. The relative rates at which tissues 
take up glucose can be used in some cases to pinpoint 
the location of tumors. In positron emission tomography 
(PET), individuals are injected with a harmless, isotopi- 
cally labeled glucose analog that is taken up but not 
metabolized by tissues. The labeled compound is 2-flu- 
oro-2-deoxyglucose (FdG), in which the hydroxyl group 
at the C-2 of glucose is replaced with 18 F (Fig. 2). This 
compound is taken up via GLUT transporters and is a 
good substrate for hexokinase, but it cannot be con- 
verted to the enediol intermediate in the phosphohex- 
ose isomerase reaction (see Fig. 14-4) and therefore 
accumulates as 6-phospho-FdG. The extent of its accu- 
mulation depends on its rate of uptake and phosphory- 
lation, which as noted above is typically 10 or more 
times higher in tumors than in normal tissue. Decay of 
18 F yields positrons (two per 18 F atom) that can be de- 
tected by a series of sensitive detectors positioned 
around the body, which allows accurate localization of 
accumulated 6-phospho-FdG (Fig. 3). 

FIGURE 3 Detection of cancerous tissue by positron emission tomogra- 
phy (PET). The adult male patient had undergone surgical removal of a 
primary skin cancer (malignant melanoma). The image on the left, ob- 
tained by whole-body computed tomography (CT scan), shows the lo- 
cation of the soft tissues and bones. The central panel is a PET scan after 
the patient had ingested 18 F-labeled 2-fluoro-2-deoxyglucose (FdG). 
Dark spots indicate regions of high glucose utilization. As expected, the 
brain and bladder are heavily labeled— the brain because it uses most 
of the glucose consumed in the body, and the bladder because the 18 F- 
labeled 6-phospho-FdG is excreted in the urine. When the intensity of 
the label in the PET scan is translated into false color (the intensity in- 
creases from green to yellow to red) and the image is superimposed on 
the CT scan, the fused image (right) reveals cancer in the bones of the 
upper spine, in the liver, and in some regions of muscle, all the result of 
cancer spreading from the primary malignant melanoma. 
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FIGURE 14-9 Effect of type 1 diabetes on carbohydrate and 
mJM fat metabolism in an adipocyte. Normally, insulin triggers the 
insertion of GLUT4 transporters into the plasma membrane by the 
fusion of GLUT4-containing vesicles with the membrane, allowing 
glucose uptake from the blood. When blood levels of insulin drop, 
GLUT4 is resequestered in vesicles by endocytosis. In type 1 (insulin- 
dependent) diabetes mellitus, these normal processes are inhibited as 
indicated by X. The lack of insulin prevents glucose uptake via GLUT4; 
as a consequence, cells are deprived of glucose and blood glucose is 

hepatocytes and adipocytes, glucose is taken up into 
the cells and phosphorylated, and the blood glucose 
level falls, greatly reducing the production of ketone 
bodies. 

Diabetes mellitus has profound effects on the me- 
tabolism of both carbohydrates and fats. We return to 
this topic in Chapter 23, after considering lipid metabo- 
lism (Chapters 17 and 21). ■ 

SUMMARY 14.1 Glycolysis 

■ Glycolysis is a near-universal pathway by which a 
glucose molecule is oxidized to two molecules of 



elevated. Lacking glucose for energy supply, adipocytes break down 
triacylglycerols stored in fat droplets and supply the resulting fatty acids 
to other tissues for mitochondrial ATP production. Two byproducts of 
fatty acid oxidation in the liver (acetoacetate and /3-hydroxybutyrate, 
see p. 666) accumulate and are released into the blood, providing fuel 
for the brain but also decreasing blood pH, causing ketoacidosis. The 
same sequence of events takes place in muscle, except that myocytes 
do not store triacylglycerols and instead take up fatty acids that are re- 
leased into the blood by adipocytes, 

pyruvate, with energy conserved as ATP and 
NADH. 

■ All 10 glycolytic enzymes are in the cytosol, and all 
10 intermediates are phosphorylated compounds 
of three or six carbons. 

■ In the preparatory phase of glycolysis, ATP 
is invested to convert glucose to fructose 
1,6-bisphosphate. The bond between C-3 and 
C-4 is then broken to yield two molecules of those 
phosphate. 

In the payoff phase, each of the two molecules of 
glyceraldehyde 3-phosphate derived from glucose 



14.2 Feeder Pathways for Glycolysis 543 



undergoes oxidation at C-l; the energy of this 
oxidation reaction is conserved in the form of one 
NADH and two ATP per those phosphate oxidized. 
The net equation for the overall process is 

Glucose + 2NAD + + 2ADP + 2P { > 

2 pyruvate + 2NADH + 2H + 4- 2ATP + 2H 2 0 

■ Glycolysis is tightly regulated in coordination with 
other energy-yielding pathways to assure a steady 
supply of ATP. 

■ In type 1 diabetes, defective uptake of glucose by 
muscle and adipose tissue has profound effects on 
the metabolism of carbohydrates and fats. 

14.2 Feeder Pathways for Glycolysis 

Many carbohydrates besides glucose meet their catabolic 
fate in glycolysis, after being transformed into one of the 
glycolytic intermediates. The most significant are the stor- 
age polysaccharides glycogen and starch, either within cells 
(endogenous) or obtained in the diet; the disaccharides 
maltose, lactose, trehalose, and sucrose; and the monosac- 
charides fructose, mannose, and galactose (Fig. 14-10). 



Dietary Polysaccharides and Disaccharides Undergo 
Hydrolysis to Monosaccharides 

For most humans, starch is the major source of carbo- 
hydrates in the diet (Fig. 14-10). Digestion begins in 
the mouth, where salivary a-amylase hydrolyzes the 
internal (al->4) glycosidic linkages of starch, producing 
short polysaccharide fragments or oligosaccharides. 
(Note that in this hydrolysis reaction, water, not P i5 is 
the attacking species.) In the stomach, salivary a-amy- 
lase is inactivated by the low pH, but a second form of 
a -amylase, secreted by the pancreas into the small in- 
testine, continues the breakdown process. Pancreatic 
a-amylase yields mainly maltose and maltotriose (the 
di- and trisaccharides of glucose) and oligosaccharides 
called limit dextrins, fragments of amylopectin contain- 
ing (al— >6) branch points. Maltose and dextrins are de- 
graded to glucose by enzymes of the intestinal brush 
border (the fingerlike microvilli of intestinal epithelial 
cells, which greatly increase the area of the intestinal 
surface). Dietary glycogen has essentially the same 
structure as starch, and its digestion proceeds by the 
same pathway. 



Trehalose 



Sucrose 




Dietary 



Endogenous 




^2. glycogen; starch glycogen 



•amy I iter 



H OH 

o-Galactose 



H OH 

D-Glucose 



.o. 

H HO 



H 




hexokinase 

CH 2 OH 

OH v 




Glucose 
1 -phosphate 



UDP-galactose 
if 

- UDP-glucose 



phosptinglueo- 
mutag 



OH H 

D-Fructose 

ATP^lVnctokinase 

Fructose 1-phosphate 

fructose 1- 
phosphate 
aldolase 




Glucose 

6-phosphate 



Fructose 
6-phosphate - 





CH 2 OH 


H 


J— -0 

/I \ TT 

/h \| 


HO 


. OH HO A. 

xi \y oh 




H H 




D-Mannose 


ATP 





Mannose 6-phosphate 



phosphomannose 
isom erase 



; 1 

Glycer aldehyde + Dihydroxy acetone 
phosphate 



Fructose 1,6- 
bisphosphate 

I 



ATP 




Glyceraldehyde 
3-phosphate 



FIGURE 14-10 Entry of dietary glycogen, 
starch, disaccharides, and hexoses into 
the preparatory stage of glycolysis. 
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Endogenous Glycogen and Starch Are Degraded by 
Phosphorolysis 

Glycogen stored in animal tissues (primarily liver and 
skeletal muscle), in microorganisms, or in plant tissues 
can be mobilized for use within the same cell by a 
phosphorolytic reaction catalyzed by glycogen 
phosphorylase (starch phosphorylase in plants) 
(Fig, 14-1 1 ). These enzymes catalyze an attack by Pi 
on the (a 1— »4) glycosidic linkage that joins the last 
two glucose residues at a nonreducing end, generating 
glucose 1 -phosphate and a polymer one glucose unit 
shorter. Phosphorolysis preserves some of the energy 
of the glycosidic bond in the phosphate ester glucose 
1 -phosphate. Glycogen phosphorylase (or starch 
phosphorylase) acts repetitively until it approaches an 
(a l-»6) branch point (see Fig. 7-14), where its action 
stops. A debranching enzyme removes the branches. 
The mechanisms and control of glycogen degradation 
are described in greater detail in Chapter 15. 

Glucose 1-phosphate produced by glycogen phospho- 
rylase is converted to glucose 6-phosphate by phospho- 
glucomutase, which catalyzes the reversible reaction 
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Phosphoglucomutase employs essentially the same 
mechanism as phosphoglycerate mutase (p. 537): both 
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FIGURE 14-11 Breakdown of intracellular glycogen by glycogen 
phosphorylase. The enzyme catalyzes attack by inorganic phosphate 
(pink) on the terminal glucosyl residue (blue) at the nonreducing end 
of a glycogen molecule, releasing glucose 1-phosphate and generating 
a glycogen molecule shortened by one glucose residue. The reaction is 
a phosphorolysis (not hydrolysis). 



entail a bisphosphate intermediate, and the enzyme is 
transiently phosphorylated in each catalytic cycle. The 
general name mutase is given to enzymes that catalyze 
the transfer of a functional group from one position to 
another in the same molecule. Mutases are a subclass of 
isomerases, enzymes that interconvert stereoisomers 
or structural or positional isomers (see Table 6-3). The 
glucose 6-phosphate formed in the phosphoglucomu- 
tase reaction can enter glycolysis or another pathway 
such as the pentose phosphate pathway, described in 
Section 14.5. 

mm WORKED EXAMPLE 14-1 Energy Savings for 

Glycogen Breakdown 
by Phosphorolysis 

Calculate the energy savings (in ATP molecules per glu- 
cose monomer) achieved by breaking down glycogen by 
phosphorolysis rather than hydrolysis to begin the 
process of glycolysis. 

Solution: Phosphorolysis produces a phosphorylated glu- 
cose (glucose 1-phosphate), which is then converted to 
glucose 6-phosphate — without expenditure of the cellu- 
lar energy (1 ATP) needed for formation of glucose 6- 
phosphate from free glucose. Thus only 1 ATP is 
consumed per glucose monomer in the preparatory 
phase, compared with 2 ATP when glycolysis starts with 
free glucose. The cell therefore gains 3 ATP per glucose 
monomer (4 ATP produced in the payoff phase minus 1 
ATP used in the preparatory phase), rather than 2 — a 
saving of 1 ATP per glucose monomer. 



Breakdown of dietary polysaccharides such as glyco- 
gen and starch in the gastrointestinal tract by phospho- 
rolysis rather than hydrolysis would produce no energy 
gain: sugar phosphates are not transported into the cells 
that line the intestine, but must first be dephosphory- 
lated to the free sugar. 

Disaccharides must be hydrolyzed to monosaccha- 
rides before entering cells. Intestinal disaccharides and 
dextrins are hydrolyzed by enzymes attached to the 
outer surface of the intestinal epithelial cells: 



Dextrin + nH 2 0 - 



► n D-glucose 



Maltose + H 2 0 ■ 



► 2 D-glucose 



Lactose + H 2 0 



Sucrose + H 2 0 



► D-galactose + D-glucose 

> D-fructose 4- D-glucose 



Trehalose + H 2 0 



► 2 D-glucose 



The monosaccharides so formed are actively trans- 
ported into the epithelial cells (see Fig. 11-44), then 
passed into the blood to be carried to various tissues, 
where they are phosphorylated and funneled into the 
glycolytic sequence. 
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I Lactose intolerance, common among adults of 
mm most human populations except those originating 
in Northern Europe and some parts of Africa, is due to 
the disappearance after childhood of most or all of the 
lactase activity of the intestinal epithelial cells. Without 
intestinal lactase, lactose cannot be completely digested 
and absorbed in the small intestine, and it passes into 
the large intestine, where bacteria convert it to toxic 
products that cause abdominal cramps and diarrhea. 
The problem is further complicated because undigested 
lactose and its metabolites increase the osmolarity of 
the intestinal contents, favoring retention of water in 
the intestine. In most parts of the world where lactose 
intolerance is prevalent, milk is not used as a food by 
adults, although milk products predigested with lactase 
are commercially available in some countries. In cer- 
tain human disorders, several or all of the intestinal 
disaccharidases are missing. In these cases, the diges- 
tive disturbances triggered by dietary disaccharides can 
sometimes be minimized by a controlled diet. ■ 

Other Monosaccharides Enter the Glycolytic Pathway at 
Several Points 

In most organisms, hexoses other than glucose can un- 
dergo glycolysis after conversion to a phosphorylated de- 
rivative. D-Fructose, present in free form in many fruits 
and formed by hydrolysis of sucrose in the small intes- 
tine of vertebrates, is phosphorylated by hexokinase: 

Fructose + ATP > fructose 6-phosphate + ADP 

This is a major pathway of fructose entry into glycolysis 
in the muscles and kidney. In the liver, fructose enters 
by a different pathway. The liver enzyme fructokinase 
catalyzes the phosphorylation of fructose at C-l rather 
than C-6: 

Fructose + ATP > fructose 1-phosphate + ADP 

The fructose 1-phosphate is then cleaved to glycer- 
aldehyde and dihydroxyacetone phosphate by fructose 
1-phosphate aldolase: 

CH 2 OPO|- 

C=0 
I 

iCHsOPOi" CH 2 OH 
I Dihydroxyacetone 
9~ ® phosphate 

HOCH + 

HCOH 

HCOH 1 

6CH 2 OH H ^ 0H 
Fructose 1-phosphate CH 2 OH 

Glyceraldehyde 

Dihydroxyacetone phosphate is converted to glycer- 
aldehyde 3-phosphate by the glycolytic enzyme triose 
phosphate isomerase. Glyceraldehyde is phosphory- 



lated by ATP and triose kinase to glyceraldehyde 
3-phosphate: 

Glyceradehyde + ATP ► 

glyceraldehyde 3-phosphate + ADP 

Thus both products of fructose 1-phosphate hydrolysis 
enter the glycolytic pathway as glyceraldehyde 3-phos- 
phate. 

BD-Galactose, a product of the hydrolysis of lactose 
(milk sugar), passes in the blood from the intes- 
tine to the liver, where it is first phosphorylated at C-l , at 
the expense of ATP, by the enzyme galactokinase: 

Mg 2+ 

Galactose + ATP > galactose 1-phosphate + ADP 

The galactose 1-phosphate is then converted to its 
epimer at C-4, glucose 1-phosphate, by a set of reactions 
in which uridine diphosphate (UDP) functions as a 
coenzyme-like carrier of hexose groups (Fig. 14-12). 
The epimerization involves first the oxidation of the C-4 
— OH group to a ketone, then reduction of the ketone to 
an — OH, with inversion of the configuration at C-4. 
NAD is the cofactor for both the oxidation and the 
reduction. 

A defect in any of the three enzymes in this pathway 
causes galactosemia in humans. In galactokinase- 
deficiency galactosemia, high galactose concentrations 
are found in blood and urine. Affected individuals de- 
velop cataracts in infancy, caused by deposition of the 
galactose metabolite galactitol in the lens. 

CH 2 OH 

H— C — OH 

I 

HO-C-H 

I 

HO-C— H 

I 

H-C — OH 

I 

CH 2 OH 
D-Galactitol 

The other symptoms in this disorder are relatively mild, 
and strict limitation of galactose in the diet greatly di- 
minishes their severity. 

Transferase-deficiency galactosemia is more seri- 
ous; it is characterized by poor growth in childhood, 
speech abnormality, mental deficiency, and liver damage 
that may be fatal, even when galactose is withheld from 
the diet. Epimerase-deficiency galactosemia leads to 
similar symptoms, but is less severe when dietary galac- 
tose is carefully controlled. ■ 

D-Mannose, released in the digestion of various 
polysaccharides and glycoproteins of foods, can be 
phosphorylated at C-6 by hexokinase: 

Mg 2+ 

Mannose + ATP > mannose 6-phosphate + ADP 

Mannose 6-phosphate is isomerized by phosphoman- 
nose isomerase to yield fructose 6-phosphate, an in- 
termediate of glycolysis. 
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FIGURE 14-12 Conversion of galactose to glucose 1-phosphate. The 

conversion proceeds through a sugar-nucleotide derivative, UDP- 
galactose, which is formed when galactose 1 -phosphate displaces glu- 
cose 1 -phosphate from UDP-glucose. UDP-galactose is then converted 
by UDP-glucose 4-epimerase to UDP-glucose, in a reaction that in- 
volves oxidation of C-4 (pink) by NAD + , then reduction of C-4 by 
NADH; the result is inversion of the configuration at C-4. The UDP- 
glucose is recycled through another round of the same reaction The 
net effect of this cycle is the conversion of galactose 1-phosphate to 
glucose 1-phosphate; there is no net production or consumption of 
UDP-galactose or UDP-glucose. 

SUMMARY 14.2 Feeder Pathways for 
Glycolysis 

■ Endogenous glycogen and starch, storage forms 
of glucose, enter glycolysis in a two-step process. 



Phosphorolytic cleavage of a glucose residue from 
an end of the polymer, forming glucose 1-phosphate, 
is catalyzed by glycogen phosphorylase or starch 
phosphorylase. Phosphoglucomutase then converts 
the glucose 1-phosphate to glucose 6-phosphate, 
which can enter glycolysis. 

■ Ingested polysaccharides and disaccharides are 
converted to monosaccharides by intestinal 
hydrolytic enzymes, and the monosaccharides 
then enter intestinal cells and are transported to 
the liver or other tissues. 

■ A variety of D-hexoses, including fructose, 
galactose, and mannose, can be funneled into 
glycolysis. Each is phosphorylated and converted 
to glucose 6-phosphate, fructose 6-phosphate, or 
fructose 1-phosphate. 

■ Conversion of galactose 1-phosphate to glucose 
1-phosphate involves two nucleotide derivatives: 
UDP-galactose and UDP-glucose. Genetic defects 
in any of the three enzymes that catalyze 
conversion of galactose to glucose 1-phosphate 
result in galactosemias of varying severity 

14.3 Fates of Pyruvate under Anaerobic 
Conditions: Fermentation 

Under aerobic conditions, the pyruvate formed in the 
final step of glycolysis is oxidized to acetate (acetyl- 
CoA), which enters the citric acid cycle and is oxidized 
to C0 2 and H 2 0. The NADH formed by dehydrogenation 
of glyceraldehyde 3-phosphate is ultimately reoxidized 
to NAD + by passage of its electrons to 0 2 in mitochon- 
drial respiration. Under hypoxic (low-oxygen) condi- 
tions, however — as in very active skeletal muscle, in 
submerged plant tissues, solid tumors, or in lactic acid 
bacteria — NADH generated by glycolysis cannot be re- 
oxidized by 0 2 . Failure to regenerate NAD + would leave 
the cell with no electron acceptor for the oxidation of 
glyceraldehyde 3-phosphate, and the energy-yielding 
reactions of glycolysis would stop. NAD + must therefore 
be regenerated in some other way. 

The earliest cells lived in an atmosphere almost de- 
void of oxygen and had to develop strategies for deriving 
energy from fuel molecules under anaerobic conditions. 
Most modern organisms have retained the ability to con- 
tinually regenerate NAD + during anaerobic glycolysis 
by transferring electrons from NADH to form a reduced 
end product such as lactate or ethanol. 

Pyruvate Is the Terminal Electron Acceptor in 
Lactic Acid Fermentation 

When animal tissues cannot be supplied with sufficient 
oxygen to support aerobic oxidation of the pyruvate and 
NADH produced in glycolysis, NAD + is regenerated 
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from NADH by the reduction of pyruvate to lactate. As 
mentioned earlier, some tissues and cell types (such as 
erythrocytes, which have no mitochondria and thus can- 
not oxidize pyruvate to C0 2 ) produce lactate from glu- 
cose even under aerobic conditions. The reduction of 
pyruvate in this pathway is catalyzed by lactate dehy- 
drogenase, which forms the l isomer of lactate at pH 7: 



(X O 

\ / 

c 

I 

I 

CH 3 
Pyruvate 



NADH + H + 

NAD 1 



Inciate 
dehydrogenase 



O, O" 

V 

110 C-H 

CH 3 
L-Lactate 

AG'°= -25.1 kJ/mol 



The overall equilibrium of the reaction strongly favors 
lactate formation, as shown by the large negative stan- 
dard free-energy change. 

In glycolysis, dehydrogenation of the two molecules 
of glyceraldehyde 3-phosphate derived from each mole- 
cule of glucose converts two molecules of NAD + to two 
of NADH. Because the reduction of two molecules of 
pyruvate to two of lactate regenerates two molecules of 
NAD + , there is no net change in NAD + or NADH: 

Glucose 



2 Pyruvate 




> 2 Lactate 



The lactate formed by active skeletal muscles (or by 
erythrocytes) can be recycled; it is carried in the blood 
to the liver, where it is converted to glucose during the 
recovery from strenuous muscular activity. When lac- 
tate is produced in large quantities during vigorous mus- 
cle contraction (during a sprint, for example), the 
acidification that results from ionization of lactic acid in 
muscle and blood limits the period of vigorous activity. 
The best-conditioned athletes can sprint at top speed 
for no more than a minute (Box 14-2). 

Although conversion of glucose to lactate includes 
two oxidation-reduction steps, there is no net change in 
the oxidation state of carbon; in glucose (C 6 H 12 0 6 ) and 
lactic acid (C 3 H 6 0 3 ), the H:C ratio is the same. Never- 
theless, some of the energy of the glucose molecule has 
been extracted by its conversion to lactate — enough to 
give a net yield of two molecules of ATP for every glu- 
cose molecule consumed. Fermentation is the general 
term for such processes, which extract energy (as ATP) 
but do not consume oxygen or change the concentra- 
tions of NAD + or NADH. Fermentations are carried out 
by a wide range of organisms, many of which occupy 
anaerobic niches, and they yield a variety of end prod- 
ucts, some of which find commercial uses. 



Ethanol Is the Reduced Product in Ethanol Fermentation 

Yeast and other microorganisms ferment glucose to 
ethanol and C0 2 , rather than to lactate. Glucose is con- 
verted to pyruvate by glycolysis, and the pyruvate is 
converted to ethanol and C0 2 in a two-step process: 
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In the first step, pyruvate is decarboxylated in an irre- 
versible reaction catalyzed by pyruvate decarboxy- 
lase. This reaction is a simple decarboxylation and does 
not involve the net oxidation of pyruvate. Pyruvate de- 
carboxylase requires Mg 2+ and has a tightly bound 
coenzyme, thiamine pyrophosphate, which is discussed 
below. In the second step, acetaldehyde is reduced to 
ethanol through the action of alcohol dehydrogenase, 
with the reducing power furnished by NADH derived 
from the dehydrogenation of glyceraldehyde 3-phos- 
phate. This reaction is a well-studied case of hydride 
transfer from NADH (Fig. 14-13). Ethanol and C0 2 are 
thus the end products of ethanol fermentation, and the 
overall equation is 



Glucose + 2ADP + 2Pi 



2 ethanol + 2C0 2 + 2ATP + 2H 2 0 



As in lactic acid fermentation, there is no net change 
in the ratio of hydrogen to carbon atoms when glucose 
(H:C ratio = 12/6 = 2) is fermented to two ethanol 
and two C0 2 (combined H:C ratio = 12/6 = 2). In all 



Acetaldehyde EL 

-C = 0 - Zn 2+ 



NADH 



ir 




NH.> 



Alcohol 

dehydrogenase 



Zn 1 * at the active site polarizes the 
carbonyl oxygen of acetaldehyde. allowing 
transfer of a hydride ion {pink) from NADH. 
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MECHANISM FIGURE 14-13 The alcohol dehydrogenase reaction. 
H Alcohol Dehydrogenase Mechanism 
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fermentations, the H:C ratio of the reactants and prod- 
ucts remains the same. 

Pyruvate decarboxylase is present in brewer's and 
baker's yeast (Saccharomyces cerevisiae) and in all 



other organisms that ferment glucose to ethanol, includ- 
ing some plants. The C0 2 produced by pyruvate decar- 
boxylation in brewer's yeast is responsible for the 
characteristic carbonation of champagne. The ancient 



BOX 14-2 



Athletes, Alligators, and Coelacanths: Glycolysis at Limitim 
Concentrations of Oxygen 



Most vertebrates are essentially aerobic organisms; they 
convert glucose to pyruvate by glycolysis, then use mo- 
lecular oxygen to oxidize the pyruvate completely to C0 2 
and H 2 0. Anaerobic catabolism of glucose to lactate oc- 
curs during short bursts of extreme muscular activity, for 
example in a 100 m sprint, during which oxygen cannot 
be carried to the muscles fast enough to oxidize pyruvate. 
Instead, the muscles use their stored glucose (glycogen) 
as fuel to generate ATP by fermentation, with lactate as 
the end product. In a sprint, lactate in the blood builds up 
to high concentrations. It is slowly converted back to glu- 
cose by gluconeogenesis in the liver in the subsequent 
rest or recovery period, during which oxygen is con- 
sumed at a gradually diminishing rate until the breathing 
rate returns to normal. The excess oxygen consumed in 
the recovery period represents a repayment of the oxy- 
gen debt. This is the amount of oxygen required to supply 
ATP for gluconeogenesis during recovery respiration, in 
order to regenerate the glycogen "borrowed" from liver 
and muscle to carry out intense muscular activity in the 
sprint. The cycle of reactions that includes glucose con- 
version to lactate in muscle and lactate conversion to glu- 
cose in liver is called the Cori cycle, for Carl and Gerty 
Cori, whose studies in the 1930s and 1940s clarified the 
pathway and its role (see Box 15^). 

The circulatory systems of most small vertebrates 
can carry oxygen to their muscles fast enough to avoid 
having to use muscle glycogen anaerobically. For exam- 
ple, migrating birds often fly great distances at high 
speeds without rest and without incurring an oxygen 
debt. Many running animals of moderate size also main- 
tain an essentially aerobic metabolism in their skeletal 
muscle. However, the circulatory systems of larger ani- 
mals, including humans, cannot completely sustain aer- 
obic metabolism in skeletal muscles over long periods of 
intense muscular activity. These animals generally are 
slow-moving under normal circumstances and engage in 
intense muscular activity only in the gravest emergen- 
cies, because such bursts of activity require long recovery 
periods to repay the oxygen debt. 

Alligators and crocodiles, for example, are normally 
sluggish animals. Yet when provoked they are capable 
of lightning-fast charges and dangerous lashings of 
their powerful tails. Such intense bursts of activity are 
short and must be followed by long periods of recovery. 
The fast emergency movements require lactic acid fer- 
mentation to generate ATP in skeletal muscles. The 



stores of muscle glycogen are rapidly expended in 
intense muscular activity, and lactate reaches very high 
concentrations in myocytes and extracellular fluid. 
Whereas a trained athlete can recover from a 100 m 
sprint in 30 min or less, an alligator may require many 
hours of rest and extra oxygen consumption to clear 
the excess lactate from its blood and regenerate muscle 
glycogen after a burst of activity. 

Other large animals, such as the elephant and rhi- 
noceros, have similar metabolic characteristics, as do 
diving mammals such as whales and seals. Dinosaurs and 
other huge, now-extinct animals probably had to depend 
on lactic acid fermentation to supply energy for muscu- 
lar activity, followed by very long recovery periods dur- 
ing which they were vulnerable to attack by smaller 
predators better able to use oxygen and thus better 
adapted to continuous, sustained muscular activity. 

Deep-sea explorations have revealed many species of 
marine life at great ocean depths, where the oxygen con- 
centration is near zero. For example, the primitive coela- 
canth, a large fish recovered from depths of 4,000 m or 
more off the coast of South Africa, has an essentially 
anaerobic metabolism in virtually all its tissues. It converts 
carbohydrates to lactate and other products, most of 
which must be excreted. Some marine vertebrates fer- 
ment glucose to ethanol and C0 2 in order to generate ATP. 
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art of brewing beer involves several enzymatic processes 
in addition to the reactions of ethanol fermentation 
(Box 14-3). In baking, C0 2 released by pyruvate decar- 
boxylase when yeast is mixed with a fermentable sugar 
causes dough to rise. The enzyme is absent in vertebrate 
tissues and in other organisms that carry out lactic acid 
fermentation. 

Alcohol dehydrogenase is present in many organisms 
that metabolize ethanol, including humans. In the liver it 
catalyzes the oxidation of ethanol, either ingested or 



produced by intestinal microorganisms, with the con- 
comitant reduction of NAD + to NADH. In this case, the 
reaction proceeds in the direction opposite to that in- 
volved in the production of ethanol by fermentation. 

Thiamine Pyrophosphate Carries 
"Active Atetaldehyde" Groups 

The pyruvate decarboxylase reaction provides our 
first encounter with thiamine pyrophosphate 



BOX H-3 Ethanol Fermentations: Brewing Beer and Producing Btofuels 



Beer brewing was a science learned early in human 
history, and later refined for larger-scale production. 
Brewers prepare beer by ethanol fermentation of the 
carbohydrates in cereal grains (seeds) such as barley, 
carried out by yeast glycolytic enzymes. The carbohy- 
drates, largely polysaccharides, must first be degraded 
to disaccharides and monosaccharides. In a process 
called malting, the barley seeds are allowed to germi- 
nate until they form the hydrolytic enzymes required to 
break down their polysaccharides, at which point germi- 
nation is stopped by controlled heating. The product is 
malt, which contains enzymes that catalyze the hydroly- 
sis of the f3 linkages of cellulose and other cell wall poly- 
saccharides of the barley husks, and enzymes such as 
a-amylase and maltase. 

The brewer next prepares the wort, the nutrient 
medium required for fermentation by yeast cells. The 
malt is mixed with water and then mashed or crushed. 
This allows the enzymes formed in the malting process 
to act on the cereal polysaccharides to form maltose, 
glucose, and other simple sugars, which are soluble in 
the aqueous medium. The remaining cell matter is then 
separated, and the liquid wort is boiled with hops to give 
flavor. The wort is cooled and then aerated. 

Now the yeast cells are added. In the aerobic wort 
the yeast grows and reproduces very rapidly, using en- 
ergy obtained from available sugars. No ethanol forms 
during this stage, because the yeast, amply supplied 
with oxygen, oxidizes the pyruvate formed by glycolysis 
to C0 2 and H 2 0 via the citric acid cycle. When all the 
dissolved oxygen in the vat of wort has been consumed, 
the yeast cells switch to anaerobic metabolism, and from 
this point they ferment the sugars into ethanol and C0 2 . 
The fermentation process is controlled in part by the 
concentration of the ethanol formed, by the pH, and by 
the amount of remaining sugar. After fermentation has 
been stopped, the cells are removed and the "raw" beer 
is ready for final processing. 

In the final steps of brewing, the amount of foam (or 
head) on the beer, which results from dissolved pro- 
teins, is adjusted. Normally this is controlled by prote- 
olytic enzymes that arise in the malting process. If these 



enzymes act on the proteins too long, the beer will have 
very little head and will be flat; if they do not act long 
enough, the beer will not be clear when it is cold. Some- 
times proteolytic enzymes from other sources are added 
to control the head. 

Much of the technology developed for large-scale 
production of alcoholic beverages is now finding appli- 
cation to a wholly different problem: the production of 
ethanol as a renewable fuel. With the continuing deple- 
tion of the known stores of fossil fuels and the rising cost 
of fuel for internal combustion engines, there is in- 
creased interest in the use of ethanol as a fuel substitute 
or extender. The principal advantage of ethanol as a fuel 
is that it can be produced from relatively inexpensive 
and renewable resources rich in sucrose, starch, or 
cellulose — starch from corn or wheat, sucrose from 
beets or cane, and cellulose from straw, forest industry 
waste, or municipal solid waste. Typically, the raw 
material (feedstock) is first converted chemically to 
monosaccharides, then fed to a hardy strain of yeast in 
an industrial-scale fermenter (Fig. 1). The fermentation 
can yield not only ethanol for fuel but also side products 
such as proteins that can be used as animal feed. 




FIGURE 1 Industrial-scale fermentations to produce biofuel and other 
products are typically carried out in tanks that hold thousands of liters 
of medium. 
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MECHANISM FIGURE 14-14 Thiamine pyrophosphate 
(TPP) and its role in pyruvate decarboxylation, (a) TPP is 

the coenzyme form of vitamin (thiamine). The reactive 
carbon atom in the thiazolium ring of TPP is shown in red. 
In the reaction catalyzed by pyruvate decarboxylase, two 
of the three carbons of pyruvate are carried transiently on 
TPP in the form of a hydroxyethyl, or "active acetalde- 
hyde/' group (b), which is subsequently released as ac- 
etaldehyde. (c) The thiazolium ring of TPP stabilizes 
carbanion intermediates by providing an electrophilic 
(electron-deficient) structure into which the carbanion 
electrons can be delocalized by resonance. Structures 
with this property often called "electron sinks/' play a 
role in many biochemical reactions — here, facilitating 
carbon-carbon bond cleavage. % Thiamine Pyrophos- 
phate Mechanism 



(TPP) (Fig. 14-14), a coenzyme derived from vitamin 
Bi. Lack of vitamin Bi in the human diet leads to the con- 
dition known as beriberi, characterized by an accumula- 
tion of body fluids (swelling), pain, paralysis, and 
ultimately death. ■ 

Thiamine pyrophosphate plays an important role in 
the cleavage of bonds adjacent to a carbonyl group, such 
as the decarboxylation of a-keto acids, and in chemical 
rearrangements in which an activated acetaldehyde 
group is transferred from one carbon atom to another 
(Table 14-1). The functional part of TPP, the thiazolium 
ring, has a relatively acidic proton at C-2. Loss of this 
proton produces a carbanion that is the active species in 



TPP-dependent reactions (Fig. 14-14). The carbanion 
readily adds to carbonyl groups, and the thiazolium ring 
is thereby positioned to act as an "electron sink" that 
greatly facilitates reactions such as the decarboxylation 
catalyzed by pyruvate decarboxylase. 

Fermentations Are Used to Produce Some Common 
Foods and Industrial Chemicals 

Our progenitors learned millennia ago to use fermenta- 
tion in the production and preservation of foods. Certain 
microorganisms present in raw food products ferment 
the carbohydrates and yield metabolic products that 
give the foods their characteristic forms, textures, and 
tastes. Yogurt, already known in Biblical times, is pro- 
duced when the bacterium Lactobacillus bulgaricus 
ferments the carbohydrate in milk, producing lactic 
acid; the resulting drop in pH causes the milk proteins to 
precipitate, producing the thick texture and sour taste 
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of unsweetened yogurt. Another bacterium, Propioni- 
bacterium freudenreichii, ferments milk to produce 
propionic acid and C0 2 ; the propionic acid precipitates 
milk proteins, and bubbles of C0 2 cause the holes char- 
acteristic of Swiss cheese. Many other food products are 
the result of fermentations: pickles, sauerkraut, sausage, 
soy sauce, and a variety of national favorites, such as 
kimchi (Korea), tempoyak (Indonesia), kefir (Russia), 
dahi (India), and pozol (Mexico). The drop in pH associ- 
ated with fermentation also helps to preserve foods, be- 
cause most of the microorganisms that cause food 
spoilage cannot grow at low pH. In agriculture, plant 
byproducts such as corn stalks are preserved for use as 
animal feed by packing them into a large container (a 
silo) with limited access to air; microbial fermentation 
produces acids that lower the pH. The silage that results 
from this fermentation process can be kept as animal 
feed for long periods without spoilage. 

In 1910 Chaim Weizmann (later to become the 
first president of Israel) discovered that the bac- 
terium Clostridium acetobutyricum ferments starch 
to butanol and acetone. This discovery opened the 
field of industrial fermentations, in which some read- 
ily available material rich in carbohydrate (corn 
starch or molasses, for example) is supplied to a pure 
culture of a specific microorganism, which ferments it 
into a product of greater commercial value. The 
ethanol used to make "gasohol" is produced by micro- 
bial fermentation, as are formic, acetic, propionic, bu- 
tyric, and succinic acids, and glycerol, methanol, 
isopropanol, butanol, and butanediol. These fermen- 
tations are generally carried out in huge closed vats in 
which temperature and access to air are controlled to 
favor the multiplication of the desired microorganism 
and to exclude contaminating organisms. The beauty 
of industrial fermentations is that complicated, multi- 
step chemical transformations are carried out in high 
yields and with few side products by chemical facto- 
ries that reproduce themselves — microbial cells. For 
some industrial fermentations, technology has been 



developed to immobilize the cells in an inert support, 
to pass the starting material continuously through the 
bed of immobilized cells, and to collect the desired 
product in the effluent — an engineer's dream! 

SUMMARY 14.3 Fates of Pyruvate under 
Anaerobic Conditions: 
Fermentation 

■ The NADH formed in glycolysis must be recycled to 
regenerate NAD + , which is required as an electron 
acceptor in the first step of the payoff phase. Under 
aerobic conditions, electrons pass from NADH to 
0 2 in mitochondrial respiration. 

■ Under anaerobic or hypoxic conditions, many 
organisms regenerate NAD + by transferring 
electrons from NADH to pyruvate, forming lactate. 
Other organisms, such as yeast, regenerate NAD + 
by reducing pyruvate to ethanol and C0 2 . In these 
anaerobic processes (fermentations), there is no 
net oxidation or reduction of the carbons of glucose. 

■ A variety of microorganisms can ferment sugar in 
fresh foods, resulting in changes in pH, taste, and 
texture, and preserving food from spoilage. Fermen- 
tations are used in industry to produce a wide variety 
of commercially valuable organic compounds from 
inexpensive starting materials. 

14.4 Gluconeogenesis 

The central role of glucose in metabolism arose early in 
evolution, and this sugar remains the nearly universal 
fuel and building block in modern organisms, from mi- 
crobes to humans. In mammals, some tissues depend 
almost completely on glucose for their metabolic energy. 
For the human brain and nervous system, as well as the 
erythrocytes, testes, renal medulla, and embryonic tis- 
sues, glucose from the blood is the sole or major fuel 
source. The brain alone requires about 120 g of glucose 
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each day — more than half of all the glucose stored as 
glycogen in muscle and liver. However, the supply of glu- 
cose from these stores is not always sufficient; between 
meals and during longer fasts, or after vigorous exercise, 
glycogen is depleted. For these times, organisms need a 
method for synthesizing glucose from noncarbohydrate 
precursors. This is accomplished by a pathway called 
gluconeogenesis ("new formation of sugar"), which 
converts pyruvate and related three- and four-carbon 
compounds to glucose. 

Gluconeogenesis occurs in all animals, plants, 
fungi, and microorganisms. The reactions are essen- 
tially the same in all tissues and all species. The impor- 
tant precursors of glucose in animals are three-carbon 
compounds such as lactate, pyruvate, and glycerol, as 
well as certain amino acids (Fig. 14-15). In mammals, 
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FIGURE 14-15 Carbohydrate synthesis from simple precursors. The 

pathway from phosphoenolpyruvate to glucose 6-phosphate is common 
to the biosynthetic conversion of many different precursors of carbohy- 
drates in animals and plants. The path from pyruvate to phospho- 
enolpyruvate leads through oxaloacetate, an intermediate of the citric 
acid cycle, which we discuss in Chapter 1 6. Any compound that can be 
converted to either pyruvate or oxaloacetate can therefore serve as start- 
ing material for gluconeogenesis. This includes alanine and aspartate, 
which are convertible to pyruvate and oxaloacetate, respectively, and 
other amino acids that can also yield three- or four-carbon fragments, 
the so-called glucogenic amino acids (Table 14-4; see also Fig. 18-15). 
Plants and photosynthetic bacteria are uniquely able to convert C0 2 to 
carbohydrates, using the glyoxylate cycle {p. 639). 



gluconeogenesis takes place mainly in the liver, and to 
a lesser extent in renal cortex and in the epithelial 
cells that line the inside of the small intestine. The glu- 
cose produced passes into the blood to supply other 
tissues. After vigorous exercise, lactate produced by 
anaerobic glycolysis in skeletal muscle returns to the 
liver and is converted to glucose, which moves back to 
muscle and is converted to glycogen — a circuit called 
the Cori cycle (Box 14-2; see also Fig. 23-20). In plant 
seedlings, stored fats and proteins are converted, via 
paths that include gluconeogenesis, to the disaccha- 
ride sucrose for transport throughout the developing 
plant. Glucose and its derivatives are precursors for 
the synthesis of plant cell walls, nucleotides and coen- 
zymes, and a variety of other essential metabolites. In 
many microorganisms, gluconeogenesis starts from 
simple organic compounds of two or three carbons, 
such as acetate, lactate, and propionate, in their 
growth medium. 

Although the reactions of gluconeogenesis are the 
same in all organisms, the metabolic context and the 
regulation of the pathway differ from one species to an- 
other and from tissue to tissue. In this section we focus 
on gluconeogenesis as it occurs in the mammalian liver. 
In Chapter 20 we show how photosynthetic organisms 
use this pathway to convert the primary products of 
photosynthesis into glucose, to be stored as sucrose or 
starch. 

Gluconeogenesis and glycolysis are not identical 
pathways running in opposite directions, although 
they do share several steps (Fig. 14-16); 7 of the 10 
enzymatic reactions of gluconeogenesis are the re- 
verse of glycolytic reactions. However, three reactions 
of glycolysis are essentially irreversible in vivo and 
cannot be used in gluconeogenesis: the conversion of 
glucose to glucose 6-phosphate by hexokinase, the 
phosphorylation of fructose 6-phosphate to fructose 
1,6-bisphosphate by phosphofructokinase-1, and the 
conversion of phosphoenolpyruvate to pyruvate by 
pyruvate kinase (Fig. 14-16). In cells, these three re- 
actions are characterized by a large negative free-en- 
ergy change, whereas other glycolytic reactions have a 
AG near 0 (Table 14-2). In gluconeogenesis, the three 
irreversible steps are bypassed by a separate set of en- 
zymes, catalyzing reactions that are sufficiently exer- 
gonic to be effectively irreversible in the direction of 
glucose synthesis. Thus, both glycolysis and gluconeo- 
genesis are irreversible processes in cells. In animals, 
both pathways occur largely in the cytosol, necessitat- 
ing their reciprocal and coordinated regulation. Sepa- 
rate regulation of the two pathways is brought about 
through controls exerted on the enzymatic steps 
unique to each. 

We begin by considering the three bypass reac- 
tions of gluconeogenesis. (Keep in mind that "bypass" 
refers throughout to the bypass of irreversible glyco- 
lytic reactions.) 
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TABLE 14-2 



| free-Energy Changes of Glycolytic Reactions in Erythrocytes 
Glycolytic reaction step 



A<?'° (kJ/mol) A<?(kJytool) 



(T) Glucose + ATP > glucose 6-phosphate + ADP 

(2) Glucose 6-phosphate fructose 6-phosphate 

©Fructose 6-phosphate + ATP > fructose 1,6-bisphosphate + ADP 

(4) Fructose 1,6-bisphosphate dihydroxyacetone phosphate + 
glyceraldehyde 3-phosphate 

(5) Dihydroxyacetone phosphate glyceraldehyde 3-phosphate 

(6) Glyceraldehyde 3-phosphate + Pi + NAD + 1,3-bisphosphoglycerate + NADH + H 4 
© 1,3-Bisphosphoglycerate + ADP 3-phosphoglycerate + ATP 

(8) 3-Phosphoglycerate 2-phosphoglycerate 

(9) 2-Phosphoglycerate phosphoenolpyruvate + H 2 0 

@ Phosphoenolpyruvate + ADP > pyruvate + ATP 
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Note: A6'° is the standard free-energy change, as defined in Chapter 13 {pp. 491-492). 
AG is the free-energy change calculated from the actual concentrations of glycolytic 
intermediates present under physiological conditions in erythrocytes, at pH 7. The gly- 
colytic reactions bypassed in gluconeogenesis are shown in red. Biochemical equations 
are not necessarily balanced for H or charge (p. 501). 



Conversion of Pyruvate to Phosphoenolpyruvate Requires 
Two Exergonic Reactions 

The first of the bypass reactions in gluconeogenesis is 
the conversion of pyruvate to phosphoenolpyruvate 
(PEP). This reaction cannot occur by simple reversal of 
the pyruvate kinase reaction of glycolysis (p. 538), 
which has a large, negative free-energy change and is 
therefore irreversible under the conditions prevailing in 
intact cells (Table 14-2, step (K))). Instead, the phospho- 
rylation of pyruvate is achieved by a roundabout se- 
quence of reactions that in eukaryotes requires 
enzymes in both the cytosol and mitochondria. As we 
shall see, the pathway shown in Figure 14-16 and de- 
scribed in detail here is one of two routes from pyruvate 
to PEP; it is the predominant path when pyruvate or 
alanine is the glucogenic precursor. A second pathway, 
described later, predominates when lactate is the gluco- 
genic precursor. 

Pyruvate is first transported from the cytosol into 
mitochondria or is generated from alanine within mito- 
chondria by transamination, in which the a-amino 
group is transferred from alanine (leaving pyruvate) to 
an a-keto carboxylic acid (transamination reactions 
are discussed in detail in Chapter 18). Then pyruvate 
carboxylase, a mitochondrial enzyme that requires the 



FIGURE 14-16 Opposing pathways of glycolysis and gluconeogene- 
sis in rat liver. The reactions of glycolysis are on the left side, in reel; 
the opposing pathway of gluconeogenesis is on the right, in blue. 
The major sites of regulation of gluconeogenesis shown here are 
discussed later in this chapter, and in detail in Chapter 15. Figure 
14-19 illustrates an alternative route for oxaloacetate produced in 
mitochondria. 
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coenzyme biotin, converts the pyruvate to oxaloacetate 
(Fig. 14-17): 

Pyruvate + HC0 3 " + ATP ► 

oxaloacetate + ADP 4- Pi (14-4) 

The carboxylation reaction involves biotin as a carrier of 
activated bicarbonate, as shown in Figure 14-18; the 
reaction mechanism is shown in Figure 16-16. (Note 
that HCO3 is formed by ionization of carbonic acid 
formed from C0 2 + H 2 0.) HCO3 is phosphorylated by 
ATP to form a mixed anhydride (a carboxyphosphate); 
then biotin displaces the phosphate in the formation of 
carboxybiotin. 

Pyruvate carboxylase is the first regulatory enzyme 
in the gluconeogenic pathway, requiring acetyl-CoA as a 
positive effector. (Acetyl-CoA is produced by fatty acid 
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(a) In mitochondria, pyruvate is converted to oxaloacetate in a biotin- 
requiring reaction catalyzed by pyruvate carboxylase, (b) In the cy- 
tosol, oxaloacetate is converted to phosphoenolpyruvate by PEP 
carboxykinase. The C0 2 incorporated in the pyruvate carboxylase re- 
action is lost here as C0 2 . The decarboxylation leads to a rearrange- 
ment of electrons that facilitates attack of the carbonyl oxygen of the 
pyruvate moiety on the y phosphate of GTP. 



oxidation (Chapter 17), and its accumulation signals the 
availability of fatty acids as fuel.) As we shall see in 
Chapter 16 (see Fig. 16-15), the pyruvate carboxylase 
reaction can replenish intermediates in another central 
metabolic pathway, the citric acid cycle. 



Pyruvate carboxylase 



'X 



HCO 



ATP 
ADI 



u>p p% 



Site 1 



o 

P: O-Q 



Long biotinyl-Lys 
tether moves 
C0 2 from site 1 
to site 2. 



C-C 'CH.. 

O 

Pyruvate 



O 




Site 2 



0 



C— C-CH 2 -C 

-o o- 

Oxaloacetate 



FIGURE 14-18 Role of biotin in the pyruvate carboxylase reaction. 

The cofactor biotin is covalently attached to the enzyme through an 
amide linkage to the e-amino group of a Lys residue, forming a 
biotinyl-enzyme. The reaction occurs in two phases, which occur at 
two different sites in the enzyme. At catalytic site 1 , bicarbonate ion is 
converted to C0 2 at the expense of ATP. Then C0 2 reacts with biotin, 
forming carboxybiotinyl-enzyme. The long arm composed of biotin 
and the Lys side chain to which it is attached then carry the C0 2 of 
carboxybiotinyl-enzyme to catalytic site 2 on the enzyme surface, 
where C0 2 is released and reacts with the pyruvate, forming oxaloac- 
etate and regenerating the biotinyl-enzyme The general role of flexible 
arms in carrying reaction intermediates between enzyme active sites is 
described in Figure 1 6-1 7, and the mechanistic details of the pyruvate 
carboxylase reaction are shown in Figure 16-16. Similar mechanisms 
occur in other biotin-dependent carboxylation reactions, such as those 
catalyzed by propionyl-CoA carboxylase (see Fig. 17-11) and acetyl- 
CoA carboxylase (see Fig. 21-1). 
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Because the mitochondrial membrane has no trans- 
porter for oxaloacetate, before export to the cytosol the 
oxaloacetate formed from pyruvate must be reduced to 
malate by mitochondrial malate dehydrogenase, at 
the expense of NADH: 

Oxaloacetate + NADH + H + L-malate + NAD + (14-5) 

The standard free-energy change for this reaction is 
quite high, but under physiological conditions (includ- 
ing a very low concentration of oxaloacetate) AG ~ 0 
and the reaction is readily reversible. Mitochondrial 
malate dehydrogenase functions in both gluconeogenesis 
and the citric acid cycle, but the overall flow of metabo- 
lites in the two processes is in opposite directions. 

Malate leaves the mitochondrion through a specific 
transporter in the inner mitochondrial membrane (see 
Fig. 19-30), and in the cytosol it is reoxidized to ox- 
aloacetate, with the production of cytosolic NADH: 



Malate + NAD + 



oxaloacetate + NADH + H^ 



(14-6) 



The oxaloacetate is then converted to PEP by phospho- 
enolpyruvate carboxykinase (Fig. 14-1 7) . This 
Mg 2+ -dependent reaction requires GTP as the phos- 
phoryl group donor: 



Oxaloacetate + GTP 



PEP + C0 2 + GDP (14-7) 



The reaction is reversible under intracellular conditions; 
the formation of one high-energy phosphate compound 
(PEP) is balanced by the hydrolysis of another (GTP). 

The overall equation for this set of bypass reactions, 
the sum of Equations 14-4 through 14-7, is 

Pyruvate + ATP + GTP + HC0 3 ~ > 

PEP + ADP + GDP + Pi + C0 2 (14-8) 
AG'° = 0.9kJ/mol 

Two high-energy phosphate equivalents (one from ATP 
and one from GTP) , each yielding about 50 kJ/mol under 
cellular conditions, must be expended to phosphorylate 
one molecule of pyruvate to PEP. In contrast, when PEP 
is converted to pyruvate during glycolysis, only one ATP 
is generated from ADP. Although the standard free- 
energy change (AG'°) of the two-step path from pyruvate 
to PEP is 0.9 kJ/mol, the actual free-energy change 
(AG), calculated from measured cellular concentrations 
of intermediates, is very strongly negative (-25 kJ/mol); 
this results from the ready consumption of PEP in other 
reactions such that its concentration remains relatively 
low. The reaction is thus effectively irreversible in the 
cell. 

Note that the C0 2 added to pyruvate in the pyru- 
vate carboxylase step is the same molecule that is lost 
in the PEP carboxykinase reaction (Fig. 14-1 7b). This 
carboxylation-decarboxylation sequence represents a 
way of "activating" pyruvate, in that the decarboxylation 
of oxaloacetate facilitates PEP formation. In Chapter 21 
we shall see how a similar carboxylation-decarboxylation 
sequence is used to activate acetyl-CoA for fatty acid 
biosynthesis (see Fig. 21-1). 



There is a logic to the route of these reactions 
through the mitochondrion. The [NADH]/[NAD + ] ra- 
tio in the cytosol is 8 x 10" 4 , about 10 5 times lower 
than in mitochondria. Because cytosolic NADH is con- 
sumed in gluconeogenesis (in the conversion of 1,3- 
bisphosphoglycerate to glyceraldehyde 3-phosphate; 
Fig. 14-16), glucose biosynthesis cannot proceed unless 
NADH is available. The transport of malate from the mi- 
tochondrion to the cytosol and its reconversion there to 
oxaloacetate effectively moves reducing equivalents to 
the cytosol, where they are scarce. This path from pyru- 
vate to PEP therefore provides an important balance be- 
tween NADH produced and consumed in the cytosol 
during gluconeogenesis. 

A second pyruvate PEP bypass predominates 
when lactate is the glucogenic precursor (Fig. 14-19). 
This pathway makes use of lactate produced by glyco- 
lysis in erythrocytes or anaerobic muscle, for example, 
and it is particularly important in large vertebrates after 
vigorous exercise (Box 14-2). The conversion of lactate 
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to pyruvate in the cytosol of hepatocytes yields NADH, 
and the export of reducing equivalents (as malate) from 
mitochondria is therefore unnecessary After the pyru- 
vate produced by the lactate dehydrogenase reaction is 
transported into the mitochondrion, it is converted to 
oxaloacetate by pyruvate carboxylase, as described 
above. This oxaloacetate, however, is converted directly 
to PEP by a mitochondrial isozyme of PEP carboxyki- 
nase, and the PEP is transported out of the mitochon- 
drion to continue on the gluconeogenic path. The 
mitochondrial and cytosolic isozymes of PEP carboxyki- 
nase are encoded by separate genes in the nuclear chro- 
mosomes, providing another example of two distinct 
enzymes catalyzing the same reaction but having differ- 
ent cellular locations or metabolic roles (recall the 
isozymes of hexokinase). 

Conversion of Fructose 1,6-Bisphosphate to Fructose 
6-Phosphate Is the Second Bypass 

The second glycolytic reaction that cannot participate in 
gluconeogenesis is the phosphorylation of fructose 6- 
phosphate by PFK-1 (Table 14-2, step ©). Because this 
reaction is highly exergonic and therefore irreversible 
in intact cells, the generation of fructose 6-phosphate 
from fructose 1,6-bisphosphate (Fig. 14-16) is catalyzed 
by a different enzyme, Mg 2+ -dependent fructose 1,6- 
bisphosphatase (FBPase-1), which promotes the es- 
sentially irreversible hydrolysis of the C-l phosphate 
(not phosphoryl group transfer to ADP): 

Fructose 1,6-bisphosphate + H 2 0 > 

fructose 6-phosphate + Pj 
AG'° = -16.3kJ/mol 

FBPase-1 is so named to distinguish it from another, 
similar enzyme (FBPase-2) with a regulatory role, which 
we discuss in Chapter 15. 



Conversion of Glucose 6-Phosphate to Glucose 
Is the Third Bypass 

The third bypass is the final reaction of gluconeogenesis, 
the dephosphorylation of glucose 6-phosphate to yield 
glucose (Fig. 14-16). Reversal of the hexokinase reac- 
tion (p. 532) would require phosphoryl group transfer 
from glucose 6-phosphate to ADP, forming ATP, an ener- 
getically unfavorable reaction (Table 14-2, step ®). The 
reaction catalyzed by glucose 6-phosphatase does not 
require synthesis of ATP; it is a simple hydrolysis of a 
phosphate ester: 



Glucose 6-phosphate + H 2 0 



-> glucose + Pi 
AG'° = -13.8kJ/mol 



This Mg + -activated enzyme is found on the lumenal side 
of the endoplasmic reticulum of hepatocytes, renal cells, 
and epithelial cells of the small intestine (see Fig. 15-28), 
but not in other tissues, which are therefore unable to 
supply glucose to the blood. If other tissues had glucose 
6-phosphatase, this enzyme's activity would hydrolyze 
the glucose 6-phosphate needed within those tissues for 
glycolysis. Glucose produced by gluconeogenesis in the 
liver or kidney or ingested in the diet is delivered to 
these other tissues, including brain and muscle, through 
the bloodstream. 

Gluconeogenesis Is Energetically Expensive, but Essential 

The sum of the biosynthetic reactions leading from 
pyruvate to free blood glucose (Table 14-3) is 

2 Pyruvate + 4ATP + 2GTP + 2NADH + 2H + + 4H 2 0 ► 

glucose + 4ADP + 2 GDP + 6Pi + 2NAD + (14-9) 

For each molecule of glucose formed from pyruvate, six 
high-energy phosphate groups are required, four from 
ATP and two from GTP. In addition, two molecules of 



TABLE 14-3 



Sequential Reactions in Gluconeogenesis Starting from Pyruvate 



Pyruvate + HC0 3 " 4- ATP > oxaloacetate + ADP + P t 

Oxaloacetate + GTP ^=± phosphoenolpyruvate + C0 2 + GDP 
Phosphoenolpyruvate + H 2 0 2-phosphoglycerate 

2- Phosphoglycerate 3-phosphoglycerate 

3- Phosphoglycerate + ATP^=± 1,3-bisphosphoglycerate + ADP 
1,3-Bisphosphoglycerate + NADH + H + glyceraldehyde 3-phosphate + NAD + + P A 
Glyceraldehyde 3-phosphate dihydroxyacetone phosphate 

Glyceraldehyde 3-phosphate + dihydroxyacetone phosphate fructose 1,6-bisphosphate 

Fructose 1,6-bisphosphate > fructose 6-phosphate + P t 

Fructose 6-phosphate glucose 6-phosphate 
Glucose 6-phosphate + H 2 0 > glucose + Pj 

Sum: 2 Pyruvate 4- 4ATP + 2GTP + 2NADH + 2H + + 4H 2 0 > glucose + 4ADP + 2 GDP + 6^ + 2NAD 4 



X2 
X2 
X2 
X2 
X2 
X2 



Note: The bypass reactions are in red; all other reactions are reversible steps of glycolysis. The figures at the right indicate that the reaction is to be counted twice, because two 
three-carbon precursors are required to make a molecule of glucose. The reactions required to replace the cytosolic NADH consumed in the glyceraldehyde 3-phosphate dehydroge- 
nase reaction (the conversion of lactate to pyruvate in the cytosol or the transport of reducing equivalents from mitochondria to the cytosol in the form of malate) are not considered in 
this summary. Biochemical equations are not necessarily balanced for H and charge (p. 501). 
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NADH are required for the reduction of two molecules 
of 1,3-bisphosphoglycerate. Clearly, Equation 14-9 is 
not simply the reverse of the equation for conversion of 
glucose to pyruvate by glycolysis, which would require 
only two molecules of ATP: 

Glucose + 2 ADP + 2P { + NAD + ► 

2 pyruvate + 2ATP + 2NADH + 2H + + 2H 2 0 

The synthesis of glucose from pyruvate is a relatively ex- 
pensive process. Much of this high energy cost is neces- 
sary to ensure the irreversibility of gluconeogenesis. 
Under intracellular conditions, the overall free-energy 
change of glycolysis is at least -63 kJ/mol. Under the 
same conditions the overall AG of gluconeogenesis is 
-16 kJ/mol. Thus both glycolysis and gluconeogenesis 
are essentially irreversible processes in cells. 

Citric Acid Cycle Intermediates and Some Amino Acids 
Are Glucogenic 

The biosynthetic pathway to glucose described above al- 
lows the net synthesis of glucose not only from pyruvate 
but also from the four-, five-, and six-carbon intermedi- 
ates of the citric acid cycle (Chapter 16). Citrate, isoci- 
trate, a-ketoglutarate, succinyl-CoA, succinate, fumarate, 
and malate — all are citric acid cycle intermediates that 
can undergo oxidation to oxaloacetate (see Fig. 16-7). 
Some or all of the carbon atoms of most amino acids de- 
rived from proteins are ultimately catabolized to pyru- 
vate or to intermediates of the citric acid cycle. Such 
amino acids can therefore undergo net conversion to 
glucose and are said to be glucogenic (Table 14-4). 
Alanine and glutamine, the principal molecules that 
transport amino groups from extrahepatic tissues to the 
liver (see Fig. 18-9), are particularly important gluco- 
genic amino acids in mammals. After removal of their 



TABLE 14-4 



Glucogenic Amino Acids, Grouped by 
Site of Entry 




Pyruvate 


Succinyl-CoA 


Alanine 


Isoleucine* 


Cysteine 


Methionine 


Glycine 


Threonine 


Serine 


Valine 


Threonine 


Fumarate 


Tryptophan* 


Phenylalanine* 


a-Ketoglutarate 


Tyrosine* 


Arginine 


Oxaloacetate 


Glutamate 


Asparagine 


Glutamine 


Aspartate 


Histidine 


Proline 





Note: All these amino acids are precursors of blood glucose or liver glycogen, because 
they can be converted to pyruvate or citric acid cycle intermediates. Of the 20 common 
amino acids, only leucine and lysine are unable to furnish carbon for net glucose 
synthesis. 

*These amino acids are also ketogenic (see Fig. 18-21). 



amino groups in liver mitochondria, the carbon skele- 
tons remaining (pyruvate and a-ketoglutarate, respec- 
tively) are readily funneled into gluconeogenesis. 

Mammals Cannot Convert Fatty Acids to Glucose 

No net conversion of fatty acids to glucose occurs in 
mammals. As we shall see in Chapter 17, the catabolism 
of most fatty acids yields only acetyl-CoA. Mammals 
cannot use acetyl-CoA as a precursor of glucose, be- 
cause the pyruvate dehydrogenase reaction is irre- 
versible and cells have no other pathway to convert 
acetyl-CoA to pyruvate. Plants, yeast, and many bacteria 
do have a pathway (the glyoxylate cycle; see Fig. 16-20) 
for converting acetyl-CoA to oxaloacetate, so these or- 
ganisms can use fatty acids as the starting material for 
gluconeogenesis. This is important during the germina- 
tion of seedlings, for example; before leaves develop and 
photosynthesis can provide energy and carbohydrates, 
the seedling relies on stored seed oils for energy produc- 
tion and cell wall biosynthesis. 

Although mammals cannot convert fatty acids to 
carbohydrate, they can use the small amount of glycerol 
produced from the breakdown of fats (tnacyiglycerols) 
for gluconeogenesis. Phosphorylation of glycerol by 
glycerol kinase, followed by oxidation of the central car- 
bon, yields dihydroxyacetone phosphate, an intermedi- 
ate in gluconeogenesis in liver. 

As we will see in Chapter 21, glycerol phosphate is 
an essential intermediate in triacylglycerol synthesis 
in adipocytes, but these cells lack glycerol kinase 
and so cannot simply phosphorylate glycerol. Instead, 
adipocytes carry out a truncated version of gluconeoge- 
nesis, known as glyceroneogenesis: the conversion of 
pyruvate to dihydroxyacetone phosphate via the early 
reactions of gluconeogenesis, followed by reduction of 
the dihydroxyacetone phosphate to glycerol phosphate 
(see Fig. 21-21). 

Glycolysis and Gluconeogenesis Are Reciprocally Regulated 

If glycolysis (the conversion of glucose to pyruvate) and 
gluconeogenesis (the conversion of pyruvate to glu- 
cose) were allowed to proceed simultaneously at high 
rates, the result would be the consumption of ATP and 
the production of heat. For example, PFK-1 and FBP- 
ase-1 catalyze opposing reactions: 



ATP + fructose 6-phosphate 



PEK-l 

ADP + fructose 1,6-bisphosphate 



Fructose 1,6-bisphosphate + H 2 0 



FBPiuw-l 

fructose 6-phosphate + Pi 

The sum of these two reactions is 

ATP + H 2 0 > ADP + Pi + heat 

These two enzymatic reactions, and several others in 
the two pathways, are regulated allosterically and by 
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covalent modification (phosphorylation). In Chapter 15 
we take up the mechanisms of this regulation in detail. 
For now, suffice it to say that the pathways are regulated 
so that when the flux of glucose through glycolysis goes 
up, the flux of pyruvate toward glucose goes down, and 
vice versa. 

SUMMARY 14.4 Gluconeogenesis 

■ Gluconeogenesis is a ubiquitous multistep process 
in which glucose is produced from lactate, pyruvate, 
or oxaloacetate, or any compound (including citric 
acid cycle intermediates) that can be converted 

to one of these intermediates. Seven of the steps 
in gluconeogenesis are catalyzed by the same 
enzymes used in glycolysis; these are the reversible 
reactions. 

■ Three irreversible steps in glycolysis are bypassed 
by reactions catalyzed by gluconeogenic enzymes: 
(1) conversion of pyruvate to PEP via oxaloacetate, 
catalyzed by pyruvate carboxylase and PEP 
carboxykinase; (2) dephosphorylation of fructose 
1,6-bisphosphate by FBPase-1; and 

(3) dephosphorylation of glucose 6-phosphate by 
glucose 6-phosphatase. 

■ Formation of one molecule of glucose from 
pyruvate requires 4 ATP, 2 GTP, and 2 NADH; it 
is expensive. 

■ In mammals, gluconeogenesis in the liver, kidney, 
and small intestine provides glucose for use by the 
brain, muscles, and erythrocytes. 

■ Pyruvate carboxylase is stimulated by acetyl-CoA, 
increasing the rate of gluconeogenesis when the 
cell has adequate supplies of other substrates 
(fatty acids) for energy production. 

■ Animals cannot convert acetyl-CoA derived from 
fatty acids into glucose; plants and microorganisms 
can. 

■ Glycolysis and gluconeogenesis are reciprocally 
regulated to prevent wasteful operation of both 
pathways at the same time. 



monophosphate pathway; Fig. 14-20). In this oxida- 
tive pathway, NADP + is the electron acceptor, yielding 
NADPH. Rapidly dividing cells, such as those of bone 
marrow, skin, and intestinal mucosa, and those of tu- 
mors, use the pentose ribose 5-phosphate to make RNA, 
DNA, and such coenzymes as ATP, NADH, FADH 2 , and 
coenzyme A. 

In other tissues, the essential product of the pen- 
tose phosphate pathway is not the pentoses but the 
electron donor NADPH, needed for reductive biosynthe- 
sis or to counter the damaging effects of oxygen radicals. 
Tissues that carry out extensive fatty acid synthesis 
(liver, adipose, lactating mammary gland) or very active 
synthesis of cholesterol and steroid hormones (liver, ad- 
renal glands, gonads) require the NADPH provided by 
this pathway. Erythrocytes and the cells of the lens and 
cornea are directly exposed to oxygen and thus to the 
damaging free radicals generated by oxygen. By main- 
taining a reducing atmosphere (a high ratio of NADPH 
to NADP + and a high ratio of reduced to oxidized glu- 
tathione), such cells can prevent or undo oxidative dam- 
age to proteins, lipids, and other sensitive molecules. In 
erythrocytes, the NADPH produced by the pentose 
phosphate pathway is so important in preventing oxida- 
tive damage that a genetic defect in glucose 6-phos- 
phate dehydrogenase, the first enzyme of the pathway, 
can have serious medical consequences (Box 14-4). ■ 
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1 4.5 Pentose Phosphate Pathway of 
Glucose Oxidation 

In most animal tissues, the major catabolic fate of 
glucose 6-phosphate is glycolytic breakdown to 
pyruvate, much of which is then oxidized via the citric 
acid cycle, ultimately leading to the formation of ATP. 
Glucose 6-phosphate does have other catabolic fates, 
however, which lead to specialized products needed 
by the cell. Of particular importance in some tissues 
is the oxidation of glucose 6-phosphate to pentose 
phosphates by the pentose phosphate pathway (also 
called the phosphogluconate pathway or the hexose 
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FIGURE 14-20 General scheme of the pentose phosphate pathway. 

NADPH formed in the oxidative phase is used to reduce glutathione, 
GSSG (see Box 14-4) and to support reductive biosynthesis. The other 
product of the oxidative phase is ribose 5-phosphate, which serves as 
a precursor for nucleotides, coenzymes, and nucleic acids. In cells that 
are not using ribose 5-phosphate for biosynthesis, the nonoxidative 
phase recycles six molecules of the pentose into five molecules of the 
hexose glucose 6-phosphate, allowing continued production of 
NADPH and converting glucose 6-phosphate (in six cycles) to C0 2 . 
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Why Pythagoras Wouldn't Eat Falafel: Glucose 6-Phosphate 
Dehydrogenase Deficiency 



Fava beans, an ingredient of falafel, have been an impor- 
tant food source in the Mediterranean and Middle East 
since antiquity. The Greek philosopher and mathemati- 
cian Pythagoras prohibited his followers from dining on 
fava beans, perhaps because they make many people 
sick with a condition called favism, which can be fatal. In 
favism, erythrocytes begin to lyse 24 to 48 hours after 
ingestion of the beans, releasing free hemoglobin into 
the blood. Jaundice and sometimes kidney failure can 
result. Similar symptoms can occur with ingestion of the 
antimalarial drug primaquine or of sulfa antibiotics, or 
following exposure to certain herbicides. These symp- 
toms have a genetic basis: glucose 6-phosphate dehy- 
drogenase (G6PD) deficiency which affects about 400 
million people worldwide. Most G6PD-deficient individ- 
uals are asymptomatic; only the combination of G6PD 
deficiency and certain environmental factors produces 
the clinical manifestations. 

Glucose 6-phosphate dehydrogenase catalyzes 
the first step in the pentose phosphate pathway (see 
Fig. 14-21), which produces NADPH. This reductant, 
essential in many biosynthetic pathways, also protects 
cells from oxidative damage by hydrogen peroxide 
(H 2 0 2 ) and superoxide free radicals, highly reactive ox- 
idants generated as metabolic byproducts and through 
the actions of drugs such as primaquine and natural 
products such as divicine — the toxic ingredient of fava 
beans. During normal detoxification, H 2 0 2 is converted 
to H 2 0 by reduced glutathione and glutathione peroxi- 
dase, and the oxidized glutathione is converted back to 
the reduced form by glutathione reductase and NADPH 
(Fig. 1). H 2 0 2 is also broken down to H 2 0 and 0 2 by cata- 
lase, which also requires NADPH. In G6PD-deficient indi- 
viduals, the NADPH production is diminished and 
detoxification of H 2 0 2 is inhibited. Cellular damage re- 
sults: lipid peroxidation leading to breakdown of erythro- 
cyte membranes and oxidation of proteins and DNA. 

The geographic distribution of G6PD deficiency is 
instructive. Frequencies as high as 25% occur in tropical 
Africa, parts of the Middle East, and Southeast Asia, ar- 
eas where malaria is most prevalent. In addition to such 
epidemiological observations, in vitro studies show that 
growth of one malaria parasite, Plasmodium falci- 
parum, is inhibited in G6PD-deficient erythrocytes. The 
parasite is very sensitive to oxidative damage and is 



killed by a level of oxidative stress that is tolerable to a 
G6PD-deficient human host. Because the advantage of 
resistance to malaria balances the disadvantage of low- 
ered resistance to oxidative damage, natural selection 
sustains the G6PD-deficient genotype in human popula- 
tions where malaria is prevalent. Only under over- 
whelming oxidative stress, caused by drugs, herbicides, 
or divicine, does G6PD deficiency cause serious medical 
problems. 

An antimalarial drug such as primaquine is believed 
to act by causing oxidative stress to the parasite. It is 
ironic that antimalarial drugs can cause human illness 
through the same biochemical mechanism that provides 
resistance to malaria. Divicine also acts as an antimalar- 
ial drug, and ingestion of fava beans may protect against 
malaria. By refusing to eat falafel, many Pythagoreans 
with normal G6PD activity may have unwittingly in- 
creased their risk of malaria! 
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FIGURE 1 Role of NADPH and glutathione in protecting cells against 
highly reactive oxygen derivatives. Reduced glutathione (GSH) protects 
the cell by destroying hydrogen peroxide and hydroxyl free radicals. 
Regeneration of GSH from its oxidized form (GSSG) requires the 
NADPH produced in the glucose 6-phosphate dehydrogenase reaction. 



The Oxidative Phase Produces Pentose 
Phosphates and NADPH 

The first reaction of the pentose phosphate pathway 
(Fig. 14-21) is the oxidation of glucose 6-phosphate by 
glucose 6-phosphate dehydrogenase (G6PD) to 



form 6-phosphoglucono-S-lactone, an intramolecular es- 
ter. NADP + is the electron acceptor, and the overall 
equilibrium lies far in the direction of NADPH formation. 
The lactone is hydrolyzed to the free acid 6-phosphoglu- 
conate by a specific lactonase, then 6-phosphoglu- 
conate undergoes oxidation and decarboxylation by 
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FIGURE 14-21 Oxidative reactions of the pentose phosphate pathway. 

The end products are ribose 5-phosphate, C0 2/ and NADPH. 



6-phosphogluconate dehydrogenase to form the ke- 
topentose ribulose 5-phosphate; the reaction generates 
a second molecule of NADPH. (This ribulose 5-phos- 
phate is important in the regulation of glycolysis and 
gluconeogenesis, as we shall see in Chapter 15.) Phos- 
phopentose isomerase converts ribulose 5-phosphate 
to its aldose isomer, ribose 5-phosphate. In some tissues, 
the pentose phosphate pathway ends at this point, and 
its overall equation is 

Glucose 6-phosphate + 2NADP + + H 2 0 ► 

ribose 5-phosphate + C0 2 + 2NADPH + 2H + 

The net result is the production of NADPH, a reductant 
for biosynthetic reactions, and ribose 5-phosphate, a 
precursor for nucleotide synthesis. 

The Nonoxidative Phase Recycles Pentose 
Phosphates to Glucose 6-Phosphate 

In tissues that require primarily NADPH, the pentose 
phosphates produced in the oxidative phase of the path- 
way are recycled into glucose 6-phosphate. In this 
nonoxidative phase, ribulose 5-phosphate is first 
epimerized to xylulose 5-phosphate: 
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Then, in a series of rearrangements of the carbon 
skeletons (Fig. 14-22), six five-carbon sugar phos- 
phates are converted to five six-carbon sugar phos- 
phates, completing the cycle and allowing continued 
oxidation of glucose 6-phosphate with production of 
NADPH. Continued recycling leads ultimately to the 
conversion of glucose 6-phosphate to six C0 2 . Two en- 
zymes unique to the pentose phosphate pathway act 
in these interconversions of sugars: transketolase and 
transaldolase. Transketolase catalyzes the transfer 
of a two-carbon fragment from a ketose donor to an al- 
dose acceptor (Fig. 14-23a). In its first appearance 
in the pentose phosphate pathway, transketolase 
transfers C-l and C-2 of xylulose 5-phosphate to ri- 
bose 5-phosphate, forming the seven-carbon product 
sedoheptulose 7-phosphate (Fig. 14-23b). The re- 
maining three-carbon fragment from xylulose is glyc- 
eraldehyde 3-phosphate. 

Next, transaldolase catalyzes a reaction similar to 
the aldolase reaction of glycolysis: a three-carbon frag- 
ment is removed from sedoheptulose 7-phosphate and 
condensed with glyceraldehyde 3-phosphate, forming 
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FIGURE 14-22 Nonoxidative reactions of the pentose phosphate 
pathway, (a) These reactions convert pentose phosphates to hexose 
phosphates, allowing the oxidative reactions (see Fig. 14-21) to con- 
tinue. Transketolase and transaldolase are specific to this pathway; the 
other enzymes also serve in the glycolytic or gluconeogenic pathways, 
(b) A schematic diagram showing the pathway from six pentoses (5C) 



6C 





4C 6C 

X 

5C 3C 
5C 3C 

X 

4C 6C 



6C 



7* 



6C 



(b) 



to five hexoses (6C). Note that this involves two sets of the interconver- 
sions shown in (a). Every reaction shown here is reversible; unidirec- 
tional arrows are used only to make clear the direction of the reactions 
during continuous oxidation of glucose 6-phosphate. In the light- 
independent reactions of photosynthesis, the direction of these reac- 
tions is reversed (see Fig. 20-10). 
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tolase is the transfer of a two-carbon group, carried temporarily on enzyme-bound TPP, from a ketose donor 
to an aldose acceptor, (b) Conversion of two pentose phosphates to a triose phosphate and a seven-carbon 
sugar phosphate, sedoheptulose 7-phosphate. 
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FIGURE H-24 The reaction catalyzed by 
transaldolase. 
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FIGURE 14-25 The second reaction cat- 
alyzed by transketolase. 



HO- 
H 



CH 2 OH 

C—O 

I 

-C — H 
-C — OH 
CH 2 OPO| 



Xylulose 
5-phosphate 



O x H 

V 
I 

H-C— OH 
+ I 
H— C— OH 

I 

CH 2 OPOf" 

Erythrose 
4-phosphate 



TPP 



O v H 

V 



t« aiisketolase 1 1 C — OH 

CH 2 OPOf 

Glyceraldehyde 
3 -phosphate 



CH 2 OH 

c=o 

I 

HO — C— H 

I 

H-C-OH 

I 

H— C-OH 
I 

CH 2 OPOf" 

Fmctose 
6-phosphate 



fructose 6-phosphate and the tetrose erythrose 

4- phosphate (Fig. 14-24). Now transketolase acts 
again, forming fructose 6-phosphate and glyceraldehyde 
3-phosphate from erythrose 4-phosphate and xylulose 

5- phosphate (Fig. 14-25). Two molecules of glycer- 
aldehyde 3-phosphate formed by two iterations of these 
reactions can be converted to a molecule of fructose 1,6- 
bisphosphate as in gluconeogenesis (Fig. 14-16), and fi- 
nally FBPase-1 and phosphohexose isomerase convert 
fructose 1,6-bisphosphate to glucose 6-phosphate. 
Overall, six pentose phosphates have been converted to 
five hexose phosphates (Fig. 14-22b)— the cycle is now 
complete! 

Transketolase requires the cofactor thiamine pyro- 
phosphate (TPP), which stabilizes a two-carbon carbanion 
in this reaction (Fig. 14-26a), just as it does in the 
pyruvate decarboxylase reaction (Fig. 14-14). Transal- 
dolase uses a Lys side chain to form a Schiff base with 
the carbonyl group of its substrate, a ketose, thereby 
stabilizing a carbanion (Fig. 14-26b) that is central to 
the reaction mechanism. 

The process described in Figure 14-21 is known as 
the oxidative pentose phosphate pathway. The first 
and third steps are oxidations with large, negative stan- 
dard free-energy changes and are essentially irreversible 
in the cell. The reactions of the nonoxidative part of the 
pentose phosphate pathway (Fig. 14-22) are readily re- 
versible and thus also provide a means of converting 
hexose phosphates to pentose phosphates. As we shall 
see in Chapter 20, a process that converts hexose phos- 
phates to pentose phosphates is crucial to the photosyn- 
thetic assimilation of C0 2 by plants. That pathway, the 
reductive pentose phosphate pathway, is essentially 
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FIGURE 14-26 Carbanion intermediates stabilized by covalent interac- 
tions with transketolase and transaldolase. (a) The ring of TPP stabilizes 
the carbanion in the dihydroxyethyl group carried by transketolase; see 
Fig 1 4-1 4 for the chemistry of TPP action, (b) In the transaldolase reac- 
tion, the protonated Schiff base formed between the e-amino group of a 
Lys side chain and the substrate stabilizes the C-3 carbanion formed 
after aldol cleavage. 

the reversal of the reactions shown in Figure 14-22 and 
employs many of the same enzymes. 

All the enzymes in the pentose phosphate pathway 
are located in the cytosol, like those of glycolysis and 
most of those of gluconeogenesis. In fact, these three 
pathways are connected through several shared interme- 
diates and enzymes. The glyceraldehyde 3-phosphate 
formed by the action of transketolase is readily converted 
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to dihydroxyacetone phosphate by the glycolytic enzyme 
triose phosphate isomerase, and these two trioses can be 
joined by the aldolase as in gluconeogenesis, forming 
fructose 1,6-bisphosphate. Alternatively, the triose phos- 
phates can be oxidized to pyruvate by the glycolytic reac- 
tions. The fate of the trioses is determined by the cell's 
relative needs for pentose phosphates, NADPH, and ATP. 

Wernicke-Korsakoff Syndrome Is Exacerbated by 
a Defect in Transketolase 

Wernicke-Korsakoff syndrome is a disorder 
caused by a severe deficiency of thiamine, a 
component of TPP. The syndrome is more common 
among people with alcoholism than in the general pop- 
ulation, because chronic, heavy alcohol consumption 
interferes with the intestinal absorption of thiamine. 
The syndrome can be exacerbated by a mutation in the 
gene for transketolase that results in an enzyme with a 
lowered affinity for TPP — an affinity one-tenth that of 
the normal enzyme. This defect makes individuals 
much more sensitive to a thiamine deficiency: even a 
moderate thiamine deficiency (tolerable in individuals 
with an unmutated transketolase) can drop the level of 
TPP below that needed to saturate the enzyme. The re- 
sult is a slowing down of the whole pentose phosphate 
pathway. In people with Wernicke-Korsakoff syndrome 
this results in a worsening of symptoms, which can in- 
clude severe memory loss, mental confusion, and par- 
tial paralysis. ■ 

Glucose 6-Phosphate Is Partitioned between Glycolysis 
and the Pentose Phosphate Pathway 

Whether glucose 6-phosphate enters glycolysis or the 
pentose phosphate pathway depends on the current 
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FIGURE 14-27 Role of NADPH in regulating the partitioning of glu- 
cose 6-phosphate between glycolysis and the pentose phosphate 
pathway. When NADPH is forming faster than it is being used for 
biosynthesis and glutathione reduction (see Fig. 14-20), [NADPH] 
rises and inhibits the first enzyme in the pentose phosphate pathway. 
As a result, more glucose 6-phosphate is available for glycolysis. 



needs of the cell and on the concentration of NADP + in 
the cytosol. Without this electron acceptor, the first re- 
action of the pentose phosphate pathway (catalyzed by 
G6PD) cannot proceed. When a cell is rapidly convert- 
ing NADPH to NADP + in biosynthetic reductions, the 
level of NADP + rises, allosterically stimulating G6PD 
and thereby increasing the flux of glucose 6-phosphate 
through the pentose phosphate pathway (Fig. 14-27). 
When the demand for NADPH slows, the level of NADP + 
drops, the pentose phosphate pathway slows, and glu- 
cose 6-phosphate is instead used to fuel glycolysis. 
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The oxidative pentose phosphate pathway 
(phosphogluconate pathway, or hexose 
monophosphate pathway) brings about oxidation 
and decarboxylation at C-l of glucose 6-phosphate, 
reducing NADP + to NADPH and producing pentose 
phosphates. 

NADPH provides reducing power for biosynthetic 
reactions, and ribose 5-phosphate is a precursor 
for nucleotide and nucleic acid synthesis. Rapidly 
growing tissues and tissues carrying out active 
biosynthesis of fatty acids, cholesterol, or steroid 
hormones send more glucose 6-phosphate through 
the pentose phosphate pathway than do tissues 
with less demand for pentose phosphates and 
reducing power. 

The first phase of the pentose phosphate pathway 
consists of two oxidations that convert glucose 
6-phosphate to ribulose 5-phosphate and reduce 
NADP + to NADPH. The second phase comprises 
nonoxidative steps that convert pentose phosphates 
to glucose 6-phosphate, which begins the cycle 
again. 

In the second phase, transketolase (with TPP as 
cofactor) and transaldolase catalyze the 
interconversion of three-, four-, five-, six-, and 
seven-carbon sugars, with the reversible conversion 
of six pentose phosphates to five hexose 
phosphates. In the carbon-assimilating reactions of 
photosynthesis, the same enzymes catalyze the 
reverse process, the reductive pentose phosphate 
pathway: conversion of five hexose phosphates to 
six pentose phosphates. 

A genetic defect in transketolase that lowers its 
affinity for TPP exacerbates the Wernicke-Korsakoff 
syndrome. 

Entry of glucose 6-phosphate either into glycolysis 
or into the pentose phosphate pathway is largely 
determined by the relative concentrations of 
NADP + and NADPH. 
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Problems — 

1. Equation for the Preparatory Phase of Glycolysis 

Write balanced biochemical equations for all the reactions in 
the catabolism of glucose to two molecules of glyceraldehyde 
3-phosphate (the preparatory phase of glycolysis) , including 
the standard free-energy change for each reaction. Then write 
the overall or net equation for the preparatory phase of glycol- 
ysis, with the net standard free-energy change. 

2. The Payoff Phase of Glycolysis in Skeletal Muscle In 

working skeletal muscle under anaerobic conditions, glycer- 



aldehyde 3-phosphate is converted to pyruvate (the payoff 
phase of glycolysis), and the pyruvate is reduced to lactate. 
Write balanced biochemical equations for all the reactions in 
this process, with the standard free-energy change for each re- 
action. Then write the overall or net equation for the payoff 
phase of glycolysis (with lactate as the end product), including 
the net standard free-energy change. 

3. GLUT Transporters Compare the localization of GLUT4 
with that of GLUT2 and GLUT3, and explain why these local- 
izations are important in the response of muscle, adipose tis- 
sue, brain, and liver to insulin. 

4. Ethanol Production in Yeast When grown anaerobically 
on glucose, yeast (S. cerevisiae) converts pyruvate to ac- 
etaldehyde, then reduces acetaldehyde to ethanol using elec- 
trons from NADH. Write the equation for the second reaction, 
and calculate its equilibrium constant at 25 °C, given the stan- 
dard reduction potentials in Table 13-7. 

5. Energetics of the Aldolase Reaction Aldolase cat- 
alyzes the glycolytic reaction 

Fructose 1,6-bisphosphate > 

glyceraldehyde 3-phosphate + dihydroxyacetone phosphate 

The standard free-energy change for this reaction in the direction 
written is +23.8 kJ/mol. The concentrations of the three interme- 
diates in the hepatocyte of a mammal are: fructose 1,6-bisphos- 
phate, 1.4 X 10" 5 m; glyceraldehyde 3-phosphate, 3 X 10" 6 m; and 
dihydroxyacetone phosphate, 1.6 X 10" 5 M. At body temperature 
(37 °C), what is the actual free-energy change for the reaction? 

6. Pathway of Atoms in Fermentation A "pulse-chase" 
experiment using 14 C-labeled carbon sources is carried out on 
a yeast extract maintained under strictly anaerobic conditions 
to produce ethanol. The experiment consists of incubating a 
small amount of 14 C-labeled substrate (the pulse) with the 
yeast extract just long enough for each intermediate in the fer- 
mentation pathway to become labeled. The label is then 
"chased" through the pathway by the addition of excess unla- 
beled glucose. The chase effectively prevents any further en- 
try of labeled glucose into the pathway. 

(a) If [l- 14 C]glucose (glucose labeled at C-l with 14 C) is 
used as a substrate, what is the location of 14 C in the product 
ethanol? Explain. 

(b) Where would 14 C have to be located in the starting 
glucose to ensure that all the 14 C activity is liberated as 14 C0 2 
during fermentation to ethanol? Explain. 

7. Heat from Fermentations Large-scale industrial fer- 
menters generally require constant, vigorous cooling. Why? 

8. Fermentation to Produce Soy Sauce Soy sauce is pre- 
pared by fermenting a salted mixture of soybeans and wheat 
with several microorganisms, including yeast, over a period of 
8 to 12 months. The resulting sauce (after solids are removed) 
is rich in lactate and ethanol. How are these two compounds 
produced? To prevent the soy sauce from having a strong vine- 
gary taste (vinegar is dilute acetic acid), oxygen must be kept 
out of the fermentation tank. Why? 
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9. Equivalence of Triose Phosphates 14 C-Labeled glycer- 
aldehyde 3-phosphate was added to a yeast extract. After a 
short time, fructose 1,6-bisphosphate labeled with 14 C at C-3 
and C-4 was isolated What was the location of the 14 C label 
in the starting glyceraldehyde 3-phosphate? Where did the 
second 14 C label in fructose 1,6-bisphosphate come from? 
Explain. 

10. Glycolysis Shortcut Suppose you discovered a mutant 
yeast whose glycolytic pathway was shorter because of the 
presence of a new enzyme catalyzing the reaction 

NAD + NADH + H + 



Glyceraldehyde 3-phosphate + H 2 0 




3-phosphoglycerate 

Would shortening the glycolytic pathway in this way ben- 
efit the cell? Explain. 

11. Role of Lactate Dehydrogenase During strenuous 
activity, the demand for ATP in muscle tissue is vastly in- 
creased. In rabbit leg muscle or turkey flight muscle, the ATP 
is produced almost exclusively by lactic acid fermentation. 
ATP is formed in the payoff phase of glycolysis by two reac- 
tions, promoted by phosphoglycerate kinase and pyruvate ki- 
nase. Suppose skeletal muscle were devoid of lactate 
dehydrogenase. Could it carry out strenuous physical activ- 
ity; that is, could it generate ATP at a high rate by glycolysis? 
Explain. 

12. Efficiency of ATP Production in Muscle The trans- 
formation of glucose to lactate in myocytes releases only about 
7% of the free energy released when glucose is completely ox- 
idized to C0 2 and H 2 0. Does this mean that anaerobic glycoly- 
sis in muscle is a wasteful use of glucose? Explain. 

13. Free-Energy Change for Triose Phosphate Oxida- 
tion The oxidation of glyceraldehyde 3-phosphate to 1,3- 
bisphosphoglycerate, catalyzed by glyceraldehyde 3-phosphate 
dehydrogenase, proceeds with an unfavorable equilibrium 
constant (K' eq = 0.08; AG'° = 6.3 kJ/mol), yet the flow through 
this point in the glycolytic pathway proceeds smoothly. How 
does the cell overcome the unfavorable equilibrium? 

14. Arsenate Poisoning Arsenate is structurally and chemi- 
cally similar to inorganic phosphate (P0, and many enzymes 
that require phosphate will also use arsenate. Organic com- 
pounds of arsenate are less stable than analogous phosphate 
compounds, however. For example, acyl arsenates decom- 
pose rapidly by hydrolysis: 

O O 

II II 
R— C— O— As— 0~ + H 2 0 > 

I 

O 

o o 

II II 

R— C— O + HO— As— Cr + H+ 



On the other hand, acyl phosphates, such as 1,3-bisphospho- 
glycerate, are more stable and undergo further enzyme- 
catalyzed transformation in cells. 

(a) Predict the effect on the net reaction catalyzed by 
glyceraldehyde 3-phosphate dehydrogenase if phosphate were 
replaced by arsenate. 

(b) What would be the consequence to an organism if ar- 
senate were substituted for phosphate? Arsenate is very toxic 
to most organisms. Explain why. 

15. Requirement for Phosphate in Ethanol Fermenta- 
tion In 1906 Harden and Young, in a series of classic studies 
on the fermentation of glucose to ethanol and C0 2 by extracts 
of brewer's yeast, made the following observations. (1) Inor- 
ganic phosphate was essential to fermentation; when the sup- 
ply of phosphate was exhausted, fermentation ceased before all 
the glucose was used. (2) During fermentation under these con- 
ditions, ethanol, C0 2 , and a hexose bisphosphate accumulated. 
(3) When arsenate was substituted for phosphate, no hexose 
bisphosphate accumulated, but the fermentation proceeded 
until all the glucose was converted to ethanol and C0 2 . 

(a) Why did fermentation cease when the supply of phos- 
phate was exhausted? 

(b) Why did ethanol and C0 2 accumulate? Was the con- 
version of pyruvate to ethanol and C0 2 essential? Why? Iden- 
tify the hexose bisphosphate that accumulated. Why did it 
accumulate? 

(c) Why did the substitution of arsenate for phosphate 
prevent the accumulation of the hexose bisphosphate yet 
allow fermentation to ethanol and C0 2 to go to completion? 
(See Problem 14.) 

16. Role of the Vitamin Niacin Adults engaged in strenu- 
ous physical activity require an intake of about 160 g of carbo- 
hydrate daily but only about 20 mg of niacin for optimal 
nutrition. Given the role of niacin in glycolysis, how do you 
explain the observation? 

17. Synthesis of Glycerol Phosphate The glycerol 3- 
phosphate required for the synthesis of glycerophospholipids 
can be synthesized from a glycolytic intermediate. Propose a 
reaction sequence for this conversion. 

I 18. Severity of Clinical Symptoms Due to Enzyme 
WLm Deficiency The clinical symptoms of two forms of 
galactosemia— deficiency of galactokinase or of UDP-glucose: 
galactose 1 -phosphate uridylyltransferase— show radically dif- 
ferent severity. Although both types produce gastric discom- 
fort after milk ingestion, deficiency of the transferase also 
leads to liver, kidney, spleen, and brain dysfunction and even- 
tual death. What products accumulate in the blood and tissues 
with each type of enzyme deficiency? Estimate the relative 
toxicities of these products from the above information. 

19. Muscle Wasting in Starvation One consequence of 
starvation is a reduction in muscle mass. What happens to the 
muscle proteins? 

20. Pathway of Atoms in Gluconeogenesis A liver extract 
capable of carrying out all the normal metabolic reactions of 
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the liver is briefly incubated in separate experiments with the 
following 14 C-labeled precursors. 

Or 

(a) [ 14 C] Bicarbonate, HO— 14 C 

x o 

(b) [l- 14 C]Pyruvate, CH 3 — C— 14 COO" 

O 

Trace the pathway of each precursor through gluconeogene- 
sis* Indicate the location of 14 C in all intermediates and in the 
product, glucose. 

21. Energy Cost of a Cycle of Glycolysis and Gluconeo- 
genesis What is the cost (in ATP equivalents) of transforming 
glucose to pyruvate via glycolysis and back again to glucose via 
gluconeogenesis? 

22. Relationship between Gluconeogenesis and Glyco- 
lysis Why is it important that gluconeogenesis is not the exact 
reversal of glycolysis? 

23. Energetics of the Pyruvate Kinase Reaction Explain 
in bioenergetic terms how the conversion of pyruvate to phos- 
phoenolpyruvate in gluconeogenesis overcomes the large, 
negative standard free- energy change of the pyruvate kinase 
reaction in glycolysis. 

24. Glucogenic Substrates A common procedure for de- 
termining the effectiveness of compounds as precursors of 
glucose in mammals is to starve the animal until the liver 
glycogen stores are depleted and then administer the com- 
pound in question. A substrate that leads to a net increase in 
liver glycogen is termed glucogenic, because it must first be 
converted to glucose 6-phosphate. Show by means of known 
enzymatic reactions which of the following substances are 
glucogenic. 

(a) Succinate, ~OOC— CH 2 — CH 2 — COO" 

(b) Glycerol, OH OH OH 

I I I 
CH 2 — C — CH 2 

H 

(c) Acetyl-CoA, 0 

CH 3 — C— S-CoA 

(d) Pyruvate, 0 

II 

CH 3 — C— COO" 

(e) Butyrate, CH 3 — CH 2 — CH 2 — COO" 

25. Ethanol Affects Blood Glucose Levels The 
liifl consumption of alcohol (ethanol), especially after peri- 
ods of strenuous activity or after not eating for several hours, 
results in a deficiency of glucose in the blood, a condition 
known as hypoglycemia. The first step in the metabolism of 
ethanol by the liver is oxidation to acetaldehyde, catalyzed by 
liver alcohol dehydrogenase: 

CH 3 CH 2 OH + NAD + > CH 3 CHO + NADH + H + 



Explain how this reaction inhibits the transformation of lactate 
to pyruvate. Why does this lead to hypoglycemia? 

26. Blood Lactate Levels during Vigorous Exer- 
cise The concentrations of lactate in blood plasma be- 
fore, during, and after a 400 m sprint are shown in the 
graph. 

Run 




q| L L J 1 

0 20 40 60 

Time (min) 

(a) What causes the rapid rise in lactate concentration? 

Cb) What causes the decline in lactate concentration after 
completion of the sprint? Why does the decline occur more 
slowly than the increase? 

(c) Why is the concentration of lactate not zero during the 
resting state? 

r27. Relationship between Fructose 1,6-Bisphos- 
phatase and Blood Lactate Levels A congenital de- 
fect in the liver enzyme fructose 1,6-bisphosphatase results in 
abnormally high levels of lactate in the blood plasma. Explain. 

28. Effect of Phloridzin on Carbohydrate Metabolism 

Phloridzin, a toxic glycoside from the bark of the pear tree, 
blocks the normal reabsorption of glucose from the kidney 
tubule, thus causing blood glucose to be almost completely ex- 
creted in the urine. In an experiment, rats fed phloridzin and 
sodium succinate excreted about 0.5 mol of glucose (made by 
gluconeogenesis) for every 1 mol of sodium succinate in- 
gested. How is the succinate transformed to glucose? Explain 
the stoichiometry. 



OH 




OH 

Phloridzin 



29. Excess O a Uptake during Gluconeogenesis Lactate 
absorbed by the liver is converted to glucose, with the input of 
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6 mol of ATP for every mole of glucose produced. The extent of 
this process in a rat liver preparation can be monitored by admin- 
istering [ 14 C]lactate and measuring the amount of [ 14 C]glucose 
produced. Because the stoichiometry of 0 2 consumption and 
ATP production is known (about 5 ATP per 0 2 ), we can predict 
the extra 0 2 consumption above the normal rate when a given 
amount of lactate is administered. However, when the extra 0 2 
used in the synthesis of glucose from lactate is actually measured, 
it is always higher than predicted by known stoichiometric rela- 
tionships. Suggest a possible explanation for this observation. 

30. Role of the Pentose Phosphate Pathway If the oxida- 
tion of glucose 6-phosphate via the pentose phosphate path- 
way were being used primarily to generate NADPH for 
biosynthesis, the other product, ribose 5-phosphate, would ac- 
cumulate. What problems might this cause? 

Data Analysis Problem 

31. Engineering a Fermentation System Fermentation of 
plant matter to produce ethanol for fuel is one potential 
method for reducing the use of fossil fuels and thus the C0 2 
emissions that lead to global warming. Many microorganisms 
can break down cellulose then ferment the glucose to ethanol. 
However, many potential cellulose sources, including agricul- 
tural residues and switchgrass, also contain substantial 
amounts of arabinose, which is not as easily fermented. 



HO-C-H 

I 

H-C— OH 

I 

H-C-OH 

I 

CH 2 OH 
D-Arabinose 

Escherichia coli is capable of fermenting arabinose to 
ethanol, but it is not naturally tolerant of high ethanol levels, 
thus limiting its utility for commercial ethanol production. An- 
other bacterium, Zymomonas mobilis, is naturally tolerant of 
high levels of ethanol but cannot ferment arabinose. Deanda, 
Zhang, Eddy and Picataggio (1996) described their efforts to 
combine the most useful features of these two organisms by 
introducing the E. coli genes for the arabinose-metabolizing 
enzymes into Z. mobilis. 

(a) Why is this a simpler strategy than the reverse: engi- 
neering E. coli to be more ethanol-tolerant? 

Deanda and colleagues inserted five E. coli genes into the 
Z. mobilis genome: araA, coding for L-arabinose isomerase, 



which interconverts L-arabinose and L-ribulose; araB, L-ribu- 
lokinase, which uses ATP to phosphorylate L-ribulose at C-5; 
araD, L-ribulose 5-phosphate eprmerase, which interconverts 
L-ribulose 5-phosphate and L-xylulose 5-phosphate; taffl, 
transaldolase; and tktA, transketolase. 

(b) For each of the three ara enzymes, briefly describe 
the chemical transformation it catalyzes and, where possible, 
name an enzyme discussed in this chapter that carries out an 
analogous reaction. 

The five E. coli genes inserted in Z. mobilis allowed the 
entry of arabinose into the nonoxidative phase of the pentose 
phosphate pathway (Fig. 14-22), where it was converted to 
glucose 6-phosphate and fermented to ethanol. 

(c) The three ara enzymes eventually converted arabi- 
nose into which sugar? 

(d) The product from part (c) feeds into the pathway 
shown in Figure 14-22. Combining the five E. coli enzymes 
listed above with the enzymes of this pathway, describe the 
overall pathway for the fermentation of 6 molecules of arabi- 
nose to ethanol. 

(e) What is the stoichiometry of the fermentation of 6 mol- 
ecules of arabinose to ethanol and C0 2 ? How many ATP mole- 
cules would you expect this reaction to generate? 

(f) Z. mobilis uses a slightly different pathway for ethanol 
fermentation from the one described in this chapter. As a re- 
sult, the expected ATP yield is only 1 ATP per molecule of ara- 
binose. Although this is less beneficial for the bacterium, it is 
better for ethanol production. Why? 

Another sugar commonly found in plant matter is xylose. 



H-C— OH 
I 

HO— C— H 

I 

H-C— OH 

I 

CH 2 OH 
D-Xylose 

(g) What additional enzymes would you need to introduce 
into the modified Z. mobilis strain described above to enable 
it to use xylose as well as arabinose to produce ethanol? You 
don't need to name the enzymes (they may not even exist in 
the real world!); just give the reactions they would need to 
catalyze. 

Reference 

Deanda, K., Zhang, M., Eddy, C, & Picataggio, S. (1996) Devel- 
opment of an arabinose-fermenting Zymomonas mobilis strain by 
metabolic pathway engineering Appl Environ Microbiol 62, 
4465-4470 



Formation of liver glycogen from lactic acid is thus seen to establish 
an important connection between the metabolism of the muscle and 
that of the liver. Muscle glycogen becomes available as blood sugar 
through the intervention of the liver, and blood sugar in turn is con- 
verted into muscle glycogen. There exists therefore a complete cycle 
of the glucose molecule in the body . . . Epinephrine was found to 




accelerate this cycle in the direction of muscle glycogen to liver 
glycogen . . . Insulin, on the other hand, was found to accelerate the 
cycle in the direction of blood glucose to muscle glycogen. 

— C F. Cori and G. T. Cori, 
article in Journal of Biological Chemistry, 1929 
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Metabolic regulation, a central theme in biochem- 
istry, is one of the most remarkable features of 
.living organisms. Of the thousands of enzyme- 
catalyzed reactions that can take place in a cell, there is 
probably not one that escapes some form of regulation. 
This need to regulate every aspect of cellular metabo- 
lism becomes clear as one examines the complexity of 
metabolic reaction sequences. Although it is convenient 
for the student of biochemistry to divide metabolic 
processes into "pathways" that play discrete roles in the 
cell's economy, no such separation exists in the living 
cell. Rather, every pathway we discuss in this book is 
inextricably intertwined with all the other cellular 
pathways in a multidimensional network of reactions 
(Fig. 15-1). For example, in Chapter 14 we discussed 
four possible fates for glucose 6-phosphate in a hepato- 
cyte: breakdown by glycolysis for the production of ATP, 
breakdown in the pentose phosphate pathway for the 
production of NADPH and pentose phosphates, use in 
the synthesis of complex polysaccharides of the extra- 
cellular matrix, or hydrolysis to glucose and phosphate 
to replenish blood glucose. In fact, glucose 6-phosphate 
has other possible fates in hepatocytes, too; it may, for 
example, be used to synthesize other sugars, such as 
glucosamine, galactose, galactosamine, fucose, and neu- 
raminic acid, for use in protein glycosylation, or it may 



be partially degraded to provide acetyl-CoA for fatty 
acid and sterol synthesis. And the bacterium Es- 
cherichia coli can use glucose to produce the carbon 
skeleton of every one of its several thousand types of 
molecules. When any cell uses glucose 6-phosphate for 
one purpose, that "decision" affects all the other path- 
ways for which glucose 6-phosphate is a precursor or in- 
termediate: any change in the allocation of glucose 
6-phosphate to one pathway affects, directly or indi- 
rectly, the flow of metabolites through all the others. 

Such changes in allocation are common in the life of 
a cell. Louis Pasteur was the first to describe the more 
than 10-fold increase in glucose consumption by a yeast 
culture when it was shifted from aerobic to anaerobic 
conditions. This "Pasteur effect" occurs without a signif- 
icant change in the concentrations of ATP or most of the 
hundreds of metabolic intermediates and products de- 
rived from glucose. A similar effect occurs in the cells of 
skeletal muscle when a sprinter leaves the starting 
blocks. The ability of a cell to carry out all these inter- 
locking metabolic processes simultaneously — obtaining 
every product in the amount needed and at the right 
time, in the face of major perturbations from outside, 
and without generating leftovers — is an astounding 
accomplishment. 

In this chapter we use the metabolism of glucose to 
illustrate some general principles of metabolic regula- 
tion. First we look at the general roles of regulation in 
achieving metabolic homeostasis and introduce meta- 
bolic control analysis, a system for analyzing complex 
metabolic interactions quantitatively. We then describe 
the specific regulatory properties of the individual en- 
zymes of glucose metabolism; for glycolysis and gluco- 
neogenesis, we described the catalytic activities of the 
enzymes in Chapter 14. Here we also discuss both the 
catalytic and regulatory properties of the enzymes of 
glycogen synthesis and breakdown, one of the best- 
studied cases of metabolic regulation. Note that in 
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METABOLIC PATHWAYS 




FIGURE 15-1 Metabolism as a three-dimensional meshwork. A typical 
eukaryotic cell has the capacity to make about 30,000 different pro- 
teins, which catalyze thousands of different reactions involving many 
hundreds of metabolites, most shared by more than one "pathway." 
This overview image of metabolic pathways is from the online KEGG 



(Kyoto Encyclopedia of Genes and Genomes) PATHWAY database 
{www,genome ad jp/kegg/pathway/map/map01 100.html). Each area 
can be further expanded for increasingly detailed information, to the 
level of specific enzymes and intermediates. 



selecting carbohydrate metabolism to illustrate the prin- 
ciples of metabolic regulation, we have artificially sepa- 
rated the metabolism of fats and carbohydrates. In fact, 
these two activities are very tightly integrated, as we 
shall see in Chapter 23. 

15.1 Regulation of Metabolic Pathways 

The pathways of glucose metabolism provide, in the 
catabolic direction, the energy essential to oppose the 
forces of entropy and, in the anabolic direction, biosyn- 
thetic precursors and a storage form of metabolic en- 
ergy. These reactions are so important to survival that 
very complex regulatory mechanisms have evolved to 



ensure that metabolites move through each pathway in 
the correct direction and at the correct rate to match 
exactly the cell's or the organism's changing circum- 
stances. By a variety of mechanisms operating on differ- 
ent time scales, adjustments are made in the rate of 
metabolite flow through an entire pathway when exter- 
nal circumstances change. 

Circumstances do change, sometimes dramatically. 
For example, the demand for ATP in insect flight muscle 
increases 100-fold in a few seconds when the insect 
takes flight. In humans, the availability of oxygen may 
decrease due to hypoxia (diminished delivery of oxygen 
to tissues) or ischemia (diminished flow of blood to tis- 
sues). The relative proportions of carbohydrate, fat, and 
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protein in the diet vary from meal to meal, and the sup- 
ply of fuels obtained in the diet is intermittent, requiring 
metabolic adjustments between meals and during peri- 
ods of starvation. Wound healing requires huge amounts 
of energy and biosynthetic precursors. 

Cells and Organisms Maintain a Dynamic Steady State 

Fuels such as glucose enter a cell, and waste products 
such as C0 2 leave, but the mass and the gross composi- 
tion of a typical cell, organ, or adult animal do not 
change appreciably over time; cells and organisms exist 
in a dynamic steady state. For each metabolic reaction 
in a pathway, the substrate is provided by the preceding 
reaction at the same rate at which it is converted to 
product. Thus, although the rate (v~) of metabolite flow, 
or flux, through this step of the pathway may be high 
and variable, the concentration of substrate, S, remains 
constant. So, for the two-step reaction 

Vl v 2 

A > S >P 

when Vi = v 2 , [S] is constant. For example, changes in 
Vi for the entry of glucose from various sources into the 
blood are balanced by changes in v 2 for the uptake of 
glucose from the blood into various tissues, so the con- 
centration of glucose in the blood ([S]) is held nearly 
constant at 5 itim. This is homeostasis at the molecular 
level. The failure of homeostatic mechanisms is often at 
the root of human disease. In diabetes mellitus, for ex- 
ample, the regulation of blood glucose concentration is 
defective as a result of the lack of or insensitivity to in- 
sulin, with profound medical consequences. 

When the external perturbation is not merely tran- 
sient, or when one kind of cell develops into another, the 
adjustments in cell composition and metabolism can be 
more dramatic and may require significant and lasting 
changes in the allocation of energy and synthetic pre- 
cursors to bring about a new dynamic steady state. Con- 
sider, for example, the differentiation of stem cells in 
the bone marrow into erythrocytes. The precursor cell 
contains a nucleus, mitochondria, and little or no hemo- 
globin, whereas the fully differentiated erythrocyte con- 
tains prodigious amounts of hemoglobin but has neither 
nucleus nor mitochondria; the cell's composition has 
permanently changed in response to external develop- 
mental signals, with accompanying changes in metabo- 
lism. This cellular differentiation requires precise 
regulation of the levels of cellular proteins. 

In the course of evolution, organisms have acquired a 
remarkable collection of regulatory mechanisms for main- 
taining homeostasis at the molecular, cellular, and organis- 
mal levels, as reflected in the proportion of genes that 
encode regulatory machinery. In humans, about 4,000 
genes (—12% of all genes) encode regulatory proteins, in- 
cluding a variety of receptors, regulators of gene expres- 
sion, and more than 500 different protein kinases! In many 
cases, the regulatory mechanisms overlap: one enzyme is 
subject to regulation by several different mechanisms. 



Both the Amount and the Catalytic Activity 
of an Enzyme Can Be Regulated 

The flux through an enzyme-catalyzed reaction can be 
modulated by changes in the number of enzyme mole- 
cules or by changes in the catalytic activity of each 
enzyme molecule already present. Such changes occur 
on time scales from milliseconds to many hours, in re- 
sponse to signals from within or outside the cell. Very 
rapid allosteric changes in enzyme activity are generally 
triggered locally, by changes in the local concentration 
of a small molecule— a substrate of the pathway in 
which that reaction is a step (say, glucose for glycoly- 
sis), a product of the pathway (ATP from glycolysis), or 
a key metabolite or cofactor (such as NADH) that indi- 
cates the cell's metabolic state. Second messengers 
(such as cyclic AMP and Ca 2+ ) generated intracellular^ 
in response to extracellular signals (hormones, cy- 
tokines, and so forth) also mediate allosteric regulation, 
on a slightly slower time scale set by the rate of the 
signal-transduction mechanism (see Chapter 12). 

Extracellular signals (Fig, 15-2,®) maybe hormonal 
(insulin or epinephrine, for example) or neuronal (acetyl- 
choline), or may be growth factors or cytokines. The num- 
ber of molecules of a given enzyme in a cell is a function of 
the relative rates of synthesis and degradation of that en- 
zyme. The rate of synthesis can be adjusted by the activa- 
tion (in response to some outside signal) of a transcription 
factor (Fig. 15-2, (2); described in more detail in Chapter 
28). Transcription factors are nuclear proteins that, 
when activated, bind specific DNA regions (response ele- 
ments) near a gene's promoter (its transcriptional starting 
point) and activate or repress the transcription of that gene, 
leading to increased or decreased synthesis of the encoded 
protein. Activation of a transcription factor is sometimes 
the result of its binding of a specific ligand and sometimes 
the result of its phosphorylation or dephosphorylation. 
Each gene is controlled by one or more response elements 
that are recognized by specific transcription factors. Some 
genes have several response elements and are therefore 
controlled by several different transcription factors, re- 
sponding to several different signals. Groups of genes en- 
coding proteins that act together, such as the enzymes of 
glycolysis or gluconeogenesis, often share common re- 
sponse element sequences, so that a single signal, acting 
through a particular transcription factor, turns all of these 
genes on and off together. The regulation of carbohydrate 
metabolism by specific transcription factors is described in 
Section 15.3. 

The stability of messenger RNAs— their resistance to 
degradation by cellular ribonucleases (Fig. 15-2, (3)) — 
varies, and the amount of a given mRNA in the cell is a 
function of its rates of synthesis and degradation (Chap- 
ter 26). The rate at which an mRNA is translated into a 
protein by ribosomes (Fig. 15-2, ®) is also regulated, 
and depends on several factors described in detail in 
Chapter 27. Note that an n-Md increase in an mRNA does 
not always mean an n-fold increase in its protein product. 
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FIGURE 15-2 Factors affecting the activity of enzymes. The total activity 
of an enzyme can be changed by altering the number of its molecules 
in the cell, or its effective activity in a subcellular compartment (© 



through ©), or by modulating the activity of existing molecules ((7) 
through ©), as detailed in the text. An enzyme may be influenced by 
a combination of such factors. 



Once synthesized, protein molecules have a finite 
lifetime, which may range from minutes to many 
days (Table 15-1). The rate of protein degradation 
(Fig. 15-2, (5)) differs from one enzyme to another and 
depends on the conditions in the cell. Some proteins are 
tagged by the covalent attachment of ubiquitin for 
degradation in proteasomes, as discussed in Chapter 28 
(see, for example, the case of cyclin, in Fig. 12-46). 
Rapid turnover (synthesis followed by degradation) is 
energetically expensive, but proteins with a short half- 
life can reach new steady state levels much faster than 
those with a long half-life, and the benefit of this quick 
responsiveness must balance or outweigh the cost to 
the cell. 



TABLE 15-1 



Tissue 



Average Half-life of Proteins in 
Mammalian Tissues 

Half-life (days) 



Liver 

Kidney 

Heart 

Brain 

Muscle 



0.9 
1.7 
4.1 
4.6 
10.7 



Yet another way to alter the effective activity of an 
enzyme is to sequester the enzyme and its substrate in 
different compartments (Fig. 15-2, ©). In muscle, for 
example, hexokinase cannot act on glucose until the 
sugar enters the myocyte from the blood, and the rate at 
which it enters depends on the activity of glucose trans- 
porters (see Table 11-3) in the plasma membrane. 
Within cells, membrane-bounded compartments segre- 
gate certain enzymes and enzyme systems, and the 
transport of substrate across these intracellular mem- 
branes may be the limiting factor in enzyme action. 

By these several mechanisms for regulating enzyme 
level, cells can dramatically change their complement of 
enzymes in response to changes in metabolic circum- 
stances. In vertebrates, liver is the most adaptable tis- 
sue; a change from a high-carbohydrate to high-lipid 
diet, for example, affects the transcription of hundreds 
of genes and thus the levels of hundreds of proteins. 
These global changes in gene expression can be quanti- 
fied by the use of DNA microarrays (see Fig. 9-22) that 
display the entire complement of mRNAs present in a 
given cell type or organ (the transcriptome) or by two- 
dimensional gel electrophoresis (see Fig. 3-21) that 
displays the protein complement of a cell type or organ 
(its proteome). Both techniques offer great insights 
into metabolic regulation. The effect of changes in the 
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proteome is often a change in the total ensemble of low 
molecular weight metabolites, the metabolome. 

Once the regulatory mechanisms that involve pro- 
tein synthesis and degradation have produced a certain 
number of molecules of each enzyme in a cell, the 
activity of those enzymes can be further regulated in 
several other ways: by the concentration of substrate, 
the presence of allosteric effectors, covalent modifica- 
tions, or binding of regulatory proteins — all of which can 
change the activity of an individual enzyme molecule 
(Fig. 15-2,® to ©). 

All enzymes are sensitive to the concentration of 
their substrate (s) (Fig. 15-2, ©). Recall that in the sim- 
plest case (an enzyme that follows Michaelis-Menten ki- 
netics), the initial rate of the reaction is half -maximal 
when the substrate is present at a concentration equal 
to K m (that is, when the enzyme is half-saturated with 
substrate). Activity drops off at lower [S], and when [S] 
<< K mi the reaction rate is linearly dependent on [S]. 
This is important because intracellular concentrations 
of substrate are often in the same range as, or lower 
than, K m . The activity of hexokinase, for example, 
changes with [glucose], and intracellular [glucose] varies 
with the concentration of glucose in the blood. As we 
will see, the different forms (isozymes) of hexokinase 
have different K m values and are therefore differently 
affected by changes in intracellular [glucose], in ways 
that make sense physiologically. 

mm WORKED EXAMPLE 15-1 Activity of a Glucose 

Transporter 

If K t (the equivalent of K^) for the glucose transporter 
in liver (GLUT2) is 40 mM, calculate the effect on the 
rate of glucose flux into a hepatocyte of increasing the 
blood glucose concentration from 3 mM to 10 mM. 

Solution: We use Equation 11-1 (p. 393) to find the ini- 
tial velocity (flux) of glucose uptake. 



V 0 



^maxt^out 



K t + [S] out 

At 3 mM glucose 

V 0 = V max (3 mM)/(40 mM + 3 mM) 

= V max (3 mM/43 mM) - 0.07 V max 

At 10 mM glucose 

V 0 = V max (10 mM)/(40 mM 4- 10 mM) 

= V max (10 mM/50 mM) = 0.20 V max 

So a rise in blood glucose from 3 mM to 10 mM increases 
the rate of glucose influx into a hepatocyte by a factor of 
0.20/0.07 « 3. 

Enzyme activity can be either increased or decreased 
by an allosteric effector (Fig. 15-2, (g); see Fig. 6-34). 
Allosteric effectors typically convert hyperbolic kinetics 



TABLE 15-2 


Relationship between Hill Coefficient 
and the Effect of Substrate 
Concentration on Reaction Rate for 
Allosteric Enzymes 


Hill coefficient 
(»h) 


Required change in [S] 
to increase F 0 from 
10% to 90% Kn» Y 


0.5 


X6,600 


1.0 


/\ ox 


2.0 


X9 


3.0 


X4.3 


4.0 


X3 



to sigmoid kinetics, or vice versa (see Fig. 15-14b, for 
example). In the steepest part of the sigmoid curve, a 
small change in the concentration of substrate, or of 
allosteric effector, can have a large impact on reaction 
rate. Recall from Chapter 5 (p. 164) that the cooperativ- 
ity of an allosteric enzyme can be expressed as a Hill 
coefficient, with higher coefficients meaning greater 
cooperativity. For an allosteric enzyme with a Hill 
coefficient of 4, activity increases from 10% V max to 90% 
V m&x with only a 3-fold increase in [S], compared with 
the 81 -fold rise in [S] needed by an enzyme with no 
cooperative effects (Hill coefficient of 1; Table 15-2). 

Covalent modifications of enzymes or other proteins 
(Fig. 15-2, (9)) occur within seconds or minutes of a reg- 
ulatory signal, typically an extracellular signal. By far the 
most common modifications are phosphorylation and de- 
phosphorylation (Fig. 15-3); up to half the proteins in a 
eukaryotic cell are phosphorylated under some circum- 
stances. Phosphorylation by a specific protein kinase 
may alter the electrostatic features of an enzyme's active 
site, cause movement of an inhibitory region of the en- 
zyme protein out of the active site, alter the enzyme's in- 
teraction with other proteins, or force conformational 
changes that translate into changes in F max or K m . For 
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FIGURE 15-3 Protein phosphorylation and dephosphorylation. Pro- 
tein kinases transfer a phosphoryl group from ATP to a Ser, Thr, orTyr 
residue in an enzyme or other protein substrate. Protein phosphatases 
remove the phosphoryl group as Pj. 
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covalent modification to be useful in regulation, the cell 
must be able to restore the altered enzyme to its original 
activity state. A family of phosphoprotein phosphatases, 
at least some of which are themselves under regulation, 
catalyzes the dephosphorylation of proteins. 

Finally, many enzymes are regulated by association 
with and dissociation from another, regulatory protein 
(Fig. 15-2, ®). For example, the cyclic AMP-dependent 
protein kinase (PKA; see Fig. 12-6) is inactive until cAMP 
binding separates catalytic from regulatory subunits. 

These several mechanisms for altering the flux 
through a step in a metabolic pathway are not mutually 
exclusive. It is very common for a single enzyme to be 
regulated at the level of transcription and by both al- 
losteric and covalent mechanisms. The combination 
provides fast, smooth, effective regulation in response 
to a very wide array of perturbations and signals. 

In the discussions that follow, it is useful to think of 
changes in enzymatic activity as serving two distinct 
though complementary roles. We use the term metabolic 
regulation to refer to processes that serve to maintain 
homeostasis at the molecular level— to hold some cellular 
parameter (concentration of a metabolite, for example) at 
a steady level over time, even as the flow of metabolites 
through the pathway changes. The term metabolic con- 
trol refers to a process that leads to a change in the out- 
put of a metabolic pathway over time, in response to some 
outside signal or change in circumstances. The distinc- 
tion, although useful, is not always easy to make. 

Reactions Far from Equilibrium in Cells Are 
Common Points of Regulation 

For some steps in a metabolic pathway the reaction is 
close to equilibrium, with the cell in its dynamic steady 
state (Fig. 15-4). The net flow of metabolites through 
these steps is the small difference between the rates of 
the forward and reverse reactions, rates that are very 
similar when a reaction is near equilibrium. Small changes 
in substrate or product concentration can produce large 



(D © ® 

V= 10.01 V = 200 V= 500 
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net rate: 10 10 10 

FIGURE 15-4 Near-equilibrium and nonequilibrium steps in a meta- 
bolic pathway. Steps (2) and © of this pathway are near equilibrium in 
the cell; for each step, the rate (V) of the forward reaction is only 
slightly greater than the reverse rate, so the net forward rate (1 0) is rel- 
atively low and the free-energy change, AC, is close to zero. An in- 
crease in [C] or [D] can reverse the direction of these steps. Step © is 
maintained in the cell far from equilibrium; its forward rate greatly ex- 
ceeds its reverse rate. The net rate of step © (10) is much larger than 
the reverse rate (0.01 ) and is identical to the net rates of steps® and© 
when the pathway is operating in the steady state. Step © has a large, 
negative AC. 



changes in the net rate, and can even change the direc- 
tion of the net flow. We can identify these near-equilib- 
rium reactions in a cell by comparing the mass action 
ratio, Q, with the equilibrium constant for the reaction, 
K' eq . Recall that for the reaction A + B — > C + D, Q = 
[C][D]/[A][B]. When Q and K f eq are within 1 to 2 orders of 
magnitude of each other, the reaction is near equilibrium. 
This is the case for 6 of the 10 steps in the glycolytic path- 
way (Table 15-3). 

Other reactions are far from equilibrium in the 
cell. For example, K' m for the phosphofructokinase-1 
(PFK-1) reaction is about 1,000, but Q ([fructose 1,6- 
bisphosphate][ADP]/[fructose 6-phosphate][ATP]) in a 
hepatocyte in the steady state is about 0.1 (Table 15-3). 
It is because the reaction is so far from equilibrium that 
the process is exergonic under cellular conditions and 
tends to go in the forward direction. The reaction is held 
far from equilibrium because, under prevailing cellular 
conditions of substrate, product, and effector concen- 
trations, the rate of conversion of fructose 6-phosphate 
to fructose 1,6-bisphosphate is limited by the activity of 
PFK-1, which is itself limited by the number of PFK-1 
molecules present and by the actions of allosteric effec- 
tors. Thus the net forward rate of the enzyme-catalyzed 
reaction is equal to the net flow of glycolytic intermedi- 
ates through other steps in the pathway, and the reverse 
flow through PFK-1 remains near zero. 

The cell cannot allow reactions with large equilib- 
rium constants to reach equilibrium. If [fructose 6-phos- 
phate], [ATP], and [ADP] in the cell were held at typical 
levels (lowmillimolar concentrations) and the PFK-1 re- 
action were allowed to reach equilibrium by an increase 
in [fructose 1,6-bisphosphate], the concentration of 
fructose 1,6-bisphosphate would rise into the molar 
range, wreaking osmotic havoc on the cell. Consider an- 
other case: if the reaction ATP — > ADP + P A were al- 
lowed to approach equilibrium in the cell, the actual 
free-energy change (AG') for that reaction (AG P ; see 
Worked Example 13-2, p. 503) would approach zero, 
and ATP would lose the high phosphoryl group transfer 
potential that makes it valuable to the cell. It is therefore 
essential that enzymes catalyzing ATP breakdown and 
other highly exergonic reactions in a cell be sensitive to 
regulation, so that when metabolic changes are forced 
by external circumstances, the flow through these en- 
zymes will be adjusted to ensure that [ATP] remains far 
above its equilibrium level. When such metabolic 
changes occur, the activities of enzymes in all intercon- 
nected pathways adjust to keep these critical steps away 
from equilibrium. Thus, not surprisingly, many enzymes 
(such as PFK-1) that catalyze highly exergonic reac- 
tions are subject to a variety of subtle regulatory mech- 
anisms. The multiplicity of these adjustments is so great 
that we cannot predict by examining the properties of 
any one enzyme in a pathway whether that enzyme has 
a strong influence on net flow through the entire path- 
way. This complex problem can be approached by meta- 
bolic control analysis, as described in Section 15.2. 
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TABLE 15-3 



Carboh; 



ilium Constants, Mass Action Coefficients, and Free-Energy Changes for Enzymes of 








Mass action ratio, Q 




Reaction 
near 
equilibrium 




AG' 
(KJ/moi ) 


Enzyme 


^eq 


liver 


Heart 




in vivo?* 


(kJAnol) 


in heart 


Hexokinase 


1 x 10 3 


2 X 10" 2 


8 X 10" 


-2 


No 


-17 


-27 


PFK-1 


1.0 x 10 3 


9 X 10 -2 


3 x 10" 


-2 


No 


-14 


-23 


Aldolase 


1.0 x 10 -4 


1.2 x 10" 6 


9 x 10" 


-6 


Yes 


+24 


-6.0 


Triose phosphate isomerase 


4 x 10~ 2 




2.4 X 10* 


-1 


Yes 


+7.5 


+3.8 


Glyceraldehyde 3-phosphate 
dehydrogenase + 
phosphoglycerate kinase 


2 X 10 3 


6 X 10 2 


9.0 




Yes 


-13 


+3.5 


Phosphoglycerate mutase 


1 X 10 _1 


1 x 10" 1 


1.2 X 10" 


-1 


Yes 


+4.4 


+0.6 




Q 
O 


2.9 


1.4 




Yes 


-3.2 


-0.5 


Pyruvate kinase 


2 X 10 4 


7 X lO^ 1 


40 




No 


-31 


-17 


Phosphoglucose isomerase 


4 X 10 _1 


3.1 X 10" 1 


2.4 X 10" 


-1 


Yes 


+2.2 


-1.4 


Pyruvate carboxylase 
+ PEP carboxykinase 


7 


1 X 10" 3 






No 


-5.0 


-23 


Glucose 6-phosphatase 


8.5 X 10 2 


1.2 x 10 2 






Yes 


-17 


-5.0 



Source: K' eq and Q from Newsholme, E.A. & Start, C. (1973) Regulation in Metabolism, Wiley Press, New York, pp. 97, 263. AG' and AG'° were calculated from these data. 
*For simplicity, any reaction for which the absolute value of the calculated AG' is less than 6 is considered near equilibrium. 



Adenine Nucleotides Play Special Roles in 
Metabolic Regulation 

After the protection of its DNA from damage, perhaps 
nothing is more important to a cell than maintaining a 
constant supply and concentration of ATP. Many ATP- 
using enzymes have K m values between 0.1 and 1 mM, 
and the ATP concentration in a typical cell is about 5 mM. 
If [ATP] were to drop significantly, these enzymes would 
be less than fully saturated by their substrate (ATP), and 
the rates of hundreds of reactions that involve ATP 
would decrease (Fig. 15-5); the cell would probably not 
survive this kinetic effect on so many reactions. 

There is also an important thermodynamic effect 
of lowered [ATP]. Because ATP is converted to ADP 
or AMP when "spent" to accomplish cellular work, 
the [ATP]/[ADP] ratio profoundly affects all reactions 
that employ these cofactors. (The same is true for 
other important cofactors, such as NADH/NAD + and 
NADPH/NADP + .) For example, consider the reaction 
catalyzed by hexokinase: 

ATP + glucose > ADP + glucose 6-phosphate 

[ADP] ep [glucose 6-phosphate] eq 

jz> — o x 10 3 

eq [ATP] eq [glucose] eq 

Note that this expression holds true only when reac- 
tants and products are at their equilibrium concentra- 
tions, where AG' = 0. At any other set of concentrations, 
AG' is not zero. Recall (from Chapter 13) that the ratio 
of products to substrates (the mass action ratio, Q) 



determines the magnitude and sign of AG' and therefore 
the driving force, AG', of the reaction: 

[ADP] [glucose 6-phosphate] 

AG' = AG' 0 + AT In rArrpir . — 

[ATP] [glucose] 

Because an alteration of this driving force profoundly 
influences every reaction that involves ATP, organ- 
isms have evolved under strong pressure to develop 
regulatory mechanisms responsive to the [ATP]/[ADP] 
ratio. 

AMP concentration is an even more sensitive indi- 
cator of a cell's energetic state than is [ATP]. Normally 



> 

T3 




5 10 15 20 25 30 35 
ATP concentration [mM] 

FIGURE 15-5 Effect of ATP concentration on the initial velocity of a 
typical ATP-dependent enzyme. These experimental data yield a K m for 
ATP of 5 mM. The concentration of ATP in animal tissues is ~5 mM. 
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TABLE 15-4 


Relative Changes in [A 


TP] and [AMP] When ATP Is Cc 


msumed 




Concentration before 


Concentration after 




Adenine 


ATP depletion 


ATP depletion 




nucleotide 


(itlM) 


(itlM) 


Relative change 


ATP 


5.0 


4.5 


10% 


ADP 


1.0 


1.0 


0 


AMP 


0.1 


0.6 


600% 



cells have a far higher concentration of ATP (5 to 
10 itiM) than of AMP (<0.1 mM). When some process 
(say, muscle contraction) consumes ATP, AMP is pro- 
duced in two steps. First, hydrolysis of ATP produces 
ADP, then the reaction catalyzed by adenylate kinase 
produces AMP: 



2ADP 



AMP + ATP 



If ATP is consumed such that its concentration drops 
10%, the relative increase in [AMP] is much greater 
than that of [ADP] (Table 15-4). It is not surprising, 
therefore, that many regulatory processes are keyed to 
changes in [AMP]. Probably the most important media- 
tor of regulation by AMP is AMP-activated protein 
kinase (AMPK), which responds to an increase in 
[AMP] by phosphorylating key proteins and thus regu- 
lating their activities. The rise in [AMP] may be caused 



by a reduced nutrient supply or by increased exercise. 
The action of AMPK (not to be confused with the cyclic 
AMP-dependent protein kinase; see Section 15.5) in- 
creases glucose transport and activates glycolysis and 
fatty acid oxidation, while suppressing energy-requiring 
processes such as the synthesis of fatty acids, choles- 
terol, and protein (Fig. 15-6). We discuss AMPK fur- 
ther, and the detailed mechanisms by which it effects 
these changes, in Chapter 23. 

In addition to ATP, hundreds of metabolic interme- 
diates also must be present at appropriate concentra- 
tions in the cell. To take just one example: the glycolytic 
intermediates dihydroxyacetone phosphate and 3- 
phosphoglycerate are precursors of triacylglycerols and 
serine, respectively. When these products are needed, 
the rate of glycolysis must be adjusted to provide them 
without reducing the glycolytic production of ATP. The 
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FIGURE 15-6 Role of AMP-activated protein kinase (AMPK) in carbo- 
hydrate and fat metabolism. AMPK is activated by elevated [AMP] or 
decreased [ATP], by exercise, by the sympathetic nervous system 
(SNS), or by peptide hormones produced in adipose tissue (leptin and 
adiponectin, described in more detail in Chapter 23). When activated, 
AMPK phosphorylates target proteins and shifts metabolism in a 




Fatty acid synthesis 
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variety of tissues away from energy-consuming processes such as the 
synthesis of glycogen, fatty acids, and cholesterol; shifts metabolism in 
extrahepatic tissues to the use of fatty acids as a fuel; and triggers 
gluconeogenesis in the liver to provide glucose for the brain. In the 
hypothalamus, AMPK stimulates feeding behavior to provide more 
dietary fuel. 
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same is true for maintaining the levels of other impor- 
tant cofactors, such as NADH and NADPH: changes in 
their mass action ratios (that is, in the ratio of reduced 
to oxidized cofactor) have global effects on metabolism. 

Of course, priorities at the organismal level have 
also driven the evolution of regulatory mechanisms. In 
mammals, the brain has virtually no stored source of 
energy, depending instead on a constant supply of glu- 
cose from the blood. If blood glucose drops from its nor- 
mal concentration of 4 to 5 mM to half that level, mental 
confusion results, and a fivefold reduction in blood glu- 
cose can lead to coma and death. To buffer against 
changes in blood glucose concentration, release of the 
hormones insulin and glucagon, elicited by high or low 
blood glucose, respectively, triggers metabolic changes 
that tend to return the blood glucose concentration to 
normal. 

Other selective pressures must also have operated 
throughout evolution, selecting for regulatory mecha- 
nisms that accomplish the following: 

1. Maximize the efficiency of fuel utilization by 
preventing the simultaneous operation of pathways 
in opposite directions (such as glycolysis and 
gluconeogenesis). 

2. Partition metabolites appropriately between 
alternative pathways (such as glycolysis and the 
pentose phosphate pathway). 

3. Draw on the fuel best suited for the immediate 
needs of the organism (glucose, fatty acids, 
glycogen, or amino acids). 

4. Slow down biosynthetic pathways when their 
products accumulate. 

The remaining chapters of this book present many ex- 
amples of each kind of regulatory mechanism. 

SUMMARY 15.1 Regulation of 

Metabolic Pathways 

■ In a metabolically active cell in a steady state, 
intermediates are formed and consumed at equal 
rates. When a transient perturbation alters the rate 
of formation or consumption of a metabolite, 
compensating changes in enzyme activities return 
the system to the steady state. 

■ Cells regulate their metabolism by a variety of 
mechanisms over a time scale ranging from less 
than a millisecond to days, either by changing the 
activity of existing enzyme molecules or by 
changing the number of molecules of a specific 
enzyme. 

■ Various signals activate or inactivate transcription 
factors, which act in the nucleus to regulate gene 
expression. Changes in the transcriptome lead to 
changes in the proteome, and ultimately in the 
metabolome of a cell or tissue. 



In multistep processes such as glycolysis, certain 
reactions are essentially at equilibrium in the 
steady state; the rates of these reactions rise and 
fall with substrate concentration. Other reactions 
are far from equilibrium; these steps are typically 
the points of regulation of the overall pathway. 

m Regulatory mechanisms maintain nearly constant 
levels of key metabolites such as ATP and NADH in 
cells and glucose in the blood, while matching the 
use or production of glucose to the organism's 
changing needs. 

■ The levels of ATP and AMP are a sensitive 

reflection of a cell's energy status, and when the 
[ATP]/[AMP] ratio decreases, the AMP-activated 
protein kinase (AMPK) triggers a variety of cellular 
responses to raise [ATP] and lower [AMP]. 

15.2 Analysis of Metabolic Control 




Detailed studies of metabolic 
regulation were not feasible 
until the basic chemical steps 
in a pathway had been clarified 
and the responsible enzymes 
characterized. Beginning with 
Eduard Buchner's discovery 
(c. 1900) that an extract of 
broken yeast cells could con- 
vert glucose to ethanol and 
C0 2 , a major thrust of bio- 
chemical research was to de- 



Eduard Buchner, ^UCe t ft e steps by which this 

1860-1917 transformation occurred and 

to purify and characterize the enzymes that catalyzed 
each step. By the middle of the twentieth century, all 10 
enzymes of the glycolytic pathway had been purified 
and characterized. In the next 50 years much was 
learned about the regulation of these enzymes by intra- 
cellular and extracellular signals, through the kinds of 
allosteric and covalent mechanisms described in this 
chapter. The conventional wisdom was that in a linear 
pathway such as glycolysis, catalysis by one enzyme 
must be the slowest and must therefore determine the 
rate of metabolite flow, or flux, through the whole path- 
way. For glycolysis, PFK-1 was considered the rate-lim- 
iting enzyme, because it was known to be closely 
regulated by fructose 2,6-bisphosphate and other al- 
losteric effectors. 

With the advent of genetic engineering technology, 
it became possible to test this "single rate-determining 
step" hypothesis by increasing the concentration of the 
enzyme that catalyzes the "rate-limiting step" in a path- 
way and determining whether flux through the pathway 
increases proportionally. Most often it does not; the sim- 
ple solution (a single rate-determining step) is wrong. It 
has now become clear that in most pathways the control 
of flux is distributed among several enzymes, and the 
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extent to which each contributes to the control varies 
with metabolic circumstances — the supply of the start- 
ing material (say, glucose), the supply of oxygen, the 
need for other products derived from intermediates of 
the pathway (say, glucose 6-phosphate for the pentose 
phosphate pathway in cells synthesizing large amounts 
of nucleotides), the effects of metabolites with regula- 
tory roles, and the hormonal status of the organism 
(such as the levels of insulin and glucagon), among 
other factors. 

Why are we interested in what limits the flux 
through a pathway? To understand the action of hor- 
mones or drugs, or the pathology that results from a fail- 
ure of metabolic regulation, we must know where 
control is exercised. If researchers wish to develop a 
drug that stimulates or inhibits a pathway, the logical 
target is the enzyme that has the greatest impact on the 
flux through that pathway And the bioengineering of a 
microorganism to overproduce a product of commercial 
value (p. 312) requires a knowledge of what limits the 
flux of metabolites toward that product. 

The Contribution of Each Enzyme to Flux through a 
Pathway Is Experimentally Measurable 

There are several ways to determine experimentally 
how a change in the activity of one enzyme in a pathway 
affects metabolite flux through that pathway. Consider 
the experimental results shown in Figure 15-7. When a 
sample of rat liver was homogenized to release all solu- 
ble enzymes, the extract carried out the glycolytic con- 
version of glucose to fructose 1,6-bisphosphate at a 
measurable rate. (This experiment, for simplicity, fo- 
cused on just the first part of the glycolytic pathway.) 
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FIGURE 15-7 Dependence of glycolytic flux in a rat liver homogenate 
on added enzymes. Purified enzymes in the amounts shown on the x 
axis were added to an extract of liver carrying out glycolysis in vitro. 
The increase in flux through the pathway is shown on the y axis. 



When increasing amounts of purified hexokinase IV 
(glucokinase) were added to the extract, the rate of gly- 
colysis progressively increased. The addition of purified 
PFK-1 to the extract also increased the rate of glycoly- 
sis, but not as dramatically as did hexokinase. Purified 
phosphohexose isomerase was without effect. These re- 
sults suggest that hexokinase and PFK-1 both con- 
tribute to setting the flux through the pathway 
(hexokinase more than PFK-1), and that phosphohex- 
ose isomerase does not. 

Similar experiments can be done on intact cells or 
organisms, using specific inhibitors or activators to 
change the activity of one enzyme while observing the 
effect on flux through the pathway. The amount of an 
enzyme can also be altered genetically; bioengineering 
can produce a cell that makes extra copies of the enzyme 
under investigation or has a version of the enzyme that is 
less active than the normal enzyme. Increasing the 
concentration of an enzyme genetically sometimes has 
significant effects on flux; sometimes it has no effect. 

Three critical parameters, which together describe 
the responsiveness of a pathway to changes in metabolic 
circumstances, lie at the center of metabolic control 
analysis. We turn now to a qualitative description of 
these parameters and their meaning in the context of a 
living cell. Box 15-1 provides a more rigorous quantita- 
tive discussion. 

The Control Coefficient Quantifies the Effect of a Change in 
Enzyme Activity on Metabolite Flux through a Pathway 

Quantitative data on metabolic flux, obtained as de- 
scribed in Figure 15-7, can be used to calculate a flux 
control coefficient, C, for each enzyme in a pathway. 
This coefficient expresses the relative contribution of 
each enzyme to setting the rate at which metabolites 
flow through the pathway — that is, the flux, J. C can 
have any value from 0.0 (for an enzyme with no impact 
on the flux) to 1.0 (for an enzyme that wholly determines 
the flux). An enzyme can also have a negative flux con- 
trol coefficient. In a branched pathway, an enzyme in one 
branch, by drawing intermediates away from the other 
branch, can have a negative impact on the flux through 
that other branch (Fig. 15-8). C is not a constant, and it 
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FIGURE 15-8 Flux control coefficient, C, in a branched metabolic 
pathway. In this simple pathway, the intermediate B has two alternative 
fates. To the extent that reaction B E draws B away from the pathway 
A — > D, it controls that pathway, which will result in a negative flux 
control coefficient for the enzyme that catalyzes step B -> E. Note that 
the sum of all four coefficients equals 1.0, as it must for any defined 
system of enzymes. 
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BOX 15-1 



METHODS 



Metabolic Control Analysis: Quantitative Aspects 



The factors that influence the flow of intermediates 
(flux) through a pathway may be determined quantita- 
tively by experiment and expressed in terms useful for 
predicting the change in flux when some factor in- 
volved in the pathway changes. Consider the simple re- 
action sequence in Figure 1, in which a substrate X 
(say, glucose) is converted in several steps to a prod- 
uct Z (perhaps pyruvate, formed glycolytically). An en- 
zyme late in the pathway is a dehydrogenase (ydh) 
that acts on substrate Y. Because the action of a dehy- 
drogenase is easily measured (see Fig. 13-24), we can 
use the flux (J) through this step (J ydh ) to measure 
the flux through the whole path. We manipulate exper- 
imentally the level of an early enzyme in the pathway 
(xase, which acts on the substrate X) and measure the 
flux through the path (J ydh ) for several levels of the 
enzyme xase. 

The relationship between the flux through the 
pathway from X to Z in the intact cell and the concen- 
tration of each enzyme in the path should be hyper- 
bolic, with virtually no flux at infinitely low enzyme 
activity and near-maximum flux at very high enzyme 
activity In a plot of </ ydh against the concentration of 
xase, E xase , the change in flux with a small change in 
enzyme level is BJ ydh /BE xasei which is simply the slope 
of the tangent to the curve at any concentration of en- 
zyme, E xase , and which tends toward zero at saturating 
^xase- At low E XSLSei the slope is steep; the flux in- 
creases with each incremental increase in enzyme ac- 
tivity. At very high E xasei the slope is much smaller; 
the system is less responsive to added xase because it 
is already present in excess over the other enzymes in 
the pathway. 

To show quantitatively the dependence of flux 
through the pathway, dJ ydh , on aS xase , we could use 
the ratio dJ ydh /dE xase . However, its usefulness is 
limited because its value depends on the units used to 
express flux and enzyme activity. By expressing the 
fractional changes in flux and enzyme activity, 
dJ ydh /J ydh , and dE xase /E X8iSe , we obtain a unitless ex- 
pression for the flux control coefficient, C, in this 
case C X g|e* 



J, 



y ydh 
ydh 



This can be rearranged to 



^ xase 



ydh 



(1) 



BE* 



J, 



which is mathematically identical to 



& In J ydh 
& In E xa ± 0 



xase ^ multistep v ^ 7 ydh multistep 

A. * Oi * Y * Bg * Z 

FIGURE 1 Flux through a hypothetical multienzyme pathway. 

This equation suggests a simple graphical means for 
determining the flux control coefficient: C xa te is the 
slope of the tangent to the plot of In J ydh versus In 
^xase? which can be obtained by replotting the experi- 
mental data in Figure 2a to obtain Figure 2b. Notice 
that CiiH is not a constant; it depends on the starting 
^xase from which the change in enzyme level takes 
place. For the cases shown in Figure 2, C^fg is about 
1.0 at the lowest E'xase, but only about 0.2 at highi? xase . 
A value near 1.0 for CxIle means that the enzyme's con- 
centration wholly determines the flux through the 
pathway; a value near 0.0 means that the enzyme's con- 
centration does not limit the flux through the path. Un- 
less the flux control coefficient is greater than about 
0.5, changes in the activity of the enzyme will not have 
a strong effect on the flux. 

The elasticity, e, of an enzyme is a measure of how 
that enzyme's catalytic activity changes when the con- 
centration of a metabolite — substrate, product, or effec- 
tor — changes. It is obtained from an experimental plot 
of the rate of the reaction catalyzed by the enzyme ver- 
sus the concentration of the metabolite, at metabolite 
concentrations that prevail in the cell. By arguments 
analogous to those used to derive C, we can show e to be 
the slope of the tangent to a plot of In V versus In [sub- 
strate, or product, or effector]: 



^xase 

_ 3 In |Y xase | 
alnS 

For an enzyme with typical Michaelis-Menten kine- 
tics, the value of s ranges from about 1 at substrate 
concentrations far below K m to near 0 as F max is ap- 
proached. Allosteric enzymes can have elasticities 
greater than 1.0, but not larger than their Hill coeffi- 
cient (p. 164). 

Finally, the effect of controllers outside the pathway 
itself (that is, not metabolites) can be measured and ex- 
pressed as the response coefficient, R. The change in 
flux through the pathway is measured for changes in the 
concentration of the controlling parameter P, and R is 
defined in a form analogous to that of Equation 1, yield- 
ing the expression 



ydh 



dP 



'ydh 



(continued on next page} 
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BOX 15-1 



METHOD 



Metabolic Control Analysis: Quantitative Aspects 
[continued from previous page) 
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In e 



Concentration of enzyme, E x 



In E v 



FIGURE 2 The flux control coefficient, (a) Typical variation of the pathway flux, / ydh , measured at the step catalyzed by the 
enzyme ydh, as a function of the amount of the enzyme xase, £ xase , which catalyzes an earlier step in the pathway. The flux 
control coefficient at (ej) is the slope of the product of the tangent to the curve, d/ ydh /d£ xase , and the ratio (scaling factor) e/j, 
(b) On a double-logarithmic plot of the same curve, the flux control coefficient is the slope of the tangent to the curve. 



Using the same logic and graphical methods as de- 
scribed above for determining C, we can obtain as the 
slope of the tangent to the plot of In J versus In P. 

The three coefficients we have described are re- 
lated in this simple way: 



Thus the responsiveness of each enzyme in a pathway to 
a change in an outside controlling factor is a simple func- 
tion of two things: the control coefficient, a variable that 
expresses the extent to which that enzyme influences 
the flux under a given set of conditions, and the elastic- 
ity, an intrinsic property of the enzyme that reflects its 
sensitivity to substrate and effector concentrations. 



is not intrinsic to a single enzyme; it is a function of the 
whole system of enzymes, and its value depends on the 
concentrations of substrates and effectors. 

When real data from the experiment on glycolysis in 
a rat liver extract (Fig. 15-7) were subjected to this kind 
of analysis, investigators found flux control coefficients 
(for enzymes at the concentrations found in the extract) 
of 0.79 for hexokinase, 0.21 for PFK-1, and 0.0 for phos- 
phohexose isomerase. It is not just fortuitous that these 
values add up to 1 .0; we can show that for any complete 
pathway, the sum of the flux control coefficients must 
equal unity. 

The Elasticity Coefficient Is Related to an Enzyme's 
Responsiveness to Changes in Metabolite or 
Regulator Concentrations 

A second parameter, the elasticity coefficient, e, ex- 
presses quantitatively the responsiveness of a single 
enzyme to changes in the concentration of a metabolite 
or regulator; it is a function of the enzyme's intrinsic ki- 
netic properties. For example, an enzyme with typical 



Michaelis-Menten kinetics shows a hyperbolic response 
to increasing substrate concentration (Fig, 15-9). At 




K m [S] 



FIGURE 15-9 Elasticity coefficient, e, of an enzyme with typical 
Michaelis-Menten kinetics. At substrate concentrations far below the 
K mt each increase in [S] produces a correspondingly large increase in 
the reaction velocity, v, For this region of the curve, the enzyme has an 
s of about 1 .0. At [S] ^> K m increasing [S] has little effect on v; e here 
is close to 0,0. 
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low concentrations of substrate (say, 0.1 if m ) each in- 
crement in substrate concentration results in a compa- 
rable increase in enzymatic activity, yielding an e near 
1.0. At relatively high substrate concentrations (say, 10 
K m X increasing the substrate concentration has little 
effect on the reaction rate, because the enzyme is al- 
ready saturated with substrate. The elasticity in this 
case approaches zero. For allosteric enzymes that show 
positive cooperativity, s may exceed 1.0, but it cannot 
exceed the Hill coefficient, which is typically between 
1.0 and 4.0. 

The Response Coefficient Expresses the Effect of an 
Outside Controller on Flux through a Pathway 

We can also derive a quantitative expression for the rel- 
ative impact of an outside factor (such as a hormone or 
growth factor), which is neither a metabolite nor an en- 
zyme in the pathway, on the flux through the pathway. 
The experiment would measure the flux through the 
pathway (glycolysis, in this case) at various levels of the 
parameter P (the insulin concentration, for example) to 
obtain the response coefficient, R, which expresses 
the change in pathway flux when P ([insulin]) changes. 

The three coefficients C, £, and R are related in a 
simple way: the responsiveness (ft) of a pathway to an 
outside factor that affects a certain enzyme is a function 
of (1) how sensitive the pathway is to changes in the ac- 
tivity of that enzyme (the control coefficient, C) and (2) 
how sensitive that specific enzyme is to changes in the 
outside controlling factor (the elasticity, 

R = C-s 

Each enzyme in the pathway can be examined in this 
way, and the effects of any of several outside factors on 
flux through the pathway can be separately determined. 
Thus, in principle, we can predict how the flux of sub- 
strate through a series of enzymatic steps will change 
when there is a change in one or more controlling fac- 
tors external to the pathway. Box 15-1 shows how these 
qualitative concepts are treated quantitatively 

Metabolic Control Analysis Has Been Applied to 
Carbohydrate Metabolism, with Surprising Results 

Metabolic control analysis provides a framework within 
which we can think quantitatively about regulation, in- 
terpret the significance of the regulatory properties of 
each enzyme in a pathway, identify the steps that most 
affect the flux through the pathway, and distinguish be- 
tween regulatory mechanisms that act to maintain 
metabolite concentrations and control mechanisms that 
actually alter the flux through the pathway. Analysis of 
the glycolytic pathway in yeast, for example, has re- 
vealed an unexpectedly low flux control coefficient for 
PFK-1, which, as we have noted, has been viewed as the 
main point of flux control — the "rate-determining 
step" — in glycolysis. Experimentally raising the level of 



PFK-1 fivefold led to a change in flux through glycolysis 
of less than 10%, suggesting that the real role of PFK-1 
regulation is not to control flux through glycolysis but to 
mediate metabolite homeostasis — to prevent large 
changes in metabolite concentrations when the flux 
through glycolysis increases in response to elevated 
blood glucose or insulin. Recall that the study of glyco- 
lysis in a liver extract (Fig. 15-7) also yielded a flux con- 
trol coefficient that contradicted the conventional 
wisdom; it showed that hexokinase, not PFK-1, is most 
influential in setting the flux through glycolysis. We 
must note here that a liver extract is far from equivalent 
to a hepatocyte; the ideal way to study flux control is by 
manipulating one enzyme at a time in the living cell. This 
is already feasible in many cases. 

Investigators have used nuclear magnetic resonance 
(NMR) as a noninvasive means to determine the concen- 
tration of glycogen and metabolites in the five-step path- 
way from glucose in the blood to glycogen in myocytes 
(Fig. 15-10) in rat and human muscle. They found that 
the flux control coefficient for glycogen synthase was 
smaller than that for the steps catalyzed by the glucose 
transporter and hexokinase. (We discuss glycogen syn- 
thase and other enzymes of glycogen metabolism in 
Sections 15.4 and 15.5.) This finding contradicts the con- 
ventional wisdom that glycogen synthase is the locus of 
flux control and suggests that the importance of the phos- 
phorylation/dephosphorylation of glycogen synthase is 
related instead to the maintenance of metabolite home- 
ostasis — that is, regulation, not control Two metabolites 
in this pathway, glucose and glucose 6-phosphate, are key 
intermediates in other pathways, including glycolysis, the 
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FIGURE 15-10 Control of glycogen synthesis from blood glucose in 
muscle. Insulin affects three of the five steps in this pathway but it is 
the effects on transport and hexokinase activity, not the change in 
glycogen synthase activity that increase the flux toward glycogen. 
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pentose phosphate pathway, and the synthesis of glu- 
cosamine. Metabolic control analysis suggests that when 
the blood glucose level rises, insulin acts in muscle to (1) 
increase glucose transport into cells by conveying GLUT4 
to the plasma membrane, (2) induce the synthesis of hex- 
okinase, and (3) activate glycogen synthase by covalent 
alteration (see Fig. 15-39). The first two effects of insulin 
increase glucose flux through the pathway (control) , and 
the third serves to adapt the activity of glycogen synthase 
so that metabolite levels (glucose 6-phosphate, for exam- 
ple) will not change dramatically with the increased flux 
(regulation). 

Metabolic Control Analysis Suggests a General Method for 
Increasing Flux through a Pathway 

How could an investigator engineer a cell to increase the 
flux through one pathway without altering the concentra- 
tions of other metabolites or the fluxes through other 
pathways? More than three decades ago Henrik Kacser 
predicted, on the basis of metabolic control analysis, that 
this could be accomplished by increasing the concentra- 
tions of every enzyme in a pathway. The prediction has 
been confirmed in several experimental tests, and it also 
fits with the way cells normally control fluxes through a 
pathway. For example, rats fed a high-protein diet dispose 
of excess amino groups by converting them to urea in the 
urea cycle (Chapter 18). After such a dietary shift, the 
urea output increases fourfold, and the amount of all 
eight enzymes in the urea cycle increases two- to three- 
fold. Similarly, when increased fatty acid oxidation is 
triggered by activation of peroxisome proliferator- 
activated receptor y (PPARy, a ligand-activated tran- 
scription factor; see Fig. 21-22), synthesis of the whole 
set of fatty acid oxidative enzymes is increased. With the 
growing use of DNA microarrays to study the expression 
of whole sets of genes in response to various perturba- 
tions, we should soon learn whether this is the general 
mechanism by which cells make long-term adjustments in 
flux through specific pathways. 

SUMMARY 15.2 Analysis of 

Metabolic Control 

■ Metabolic control analysis shows that control of 
the rate of metabolite flux through a pathway is 
distributed among several of the enzymes in that 
path. 

■ The flux control coefficient, C, is an experimentally 
determined measure of the effect of an enzyme's 
concentration on flux through a multienzyme 
pathway It is characteristic of the whole system, 
not intrinsic to the enzyme. 

■ The elasticity coefficient, s, of an enzyme is an 
experimentally determined measure of its 
responsiveness to changes in the concentration 
of a metabolite or regulator molecule. 



■ The response coefficient, R, is a measure of the 
experimentally determined change in flux through 
a pathway in response to a regulatory hormone 

or second messenger. It is a function of C and s\ 
R = C*s. 

m Some regulated enzymes control the flux through 
a pathway, while others rebalance the level of 
metabolites in response to the change in flux. The 
first activity is control; the second, rebalancing 
activity is regulation. 

■ Metabolic control analysis predicts, and experiments 
have confirmed, that flux toward a specific product 
is most effectively increased by raising the 
concentration of all enzymes in the pathway. 

15.3 Coordinated Regulation of Glycolysis 
and Gluconeogenesis 

In mammals, gluconeogenesis occurs primarily in the 
liver, where its role is to provide glucose for export to 
other tissues when glycogen stores are exhausted and 
when no dietary glucose is available. As we discussed in 
Chapter 14, gluconeogenesis employs several of the en- 
zymes that act in glycolysis, but it is not simply the re- 
versal of glycolysis. Seven of the glycolytic reactions are 
freely reversible, and the enzymes that catalyze these re- 
actions also function in gluconeogenesis (Fig. 15-11). 
Three reactions of glycolysis are so exergonic as to be 
essentially irreversible: those catalyzed by hexokinase, 
PFK-1, and pyruvate kinase. All three reactions have a 
large, negative AG' (Table 15-3 shows the values in 
heart muscle). Gluconeogenesis uses detours around 
each of these irreversible steps; for example, the con- 
version of fructose 1,6-bisphosphate to fructose 6-phos- 
phate is catalyzed by fructose 1,6-bisphosphatase 
(FBPase-1). Each of these bypass reactions also has a 
large, negative AG'. 

At each of the three points where glycolytic reactions 
are bypassed by alternative, gluconeogenic reactions, 
simultaneous operation of both pathways would consume 
ATP without accomplishing any chemical or biological 
work. For example, PFK-1 and FBPase-1 catalyze oppos- 
ing reactions: 

ATP + fructose 6-phosphate > 

ADP + fructose 1,6-bisphosphate 

_ _ _ FRPase-i 

Fructose 1,6-bisphosphate + H 2 0 > 

fructose 6-phosphate + Pi 

The sum of these two reactions is 

ATP + H 2 0 ► ADP + Pi + heat 

that is, hydrolysis of ATP without any useful metabolic 
work being done. Clearly, if these two reactions were al- 
lowed to proceed simultaneously at a high rate in the 
same cell, a large amount of chemical energy would be 
dissipated as heat. This uneconomical process has been 
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FIGURE 15-11 Glycolysis and gluconeogenesis. Opposing path- 
ways of glycolysis (pink) and gluconeogenesis (blue) in rat liver. 
Three steps are catalyzed by different enzymes in gluconeogenesis 



(the "bypass reactions") and glycolysis; seven steps are catalyzed by 
the same enzymes in the two pathways. Cofactors have been omit- 
ted for simplicity. 



called a futile cycle. However, as we shall see later, 
such cycles may provide advantages for controlling 
pathways, and the term substrate cycle is a better de- 
scription. Similar substrate cycles also occur with the 
other two sets of bypass reactions of gluconeogenesis 
(Fig. 15-11). 

We look now in some detail at the mechanisms that 
regulate glycolysis and gluconeogenesis at the three 
points where these pathways diverge. 

Hexokinase Isozymes of Muscle and Liver Are Affected 
Differently by Their Product, Glucose 6-Phosphate 

Hexokinase, which catalyzes the entry of glucose into 
the glycolytic pathway, is a regulatory enzyme. Humans 



have four isozymes (designated I to IV) , encoded by four 
different genes. Isozymes are different proteins that cat- 
alyze the same reaction (Box 15-2). The predominant 
hexokinase isozyme of myocytes (hexokinase II) has a 
high affinity for glucose — it is half-saturated at about 
0.1 mM. Because glucose entering myocytes from the 
blood (where the glucose concentration is 4 to 5 mM) 
produces an intracellular glucose concentration high 
enough to saturate hexokinase II, the enzyme normally 
acts at or near its maximal rate. Muscle hexokinase I 
and hexokinase II are allosterically inhibited by their 
product, glucose 6-phosphate, so whenever the cellular 
concentration of glucose 6-phosphate rises above its nor- 
mal level, these isozymes are temporarily and reversibly 
inhibited, bringing the rate of glucose 6-phosphate 



584 | Principles of Metabolic Regulation 



Isozymes: Different Proteins That Catalyze the Same Reaction 



BOX 15-2 

The four forms of hexokinase found in mammalian tissues 
are but one example of a common biological situation: the 
same reaction catalyzed by two or more different molec- 
ular forms of an enzyme. These multiple forms, called 
isozymes or isoenzymes, may occur in the same species, 
in the same tissue, or even in the same cell. The differ- 
ent forms (isoforms) of the enzyme generally differ in 
kinetic or regulatory properties, in the cofactor they use 
(NADH or NADPH for dehydrogenase isozymes, for ex- 
ample), or in their subcellular distribution (soluble or 
membrane-bound). Isozymes may have similar, but not 
identical, amino acid sequences, and in many cases they 
clearly share a common evolutionary origin. 

One of the first enzymes found to have isozymes 
was lactate dehydrogenase (LDH; p. 547), which in ver- 
tebrate tissues exists as at least five different isozymes 
separable by electrophoresis. All LDH isozymes contain 
four polypeptide chains (each of M r 33,500), each type 
containing a different ratio of two kinds of polypeptides. 
The M (for muscle) chain and the H (for heart) chain 
are encoded by two different genes. 

In skeletal muscle the predominant isozyme con- 
tains four M chains, and in heart the predominant 
isozyme contains four H chains. Other tissues have some 
combination of the five possible types of LDH isozymes: 



TVpe 


Composition 
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LDHi 


HHHH 


Heart and erythrocyte 


LDH 2 


HHHM 


Heart and erythrocyte 


LDH 3 


HHMM 


Brain and kidney 


LDH 4 


HMMM 


Skeletal muscle and liver 


LDH 5 


MMMM 


Skeletal muscle and liver 



B9 The differences in the isozyme content of tissues 
■ufl can be used to assess the timing and extent of 
heart damage due to myocardial infarction (heart at- 
tack). Damage to heart tissue results in the release of 
heart LDH into the blood. Shortly after a heart attack, 



the blood level of total LDH increases, and there is more 
LDH 2 than LDHi. After 12 hours the amounts of LDH X 
and LDH 2 are very similar, and after 24 hours there is 
more LDH : than LDH 2 . This switch in the [LDHi]/[LDH 2 ] 
ratio, combined with increased concentrations in the 
blood of another heart enzyme, creatine kinase, is very 
strong evidence of a recent myocardial infarction. ■ 

The different LDH isozymes have significantly dif- 
ferent values of V mSiX and K m , particularly for pyruvate. 
The properties of LDH 4 favor rapid reduction of very low 
concentrations of pyruvate to lactate in skeletal muscle, 
whereas those of isozyme LDHi favor rapid oxidation of 
lactate to pyruvate in the heart. 

In general, the distribution of different isozymes of a 
given enzyme reflects at least four factors: 

1. Different metabolic patterns in different organs. 
For glycogen phosphorylase, the isozymes in skele- 
tal muscle and liver have different regulatory prop- 
erties, reflecting the different roles of glycogen 
breakdown in these two tissues. 

2. Different locations and metabolic roles for 
isozymes in the same cell The isocitrate dehydro- 
genase isozymes of the cytosol and the mitochon- 
drion are an example (Chapter 16). 

3. Different stages of development in embryonic or 
fetal tissues and in adult tissues. For example, 
the fetal liver has a characteristic isozyme distribu- 
tion of LDH, which changes as the organ develops 
into its adult form. Some enzymes of glucose catab- 
olism in malignant (cancer) cells occur as their 
fetal, not adult, isozymes. 

4. Different responses of isozymes to allosteric mod- 
ulators. This difference is useful in fine-tuning meta- 
bolic rates. Hexokinase IV (glucokinase) of liver and 
the hexokinase isozymes of other tissues differ in 
their sensitivity to inhibition by glucose 6-phosphate. 



formation into balance with the rate of its utilization and 
reestablishing the steady state. 

The different hexokinase isozymes of liver and 
muscle reflect the different roles of these organs in car- 
bohydrate metabolism: muscle consumes glucose, us- 
ing it for energy production, whereas liver maintains 
blood glucose homeostasis by consuming or producing 
glucose, depending on the prevailing blood glucose con- 
centration. The predominant hexokinase isozyme of 
liver is hexokinase IV (glucokinase), which differs in 
three important respects from hexokinases I— III of mus- 
cle. First, the glucose concentration at which hexoki- 
nase IV is half-saturated (about 10 mM) is higher than 



the usual concentration of glucose in the blood. Be- 
cause an efficient glucose transporter in hepatocytes 
(GLUT2) rapidly equilibrates the glucose concentra- 
tions in cytosol and blood (see Fig. 11-30 for the kinet- 
ics of the same transporter, GLUT2, in erythrocytes), 
the high K m of hexokinase IV allows its direct regulation 
by the level of blood glucose (Fig. 15-12). When blood 
glucose is high, as it is after a meal rich in carbohy- 
drates, excess glucose is transported into hepatocytes, 
where hexokinase IV converts it to glucose 6-phos- 
phate. Because hexokinase IV is not saturated at 10 mM 
glucose, its activity continues to increase as the glucose 
concentration rises to 10 mM or more. Under conditions 
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FIGURE 15-12 Comparison of the kinetic properties of hexokinase IV 
(glucokinase) and hexokinase I. Note the sigmoidicity for hexokinase 
IV and the much lower K m for hexokinase I. When blood glucose rises 
above 5 itim, hexokinase IV activity increases, but hexokinase I is al- 
ready operating near V max and cannot respond to an increase in glu- 
cose concentration, Hexokinases I, II, and Ml have similar kinetic 
properties. 

of low blood glucose, the glucose concentration in a he- 
patocyte is low relative to the K m of hexokinase IV, and 
the glucose generated by gluconeogenesis leaves the 
cell before being trapped by phosphorylation. 

Second, hexokinase IV is not inhibited by glucose 
6-phosphate, and it can therefore continue to operate 
when the accumulation of glucose 6-phosphate completely 
inhibits hexokinases I— III. Finally hexokinase IV is subject 
to inhibition by the reversible binding of a regulatory pro- 
tein specific to liver (Fig. 15-13). The binding is much 
tighter in the presence of the allosteric effector fructose 6- 
phosphate. Glucose competes with fructose 6-phosphate 
for binding and causes dissociation of the regulatory protein 
from the hexokinase, relieving the inhibition. Immediately 



after a carbohydrate-rich meal, when blood glucose is high, 
glucose enters the hepatocyte via GLUT2 and activates 
hexokinase IV by this mechanism. During a fast, when 
blood glucose drops below 5 mM, fructose 6-phosphate trig- 
gers the inhibition of hexokinase IV by the regulatory pro- 
tein, so the liver does not compete with other organs for the 
scarce glucose. The mechanism of inhibition by the regula- 
tory protein is interesting: the protein anchors hexokinase 
IV inside the nucleus, where it is segregated from the other 
enzymes of glycolysis in the cytosol (Fig. 15-13). When the 
glucose concentration in the cytosol rises, it equilibrates 
with glucose in the nucleus by transport through the nu- 
clear pores. Glucose causes dissociation of the regulatory 
protein, and hexokinase IV enters the cytosol and begins to 
phosphorylate glucose. 

Hexokinase IV (Glucokinase) and Glucose 6-Phosphatase 
Are Transcriptionally Regulated 

Hexokinase IV is also regulated at the level of protein 
synthesis. Circumstances that call for greater energy 
production (low [ATP], high [AMP], vigorous muscle con- 
traction) or for greater glucose consumption (high blood 
glucose, for example) cause increased transcription of 
the hexokinase IV gene. Glucose 6-phosphatase, the glu- 
coneogenic enzyme that bypasses the hexokinase step of 
glycolysis, is transcriptionally regulated by factors that 
call for increased production of glucose (low blood glu- 
cose, glucagon signaling). The transcriptional regulation 
of these two enzymes (along with other enzymes of gly- 
colysis and gluconeogenesis) is described below. 

Phosphofructokinase-1 and Fructose 1,6-bisphosphatase 
Are Reciprocally Regulated 

As we have noted, glucose 6-phosphate can flow either 
into glycolysis or through any of several other pathways, 
including glycogen synthesis and the pentose phosphate 
pathway The metabolically irreversible reaction cat- 
alyzed by PFK-1 is the step that commits glucose to gly- 
colysis. In addition to its substrate-binding sites, this 
complex enzyme has several regulatory sites at which 
allosteric activators or inhibitors bind. 
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FIGURE 15-13 Regulation of hexokinase 
IV (glucokinase) by sequestration in the 
nucleus. The protein inhibitor of hexoki- 
nase IV is a nuclear binding protein that 
draws hexokinase IV into the nucleus when 
the fructose 6-phosphate concentration in 
liver is high and releases it to the cytosol 
when the glucose concentration is high. 
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ATP is not only a substrate for PFK-1 but also an 
end product of the glycolytic pathway. When high 
cellular [ATP] signals that ATP is being produced 
faster than it is being consumed, ATP inhibits PFK-1 
by binding to an allosteric site and lowering the affin- 
ity of the enzyme for its substrate fructose 6-phos- 
phate (Fig. 15-14). ADP and AMP, which increase 
in concentration as consumption of ATP outpaces 



FIGURE 15-14 Phosphofructokinase-1 (PFK-1) and its regulation. 

(a) Ribbon diagram of E. coil PFK-1, showing two of its four identical 
subunits (PDB ID 1 PFK). Each subunit has its own catalytic site, where 
the products ADP and fructose 1,6-bisphosphate (blue and yellow ball- 
and-stick models, respectively) are almost in contact, and its own bind- 
ing sites for the allosteric regulator ADP (blue), located at the interface 
between subunits. (b) Allosteric regulation of muscle PFK-1 by ATP, 
shown by a substrate-activity curve. At low [ATP], the K 0i5 for fructose 
6-phosphate is relatively low, enabling the enzyme to function at a 
high rate at relatively low [fructose 6-phosphate]. (Recall from Chapter 
6 that K 0 5 is the K m term for regulatory enzymes.) When [ATP] is high, 
K 05 for fructose 6-phosphate is greatly increased, as indicated by the 
sigmoid relationship between substrate concentration and enzyme ac- 
tivity, (c) Summary of the regulators affecting PFK-1 activity. 

production, act allosterically to relieve this inhibition 
by ATP. These effects combine to produce higher en- 
zyme activity when ADP or AMP accumulates and lower 
activity when ATP accumulates. 

Citrate (the ionized form of citric acid), a key in- 
termediate in the aerobic oxidation of pyruvate, fatty 
acids, and amino acids, is also an allosteric regulator of 
PFK-1; high citrate concentration increases the in- 
hibitory effect of ATP, further reducing the flow of glu- 
cose through glycolysis. In this case, as in several 
others encountered later, citrate serves as an intracel- 
lular signal that the cell is meeting its current needs for 
energy-yielding metabolism by the oxidation of fats 
and proteins. 

The corresponding step in gluconeogenesis is the 
conversion of fructose 1,6-bisphosphate to fructose 6- 
phosphate (Fig. 15-15). The enzyme that catalyzes 
this reaction, FBPase-1, is strongly inhibited (allosteri- 
cally) by AMP; when the cell's supply of ATP is low (cor- 
responding to high [AMP]), the ATP-requiring synthesis 
of glucose slows. 

Thus these opposing steps in the glycolytic and gluco- 
neogenic pathways — PFK-1 and FBPase-1 — are regulated 
in a coordinated and reciprocal manner. In general, when 
sufficient concentrations of acetyl-CoA or citrate (the 
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FIGURE 15-15 Regulation of fructose 1,6-bisphosphatase (FBPase-1) 
and phosphofructokinase-1 (PFK-1). The important role of fructose 
2,6-bisphosphate in the regulation of this substrate cycle is detailed in 
subsequent figures. 
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product of acetyl-CoA condensation with oxaloacetate) 
are present, or when a high proportion of the cell's adeny- 
late is in the form of ATP, gluconeogenesis is favored. 
When the level of AMP increases, it promotes glycolysis by 
stimulating PFK-1 (and, as we shall see in Section 15.5, 
promotes glycogen degradation by activating glycogen 
phosphorylase). 

Fructose 2,6-Bisphosphate Is a Potent Allosteric Regulator 
of PFK-1 andFBPase-1 

The special role of liver in maintaining a constant blood 
glucose level requires additional regulatory mechanisms 
to coordinate glucose production and consumption. 
When the blood glucose level decreases, the hormone 
glucagon signals the liver to produce and release more 
glucose and to stop consuming it for its own needs. One 
source of glucose is glycogen stored in the liver; another 
source is gluconeogenesis, using pyruvate, lactate, glyc- 
erol, or certain amino acids as starting material. When 
blood glucose is high, insulin signals the liver to use glu- 
cose as a fuel and as a precursor for the synthesis and 
storage of glycogen and triacylglycerol. 



The rapid hormonal regulation of glycolysis and glu- 
coneogenesis is mediated by fructose 2,6-bisphos- 
phate, an allosteric effector for the enzymes PFK-1 and 
FBPase-1: 
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When fructose 2,6-bisphosphate binds to its allosteric 
site on PFK-1, it increases the enzyme's affinity for 
its substrate fructose 6-phosphate and reduces its 
affinity for the allosteric inhibitors ATP and citrate 
(Fig. 15-16). At the physiological concentrations of 
its substrates, ATP and fructose 6-phosphate, and of its 
other positive and negative effectors (ATP, AMP, 
citrate), PFK-1 is virtually inactive in the absence of 
fructose 2,6-bisphosphate. Fructose 2,6-bisphosphate 
has the opposite effect on FBPase-1: it reduces its 
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FIGURE 15-16 Role of fructose 2,6-bisphosphate in regulation of gly- 
colysis and gluconeogenesis. Fructose 2,6-bisphosphate (F26BP) has 
opposite effects on the enzymatic activities of phosphofructokinase-1 
(PFK-1 , a glycolytic enzyme) and fructose 1 ,6-bisphosphatase (FBPase- 
1, a gluconeogenic enzyme), (a) PFK-1 activity in the absence of 
F26BP (blue curve) is half-maximal when the concentration of fructose 
6-phosphate is 2 mM (that is, K 0 s ~ 2 mM). When 0.13 fxM F26BP is 
present (red curve), the K 0 5 for fructose 6-phosphate is only 0.08 mM. 



Thus F26BP activates PFK-1 by increasing its apparent affinity for fruc- 
tose 6-phosphate (see Fig. 15-1 4b). (b) FBRase-1 activity is inhibited 
by as little as 1 ^m F26BP and is strongly inhibited by 25 /am. In the ab- 
sence of this inhibitor (blue curve) the K 05 for fructose 1,6-bisphos- 
phate is 5 fiM, but in the presence of 25 fiM F26BP (red curve) the K 0 5 
is >70 /im. Fructose 2,6-bisphosphate also makes FBPase-1 more sen- 
sitive to inhibition by another allosteric regulator, AMP. (c) Summary of 
regulation by F26BP. 
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FIGURE 15-17 Regulation of fructose 2,6-bisphosphate level, (a) The 

cellular concentration of the regulator fructose 2,6-bisphosphate (F26BP) 
is determined by the rates of its synthesis by phosphofructokinase-2 



(PFK-2) and its breakdown by fructose 2,6-bisphosphatase (FBPase-2). 
(b) Both enzyme activities are part of the same polypeptide chain, and 
they are reciprocally regulated by insulin and glucagon. 



affinity for its substrate (Fig. 15-16c), thereby slowing 
gluconeogenesis. 

The cellular concentration of the allosteric regu- 
lator fructose 2,6-bisphosphate is set by the relative 
rates of its formation and breakdown (Fig. 
15-17a). It is formed by phosphorylation of fruc- 
tose 6-phosphate, catalyzed by phosphofructoki- 
nase-2 (PFK-2), and is broken down by fructose 
2,6-bisphosphatase (FBPase-2). (Note that these 
enzymes are distinct from PFK-1 and FBPase-1, 
which catalyze the formation and breakdown, re- 
spectively, of fructose 1,6-bisphosphate.) PFK-2 and 
FBPase-2 are two separate enzymatic activities of a 
single, bifunctional protein. The balance of these 
two activities in the liver, which determines the cel- 
lular level of fructose 2,6-bisphosphate, is regulated 
by glucagon and insulin (Fig. 15-1 7b). 

As we saw in Chapter 12 (p. 431), glucagon stim- 
ulates the adenylyl cyclase of liver to synthesize 3', 5'- 
cyclic AMP (cAMP) from ATP. Cyclic AMP then 
activates cAMP-dependent protein kinase, which 
transfers a phosphoryl group from ATP to the bifunc- 
tional protein PFK-2/FBPase-2. Phosphorylation of 
this protein enhances its FBPase-2 activity and in- 
hibits its PFK-2 activity. Glucagon thereby lowers the 
cellular level of fructose 2,6-bisphosphate, inhibiting 
glycolysis and stimulating gluconeogenesis. The re- 
sulting production of more glucose enables the liver 
to replenish blood glucose in response to glucagon. 
Insulin has the opposite effect, stimulating the activ- 
ity of a phosphoprotein phosphatase that catalyzes 
removal of the phosphoryl group from the bifunc- 
tional protein PFK-2/FBPase-2, activating its PFK-2 
activity, increasing the level of fructose 2,6-bisphos- 
phate, stimulating glycolysis, and inhibiting gluconeo- 
genesis. 



Xylulose 5-Phosphate Is a Key Regulator of 
Carbohydrate and Fat Metabolism 

Another regulatory mechanism also acts by controlling 
the level of fructose 2,6-bisphosphate. In the mammalian 
liver, xylulose 5-phosphate (see p. 560), a product of the 
pentose phosphate pathway (hexose monophosphate 
pathway), mediates the increase in glycolysis that fol- 
lows ingestion of a high-carbohydrate meal. The xylu- 
lose 5-phosphate concentration rises as glucose entering 
the liver is converted to glucose 6-phosphate and en- 
ters both the glycolytic and pentose phosphate path- 
ways. Xylulose 5-phosphate activates phosphoprotein 
phosphatase 2A (PP2A; Fig. 15-18), which dephos- 
phorylates the bifunctional PFK-2/FBPase-2 enzyme 
(Fig. 15-17). Dephosphorylation activates PFK-2 and 
inhibits FBPase-2, and the resulting rise in fructose 2,6- 
bisphosphate concentration stimulates glycolysis and 
inhibits gluconeogenesis. The increased glycolysis 
boosts the production of acetyl-CoA, while the in- 
creased flow of hexose through the pentose phosphate 
pathway generates NADPH. Acetyl-CoA and NADPH 
are the starting materials for fatty acid synthesis, which 
has long been known to increase dramatically in re- 
sponse to intake of a high-carbohydrate meal. Xylulose 
5-phosphate also increases the synthesis of all the en- 
zymes required for fatty acid synthesis, meeting the 
prediction from metabolic control analysis. We return to 
this effect in our discussion of the integration of carbo- 
hydrate and lipid metabolism in Chapter 23. 

The Glycolytic Enzyme Pyruvate Kinase Is 
Allosterically Inhibited by ATP 

At least three isozymes of pyruvate kinase are found in 
vertebrates, differing in their tissue distribution and 
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their response to modulators. High concentrations of 
ATP, acetyl-CoA, and long-chain fatty acids (signs 
of abundant energy supply) allosterically inhibit all 
isozymes of pyruvate kinase (Fig. 15-19). The liver 
isozyme (L form), but not the muscle isozyme (M 
form), is subject to further regulation by phosphoryla- 
tion. When low blood glucose causes glucagon release, 
cAMP-dependent protein kinase phosphorylates the L 
isozyme of pyruvate kinase, inactivating it. This slows 
the use of glucose as a fuel in liver, sparing it for export 
to the brain and other organs. In muscle, the effect of 
increased [cAMP] is quite different. In response to epi- 
nephrine, cAMP activates glycogen breakdown and 
glycolysis and provides the fuel needed for the fight-or- 
flight response. 
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FIGURE 15-18 Structure and action of phosphoprotein phosphatase 
2A (PP2A). (a) The catalytic subunit has two Mn 2+ ions in its active 
site, positioned close to the substrate recognition surface formed by the 
interface between the catalytic subunit and the regulatory subunit 
(PDB ID 2NPP). Microcystin-LR, shown here in red, is a specific in- 
hibitor of PP2A. The catalytic and regulatory subunits rest in a scaffold 
(the A subunit) that positions them relative to each other and shapes 
the substrate recognition site, (b) PP2A recognizes several target pro- 
teins, its specificity provided by the regulatory subunit. Each of several 
regulatory subunits fits the scaffold containing the catalytic subunit, 
and each regulatory subunit creates its unique substrate-binding site. 
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FIGURE 15-19 Regulation of pyruvate kinase. The enzyme is allosteri- 
cally inhibited by ATP, acetyl-CoA, and long-chain fatty acids (all signs 
of an abundant energy supply), and the accumulation of fructose 1 ,6- 
bisphosphate triggers its activation. Accumulation of alanine, which 
can be synthesized from pyruvate in one step, allosterically inhibits 
pyruvate kinase, slowing the production of pyruvate by glycolysis. The 
liver isozyme (L form) is also regulated hormonally. Glucagon activates 



cAMP-dependent protein kinase (PKA; see Fig. 15-35), which phos- 
phorylates the pyruvate kinase L isozyme, inactivating it. When the 
glucagon level drops, a protein phosphatase (PP) dephosphorylates 
pyruvate kinase, activating it. This mechanism prevents the liver from 
consuming glucose by glycolysis when blood glucose is low; instead, 
the liver exports glucose. The muscle isozyme (M form) is not affected 
by this phosphorylation mechanism, 
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The Gluconeogenic Conversion of Pyruvate to Phosphoenol 
Pyruvate Is Under Multiple Types of Regulation 

In the pathway leading from pyruvate to glucose, the 
first control point determines the fate of pyruvate in the 
mitochondrion: its conversion either to acetyl-CoA (by 
the pyruvate dehydrogenase complex) to fuel the citric 
acid cycle (Chapter 16) or to oxaloacetate (by pyruvate 
carboxylase) to start the process of gluconeogenesis 
(Fig. 15-20). When fatty acids are readily available as 
fuels, their breakdown in liver mitochondria yields 
acetyl-CoA, a signal that further oxidation of glucose for 
fuel is not necessary. Acetyl-CoA is a positive allosteric 
modulator of pyruvate carboxylase and a negative mod- 
ulator of pyruvate dehydrogenase, through stimulation 
of a protein kinase that inactivates the dehydrogenase. 
When the cell's energy needs are being met, oxidative 
phosphorylation slows, NADH rises relative to NAD 4 " 
and inhibits the citric acid cycle, and acetyl-CoA accu- 
mulates. The increased concentration of acetyl-CoA in- 
hibits the pyruvate dehydrogenase complex, slowing the 
formation of acetyl-CoA from pyruvate, and stimulates 
gluconeogenesis by activating pyruvate carboxylase, 
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FIGURE 15-20 Two alternative fates for pyruvate. Pyruvate can be 
converted to glucose and glycogen via gluconeogenesis or oxidized to 
acetyl-CoA for energy production. The first enzyme in each path is reg- 
ulated allosterically; acetyl-CoA, produced either by fatty acid oxida- 
tion or by the pyruvate dehydrogenase complex, stimulates pyruvate 
carboxylase and inhibits pyruvate dehydrogenase. 



allowing conversion of excess pyruvate to oxaloacetate 
(and, eventually, glucose). 

Oxaloacetate formed in this way is converted to 
phosphoenolpyruvate (PEP) in the reaction catalyzed 
by PEP carboxykinase (Fig. 15-11). In mammals, the 
regulation of this key enzyme occurs primarily at the 
level of its synthesis and breakdown, in response to di- 
etary and hormonal signals. Fasting or high glucagon 
levels act through cAMP to increase the rate of tran- 
scription and to stabilize the mRNA. Insulin, or high 
blood glucose, has the opposite effects. We discuss this 
transcriptional regulation in more detail below. Gener- 
ally triggered by a signal from outside the cell (diet, hor- 
mones), these changes take place on a time scale of 
minutes to hours. 

Transcriptional Regulation of Glycolysis and 
Gluconeogenesis Changes the Number of 
Enzyme Molecules 

Most of the regulatory actions discussed thus far are 
mediated by fast, quickly reversible mechanisms: al- 
losteric effects, covalent alteration (phosphorylation) 
of the enzyme, or binding of a regulatory protein. An- 
other set of regulatory processes involves changes in 
the number of molecules of an enzyme in the cell, 
through changes in the balance of enzyme synthesis 
and breakdown, and our discussion now turns to regu- 
lation of transcription through signal-activated tran- 
scription factors. 

In Chapter 12 we encountered nuclear receptors 
and transcription factors in the context of insulin signal- 
ing. Insulin acts through its receptor in the plasma mem- 
brane to turn on at least two distinct signaling pathways, 
each involving activation of a protein kinase. The MAP 
kinase ERK, for example, phosphorylates the transcrip- 
tion factors SRF and Elkl (see Fig. 12-15), which then 
stimulate the synthesis of enzymes needed for cell 
growth and division. Protein kinase B (PKB; also called 
Akt) phosphorylates another set of transcription factors 
(PDX1, for example), and these stimulate the synthesis 
of enzymes that metabolize carbohydrates and the fats 
formed and stored following excess carbohydrate intake 
in the diet. In pancreatic /3 cells, PDX1 also stimulates 
the synthesis of insulin itself. 

More than 150 genes are transcriptionally regulated 
by insulin; humans have at least seven general types of 
insulin response elements, each recognized by a subset 
of transcription factors activated by insulin under vari- 
ous conditions. Insulin stimulates the transcription of the 
genes that encode hexokinases II and IV, PFK-1, pyru- 
vate kinase, and PFK-2/FBPase-2 (all involved in glycol- 
ysis and its regulation); several enzymes of fatty acid 
synthesis; and glucose 6-phosphate dehydrogenase and 
6-phosphogluconate dehydrogenase, enzymes of the 
pentose phosphate pathway that generate the NADPH 
required for fatty acid synthesis. Insulin also slows the 
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TABLE 15-5 



Some of the Genes Regulated by Insulin 

Change in gene expression Pathway 



Increased expression 

Hexokinase II 
Hexokinase IV 

Phosphofructokinase-1 (PFK-1) 
Pyruvate kinase 
PFK-2/FBPase-2 

Glucose 6-phosphate dehydrogenase 

6-Phosphogluconate dehydrogenase 

Pyruvate dehydrogenase 

Acetyl-CoA carboxylase 

Malic enzyme 

ATP-citrate lyase 

Fatty acid synthase complex 

Stearoyl-CoA dehydrogenase 

Acyl-CoA-glycerol transferases 

Decreased expression 

PEP carboxykinase 

Glucose 6-phosphatase (catalytic subunit) 



Glycolysis 
Glycolysis 
Glycolysis 
Glycolysis 

Regulation of glycolysis/gluconeogenesis 

Pentose phosphate pathway (NADPH) 

Pentose phosphate pathway (NADPH) 

Fatty acid synthesis 

Fatty acid synthesis 

Fatty acid synthesis (NADPH) 

Fatty acid synthesis (provides acetyl-CoA) 

Fatty acid synthesis 

Fatty acid desaturation 

Triacylglycerol synthesis 

Gluconeogenesis 
Glucose release to blood 



expression of the genes for two enzymes of gluconeoge- 
nesis: PEP carboxykinase and glucose 6-phosphatase 
(Table 15-5). 

One transcription factor important to carbohydrate 
metabolism is ChREBP (carbohydrate response ele- 
ment binding protein; Fig. 15-21) , which is expressed 
primarily in liver, adipose tissue, and kidney. It serves to 
coordinate the synthesis of enzymes needed for carbohy- 
drate and fat synthesis. ChREBP in its inactive state is 
phosphorylated, and is located in the cytosoL When the 
phosphoprotein phosphatase PP2A (Fig. 15-18) removes 
a phosphoryl group from ChREBP, the transcription fac- 
tor can enter the nucleus. Here, nuclear PP2A removes 
another phosphoryl group, and ChREBP now joins with a 
partner protein, Mix, and turns on the synthesis of several 
enzymes: pyruvate kinase, fatty acid synthase, and 
acetyl-CoA carboxylase, the first enzyme in the path to 
fatty acid synthesis (Fig. 15-21). 



FIGURE 15-21 Mechanism of gene regulation by the transcription 
factor ChREBP. When ChREBP in the cytosol of a hepatocyte is phos- 
phorylated on a Ser and a Thr residue, it cannot enter the nucleus. De- 
phosphorylation of (P^-Ser by protein phosphatase PP2A allows 
ChREBP to enter the nucleus, where a second dephosphorylation, of 
(P)-Thr, activates ChREBP so that it can associate with its partner 
protein, Mix. ChREBP-Mlx now binds to the carbohydrate response 
element (ChoRE) in the promoter and stimulates transcription. PP2A is 
allosterically activated by xylulose 5-phosphate, an intermediate in the 
pentose phosphate pathway. 
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Controlling the activity of PP2A — and thus, ultimately, 
the synthesis of this group of metabolic enzymes — is xylu- 
lose 5-phosphate, an intermediate not of glycolysis or glu- 
coneogenesis but of the pentose phosphate pathway. 
When blood glucose concentration is high, glucose enters 
the liver and is phosphoiylated by hexokinase IV. The glu- 
cose 6-phosphate thus formed can enter either the gly- 
colytic pathway or the pentose phosphate pathway. If the 
latter, two initial oxidations produce xylulose 5-phosphate, 
which serves as a signal that the glucose-utilizing pathways 
are well-supplied with substrate. It accomplishes this by al- 
losterically activating PP2A, which then dephosphorylates 
ChREBP, allowing the transcription factor to turn on the ex- 
pression of genes for enzymes of glycolysis and fat synthe- 
sis (Fig. 15-21). Glycolysis yields pyruvate, and conversion 
of pyruvate to acetyl-CoA provides the starting material 
for fatty acid synthesis: acetyl-CoA carboxylase converts 
acetyl-CoA to malonyl-CoA, the first committed intermedi- 
ate in the path to fatty acids. The fatty acid synthase com- 
plex produces fatty acids for export to adipose tissue and 
storage as triacylglycerols (Chapter 21). In this way, excess 
dietary carbohydrate is stored as fat. 

Another transcription factor in the liver, SREBP- 
lc, a member of the family of sterol response ele- 
ment binding proteins (see Fig. 21-43), turns on the 
synthesis of pyruvate kinase, hexokinase IV, lipoprotein 
lipase, acetyl-CoA carboxylase, and the fatty acid syn- 
thase complex that will convert acetyl-CoA (produced 
from pyruvate) into fatty acids for storage in adipocytes. 
The synthesis of SREBC-lc is stimulated by insulin and 
depressed by glucagon. SREBP-lc also suppresses the 
expression of several gluconeogenic enzymes: glucose 
6-phosphatase, PEP carboxykinase, and FBPase-1. 

The transcription factor CREB (cyclic AMP re- 
sponse element binding protein) turns on the syn- 
thesis of glucose 6-phosphatase and PEP carboxykinase 
in response to the increase in [cAMP] triggered by 
glucagon. In contrast, insulin-stimulated inactivation 
of other transcription factors turns off several gluco- 
neogenic enzymes in the liver: PEP carboxykinase, fruc- 
tose 1 ,6-bisphosphatase, the glucose 6-phosphate 
transporter of the endoplasmic reticulum, and glucose 
6-phosphatase. For example, FOXOl (forkhead box 
other) stimulates the synthesis of gluconeogenic en- 
zymes and suppresses the synthesis of the enzymes of 
glycolysis, the pentose phosphate pathway, and triacyl- 
glycerol synthesis (Fig. 15-22). In its unphosphory- 
lated form, FOXOl acts as a nuclear transcription factor. 
In response to insulin, FOXOl leaves the nucleus and in 
the cytosol is phosphorylated by PKB, then tagged with 
ubiquitin and degraded by the proteasome. Glucagon 
prevents this phosphorylation by PKB, and FOXOl re- 
mains active in the nucleus. 

Complicated though the processes outlined above 
may seem, regulation of the genes encoding enzymes of 
carbohydrate and fat metabolism is proving far more 
complex and more subtle than we have shown here. 
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FIGURE 15-22 Mechanism of gene regulation by the transcription 
factor FOXOL Insulin activates the signaling cascade shown in Figure 
12-16, leading to activation of protein kinase B (PKB). FOXOl in the 
cytosol is phosphorylated by PKB, and the phosphorylated transcrip- 
tion factor is tagged by the attachment of ubiquitin for degradation by 
proteasomes. FOXOl that remains unphosphorylated or is dephospho- 
rylated can enter the nucleus, bind to a response element, and trigger 
transcription of the associated genes. Insulin therefore has the effect of 
turning off the expression of these genes, which include PEP carboxy- 
kinase and glucose 6-phosphatase 

Multiple transcription factors can act on the same gene 
promoter; multiple protein kinases and phosphatases 
can activate or inactivate these transcription factors; 
and a variety of protein accessory factors modulate the 
action of the transcription factors. This complexity is 
apparent, for example, in the gene encoding PEP car- 
boxykinase, a very well-studied case of transcriptional 
control. Its promoter region (Fig. 15-23) has 15 or 
more response elements that are recognized by at least 
a dozen known transcription factors, with more likely to 
be discovered. The transcription factors act in combina- 
tion on this promoter region, and on hundreds of other 
gene promoters, to fine-tune the levels of hundreds of 
metabolic enzymes, coordinating their activity in the 
metabolism of carbohydrates and fats. The critical im- 
portance of transcription factors in metabolic regulation 
is made clear by observing the effects of mutations in 
their genes. For example, at least five different types of 
maturity-onset diabetes of the young (MODY) are asso- 
ciated with mutations in specific transcription factors 
(Box 15-3). 
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FIGURE 15-23 The PEP carboxykinase promoter region, showing the 
complexity of regulatory input to this gene. This diagram shows the tran- 
scription factors (smaller icons, bound to the DNA) known to regulate the 
transcription of the PEP carboxykinase gene. The extent to which this 
gene is expressed depends on the combined input affecting all of these 



factors, which can reflect the availability of nutrients, blood glucose level, 
and other factors that go into making up the cell's need for this enzyme at 
this particular time. PI, P2, P3I, P31I, and P4 are protein binding sites 
identified by DNase I footprinting (see Box 26-1 ). The TATA box is the as- 
sembly point for the RNA polymerase II (Pol II) transcription complex. 



BOX 15-3 Genetic Mutations That Lead to Rare Forms of Diabetes 



The term "diabetes" describes a variety of medical con- 
ditions that have in common an excessive production of 
urine. In Box 11-2 we described diabetes insipidus, in 
which defective water reabsorption in the kidney results 
from a mutation in the gene for aquaporin. "Diabetes 
mellitus" refers specifically to disease in which the abil- 
ity to metabolize glucose is defective, due either to the 
failure of the pancreas to produce insulin or to tissue re- 
sistance to the actions of insulin. 

There are two common types of diabetes mellitus. 
Type 1, also called insulin-dependent diabetes mellitus 
(IDDM), is caused by autoimmune attack on the insulin- 
producing (5 cells of the pancreas. Individuals with 
IDDM must take insulin by injection or inhalation to 
compensate for their missing jS cells. IDDM develops in 
childhood or in the teen years; an older name for the 
disease is juvenile diabetes. Type 2, also called non- 
insulin-dependent diabetes mellitus (NIDDM), typically 
develops in adults over 40 years old. It is far more com- 
mon than IDDM, and its occurrence in the population is 
strongly correlated with obesity. The current epidemic 



of obesity in the more developed countries brings with it 
the promise of an epidemic of NIDDM, providing a 
strong incentive to understand the relationship between 
obesity and the onset of NIDDM at the genetic and bio- 
chemical levels. After completing our look at the metab- 
olism of fats and proteins in later chapters, we will 
return (in Chapter 23) to the discussion of diabetes, 
which has a broad effect on metabolism: of carbohy- 
drates, fats, and proteins. 

Here we consider another type of diabetes in which 
carbohydrate and fat metabolism is deranged: mature 
onset diabetes of the young (MODY), in which genetic 
mutation affects a transcription factor important in car- 
rying the insulin signal into the nucleus, or affects an en- 
zyme that responds to insulin. In MODY2, a mutation in 
the hexokinase IV (glucokinase) gene affects the liver 
and pancreas, tissues in which this is the main isoform of 
hexokinase. The glucokinase of pancreatic /3 cells func- 
tions as a glucose sensor. Normally, when blood glucose 

(continued on next page} 
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BOX 15-3 




Genetic Mutations That lead to Rare Forms of Diabetes 
{continued from previous page) 




rises, so does the glucose level in £ cells, and because 
glucokinase has a relatively high K m for glucose, its ac- 
tivity increases with rising blood glucose levels. Metabo- 
lism of the glucose 6-phosphate formed in this reaction 
raises the ATP level in (3 cells, and this triggers insulin 
release by the mechanism shown in Figure 23-28. In 
healthy individuals, blood glucose concentrations of -5 
mM trigger this insulin release. But individuals with inac- 
tivating mutations in both copies of the glucokinase 
gene have very high thresholds for insulin release, and 
consequently, from birth, they have severe hyper- 
glycemia — permanent neonatal diabetes. In individuals 
with one mutated and one normal copy of the glucoki- 
nase gene, the glucose threshold for insulin release rises 
to about 7 mM. As a result these individuals have blood 
glucose levels only slightly above normal: they generally 
have only mild hyperglycemia and no symptoms. This 



condition (MODY2) is generally discovered by accident 
during routine blood glucose analysis. 

There are at least five other types of MODY, each 
the result of an inactivating mutation in one or another 
of the transcription factors essential to the normal de- 
velopment and function of pancreatic /3 cells. Individu- 
als with these mutations have varying degrees of 
reduced insulin production and the associated defects 
in blood glucose homeostasis. In MODY1 and MODY3, 
the defects are severe enough to produce the long-term 
complications associated with IDDM and NIDDM — 
cardiovascular problems, kidney failure, and blindness. 
MODY4, 5, and 6 are less severe forms of the disease. Al- 
together, MODY disorders represent a small percentage 
of NIDDM cases. Also very rare are individuals with mu- 
tations in the insulin gene itself; they have defects in 
insulin signaling of varying severity. 



SUMMARY 15.3 Coordinated Regulation 
of Glycolysis and 
Gluconeogenesis 

■ Gluconeogenesis and glycolysis share seven 
enzymes, catalyzing the freely reversible reactions 
of the pathways. For the other three steps, the 
forward and reverse reactions are catalyzed by 
different enzymes, and these are the points of 
regulation of the two pathways. 

■ Hexokinase IV (glucokinase) has kinetic properties 
related to its special role in the liver: releasing 
glucose to the blood when blood glucose is low, 
and taking up and metabolizing glucose when blood 
glucose is high. 

■ PFK-1 is allosterically inhibited by ATP and citrate. 
In most mammalian tissues, including liver, fructose 
2,6-bisphosphate is an allosteric activator of this 
enzyme. 

Pyruvate kinase is allosterically inhibited by 
ATP, and the liver isozyme also is inhibited by 
cAMP-dependent phosphorylation. 

■ Gluconeogenesis is regulated at the level of 
pyruvate carboxylase (which is activated by 
acetyl-CoA) and FBPase-1 (which is inhibited by 
fructose 2,6-bisphosphate and AMP). 

■ To limit substrate cycling between glycolysis and 
gluconeogenesis, the two pathways are under 
reciprocal allosteric control, mainly achieved by 
the opposing effects of fructose 2,6-bisphosphate 
on PFK-1 and FBPase-1. 

■ Glucagon or epinephrine decreases [fructose 
2,6-bisphosphate], by raising [cAMP] and bringing 



about phosphorylation of the bifunctional enzyme 
PFK-2/FBPase-2. Insulin increases [fructose 
2,6-bisphosphate] by activating a phosphoprotein 
phosphatase that dephosphorylates and thus 
activates PFK-2. 

■ Xylulose 5-phosphate, an intermediate of the pentose 
phosphate pathway, activates phosphoprotein 
phosphatase PP2A, which dephosphorylates several 
target proteins, including PFK-2/FBPase-2, tilting the 
balance toward glucose uptake, glycogen synthesis, 
and lipid synthesis in the liver. 

■ Transcription factors including ChREBP, CREB, 
SREBP, and FOXOl act in the nucleus to regulate 
the expression of specific genes coding for enzymes 
of the glycolytic and gluconeogenic pathways. 
Insulin and glucagon act antagonistically in 
activating these transcription factors, thus turning 
on and off large numbers of genes. 

1 5.4 The Metabolism of Glycogen in Animals 

Our discussion of metabolic regulation, using carbohy- 
drate metabolism as the primary example, now turns to 
the synthesis and breakdown of glycogen. In this section 
we focus on the metabolic pathways; in Section 15.5 we 
turn to the regulatory mechanisms. 

In organisms from bacteria to plants to vertebrates, 
excess glucose is converted to polymeric forms for stor- 
age — glycogen in vertebrates and many microorganisms, 
starch in plants. In vertebrates, glycogen is found primarily 
in the liver and skeletal muscle; it may represent up to 10% 
of the weight of liver and 1% to 2% of the weight of muscle. 
If this much glucose were dissolved in the cytosol of a he- 
patocyte, its concentration would be about 0.4 m, enough 
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FIGURE 15-24 Glycogen granules in a hepatocyte. Glycogen, a stor- 
age form of carbohydrate, appears as electron-dense particles, often in 
aggregates or rosettes. In hepatocytes glycogen is closely associated 
with tubules of the smooth endoplasmic reticulum. Many mitochondria 
are also evident in this micrograph. 

to dominate the osmotic properties of the cell. When 
stored as a large polymer (glycogen), however, the same 
mass of glucose has a concentration of only 0.01 fjM. Glyco- 
gen is stored in large cytosolic granules. The elementary 
particle of glycogen, the ^-particle, is about 21 nm in diam- 
eter and consists of up to 55,000 glucose residues with 
about 2,000 nonreducing ends. Twenty to 40 of these par- 
ticles cluster together to form a-rosettes, easily seen with 
the microscope in tissue samples from well-fed animals 
(Fig. 15-24) but essentially absent after a 24-hour fast. 

The glycogen in muscle is there to provide a quick 
source of energy for either aerobic or anaerobic metab- 
olism. Muscle glycogen can be exhausted in less than an 
hour during vigorous activity. Liver glycogen serves as a 
reservoir of glucose for other tissues when dietary glu- 
cose is not available (between meals or during a fast); 



this is especially important for the neurons of the brain, 
which cannot use fatty acids as fuel. Liver glycogen can 
be depleted in 12 to 24 hours. In humans, the total 
amount of energy stored as glycogen is far less than the 
amount stored as fat (triacylglycerol) (see Table 23-5), 
but fats cannot be converted to glucose in mammals and 
cannot be catabolized anaerobically. 

Glycogen granules are complex aggregates of glyco- 
gen and the enzymes that synthesize it and degrade it, as 
well as the machinery for regulating these enzymes. The 
general mechanisms for storing and mobilizing glycogen 
are the same in muscle and liver, but the enzymes differ 
in subtle yet important ways that reflect the different 
roles of glycogen in the two tissues. Glycogen is also ob- 
tained in the diet and broken down in the gut, and this in- 
volves a separate set of hydrolytic enzymes that convert 
glycogen to free glucose. (Dietary starch is hydrolyzed in 
a similar way.) We begin our discussion with the break- 
down of glycogen to glucose 1 -phosphate (glycogenoly- 
sis), then turn to synthesis of glycogen (glycogenesis). 

Glycogen Breakdown Is Catalyzed by 
Glycogen Phosphorylase 

In skeletal muscle and liver, the glucose units of the outer 
branches of glycogen enter the glycolytic pathway 
through the action of three enzymes: glycogen phospho- 
rylase, glycogen debranching enzyme, and phosphoglu- 
comutase. Glycogen phosphorylase catalyzes the reaction 
in which an (a 1— >4) glycosidic linkage between two glu- 
cose residues at a nonreducing end of glycogen under- 
goes attack by inorganic phosphate (P0, removing the 
terminal glucose residue as a-D-glucose 1-phosphate 
(Fig. 15-25). This phosphorolysis reaction is different 
from the hydrolysis of glycosidic bonds by amylase dur- 
ing intestinal degradation of dietary glycogen and starch. 
In phosphorolysis, some of the energy of the glycosidic 
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FIGURE 15-25 Removal of a glucose residue 
from the nonreducing end of a glycogen 
chain by glycogen phosphorylase. This 
process is repetitive; the enzyme removes 
successive glucose residues until it reaches 
the fourth glucose unit from a branch point 
(see Fig. 15-26). 



[596] Principles of Metabolic Regulation 



bond is preserved in the formation of the phosphate ester, 
glucose 1-phosphate (see Section 14.2). 

Pyridoxal phosphate is an essential cofactor in the 
glycogen phosphorylase reaction; its phosphate group 
acts as a general acid catalyst, promoting attack by P { on 
the glycosidic bond. (This is an unusual role for pyri- 
doxal phosphate; its more typical role is as a cofactor in 
amino acid metabolism; see Fig. 18-6.) 

Glycogen phosphorylase acts repetitively on the 
nonreducing ends of glycogen branches until it reaches 
a point four glucose residues away from an (a 1— >6) 
branch point (see Fig. 7-14), where its action stops. 
Further degradation by glycogen phosphorylase can oc- 
cur only after the debranching enzyme, formally 
known as oligo (al— >6) to (al^4) glucan-trans- 
ferase, catalyzes two successive reactions that transfer 
branches (Fig. 15-20). Once these branches are trans- 
ferred and the glucosyl residue at C-6 is hydrolyzed, 
glycogen phosphorylase activity can continue. 

Glucose 1-Phosphate Can Enter Glycolysis or, in Liver, 
Replenish Blood Glucose 

Glucose 1-phosphate, the end product of the glycogen 
phosphorylase reaction, is converted to glucose 6-phos- 
phate by phosphoglucomutase, which catalyzes the 
reversible reaction 



Nonreducing 
ends 



Glucose 1-phosphate 



glucose 6-phosphate 



Initially phosphorylated at a Ser residue, the enzyme do- 
nates a phosphoryl group to C-6 of the substrate, then 
accepts a phosphoryl group from C-l (Fig. 15-27). 

The glucose 6-phosphate formed from glycogen in 
skeletal muscle can enter glycolysis and serve as an en- 
ergy source to support muscle contraction. In liver, 
glycogen breakdown serves a different purpose: to re- 
lease glucose into the blood when the blood glucose 
level drops, as it does between meals. This requires the 
enzyme glucose 6-phosphatase, present in liver and kid- 
ney but not in other tissues. The enzyme is an integral 
membrane protein of the endoplasmic reticulum, pre- 
dicted to contain nine transmembrane helices, with its 
active site on the lumenal side of the ER. Glucose 6- 
phosphate formed in the cytosol is transported into the 
ER lumen by a specific transporter (Tl) (Fig. 15-28) 
and hydrolyzed at the lumenal surface by the glucose 6- 
phosphatase. The resulting Pi and glucose are thought 
to be carried back into the cytosol by two different 
transporters (T2 and T3), and the glucose leaves the he- 
patocyte via the plasma membrane transporter, GLUT2. 
Notice that by having the active site of glucose 6-phos- 
phatase inside the ER lumen, the cell separates this re- 
action from the process of glycolysis, which takes place 
in the cytosol and would be aborted by the action of 
glucose 6-phosphatase. Genetic defects in either glu- 
cose 6-phosphatase or Tl lead to serious derangement 
of glycogen metabolism, resulting in type la glycogen 
storage disease (Box 15-4). 
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FIGURE 15-26 Glycogen breakdown near an (a1^6) branch point. 

Following sequential removal of terminal glucose residues by glycogen 
phosphorylase (see Fig. 15-25), glucose residues near a branch are re- 
moved in a two-step process that requires a bifunctional debranching 
enzyme. First, the transferase activity of the enzyme shifts a block of 
three glucose residues from the branch to a nearby nonreducing end, 
to which they are reattached in (a1->4) linkage. The single glucose 
residue remaining at the branch point, in (a1-»6) linkage, is then re- 
leased as free glucose by the debranching enzyme's (a1->6) glucosi- 
dase activity. The glucose residues are shown in shorthand form, which 
omits the — H, — OH, and — CH 2 OH groups from the pyranose rings. 

Because muscle and adipose tissue lack glucose 6- 
phosphatase, they cannot convert the glucose 6-phos- 
phate formed by glycogen breakdown to glucose, and 
these tissues therefore do not contribute glucose to the 
blood. 

The Sugar Nucleotide UDP-Glucose Donates Glucose 
for Glycogen Synthesis 

Many of the reactions in which hexoses are transformed 
or polymerized involve sugar nucleotides, compounds 
in which the anomeric carbon of a sugar is activated by 
attachment to a nucleotide through a phosphate ester 
linkage. Sugar nucleotides are the substrates for polymer- 
ization of monosaccharides into disaccharides, glycogen, 
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FIGURE 15-27 Reaction catalyzed by phosphoglucomutase. 

The reaction begins with the enzyme phosphorylated on a Ser 
residue. In step (T), the enzyme donates its phosphoryl group 
(green) to glucose 1-phosphate, producing glucose 1,6-bisphos- 
phate. In step @, the phosphoryl group at C-1 of glucose 1,6- 
bisphosphate (red) is transferred back to the enzyme, re-forming 
the phosphoenzyme and producing glucose 6-phosphate. 
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FIGURE 15-28 Hydrolysis of glu- 
cose 6-phosphate by glucose 6- 
phosphatase of the ER.The catalytic 
site of glucose 6-phosphatase faces 
the lumen of the ER. A glucose 6- 
phosphate (G6P) transporter (T1) 
carries the substrate from the cy- 
tosol to the lumen, and the prod- 
ucts glucose and P ( pass to the 
cytosol on specific transporters (T2 
and T3). Glucose leaves the cell 
via the GLUT2 transporter in the 
plasma membrane. 




Luis Leloir, 1906-1987 



starch, cellulose, and more com- 
plex extracellular polysaccharides. 
They are also key intermediates in 
the production of the aminohex- 
oses and deoxyhexoses found in 
some of these polysaccharides, 
and in the synthesis of vitamin C 
(L-ascorbic acid) . The role of 
sugar nucleotides in the biosyn- 
thesis of glycogen and many other 
carbohydrate derivatives was dis- 
covered in 1953 by the Argentine 
biochemist Luis Leloir. 
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BOX 15-4 Carl and Gerty Cork Pioneers in 

Much of what is written in present-day biochemistry text- 
books about the metabolism of glycogen was discovered 
between about 1925 and 1950 by the remarkable husband 
and wife team of Carl F. Cori and Gerty T. Cori. Both 
trained in medicine in Europe at the end of World War I 
(she completed premedical studies and medical school in 
one year!). They left Europe together in 1922 to establish 
research laboratories in the United States, first for nine 
years in Buffalo, New York, at what is now the Roswell 
Park Memorial Institute, then from 1931 until the end of 
their lives at Washington University in St. Louis. 

In their early physiological studies of the origin 
and fate of glycogen in animal muscle, the Coris 




The Coris in Gerty Cori's laboratory, around 1947, 



Glycogen Metabolism and Disease 




demonstrated the conversion of glycogen to lactate in 
tissues, movement of lactate in the blood to the liver, 
and, in the liver, reconversion of lactate to glycogen — 
a pathway that came to be known as the Cori cycle (see 
Fig. 23-20). Pursuing these observations at the bio- 
chemical level, they showed that glycogen was mobi- 
lized in a phosphorolysis reaction catalyzed by the 
enzyme they discovered, glycogen phosphorylase. 
They identified the product of this reaction (the "Cori 
ester") as glucose 1 -phosphate and showed that it 
could be reincorporated into glycogen in the reverse 
reaction. Although this did not prove to be the reaction 
by which glycogen is synthesized in cells, it was the 
first in vitro demonstration of the synthesis of a macro- 
molecule from simple monomeric subunits, and it in- 
spired others to search for polymerizing enzymes. 
Arthur Kornberg, discoverer of the first DNA poly- 
merase, said of his experience in the Coris' lab, "Glyco- 
gen phosphorylase, not base pairing, was what led me 
to DNA polymerase." 

I Gerty Cori became interested in human genetic 
HH diseases in which too much glycogen is stored 
in the liver. She was able to identify the biochemical 
defect in several of these diseases and to show that 
the diseases could be diagnosed by assays of the en- 
zymes of glycogen metabolism in small samples of tis- 
sue obtained by biopsy. Table 1 summarizes what we 
now know about 13 genetic diseases of this sort. ■ 

Carl and Gerty Cori shared the Nobel Prize in Phys- 
iology or Medicine in 1947 with Bernardo Houssay of Ar- 
gentina, who was cited for his studies of hormonal 
regulation of carbohydrate metabolism. The Cori labora- 
tories in St. Louis became an international center of bio- 
chemical research in the 1940s and 1950s, and at least 
six scientists who trained with the Coris became Nobel 
laureates: Arthur Kornberg (for DNA synthesis, 1959), 
Severo Ochoa (for RNA synthesis, 1959), Luis Leloir 
(for the role of sugar nucleotides in polysaccharide syn- 
thesis, 1970), Earl Sutherland (for the discovery of 



The suitability of sugar nucleotides for biosynthetic 
reactions stems from several properties: 

1. Their formation is metabolically irreversible, 
contributing to the irreversibility of the synthetic 
pathways in which they are intermediates. The 
condensation of a nucleoside triphosphate with a 
hexose 1 -phosphate to form a sugar nucleotide 
has a small positive free-energy change, but the 
reaction releases PP i; which is rapidly hydrolyzed 
by inorganic pyrophosphatase (Fig. 15-29), 
in a reaction that is strongly exergonic 
(AG'° = -19.2 kJ/mol). This keeps the cellular 
concentration of PP; low, ensuring that the 



actual free-energy change in the cell is favorable. 
In effect, rapid removal of the product, driven by 
the large, negative free-energy change of PPi 
hydrolysis, pulls the synthetic reaction forward, 
a common strategy in biological polymerization 
reactions. 

2. Although the chemical transformations of sugar 
nucleotides do not involve the atoms of the 
nucleotide itself, the nucleotide moiety has many 
groups that can undergo noncovalent interactions 
with enzymes; the additional free energy of binding 
can contribute significantly to catalytic activity 
(Chapter 6; see also p. 297). 
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cAMP in the regulation of carbohydrate metabolism, 1974), and Edwin Krebs (for the discovery of phospho- 
1971), Christian de Duve (for subcellular fractionation, rylase kinase, 1991). 



mmJB, ; Glycogen Storage Diseases of Humans 


Type (name) 


Enzyme affected 


Primary organ 
affected 


Symptoms 


TypeO 


Glycogen synthase 


Liver 


Low blood glucose, high 
ketone bodies, early death 


Type la (von Gierke's) 


Glucose 6-phosphatase 


Liver 


Enlarged liver, kidney failure 


Type lb 


Microsomal glucose 
6-phosphate translocase 


Liver 


As in la; also high 
susceptibility to bacterial 
infections 


Type Ic 


Microsomal P* 
transporter 


Liver 


As in la 


Type II (Pompe's) 


Lysosomal glucosidase 


Skeletal and 
cardiac muscle 


Infantile form: death by age 2; 
juvenile form: muscle defects 
(myopathy) ; adult form: as 
in muscular dystrophy 


Type Ilia (Cori's or Forbes's) 


Debranching enzyme 


Liver, skeletal 
and cardiac 
muscle 


Enlarged liver in infants; 
myopathy 


Type Illb 


Liver debranching 
enzyme (muscle 
enzyme normal) 


Liver 


Enlarged liver in infants 


Type IV (Andersen's) 


Branching enzyme 


Liver, skeletal 
muscle 


Enlarged liver and spleen, 
myoglobin in urine 


Type V (McArdle's) 


Muscle phosphorylase 


Skeletal muscle 


Exercise-induced cramps and 
pain; myoglobin in urine 


Type VI (Hers's) 


Liver phosphorylase 


Liver 


Enlarged liver 


Type VII (Tarui's) 


Muscle PFK-1 


Muscle, 
erythrocytes 


As in type V; also hemolytic 
anemia 


IVpe VIb, VIII, or IX 


Phosphorylase kinase 


Liver, leukocytes, 
muscle 


Enlarged liver 


Type XI (Fanconi-Bickel) 


Glucose transporter 
(GLUT2) 


Liver 


Failure to thrive, enlarged 
liver, rickets, kidney 
dysfunction 



3. Like phosphate, the nucleotidyl group (UMP or 
AMP, for example) is an excellent leaving group, 
facilitating nucleophilic attack by activating the 
sugar carbon to which it is attached. 

4. By "tagging" some hexoses with nucleotidyl groups, 
cells can set them aside in a pool for one purpose 
(glycogen synthesis, for example), separate from 
hexose phosphates destined for another purpose 
(such as glycolysis). 

Glycogen synthesis takes place in virtually all ani- 
mal tissues but is especially prominent in the liver and 
skeletal muscles. The starting point for synthesis of 
glycogen is glucose 6-phosphate. As we have seen, this 



can be derived from free glucose in a reaction cat- 
alyzed by the isozymes hexokinase I and hexokinase II 
in muscle and hexokinase IV (glucokinase) in liver: 

D-Glucose + ATP ► D-glucose 6-phosphate + ADP 

However, some ingested glucose takes a more round- 
about path to glycogen. It is first taken up by erythro- 
cytes and converted to lactate glycolytically; the lactate 
is then taken up by the liver and converted to glucose 
6-phosphate by gluconeogenesis. 

To initiate glycogen synthesis, the glucose 6-phos- 
phate is converted to glucose 1 -phosphate in the phos- 
phoglucomutase reaction: 

Glucose 6-phosphate glucose 1-phosphate 
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FIGURE 15-29 Formation of a sugar nucleotide. A con- 
densation reaction occurs between a nucleoside triphos- 
phate (NTP) and a sugar phosphate. The negatively 
charged oxygen on the sugar phosphate serves as a nucle- 
ophile, attacking the a phosphate of the nucleoside 
triphosphate and displacing pyrophosphate. The reaction 
is pulled in the forward direction by the hydrolysis of PP : 
by inorganic pyrophosphatase. 
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The product of this reaction is converted to UDP-glucose 
by the action of UDP-glucose pyrophosphorylase, in a 

key step of glycogen biosynthesis: 

Glucose 1-phosphate + UTP ► UDP-glucose + PPi 

Notice that this enzyme is named for the reverse reaction; 
in the cell, the reaction proceeds in the direction of UDP- 



glucose formation, because pyrophosphate is rapidly hy- 
drolyzed by inorganic pyrophosphatase (Fig. 15-29). 

UDP-glucose is the immediate donor of glucose 
residues in the reaction catalyzed by glycogen syn- 
thase, which promotes the transfer of the glucose 
residue from UDP-glucose to a nonreducing end of a 
branched glycogen molecule (Fig. 15-30). The overall 
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FIGURE 15-30 Glycogen synthesis. A glycogen chain is elongated by 
glycogen synthase. The enzyme transfers the glucose residue of UDP- 
glucose to the nonreducing end of a glycogen branch (see Fig. 7-14) to 
make a new (a1— >4) linkage. 
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FIGURE 15-31 Branch synthesis in glycogen. The glycogen-branching enzyme (also called amylo (1— >4) 
to (l->6) transglycosylase, or glycosyl-(4^6)-transferase) forms a new branch point during glycogen 
synthesis. 



equilibrium of the path from glucose 6-phosphate to 
glycogen lengthened by one glucose unit greatly favors 
synthesis of glycogen. 

Glycogen synthase cannot make the (a 1— >6) bonds 
found at the branch points of glycogen; these are 
formed by the glycogen-branching enzyme, also called 
amylo (1— >4) to (1— »6) transglycosylase, or glyco- 
syl- (4— >6) -transferase. The glycogen-branching en- 
zyme catalyzes transfer of a terminal fragment of 6 or 7 
glucose residues from the nonreducing end of a glyco- 
gen branch having at least 11 residues to the C-6 hy- 
droxyl group of a glucose residue at a more interior 
position of the same or another glycogen chain, thus 
creating a new branch (Fig. 15-31). Further glucose 
residues may be added to the new branch by glycogen 
synthase. The biological effect of branching is to make 
the glycogen molecule more soluble and to increase the 
number of nonreducing ends. This increases the num- 
ber of sites accessible to glycogen phosphorylase and 
glycogen synthase, both of which act only at nonreduc- 
ing ends. 

Glycogenin Primes the Initial Sugar Residues 
in Glycogen 

Glycogen synthase cannot initiate a new glycogen 
chain de novo. It requires a primer, usually a pre- 
formed (a 1^4) polyglucose chain or branch having at 
least eight glucose residues. So, how is a new glycogen 
molecule initiated? The intriguing protein glycogenin 
(Fig. 15-32) is both the primer on which new chains 
are assembled and the enzyme that catalyzes their as- 
sembly. The first step in the synthesis of a new glyco- 
gen molecule is the transfer of a glucose residue from 
UDP-glucose to the hydroxyl group of Tyr 194 of glyco- 
genin, catalyzed by the protein's intrinsic glucosyl- 
transferase activity (Fig. 15-33). The nascent chain 
is extended by the sequential addition of seven more 



glucose residues, each derived from UDP-glucose; the 
reactions are catalyzed by the chain-extending activity 
of glycogenin. At this point, glycogen synthase takes 
over, further extending the glycogen chain. Glycogenin 
remains buried within the /3-particle, covalently at- 
tached to the single reducing end of the glycogen mol- 
ecule (Fig. 15-33b). 




FIGURE 15-32 Glycogenin structure. (PDB ID 1LL2) Muscle glyco- 
genin {M r 37,000) forms dimers in solution. Humans have a second 
isoform in liver, glycogenin-2.The substrate, UDP-glucose (shown as a 
red ball-and-stick structure), is bound to a Rossmann fold near the 
amino terminus and is some distance from the Tyr 194 residues 
(turquoise)— 1 5 A from the Tyr in the same monomer, 1 2 A from the Tyr 
in the dimeric partner. Each UDP-glucose is bound through its phos- 
phates to a Mn 2+ ion (green) that is essential to catalysis. Mn 2+ is be- 
lieved to function as an electron-pair acceptor (Lewis acid) to stabilize 
the leaving group, UDP. The glycosidic bond in the product has the 
same configuration about the C-1 of glucose as the substrate UDP-glu- 
cose, suggesting that the transfer of glucose from UDP to Tyr 194 occurs 
in two steps. The first step is probably a nucleophilic attack by Asp 162 
(orange), forming a temporary intermediate with inverted configura- 
tion. A second nucleophilic attack by Tyr 194 then restores the starting 
configuration. 



602j Principles of Metabolic Regulation 



(a) 



(b) 



CH 2 OH 




TyT m 



Glycogenin 



H HO 

UDP-glucose 



O— P— O— P— o- 

II I 

o o 



Ribose 



UDP-glucose 



Uracil 



ilJucHiiyJtransfena 
:u:Uvrty 



UDP 



H HO 
UDP-glucose 

-o—p— o— p— o- 



0 O— Ribose f-| Uracil 



:ti;Uvny 



UDP 



Repeats six times 



Each chain has 
12 to 14 glucose 
residues 





G glycogenin 
primer 
second tier 



third tier 

fourth tier 

outer tier 
(unbranched) 



FIGURE 15-33 Glycogenin and the structure of the glycogen particle. 

(a) Glycogenin catalyzes two distinct reactions. Initial attack by the hy- 
droxyl group of Tyr 194 on C-1 of the glucosyl moiety of UDP-glucose 
results in a glucosylated Tyr residue. The C-1 of another UDP-glucose 
molecule is now attacked by the C-4 hydroxyl group of the terminal 
glucose, and this sequence repeats to form a nascent glycogen mole- 
cule of eight glucose residues attached by (a1 -»4) glycosidic linkages. 

(b) Structure of the glycogen particle. Starting at a central glycogenin 
molecule, glycogen chains (12 to 14 residues) extend in tiers. Inner 
chains have two (a1 ->6) branches each. Chains in the outer tier are un- 
branched. There are 12 tiers in a mature glycogen particle (only 5 are 
shown here), consisting of about 55,000 glucose residues in a mole- 
cule of about 21 nm diameter and M r ~1 x 1 0 7 . 



SUMMARY 15.4 The Metabolism of 
Glycogen in Animals 

■ Glycogen is stored in muscle and liver as large 
particles. Contained within the particles are the 
enzymes that metabolize glycogen, as well as 
regulatory enzymes. 

■ Glycogen phosphorylase catalyzes phosphorolytic 
cleavage at the nonreducing ends of glycogen 
chains, producing glucose 1-phosphate. The 
debranching enzyme transfers branches onto main 
chains and releases the residue at the (al— >6) 
branch as free glucose. 

■ Phosphoglucomutase interconverts glucose 
1-phosphate and glucose 6-phosphate. Glucose 
6-phosphate can enter glycolysis or, in liver, 
can be converted to free glucose by glucose 
6-phosphatase in the endoplasmic reticulum, 
then released to replenish blood glucose. 



■ The sugar nucleotide UDP-glucose donates glucose 
residues to the nonreducing end of glycogen in 
the reaction catalyzed by glycogen synthase. A 
separate branching enzyme produces the (al—>6) 
linkages at branch points. 

■ New glycogen particles begin with the autocatalytic 
formation of a glycosidic bond between the glucose 
of UDP-glucose and a Tyr residue in the protein 
glycogenin, followed by addition of several glucose 
residues to form a primer that can be acted on by 
glycogen synthase. 

15.5 Coordinated Regulation of Glycogen 
Synthesis and Breakdown 

As we have seen, the mobilization of stored glycogen is 
brought about by glycogen phosphorylase, which de- 
grades glycogen to glucose 1-phosphate (Fig. 15-25). 
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Glycogen phosphorylase provides an especially instruc- 
tive case of enzyme regulation. It was one of the first 
known examples of an allosterically regulated enzyme 
and the first enzyme shown to be controlled by reversible 
phosphorylation. It was also one of the first allosteric en- 
zymes for which the detailed three-dimensional struc- 
tures of the active and inactive forms were revealed by 
x-ray crystallographic studies. Glycogen phosphorylase 
is also another illustration of how isozymes play their 
tissue-specific roles. 

Glycogen Phosphorylase Is Regulated 
Allosterically and Hormonally 

In the late 1930s, Carl and 
Gerty Cori (Box 15-4) discov- 
ered that the glycogen phos- 
phorylase of skeletal muscle 
exists in two interconvertible 
forms: glycogen phosphory- 
lase a, which is catalytically 
active, and glycogen phos- 
phorylase &, which is less ac- 
tive (Fig. 15-34). Subsequent 
studies by Earl Sutherland 
showed that phosphorylase b 
predominates in resting mus- 
cle, but during vigorous mus- 
cular activity epinephrine triggers phosphorylation of a 
specific Ser residue in phosphorylase 6, converting it to 
its more active form, phosphorylase a. (Note that glyco- 
gen phosphorylase is often referred to simply as 
phosphorylase — so honored because it was the first 
phosphorylase to be discovered; the shortened name 
has persisted in common usage and in the literature.) 

The enzyme (phosphorylase b kinase) responsi- 
ble for activating phosphorylase by transferring a phos- 
phoryl group to its Ser residue is itself activated by 
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epinephrine or glucagon through a series of steps shown 
in Figure 15-35. Sutherland discovered the second 
messenger cAMP, which increases in concentration in 
response to stimulation by epinephrine (in muscle) or 
glucagon (in liver). Elevated [cAMP] initiates an en- 
zyme cascade, in which a catalyst activates a catalyst, 
which activates a catalyst (see Section 12.1). Such cas- 
cades allow for large amplification of the initial signal 
(see pink boxes in Fig. 15-35). The rise in [cAMP] acti- 
vates cAMP-dependent protein kinase, also called pro- 
tein kinase A (PKA). PKA then phosphorylates and 
activates phosphorylase b kinase, which catalyzes the 
phosphorylation of Ser residues in each of the two iden- 
tical subunits of glycogen phosphorylase, activating it 
and thus stimulating glycogen breakdown. In muscle, 
this provides fuel for glycolysis to sustain muscle con- 
traction for the fight-or-flight response signaled by epi- 
nephrine. In liver, glycogen breakdown counters the low 
blood glucose signaled by glucagon, releasing glucose. 
These different roles are reflected in subtle differences 
in the regulatory mechanisms in muscle and liver. The 
glycogen phosphorylases of liver and muscle are 
isozymes, encoded by different genes and differing in 
their regulatory properties. 

In muscle, superimposed on the regulation of phos- 
phorylase by covalent modification are two allosteric 
control mechanisms (Fig. 15-35). Ca 2+ , the signal for 
muscle contraction, binds to and activates phosphory- 
lase b kinase, promoting conversion of phosphorylase b 
to the active a form. Ca 2+ binds to phosphorylase b 
kinase through its 5 subunit, which is calmodulin (see 
Fig. 12-11). AMP, which accumulates in vigorously con- 
tracting muscle as a result of ATP breakdown, binds to 
and activates phosphorylase, speeding the release of 
glucose 1-phosphate from glycogen. When ATP levels 
are adequate, ATP blocks the allosteric site to which 
AMP binds, inactivating phosphorylase. 

When the muscle returns to rest, a second enzyme, 
phosphorylase a phosphatase, also called phospho- 
protein phosphatase 1 (PP1), removes the phospho- 
ryl groups from phosphorylase a, converting it to the 
less active form, phosphorylase b. 

Like the enzyme of muscle, the glycogen phos- 
phorylase of liver is regulated hormonally (by phos- 
phorylation/dephosphorylation) and allosterically. The 
dephosphorylated form is essentially inactive. When the 
blood glucose level is too low, glucagon (acting through 
the cascade mechanism shown in Fig. 15-35) activates 



FIGURE 15-34 Regulation of muscle glycogen phosphorylase by cova- 
lent modification. In the more active form of the enzyme, phosphory- 
lase a, Ser 14 residues, one on each subunit, are phosphorylated. 
Phosphorylase a is converted to the less active form, phosphorylase b f 
by enzymatic loss of these phosphoryl groups, catalyzed by phospho- 
rylase a phosphatase (also known as phosphoprotein phosphatase 1, 
PP1). Phosphorylase b can be reconverted (reactivated) to phosphory- 
lase a by the action of phosphorylase b kinase, (See also Fig. 6-36 on 
glycogen phosphorylase regulation.) 
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FIGURE 15-35 Cascade mechanism of epinephrine and 
glucagon action. By binding to specific surface receptors, 
either epinephrine acting on a myocyte (left) or glucagon 
acting on a hepatocyte (right) activates a GTP-binding pro- 
tein G sa (see Fig. 12-4). Active G sa triggers a rise in [cAMP], 
activating PKA. This sets off a cascade of phosphorylations; 
PKA activates phosphorylase b kinase, which then activates 
glycogen phosphorylase. Such cascades effect a large am- 
plification of the initial signal; the figures in pink boxes are 
probably low estimates of the actual increase in number of 
molecules at each stage of the cascade. The resulting break- 
down of glycogen provides glucose, which in the myocyte 
can supply ATP (via glycolysis) for muscle contraction and 
in the hepatocyte is released into the blood to counter the 
low blood glucose. 
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phosphorylase b kinase, which in turn converts phos- 
phorylase b to its active a form, initiating the release of 
glucose into the blood. When blood glucose levels return 
to normal, glucose enters hepatocytes and binds to an 
inhibitory allosteric site on phosphorylase a. This bind- 
ing also produces a conformational change that exposes 



the phosphorylated Ser residues to PP1, which cat- 
alyzes their dephosphorylation and inactivates the phos- 
phorylase (Fig. 15-36). The allosteric site for glucose 
allows liver glycogen phosphorylase to act as its own 
glucose sensor and to respond appropriately to changes 
in blood glucose. 
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FIGURE 15-36 Glycogen phosphorylase of liver as a glucose sensor. 

Gl ucose binding to an allosteric site of the phosphorylase a isozyme of 
liver induces a conformational change that exposes its phosphorylated 
Ser residues to the action of phosphorylase a phosphatase (PP1). This 



phosphatase converts phosphorylase a to phosphorylase b, sharply re- 
ducing the activity of phosphorylase and slowing glycogen breakdown 
in response to high blood glucose. Insulin also acts indirectly to stim- 
ulate PP1 and slow glycogen breakdown. 
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Glycogen Synthase Is Also Regulated by Phosphorylation 

and Dephosphorylation 

Like glycogen phosphorylase, glycogen synthase can 
exist in phosphorylated and dephosphorylated forms 
(Fig. 15-37). Its active form, glycogen synthase a, is 
unphosphorylated. Phosphorylation of the hydroxyl 
side chains of several Ser residues of both subunits con- 
verts glycogen synthase a to glycogen synthase b, 
which is inactive unless its allosteric activator, glucose 
6-phosphate, is present. Glycogen synthase is remarkable 
for its ability to be phosphorylated on various residues by 
at least 11 different protein kinases. The most important 
regulatory kinase is glycogen synthase kinase 3 (GSK3), 



FIGURE 15-37 Effects of GSK3 on glycogen synthase activity. Glycogen 
synthase a, the active form, has three Ser residues near its carboxyl ter- 
minus, which are phosphorylated by glycogen synthase kinase 3 (GSK3) 
This converts glycogen synthase to the inactive (b) form. GSK3 action re- 
quires prior phosphorylation (priming) by casein kinase (CKII). Insulin 
triggers activation of glycogen synthase b by blocking the activity of 
GSK3 (see the pathway for this action in Fig. 12-16) and activating a 
phosphoprotein phosphatase (PP1 in muscle, another phosphatase in 
liver). In muscle, epinephrine activates PKA, which phosphorylates the 
glycogen-targeting protein Gm (see Fig. 1 5^0) on a site that causes dis- 
sociation of PP1 from glycogen. Glucose 6-phosphate favors dephos- 
phorylation of glycogen synthase by binding to it and promoting a 
conformation that is a good substrate for PP1 . Glucose also promotes 
dephosphorylation; the binding of glucose to glycogen phosphorylase a 
forces a conformational change that favors dephosphorylation to glyco- 
gen phosphorylase b, thus relieving its inhibition of PP1 (see Fig 1 5-39) 



which adds phosphoryl groups to three Ser residues near 
the carboxyl terminus of glycogen synthase, strongly inac- 
tivating it. The action of GSK3 is hierarchical; it cannot 
phosphorylate glycogen synthase until another protein ki- 
nase, casein kinase II (CKII), has first phosphorylated 
the glycogen synthase on a nearby residue, an event called 
priming (Fig. 15-38a). 

In liver, conversion of glycogen synthase b to the ac- 
tive form is promoted by PP1, which is bound to the 
glycogen particle. PP1 removes the phosphoryl groups 
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FIGURE 15-38 Priming of GSK3 phosphorylation of glycogen synthase. 

(a) Glycogen synthase kinase 3 first associates with its substrate (glyco- 
gen synthase) by interaction between three positively charged residues 
(Arg 96 , Arg 180 , Lys 205 ) and a phosphoserine residue at position +4 in the 
substrate. (For orientation, the Ser orThr residue to be phosphorylated in 
the substrate is assigned the index 0, Residues on the amino-terminal 
side of this residue are numbered -1, -2, and so forth; residues on the 
carboxyl-terminal side are numbered +1, +2, and so forth.) This associ- 
ation aligns the active site of the enzyme with a Ser residue at position 0, 
which it phosphorylates. This creates a new priming site, and the en- 




Pseudosubstrate 



zyme moves down the protein to phosphorylate the Ser residue at posi- 
tion -4, and then the Ser at -8. (b) GSK3 has a Ser residue near its 
amino terminus that can be phosphorylated by PKA or PKB (see Fig. 
1 5-39), This produces a "pseudosubstrate" region in GSK3 that folds into 
the priming site and makes the active site inaccessible to another protein 
substrate, inhibiting GSK3 until the priming phosphoryl group of its 
pseudosubstrate region is removed by PP1 . Other proteins that are sub- 
strates for GSK3 also have a priming site at position +4, which must be 
phosphorylated by another protein kinase before GSK3 can act on them. 
(See also Figs 6-37 and 12-22b on glycogen synthase regulation.) 
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from the three Ser residues phosphorylated by GSK3. 
Glucose 6-phosphate binds to an allosteric site on glyco- 
gen synthase b, making the enzyme a better substrate 
for dephosphorylation by PP1 and causing its activation. 
By analogy with glycogen phosphorylase, which acts as 
a glucose sensor, glycogen synthase can be regarded as 
a glucose 6-phosphate sensor. In muscle, a different 
phosphatase may have the role played by PP1 in liver, 
activating glycogen synthase by dephosphorylating it. 

Glycogen Synthase Kinase 3 Mediates Some of the 
Actions of insulin 

As we saw in Chapter 12, one way in which insulin trig- 
gers intracellular changes is by activating a protein ki- 
nase (PKB) that in turn phosphorylates and inactivates 
GSK3 (Fig. 15-39; see also Fig. 12-16). Phosphoryla- 
tion of a Ser residue near the amino terminus of GSK3 
converts that region of the protein to a pseudosubstrate, 
which folds into the site at which the priming phospho- 
rylated Ser residue normally binds (Fig. 15-38b). This 
prevents GSK3 from binding the priming site of a real 
substrate, thereby inactivating the enzyme and tipping 
the balance in favor of dephosphorylation of glycogen 
synthase by PP1. Glycogen phosphorylase can also af- 
fect the phosphorylation of glycogen synthase: active 
glycogen phosphorylase directly inhibits PP1, prevent- 
ing it from activating glycogen synthase (Fig. 15-37). 

Although first discovered in its role in glycogen me- 
tabolism (hence the name glycogen synthase kinase), 
GSK3 clearly has a much broader role than the regula- 
tion of glycogen synthase. It mediates signaling by in- 
sulin and other growth factors and nutrients, and it acts 
in the specification of cell fates during embryonic devel- 
opment. Among its targets are cytoskeletal proteins and 
proteins essential for mRNA and protein synthesis. 
These targets, like glycogen synthase, must first un- 



dergo a priming phosphorylation by another protein ki- 
nase before they can be phosphorylated by GSK3. 

Phosphoprotein Phosphatase 1 is Centra! to 
Glycogen Metabolism 

A single enzyme, PP1, can remove phosphoryl groups 
from all three of the enzymes phosphorylated in re- 
sponse to glucagon (liver) and epinephrine (liver and 
muscle): phosphorylase kinase, glycogen phosphory- 
lase, and glycogen synthase. Insulin stimulates glycogen 
synthesis by activating PP1 and by inactivating GSK3. 

Phosphoprotein phosphatase 1 does not exist free in 
the cytosol, but is tightly bound to its target proteins by one 
of a family of glycogen-targeting proteins that bind 
glycogen and each of the three enzymes, glycogen phos- 
phorylase, phosphorylase kinase, and glycogen synthase 
(Fig. 15-40). PP1 is itself subject to covalent and al- 
losteric regulation: it is inactivated when phosphorylated by 
PKA and is allosterically activated by glucose 6-phosphate. 

Allosteric and Hormonal Signals Coordinate Carbohydrate 
Metabolism Globally 

Having looked at the mechanisms that regulate individ- 
ual enzymes, we can now consider the overall shifts in 
carbohydrate metabolism that occur in the well-fed 
state, during fasting, and in the fight-or-flight response — 
signaled by insulin, glucagon, and epinephrine, respec- 
tively. We need to contrast two cases in which regulation 
serves different ends: (1) the role of hepatocytes in sup- 
plying glucose to the blood, and (2) the selfish use of car- 
bohydrate fuels by nonhepatic tissues, typified by skeletal 
muscle (myocytes), to support their own activities. 

After ingestion of a carbohydrate-rich meal, the ele- 
vation of blood glucose triggers insulin release 
(Fig. 15-41, top). In a hepatocyte, insulin has two imme- 



Insulin 



FIGURE 15-39 The path from insulin to GSK3 and 
glycogen synthase. Insulin binding to its receptor 
activates a tyrosine protein kinase in the receptor, 
which phosphorylates insulin receptor substrate- 1 
(IRS-1). The phosphotyrosine in this protein is then 
bound by phosphatidylinositol 3-kinase (PI-3K), 
which converts phosphatidylinositol 4,5-bisphos- 
phate (PIP 2 ) in the membrane to phosphatidylinosi- 
tol 3,4,5-trisphosphate (PIP 3 ). A protein kinase 
(PDK-1) that is activated when bound to PIP 3 acti- 
vates a second protein kinase (PKB), which phos- 
phorylates glycogen synthase kinase 3 (GSK3) in its 
pseudosubstrate region, inactivating it by the mech- 
anisms shown in Figure 15-38b. The inactivation of 
GSK3 allows phosphoprotein phosphatase 1 (PP1) to 
dephosphorylate and thus activate glycogen syn- 
thase. In this way insulin stimulates glycogen synthe- 
sis. (See Fig. 12-1 6 for more details on insulin action.) 
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FIGURE 15-40 Glycogen-targeting protein Gm. The 

glycogen-targeting protein Gm is one of a family of proteins 
that bind other proteins (including PP1) to glycogen parti- 
cles. Gm can be phosphorylated at two different sites in re- 
sponse to insulin or epinephrine. © Insulin-stimulated 
phosphorylation of Gm site 1 activates PP1, which dephos- 
phorylates phosphorylase kinase, glycogen phosphorylase, 
and glycogen synthase (not shown). (2) Epinephrine- 
stimulated phosphorylation of Gm site 2 causes dissoci- 
ation of PP1 from the glycogen particle, preventing its 
access to glycogen phosphorylase and glycogen syn- 
thase. PKA also phosphorylates a protein (inhibitor 1) 
that, when phosphorylated, inhibits PP1 By these means, 
insulin inhibits glycogen breakdown and stimulates 
glycogen synthesis, and epinephrine (or glucagon in the 
liver) has the opposite effects. 
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FIGURE 15-41 Regulation of carbohydrate metabolism 
in the liver. Arrows indicate causal relationships between 
the changes they connect. For example, an arrow from lA 
to TB means that a decrease in A causes an increase in B. 
Pink arrows connect events that result from high blood 
glucose; blue arrows connect events that result from low 
blood glucose. 
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diate effects: it inactivates GSK3, acting through the cas- 
cade shown in Figure 15-39, and activates a protein phos- 
phatase, perhaps PP1. These two actions fully activate 
glycogen synthase. PP1 also inactivates glycogen phos- 
phorylase a and phosphorylase kinase by dephosphory- 
lating both, effectively stopping glycogen breakdown. 
Glucose enters the hepatocyte through the high-capacity 
transporter GLUT2, always present in the plasma mem- 
brane, and the elevated intracellular glucose leads to dis- 
sociation of hexokinase IV (glucokinase) from its nuclear 
regulatory protein (Fig. 15-13). Hexokinase IV enters the 
cytosol and phosphorylates glucose, stimulating glycoly- 
sis and supplying the precursor for glycogen synthesis. 
Under these conditions, hepatocytes use the excess glu- 
cose in the blood to synthesize glycogen, up to the limit of 
about 10% of the total weight of the liver. 

Between meals, or during an extended fast, the drop 
in blood glucose triggers the release of glucagon, which, 
acting through the cascade shown in Figure 15-35, 
activates PKA. PKA mediates all the effects of glucagon 
(Fig. 15-41, bottom). It phosphorylates phosphorylase 
kinase, activating it and leading to the activation of glyco- 
gen phosphorylase. It phosphorylates glycogen synthase, 
inactivating it and blocking glycogen synthesis. It phos- 
phorylates PFK-2/FBPase-2, leading to a drop in the con- 
cen- tration of the regulator fructose 2,6-bisphosphate, 
which has the effect of inactivating the glycolytic enzyme 
PFK-1 and activating the gluconeogenic enzyme FBPase- 
1. And it phosphorylates and inactivates the glycolytic en- 
zyme pyruvate kinase. Under these conditions, the liver 
produces glucose 6-phosphate by glycogen breakdown 
and by gluconeogenesis, and it stops using glucose to fuel 
glycolysis or make glycogen, maximizing the amount of 
glucose it can release to the blood. This release of glucose 
is possible only in liver and kidney, because other tissues 
lack glucose 6-phosphatase (Fig. 15-28). 

The physiology of skeletal muscle differs from that 
of liver in three ways important to our discussion of 
metabolic regulation (Fig. 15-42): (1) muscle uses its 
stored glycogen only for its own needs; (2) as it goes 
from rest to vigorous contraction, muscle undergoes 
very large changes in its demand for ATP, which is 
supported by glycolysis; (3) muscle lacks the enzymatic 
machinery for gluconeogenesis. The regulation of carbo- 
hydrate metabolism in muscle reflects these differences 
from liver. First, myocytes lack receptors for glucagon. 
Second, the muscle isozyme of pyruvate kinase is not 
phosphorylated by PKA, so glycolysis is not turned off 
when [cAMP] is high. In fact, cAMP increases the rate of 
glycolysis in muscle, probably by activating glycogen 
phosphorylase. When epinephrine is released into the 
blood in a fight-or-flight situation, PKA is activated by 
the rise in [cAMP] and phosphorylates and activates 
glycogen phosphorylase kinase. The resulting phospho- 
rylation and activation of glycogen phosphorylase re- 
sults in faster glycogen breakdown. Epinephrine is not 
released under low-stress conditions, but with each 
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FIGURE 15-42 Difference in the regulation of carbohydrate metabo- 
lism in liver and muscle. In liver, either glucagon (indicating low 
blood glucose) or epinephrine (signaling the need to fight or flee) has 
the effect of maximizing the output of glucose into the bloodstream. 
In muscle, epinephrine increases glycogen breakdown and glycolysis, 
which together provide fuel to produce the ATP needed for muscle 
contraction, 



neuronal stimulation of muscle contraction, cytosolic 
[Ca 2+ ] rises briefly and activates phosphorylase kinase 
through its calmodulin subunit. 

Elevated insulin triggers increased glycogen synthe- 
sis in myocytes by activating PP1 and inactivating GSK3. 
Unlike hepatocytes, myocytes have a reserve of GLUT4 
sequestered in intracellular vesicles. Insulin triggers their 
movement to the plasma membrane (see Fig. 12-16), 
where they allow increased glucose uptake. In response 
to insulin, therefore, myocytes help to lower blood 
glucose by increasing their rates of glucose uptake, 
glycogen synthesis, and glycolysis. 

Carbohydrate and Lipid Metabolism Are Integrated by 
Hormonal and Allosteric Mechanisms 

As complex as the regulation of carbohydrate metabo- 
lism is, it is far from the whole story of fuel metabolism. 
The metabolism of fats and fatty acids is very closely 
tied to that of carbohydrates. Hormonal signals such as 
insulin and changes in diet or exercise are equally im- 
portant in regulating fat metabolism and integrating it 
with that of carbohydrates. We return to this overall 
metabolic integration in mammals in Chapter 23, after 
first considering the metabolic pathways for fats and 
amino acids (Chapters 17 and 18). The message we wish 
to convey here is that metabolic pathways are overlaid 
with complex regulatory controls that are exquisitely 
sensitive to changes in metabolic circumstances. These 
mechanisms act to adjust the flow of metabolites 
through various metabolic pathways, as needed by the 
cell and organism, and to do so without causing major 
changes in the concentrations of intermediates shared 
with other pathways. 
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SUMMARY 15.5 



Coordinated Regulation 
of Glycogen Synthesis 
and Breakdown 



Glycogen phosphorylase is activated in response to 
glucagon or epinephrine, which raise [cAMP] and 
activate PKA. PKA phosphorylates and activates 
phosphorylase kinase, which converts glycogen 
phosphorylase b to its active a form. Phosphoprotein 
phosphatase 1 (PP1) reverses the phosphorylation 
of glycogen phosphorylase a, inactivating it. Glucose 
binds to the liver isozyme of glycogen phosphorylase 
a, favoring its dephosphorylation and inactivation. 

Glycogen synthase a is inactivated by 
phosphorylation catalyzed by GSK3. Insulin 
blocks GSK3. PP1, which is activated by insulin, 
reverses the inhibition by dephosphorylating 
glycogen synthase b. 

Insulin increases glucose uptake into myocytes and 
adipocytes by triggering movement of the glucose 
transporter GLUT4 to the plasma membrane. 

Insulin stimulates the synthesis of hexokinases 
II and IV, PFK-1, pyruvate kinase, and several 
enzymes involved in lipid synthesis. Insulin 
stimulates glycogen synthesis in muscle and liver. 

In liver, glucagon stimulates glycogen breakdown 
and gluconeogenesis while blocking glycolysis, 
thereby sparing glucose for export to the brain 
and other tissues. 

In muscle, epinephrine stimulates glycogen 
breakdown and glycolysis, providing ATP to 
support contraction. 
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Problems 



1. Measurement of Intracellular Metabolite Concentra- 
tions Measuring the concentrations of metabolic intermediates 
in a living cell presents great experimental difficulties— usually a 
cell must be destroyed before metabolite concentrations can be 
measured. Yet enzymes catalyze metabolic interconversions very 
rapidly, so a common problem associated with these types of 
measurements is that the findings reflect not the physiological 
concentrations of metabolites but the equilibrium concentra- 
tions A reliable experimental technique requires all enzyme-cat- 
alyzed reactions to be instantaneously stopped in the intact 
tissue so that the metabolic intermediates do not undergo 
change. This objective is accomplished by rapidly compressing 
the tissue between large aluminum plates cooled with liquid ni- 
trogen (-190 °C), a process called freeze-clamping. After 
freezing, which stops enzyme action instantly, the tissue is pow- 
dered and the enzymes are inactivated by precipitation with per- 
chloric acid. The precipitate is removed by centrifugation, and 
the clear supernatant extract is analyzed for metabolites. To cal- 
culate intracellular concentrations, the intracellular volume is 
determined from the total water content of the tissue and a 
measurement of the extracellular volume. 

The intracellular concentrations of the substrates and 
products of the phosphofructokinase-1 reaction in isolated rat 
heart tissue are given in the table below. 



Metabolite 



Concentration (jtM)* 



Fructose 6-phosphate 
Fructose 1,6-bisphosphate 
ATP 
ADP 



87.0 
22.0 
11,400 
1,320 



Source: From Williamson, J.R. (1965) Glycolytic control mechanisms I: 
inhibition of glycolysis by acetate and pyruvate in the isolated, perfused rat 
heart J. Biol. Chem. 240, 2308-2321. 

♦Calculated as /xmol/mL of intracellular water. 



(a) Calculate Q, [fructose l,6-bisphosphate][ADP]/[fruc- 
tose 6-phosphate] [ATP], for the PFK-1 reaction under physio- 
logical conditions. 

(b) Given a AG' 0 for the PFK-1 reaction of -14.2 kJ/mol, 
calculate the equilibrium constant for this reaction. 

(c) Compare the values of Q and K' m , Is the physiological 
reaction near or far from equilibrium? Explain. What does this 
experiment suggest about the role of PFK-1 as a regulatory 
enzyme? 

2. Are All Metabolic Reactions at Equilibrium? 

(a) Phosphoenolpyruvate (PEP) is one of the two phos- 
phoryl group donors in the synthesis of ATP during glycolysis. 
In human erythrocytes, the steady-state concentration of ATP 
is 2.24 mM, that of ADP is 0.25 mM, and that of pyruvate is 
0.051 mM. Calculate the concentration of PEP at 25 °C, assum- 
ing that the pyruvate kinase reaction (see Fig. 13-13) is at 
equilibrium in the cell. 

(b) The physiological concentration of PEP in human ery- 
throcytes is 0.023 mM. Compare this with the value obtained in 
(a). Explain the significance of this difference. 



3. Effect of O a Supply on Glycolytic Rates The regulated 
steps of glycolysis in intact cells can be identified by studying 
the catabolism of glucose in whole tissues or organs. For ex- 
ample, the glucose consumption by heart muscle can be mea- 
sured by artificially circulating blood through an isolated intact 
heart and measuring the concentration of glucose before and 
after the blood passes through the heart. If the circulating 
blood is deoxygenated, heart muscle consumes glucose at a 
steady rate. When oxygen is added to the blood, the rate of 
glucose consumption drops dramatically, then is maintained at 
the new, lower rate. Explain. 

4. Regulation of PFK-1 The effect of ATP on the allosteric 
enzyme PFK-1 is shown below. For a given concentration of 
fructose 6-phosphate, the PFK-1 activity increases with in- 
creasing concentrations of ATP, but a point is reached beyond 
which increasing the concentration of ATP inhibits the enzyme. 





100 r 








80 - 


o 






60 - 


>> 




V 

o 


40 - 


cd 




PFK-1 


20 - 
0 - 




[ATP] 

(a) Explain how ATP can be both a substrate and an in- 
hibitor of PFK-1 , How is the enzyme regulated by ATP? 

(b) In what ways is glycolysis regulated by ATP levels? 

(c) The inhibition of PFK-1 by ATP is diminished when the 
ADP concentration is high, as shown in the illustration. How 
can this observation be explained? 

5. Cellular Glucose Concentration The concentration of 
glucose in human blood plasma is maintained at about 5 mM. 
The concentration of free glucose inside a myocyte is much 
lower. Why is the concentration so low in the cell? What hap- 
pens to glucose after entry into the cell? Glucose is adminis- 
tered intravenously as a food source in certain clinical 
situations. Given that the transformation of glucose to glucose 
6-phosphate consumes ATP, why not administer intravenous 
glucose 6-phosphate instead? 

6. Enzyme Activity and Physiological Function The 7 max 
of the glycogen phosphorylase from skeletal muscle is much 
greater than the 7 max of the same enzyme from liver tissue. 

(a) What is the physiological function of glycogen phos- 
phorylase in skeletal muscle? In liver tissue? 

(b) Why does the 7 max of the muscle enzyme need to be 
greater than that of the liver enzyme? 

7. Glycogen Phosphorylase Equilibrium Glycogen phos- 
phorylase catalyzes the removal of glucose from glycogen. The 
AG'° for this reaction is 3.1 kJ/mol 

(a) Calculate the ratio of [Pi] to [glucose 1 -phosphate] 
when the reaction is at equilibrium. (Hint: The removal of 
glucose units from glycogen does not change the glycogen 
concentration.) 
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(b) The measured ratio [PJ/tglucose 1 -phosphate] in my- 
ocytes under physiological conditions is more than 100:1. 
What does this indicate about the direction of metabolite flow 
through the glycogen phosphorylase reaction in muscle? 

(c) Why are the equilibrium and physiological ratios dif- 
ferent? What is the possible significance of this difference? 

8. Regulation of Glycogen Phosphorylase In muscle tis- 
sue, the rate of conversion of glycogen to glucose 6-phosphate 
is determined by the ratio of phosphorylase a (active) to phos- 
phorylase b (less active) . Determine what happens to the rate 
of glycogen breakdown if a muscle preparation containing 
glycogen phosphorylase is treated with (a) phosphorylase ki- 
nase and ATP; (b) PP1; (c) epinephrine. 

9. Glycogen Breakdown in Rabbit Muscle The intracellu- 
lar use of glucose and glycogen is tightly regulated at four 
points, To compare the regulation of glycolysis when oxygen is 
plentiful and when it is depleted, consider the utilization of 
glucose and glycogen by rabbit leg muscle in two physiological 
settings: a resting rabbit, with low ATP demands, and a rabbit 
that sights its mortal enemy, the coyote, and dashes into its 
burrow. For each setting, determine the relative levels (high, 
intermediate, or low) of AMP, ATP, citrate, and acetyl-CoA and 
describe how these levels affect the flow of metabolites 
through glycolysis by regulating specific enzymes. In periods 
of stress, rabbit leg muscle produces much of its ATP by anaer- 
obic glycolysis (lactate fermentation) and very little by oxida- 
tion of acetyl-CoA derived from fat breakdown, 

10. Glycogen Breakdown in Migrating Birds Unlike the 
rabbit with its short dash, migratory birds require energy for 
extended periods of time. For example, ducks generally fly 
several thousand miles during their annual migration. The 
flight muscles of migratory birds have a high oxidative capac- 
ity and obtain the necessary ATP through the oxidation of 
acetyl-CoA (obtained from fats) via the citric acid cycle, Com- 
pare the regulation of muscle glycolysis during short-term in- 
tense activity, as in the fleeing rabbit, and during extended 
activity, as in the migrating duck. Why must the regulation in 
these two settings be different? 

S3 11. Enzyme Defects in Carbohydrate Metabolism 

Kifl Summaries of four clinical case studies follow. For each 
case detennine which enzyme is defective and designate the ap- 
propriate treatment, from the lists provided at the end of the 
problem. Justify your choices. Answer the questions contained 
in each case study. (You may need to refer to information in 
Chapter 14.) 

Case A The patient develops vomiting and diarrhea 
shortly after milk ingestion. A lactose tolerance test is admin- 
istered. (The patient ingests a standard amount of lactose, and 
the glucose and galactose concentrations of blood plasma are 
measured at intervals In individuals with normal carbohydrate 
metabolism, the levels increase to a maximum in about 1 hour, 
then decline.) The patient's blood glucose and galactose con- 
centrations do not increase during the test. Why do blood glu- 
cose and galactose increase and then decrease during the test 
in healthy individuals? Why do they fail to rise in the patient? 



Case B The patient develops vomiting and diarrhea after 
ingestion of milk. His blood is found to have a low concentra- 
tion of glucose but a much higher than normal concentration 
of reducing sugars. The urine tests positive for galactose. Why 
is the concentration of reducing sugar in the blood high? Why 
does galactose appear in the urine? 

Case C The patient complains of painful muscle cramps 
when performing strenuous physical exercise but has no other 
symptoms. A muscle biopsy indicates a muscle glycogen 
concentration much higher than normal. Why does glycogen 
accumulate? 

Case D The patient is lethargic, her liver is enlarged, and 
a biopsy of the liver shows large amounts of excess glycogen. 
She also has a lower than normal blood glucose level. What is 
the reason for the low blood glucose in this patient? 

Defective Enzyme 

(a) Muscle PFK-1 

(b) Phosphomannose isomerase 

(c) Galactose 1 -phosphate uridylyltransf erase 

(d) Liver glycogen phosphorylase 

(e) Triose kinase 

(f ) Lactase in intestinal mucosa 

(g) Maltase in intestinal mucosa 

(h) Muscle debranching enzyme 
Treatment 

1 . Jogging 5 km each day 
2 r Fat-free diet 

3. Low-lactose diet 

4. Avoiding strenuous exercise 

5. Large doses of niacin (the precursor of NAD + ) 

6. Frequent feedings (smaller portions) of a normal diet 

| 12. Effects of Insufficient Insulin in a Person 
miM with Diabetes A man with insulin-dependent dia- 
betes is brought to the emergency room in a near-comatose 
state. While vacationing in an isolated place, he lost his insulin 
medication and has not taken any insulin for two days. 

(a) For each tissue listed below, is each pathway faster, 
slower, or unchanged in this patient, compared with the nor- 
mal level when he is getting appropriate amounts of insulin? 

(b) For each pathway, describe at least one control mech- 
anism responsible for the change you predict. 

Tissue and Pathways 

1. Adipose: fatty acid synthesis 

2. Muscle: glycolysis; fatty acid synthesis; glycogen syn- 
thesis 

3 Liver: glycolysis; gluconeogenesis; glycogen synthesis; 
fatty acid synthesis; pentose phosphate pathway 

B9 13. Blood Metabolites in Insulin Insufficiency 

mum For the patient described in Problem 12, predict the 
levels of the following metabolites in his blood before treat- 
ment in the emergency room, relative to levels maintained 
during adequate insulin treatment: (a) glucose; (b) ketone 
bodies; (c) free fatty acids. 

14. Metabolic Effects of Mutant Enzymes Predict and ex- 
plain the effect on glycogen metabolism of each of the following 
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defects caused by mutation: (a) loss of the cAMP-binding site 
on the regulatory subunit of protein kinase A (PKA) ; (b) loss of 
the protein phosphatase inhibitor (inhibitor 1 in Fig. 15-40); 
(c) overexpression of phosphorylase b kinase in liver; (d) de- 
fective glucagon receptors in liver. 

15. Hormonal Control of Metabolic Fuel Between your 
evening meal and breakfast, your blood glucose drops and 
your liver becomes a net producer rather than consumer of 
glucose. Describe the hormonal basis for this switch, and ex- 
plain how the hormonal change triggers glucose production by 
the liver. 

16. Altered Metabolism in Genetically Manipulated 
Mice Researchers can manipulate the genes of a mouse so 
that a single gene in a single tissue either produces an inactive 
protein (a "knockout" mouse) or produces a protein that is 
always (constitutively) active. What effects on metabolism 
would you predict for mice with the following genetic changes: 

(a) knockout of glycogen debranching enzyme in the liver; 

(b) knockout of hexokinase IV in liver; (c) knockout of 
FBPase-2 in liver; (d) constitutively active FBPase-2 in liver; 
(e) constitutively active AMPK in muscle; (f) constitutively 
active ChREBP in liver? 

Data Analysis Problem 

17. Optimal Glycogen Structure Muscle cells need rapid 
access to large amounts of glucose during heavy exercise 
This glucose is stored in liver and skeletal muscle in polymeric 
form as particles of glycogen. The typical glycogen particle 
contains about 55,000 glucose residues (see Fig. 15-33b). 
Melendez-Hevia, Waddell, and Shelton (1993) explored some 
theoretical aspects of the structure of glycogen, as described 
in this problem. 

(a) The cellular concentration of glycogen in liver is about 
0.01 jjm> What cellular concentration of free glucose would be 
required to store an equivalent amount of glucose? Why would 
this concentration of free glucose present a problem for the 
ceU? 

Glucose is released from glycogen by glycogen phosphory- 
lase, an enzyme that can remove glucose molecules, one at a time, 
from one end of a glycogen chain. Glycogen chains are branched 
(see Figs 15-26 and 15-33b), and the degree of branching— the 
number of branches per chain — has a powerful influence on the 
rate at which glycogen phosphorylase can release glucose. 

(b) Why would a degree of branching that was too low 
(i.e., below an optimum level) reduce the rate of glucose re- 
lease? (Hint: Consider the extreme case of no branches in a 
chain of 55,000 glucose residues.) 



(c) Why would a degree of branching that was too high 
also reduce the rate of glucose release? (Hint: Think of the 
physical constraints.) 

Melendez-Hevia and colleagues did a series of calculations 
and found that two branches per chain (see Fig. 15-33b) was 
optimal for the constraints described above. This is what is 
found in glycogen stored in muscle and liver. 

To determine the optimum number of glucose residues 
per chain, Melendez-Hevia and coauthors considered two key 
parameters that define the structure of a glycogen particle: 
t = the number of tiers of glucose chains in a particle (the mol- 
ecule in Fig. 15-33b has five tiers); g c = the number of glu- 
cose residues in each chain. They set out to find the values of 
t and g c that would maximize three quantities: (1) the 
amount of glucose stored in the particle (G T ) per unit vol- 
ume; (2) the number of unbranched glucose chains (C A ) per 
unit volume (i.e., number of chains in the outermost tier, 
readily accessible to glycogen phosphorylase); and (3) the 
amount of glucose available to phosphorylase in these un- 
branched chains (G PT ) . 

(d) Show that C A = 2 t ~ 1 . This is the number of chains 
available to glycogen phosphorylase before the action of the 
debranching enzyme. 

(e) Show that C T , the total number of chains in the parti- 
cle, is given by C T = 2 l - 1. Thus G T = # C (C T ) = g c & 1), 
the total number of glucose residues in the particle. 

(f) Glycogen phosphorylase cannot remove glucose from 
glycogen chains that are shorter than five glucose residues. 
Show that G PT = (g c - 4)(2 f ~ This is the amount of glu- 
cose readily available to glycogen phosphorylase. 

(g) Based on the size of a glucose residue and the location 
of branches, the thickness of one tier of glycogen is 0.12e/ c nm + 
0.35 nm. Show that the volume of a particle, V S} is given by the 

equation V s = |tt£ 3 (O.120 c + 0.35) 3 nm 3 , 

Melendez-Hevia and coauthors then determined the opti- 
mum values of t and g c — those that gave the maximum value of 
a quality function, /, that maximizes G T , C A , and G PT , while 
GjG^Gpx 

minimizing V s :f= — . They found that the optimum 

value of g c is independent of t. 

(h) Choose a value of t between 5 and 15 and find the op- 
timum value of g c . How does this compare with the g c found in 
liver glycogen (see Fig. 15-33b)? (Hint: You may find it useful 
to use a spreadsheet program.) 

Reference 

Melendez-Hevia, E., Waddell, T.G., & Shelton, E.D. (1993) 
Optimization of molecular design in the evolution of metabolism: the 
glycogen molecule. Biochem J. 295, 477-483. 



If citrate is added the rate of respiration is often increased ... the ex- 
tra oxygen uptake is by far greater than can be accounted for by the 
complete oxidation of citrate . . . Since citric acid reacts catalytically 
in the tissue it is probable that it is removed by a primary reaction but 
regenerated by a subsequent reaction. 

— H. A. Krebs and W. A. Johnson, article in Enzymologia, 1937 
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A swe saw in Chapter 14, some cells obtain energy 
£i (ATP) by fermentation, breaking down glucose in 

1 1 the absence of oxygen. For most eukaryotic cells 
and many bacteria, which live under aerobic conditions 
and oxidize their organic fuels to carbon dioxide and wa- 
ter, glycolysis is but the first stage in the complete oxi- 
dation of glucose. Rather than being reduced to lactate, 
ethanol, or some other fermentation product, the pyru- 
vate produced by glycolysis is further oxidized to H 2 0 
and C0 2 . This aerobic phase of catabolism is called res- 
piration. In the broader physiological or macroscopic 
sense, respiration refers to a multicellular organism's 
uptake of 0 2 and release of C0 2 . Biochemists and cell bi- 
ologists, however, use the term in a narrower sense to 
refer to the molecular processes by which cells consume 

0 2 and produce C0 2 — processes more precisely termed 
cellular respiration. 

Cellular respiration occurs in three major stages 
(Fig. 16-1). In the first, organic fuel molecules — 
glucose, fatty acids, and some amino acids — are oxi- 
dized to yield two-carbon fragments in the form of the 
acetyl group of acetyl-coenzyme A (acetyl-CoA). In 
the second stage, the acetyl groups are fed into the 
citric acid cycle, which enzymatically oxidizes them to 
C0 2 ; the energy released is conserved in the reduced 
electron carriers NADH and FADH 2 . In the third stage 
of respiration, these reduced coenzymes are them- 
selves oxidized, giving up protons (H + ) and electrons. 



The electrons are transferred to 0 2 — the final electron 
acceptor — via a chain of electron-carrying molecules 
known as the respiratory chain. In the course of elec- 
tron transfer, the large amount of energy released is 
conserved in the form of ATP, by a process called ox- 
idative phosphorylation (Chapter 19). Respiration is 
more complex than glycolysis and is believed to have 
evolved much later, after the appearance of cyanobac- 
teria. The metabolic activities of cyanobacteria ac- 
count for the rise of oxygen levels in the earth's 
atmosphere, a dramatic turning point in evolutionary 
history. 

We consider first the conversion of pyruvate to 
acetyl groups, then the entry of those groups into the 
citric acid cycle, also called the tricarboxylic acid 
(TCA) cycle or the Krebs cycle (after its discov- 
erer, Hans Krebs). We next examine the cycle reac- 
tions and the enzymes that catalyze them. Because 
intermediates of the citric acid cycle are also si- 
phoned off as biosynthetic precursors, we go on to 
consider some ways in which these intermediates are 
replenished. The citric acid cycle is a hub in metabo- 
lism, with degradative pathways leading in and ana- 
bolic pathways leading out, 
and it is closely regulated in 
coordination with other path- 
ways. The chapter ends with 
a description of the glyoxy- 
late pathway, a metabolic se- 
quence in some organisms 
that employs several of the 
same enzymes and reactions 
used in the citric acid cycle, 
bringing about the net syn- 
thesis of glucose from stored 
triacylglycerols. 




Hans Krebs, 1900-1981 
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FIGURE 16-1 Catabolism of proteins, fats, and carbohydrates in the 
three stages of cellular respiration. Stage 1: oxidation of fatty acids, 
glucose, and some amino acids yields acetyl-CoA. Stage 2: oxidation of 
acetyl groups in the citric acid cycle includes four steps in which elec- 
trons are abstracted. Stage 3: electrons carried by NADH and FADH 2 
are funneled into a chain of mitochondrial (or, in bacteria, plasma 
membrane-bound) electron carriers — the respiratory chain— ultimately 
reducing 0 2 to H 2 0. This electron flow drives the production of ATP. 

16.1 Production of Acetyl-CoA 
(Activated Acetate) 

In aerobic organisms, glucose and other sugars, fatty 
acids, and most amino acids are ultimately oxidized 
to C0 2 and H 2 0 via the citric acid cycle and the 



respiratory chain. Before entering the citric acid cy- 
cle, the carbon skeletons of sugars and fatty acids 
are degraded to the acetyl group of acetyl-CoA, the 
form in which the cycle accepts most of its fuel in- 
put. Many amino acid carbons also enter the cycle 
this way, although several amino acids are degraded 
to other cycle intermediates. Here we focus on how 
pyruvate, derived from glucose and other sugars by 
glycolysis, is oxidized to acetyl-CoA and C0 2 by the 
pyruvate dehydrogenase (PDH) complex, a clus- 
ter of enzymes — multiple copies of each of three en- 
zymes — located in the mitochondria of eukaryotic 
cells and in the cytosol of bacteria. 

A careful examination of this enzyme complex is re- 
warding in several respects. The PDH complex is a clas- 
sic, much-studied example of a multienzyme complex in 
which a series of chemical intermediates remain bound 
to the enzyme molecules as a substrate is transformed 
into the final product. Five cofactors, four derived from 
vitamins, participate in the reaction mechanism. The 
regulation of this enzyme complex also illustrates how 
a combination of covalent modification and allosteric 
reg-ulation results in precisely regulated flux through 
a metabolic step. Finally, the PDH complex is the proto- 
type for two other important enzyme complexes: 
a-ketoglutarate dehydrogenase, of the citric acid cycle, 
and the branched-chain a-keto acid dehydrogenase, 
of the oxidative pathways of several amino acids (see 
Fig, 18-28). The remarkable similarity in the protein 
structure, cofactor requirements, and reaction mecha- 
nisms of these three complexes doubtless reflects a 
common evolutionary origin. 

Pyruvate Is Oxidized to Acetyl-CoA and (0 2 

The overall reaction catalyzed by the pyruvate dehydro- 
genase complex is an oxidative decarboxylation, an 

irreversible oxidation process in which the carboxyl 
group is removed from pyruvate as a molecule of C0 2 
and the two remaining carbons become the acetyl group 
of acetyl-CoA (Fig. 16-2). The NADH formed in this re- 
action gives up a hydride ion (:H~) to the respiratory 
chain (Fig. 16-1), which carries the two electrons to 
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FIGURE 16-2 Overall reaction catalyzed by the pyruvate dehydrogenase 
complex. The five coenzymes participating in this reaction, and the three 
enzymes that make up the enzyme complex, are discussed in the text. 
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oxygen or, in anaerobic microorganisms, to an alterna- 
tive electron acceptor such as nitrate or sulfate. The 
transfer of electrons from NADH to oxygen ultimately 
generates 2.5 molecules of ATP per pair of electrons. 
The irreversibility of the PDH complex reaction has 
been demonstrated by isotopic labeling experiments: 
the complex cannot reattach radioactively labeled C0 2 
to acetyl-CoA to yield carboxyl-labeled pyruvate. 

The Pyruvate Dehydrogenase Complex Requires 
Five Coenzymes 

The combined dehydrogenation and decarboxylation of 
pyruvate to the acetyl group of acetyl-CoA (Fig. 16-2) 
requires the sequential action of three different en- 
zymes and five different coenzymes or prosthetic 
groups — thiamine pyrophosphate (TPP), flavin adenine 
dinucleotide (FAD), coenzyme A (CoA, sometimes de- 
noted CoA-SH, to emphasize the role of the — SH 
group), nicotinamide adenine dinucleotide (NAD), and 
lipoate. Four different vitamins required in human nutri- 
tion are vital components of this system: thiamine (in 
TPP), riboflavin (in FAD), niacin (in NAD), and pan- 
tothenate (in CoA). We have already described the roles 
of FAD and NAD as electron carriers (Chapter 13), and 
we have encountered TPP as the coenzyme of pyruvate 
decarboxylase (see Fig. 14-14). 

Coenzyme A (Fig. 16-3) has a reactive thiol ( — SH) 
group that is critical to the role of CoA as an acyl carrier 
in a number of metabolic reactions. Acyl groups are co- 
valently linked to the thiol group, forming thioesters. 
Because of their relatively high standard free energies of 
hydrolysis (see Figs 13-16, 13-17), thioesters have a 
high acyl group transfer potential and can donate their 
acyl groups to a variety of acceptor molecules. The acyl 



group attached to coenzyme A may thus be thought of 
as "activated" for group transfer. 

The fifth cofactor of the PDH complex, lipoate 
(Fig. 16-4), has two thiol groups that can undergo 
reversible oxidation to a disulfide bond ( — S — S — ), 
similar to that between two Cys residues in a protein. 
Because of its capacity to undergo oxidation-reduction 
reactions, lipoate can serve both as an electron (hydro- 
gen) carrier and as an acyl carrier, as we shall see. 
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FIGURE 16-4 Lipoic acid (lipoate) in amide linkage with a Lys residue. 

The I ipoy I lysyl moiety is the prosthetic group of dihydrolipoyl 
transacetylase (E 2 of the PDH complex). The lipoyl group occurs in ox- 
idized (disulfide) and reduced (dithiol) forms and acts as a carrier of 
both hydrogen and an acetyl (or other acyl) group. 
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The Pyruvate Dehydrogenase Complex Consists 
of Three Distinct Enzymes 

The PDH complex contains three enzymes — pyruvate 
dehydrogenase (EJ, dihydrolipoyl transacetylase 
(E 2 ), and dihydrolipoyl dehydrogenase (E 3 ) — each 
present in multiple copies. The number of copies of each 
enzyme and therefore the size of the complex varies 
among species. The PDH complex isolated from mam- 
mals is about 50 nm in diameter — more than five times 
the size of an entire ribosome and big enough to be visu- 
alized with the electron microscope (Fig. 16-5a). In 
the bovine enzyme, 60 identical copies of E 2 form a pen- 
tagonal dodecahedron (the core) with a diameter of 
about 25 nm (Fig. 16-5b). (The core of the Escherichia 
coli enzyme contains 24 copies of E 2 .) E 2 is the point of 
connection for the prosthetic group lipoate, attached 
through an amide bond to the e-amino group of a Lys 
residue (Fig. 16-4). E 2 has three functionally distinct 
domains (Fig. 16-5c): the amino-terminal lipoyl do- 
main, containing the lipoyl-Lys residue (s); the central 
E x - and E 3 -binding domain; and the inner-core acyl- 
transferase domain, which contains the acyltrans- 
f erase active site. The yeast PDH complex has a single 
lipoyl domain with a lipoate attached, but the mam- 
malian complex has two, and E. coli has three (Fig. 
16-5c). The domains of E 2 are separated by linkers, se- 
quences of 20 to 30 amino acid residues, rich in Ala and 
Pro and interspersed with charged residues; these link- 
ers tend to assume their extended forms, holding the 
three domains apart. 

The active site of E 1 has bound TPP, and that of 
E 3 has bound FAD. Also part of the complex are two 

FIGURE 16-5 The pyruvate dehydrogenase complex, (a) Cryoelectron 
micrograph of PDH complexes isolated from bovine kidney. In cryoelec- 
tron microscopy, biological samples are viewed at extremely low tem- 
peratures; this avoids potential artifacts introduced by the usual process 
of dehydrating, fixing, and staining, (b) Three-dimensional image of 
PDH complex, showing the subunit structure: E n , pyruvate dehydroge- 
nase; E 2 , dihydrolipoyl transacetylase; and E 3 , dihydrolipoyl dehydroge- 
nase. This image is reconstructed by analysis of a large number of images 
such as those in (a), combined with crystallographic studies of individ- 
ual subunits. The core (green) consists of 60 molecules of E 2 , arranged in 
20 trimers to form a pentagonal dodecahedron. The lipoyl domain of E 2 
(blue) reaches outward to touch the active sites of E-i molecules (yellow) 
arranged on the E 2 core. Several E 3 subunits (red) are also bound to the 
core, where the swinging arm on E 2 can reach their active sites. An as- 
terisk marks the site where a lipoyl group is attached to the lipoyl do- 
main of E 2 . To make the structure clearer, about half of the complex has 
been cut away from the front. This model was prepared by Z. H. Zhou 
and colleagues (2001); in another model, proposed by J. L. S. Milne and 
colleagues (2002), the E 3 subunits are located more toward the periph- 
ery (see Further Reading), (c) E 2 consists of three types of domains linked 
by short polypeptide linkers: a catalytic acyltransferase domain; a bind- 
ing domain, involved in the binding of E 2 to and E 3 ; and one or more 
(depending on the species) lipoyl domains. 
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regulatory proteins, a protein kinase and a phospho- 
protein phosphatase, discussed below. This basic E r 
E 2 -E 3 structure has been conserved during evolution 
and used in a number of similar metabolic reactions, 
including the oxidation of a-ketoglutarate in the citric 
acid cycle (described below) and the oxidation of a- 
keto acids derived from the breakdown of the 
branched-chain amino acids valine, isoleucine, and 
leucine (see Fig. 18-28). Within a given species, E 3 of 
PDH is identical to E 3 of the other two enzyme com- 
plexes. The attachment of lipoate to the end of a Lys 
side chain in E 2 produces a long, flexible arm that 
can move from the active site of Ej to the active sites 
of E 2 and E 3 , a distance of perhaps 5 nm or more. 

In Substrate Channeling, Intermediates Never Leave 
the Enzyme Surface 

Figure 16-6 shows schematically how the pyruvate 
dehydrogenase complex carries out the five consecu- 
tive reactions in the decarboxylation and dehydro- 
genation of pyruvate. Step ® is essentially identical to 
the reaction catalyzed by pyruvate decarboxylase (see 
Fig. 14-14c); C-l of pyruvate is released as C0 2) and 
C-2, which in pyruvate has the oxidation state of an 
aldehyde, is attached to TPP as a hydroxyethyl group. 
This first step is the slowest and therefore limits 
the rate of the overall reaction. It is also the point at 
which the PDH complex exercises its substrate speci- 
ficity. In step (2) the hydroxyethyl group is oxidized to 
the level of a carboxylic acid (acetate). The two electrons 



removed in this reaction reduce the — S — S — of a lipoyl 
group on E 2 to two thiol ( — SH) groups. The acetyl 
moiety produced in this oxidation-reduction reaction 
is first esterified to one of the lipoyl — SH groups, then 
transesterified to CoA to form acetyl-CoA (step (S)). 
Thus the energy of oxidation drives the formation of a 
high-energy thioester of acetate. The remaining reac- 
tions catalyzed by the PDH complex (by E 3 , in steps @ 
and (5)) are electron transfers necessary to regenerate 
the oxidized (disulfide) form of the lipoyl group of E 2 
to prepare the enzyme complex for another round of 
oxidation. The electrons removed from the hydrox- 
yethyl group derived from pyruvate pass through FAD 
to NAD + . 

Central to the mechanism of the PDH complex are 
the swinging lipoyllysyl arms of E 2 , which accept from 
E x the two electrons and the acetyl group derived from 
pyruvate, passing them to E 3 . All these enzymes and 
coenzymes are clustered, allowing the intermediates to 
react quickly without diffusing away from the surface of 
the enzyme complex. The five-reaction sequence shown 
in Figure 16-6 is thus an example of substrate chan- 
neling. The intermediates of the multistep sequence 
never leave the complex, and the local concentration of 
the substrate of E 2 is kept very high. Channeling also 
prevents theft of the activated acetyl group by other 
enzymes that use this group as substrate. As we shall 
see, a similar tethering mechanism for the channeling of 
substrate between active sites is used in some other en- 
zymes, with lipoate, biotin, or a CoA-like moiety serving 
as cofactors. 
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FIGURE 16-6 Oxidative decarboxylation of pyruvate to acetyl-CoA by 
the PDH complex. The fate of pyruvate is traced in red. In step (T) pyru- 
vate reacts with the bound thiamine pyrophosphate (TPP) of pyruvate 
dehydrogenase (E 1 ) / undergoing decarboxylation to the hydroxyethyl 
derivative (see Fig. 14-14), Pyruvate dehydrogenase also carries out 
step @, the transfer of two electrons and the acetyl group from TPP to 
the oxidized form of the lipoyllysyl group of the core enzyme, dihy- 
drolipoyl transacetylase (E 2 ), to form the acetyl thioester of the reduced 
lipoyl group. Step (3) is a transesterification in which the — SH group 



of CoA replaces the — SH group of E 2 to yield acetyl-CoA and the fully 
reduced (dithiol) form of the lipoyl group. In step @ dihydrolipoyl 
dehydrogenase (E 3 ) promotes transfer of two hydrogen atoms from the 
reduced lipoyl groups of E 2 to the FAD prosthetic group of E 3 , restoring 
the oxidized form of the lipoyllysyl group of E 2 . In step © the reduced 
FADH 2 of E 3 transfers a hydride ion to NAD + , forming NADH. The 
enzyme complex is now ready for another catalytic cycle. (Subunit 
colors correspond to those in Fig. 1 6-5b.) 
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I As one might predict, mutations in the genes for 
H the subunits of the PDH complex, or a dietary 
thiamine deficiency, can have severe consequences. Thi- 
amine-deficient animals are unable to oxidize pyruvate 
normally. This is of particular importance to the brain, 
which usually obtains all its energy from the aerobic ox- 
idation of glucose in a pathway that necessarily includes 
the oxidation of pyruvate. Beriberi, a disease that re- 
sults from thiamine deficiency, is characterized by loss 
of neural function. This disease occurs primarily in pop- 
ulations that rely on a diet consisting mainly of white 
(polished) rice, which lacks the hulls in which most of 
the thiamine of rice is found. People who habitually con- 
sume large amounts of alcohol can also develop thi- 
amine deficiency, because much of their dietary intake 
consists of the vitamin-free "empty calories" of distilled 
spirits. An elevated level of pyruvate in the blood is 
often an indicator of defects in pyruvate oxidation due 
to one of these causes. ■ 



SUMMARY 16.1 Production of Acetyl-CoA 
(Activated Acetate) 

■ Pyruvate, the product of glycolysis, is converted 
to acetyl-CoA, the starting material for the citric 
acid cycle, by the pyruvate dehydrogenase 
complex. 

■ The PDH complex is composed of multiple copies 
of three enzymes: pyruvate dehydrogenase, E : 
(with its bound cofactor TPP); dihydrolipoyl 
transacetylase, E 2 (with its covalently bound lipoyl 
group); and dihydrolipoyl dehydrogenase, E 3 (with 
its cofactors FAD and NAD). 

■ Ej catalyzes first the decarboxylation of pyruvate, 
producing hydroxyethyl-TPP, and then the 
oxidation of the hydroxyethyl group to an acetyl 
group. The electrons from this oxidation reduce 
the disulfide of lipoate bound to E 2 , and the acetyl 
group is transferred into thioester linkage with 
one — SH group of reduced lipoate. 

■ E 2 catalyzes the transfer of the acetyl group to 
coenzyme A, forming acetyl-CoA. 

■ E 3 catalyzes the regeneration of the disulfide 
(oxidized) form of lipoate; electrons pass first to 
FAD, then to NAD + . 

■ The long lipoyllysyl arm swings from the active 
site of E : to E 2 to E 3; tethering the intermediates 
to the enzyme complex to allow substrate 
channeling. 

■ The organization of the PDH complex is very 
similar to that of the enzyme complexes that 
catalyze the oxidation of a-ketoglutarate and 
the branched-chain a-keto acids. 



16.2 Reactions of the Citric Acid Cycle 

We are now ready to trace the process by which acetyl- 
CoA undergoes oxidation. This chemical transforma- 
tion is carried out by the citric acid cycle, the first 
cyclic pathway we have encountered (Fig. 16-7). To 
begin a turn of the cycle, acetyl-CoA donates its acetyl 
group to the four-carbon compound oxaloacetate to 
form the six-carbon citrate. Citrate is then transformed 
into isocitrate, also a six-carbon molecule, which is de- 
hydrogenated with loss of C0 2 to yield the five-carbon 
compound a-ketoglutarate (also called oxoglutarate). 
a-Ketoglutarate undergoes loss of a second molecule 
of C0 2 and ultimately yields the four-carbon compound 
succinate. Succinate is then enzymatically converted 
in three steps into the four-carbon oxaloacetate — 
which is then ready to react with another molecule of 
acetyl-CoA. In each turn of the cycle, one acetyl group 
(two carbons) enters as acetyl-CoA and two molecules 
of C0 2 leave; one molecule of oxaloacetate is used to 
form citrate and one molecule of oxaloacetate is re- 
generated. No net removal of oxaloacetate occurs; one 
molecule of oxaloacetate can theoretically bring about 
oxidation of an infinite number of acetyl groups, and, 
in fact, oxaloacetate is present in cells in very low con- 
centrations. Four of the eight steps in this process are 
oxidations, in which the energy of oxidation is very ef- 
ficiently conserved in the form of the reduced coen- 
zymes NADH and FADH 2 . 

As noted earlier, although the citric acid cycle is 
central to energy-yielding metabolism its role is not lim- 
ited to energy conservation. Four- and five-carbon inter- 
mediates of the cycle serve as precursors for a wide 
variety of products. To replace intermediates removed 
for this purpose, cells employ anaplerotic (replenishing) 
reactions, which are described below. 

Eugene Kennedy and Albert Lehninger showed in 
1948 that, in eukaryotes, the entire set of reactions of the 
citric acid cycle takes place in mitochondria. Isolated mi- 
tochondria were found to contain not only all the en- 
zymes and coenzymes required for the citric acid cycle, 
but also all the enzymes and proteins necessary for the 
last stage of respiration — electron transfer and ATP syn- 
thesis by oxidative phosphorylation. As we shall see in 
later chapters, mitochondria also contain the enzymes for 
the oxidation of fatty acids and some amino acids to 
acetyl-CoA, and the oxidative degradation of other amino 
acids to a-ketoglutarate, succinyl-CoA, or oxaloacetate. 
Thus, in nonphotosynthetic eukaryotes, the mitochon- 
drion is the site of most energy-yielding oxidative reactions 
and of the coupled synthesis of ATP. In photosynthetic eu- 
karyotes, mitochondria are the major site of ATP produc- 
tion in the dark, but in daylight chloroplasts produce most 
of the organism's ATP. In most bacteria, the enzymes of the 
citric acid cycle are in the cytosol, and the plasma mem- 
brane plays a role analogous to that of the inner mitochon- 
drial membrane in ATP synthesis (Chapter 19). 
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FIGURE 16-7 Reactions of the citric acid cycle. The carbon atoms 
shaded in pink are those derived from the acetate of acetyl-CoA in the 
first turn of the cycle; these are not the carbons released as C0 2 in the 
first turn. Note that in succinate and fumarate, the two-carbon group 
derived from acetate can no longer be specifically denoted; because 
succinate and fumarate are symmetric molecules, C-1 and C-2 are in- 
distinguishable from C-4 and C-3. The number beside each reaction 



step corresponds to a numbered heading on pages 622-628. The red 
arrows show where energy is conserved by electron transfer to FAD or 
NAD + , forming FADH 2 or NADH + H + . Steps ©, ®, and @ are 
essentially irreversible in the cell; all other steps are reversible. The 
product of step (5) may be either ATP or GTP, depending on which 
succinyl-CoA synthetase isozyme is the catalyst. 



The Citric Acid Cycle Has Eight Steps 

In examining the eight successive reaction steps of the cit- 
ric acid cycle, we place special emphasis on the chemical 



transformations taking place as citrate formed from 
acetyl-CoA and oxaloacetate is oxidized to yield C0 2 and 
the energy of this oxidation is conserved in the reduced 
coenzymes NADH and FADH 2 . 
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(£) Formation of Citrate The first reaction of the cy- 
cle is the condensation of acetyl-CoA with oxaloac- 
etate to form citrate, catalyzed by citrate synthase: 
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In this reaction the methyl carbon of the acetyl group is 
joined to the carbonyl group (C-2) of oxaloacetate. Citroyl- 
CoA is a transient intermediate formed on the active site of 
the enzyme (see Fig. 16-9). It rapidly undergoes hydrolysis 
to free CoA and citrate, which are released from the active 
site. The hydrolysis of this high-energy thioester intermedi- 
ate makes the forward reaction highly exergonic. The large, 
negative standard free-energy change of the citrate syn- 
thase reaction is essential to the operation of the cycle be- 
cause, as noted earlier, the concentration of oxaloacetate is 
normally very low. The CoA liberated in this reaction is re- 
cycled to participate in the oxidative decarboxylation of an- 
other molecule of pyruvate by the PDH complex. 

Citrate synthase from mitochondria has been crystal- 
lized and visualized by x-ray diffraction in the presence 
and absence of its substrates and inhibitors (Fig. 16-8). 
Each subunit of the homodimeric enzyme is a single 
polypeptide with two domains, one large and rigid, the 
other smaller and more flexible, with the active site be- 
tween them. Oxaloacetate, the first substrate to bind to 
the enzyme, induces a large conformational change in the 
flexible domain, creating a binding site for the second sub- 
strate, acetyl-CoA. When citroyl-CoA has formed in the 
enzyme active site, another conformational change brings 
about thioester hydrolysis, releasing CoA-SH. This in- 
duced fit of the enzyme first to its substrate and then to its 
reaction intermediate decreases the likelihood of prema- 
ture and unproductive cleavage of the thioester bond of 
acetyl-CoA. Kinetic studies of the enzyme are consistent 
with this ordered bisubstrate mechanism (see Fig. 6-13). 
The reaction catalyzed by citrate synthase is essentially a 
Claisen condensation (p. 497), involving a thioester 
(acetyl-CoA) and a ketone (oxaloacetate) (Fig. 16-9). 

(2) Formation of Isocitrate via cis-Aconitate The 
enzyme aconitase (more formally, aconitate hy- 

dratase) catalyzes the reversible transformation of cit- 
rate to isocitrate, through the intermediary formation 
of the tricarboxylic acid cis-aconitate, which normally 
does not dissociate from the active site. Aconitase can 
promote the reversible addition of H 2 0 to the double 
bond of enzyme-bound cis-aconitate in two different 
ways, one leading to citrate and the other to isocitrate: 
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FIGURE 16-8 Structure of citrate synthase. The flexible domain of 
each subunit undergoes a large conformational change on binding 
oxaloacetate, creating a binding site for acetyl-CoA. (a) Open form of 
the enzyme alone (PDB ID 5CSC); (b) closed form with bound ox- 
aloacetate and a stable analog of acetyl-CoA (carboxymethyl-CoA) 
(derived from PDB ID 5CTS). In these representations one subunit is 
colored tan and one green 
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Although the equilibrium mixture at pH 7.4 and 25°C 
contains less than 10% isocitrate, in the cell the reaction 
is pulled to the right because isocitrate is rapidly 
consumed in the next step of the cycle, lowering its 
steady-state concentration. Aconitase contains an iron- 
sulfur center (Fig. 16-10), which acts both in the 
binding of the substrate at the active site and in the cat- 
alytic addition or removal of H 2 0. In iron-depleted cells, 
aconitase loses its iron-sulfur center and acquires a new 
role in the regulation of iron homeostasis. Aconitase is 
one of many enzymes known to "moonlight" in a second 
role (Box 16-1). 

(3) Oxidation of Isocitrate to a-Ketoglutarate and 

C0 2 In the next step, isocitrate dehydrogenase 

catalyzes oxidative decarboxylation of isocitrate to form 
a-ketoglutarate (Fig. 16-11). Mn 2+ in the active site 
interacts with the carbonyl group of the intermediate 
oxalosuccinate, which is formed transiently but does not 
leave the binding site until decarboxylation converts it 
to a-ketoglutarate. Mn 2+ also stabilizes the enol formed 
transiently by decarboxylation. 

There are two different forms of isocitrate dehydroge- 
nase in all cells, one requiring NAD + as electron acceptor 



MECHANISM FIGURE 16-9 Citrate synthase. In the mammalian citrate 
synthase reaction, oxaloacetate binds first, in a strictly ordered reac- 
tion sequence. This binding triggers a conformation change that opens 
up the binding site for acetyl-CoA. Oxaloacetetate is specifically ori- 
ented in the active site of citrate synthase by interaction of its two car- 
boxylates with two positively charged Arg residues (not shown here). 
§ Citrate Synthase Mechanism 
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FIGURE 16-10 Iron-sulfur center in aconitase. The iron-sulfur center is 
in red, the citrate molecule in blue. Three Cys residues of the enzyme 
bind three iron atoms; the fourth iron is bound to one of the carboxyl 
groups of citrate and also interacts noncovalently with a hydroxyl 
group of citrate (dashed bond). A basic residue (:B) in the enzyme 
helps to position the citrate in the active site. The iron-sulfur center acts 
in both substrate binding and catalysis. The general properties of iron- 
sulfur proteins are discussed in Chapter 19 (see Fig. 19-5). 
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BOX 1 6-1 Moonlighting Enzymes: Proteins with More Than One Job 



The "one gene-one enzyme" dictum, put forward by 
George Beadle and Edward Tatum in 1940 (see Chap- 
ter 24), went unchallenged for much of the twentieth 
century, as did the associated assumption that each pro- 
tein had only one role. But in recent years, many strik- 
ing exceptions to this simple formula have been 
discovered — cases in which a single protein encoded by 
a single gene clearly does more than one job in the cell. 
Aconitase is one such protein: it acts both as an enzyme 
and as a regulator of protein synthesis. 

Eukaryotic cells have two isozymes of aconitase. The 
mitochondrial isozyme converts citrate to isocitrate in the 
citric acid cycle. The cytosolic isozyme has two distinct 
functions. It catalyzes the conversion of citrate to isoci- 
trate, providing the substrate for a cytosolic isocitrate de- 
hydrogenase that generates NADPH as reducing power 
for fatty acid synthesis and other anabolic processes in 
the cytosol. It also has a role in cellular iron homeostasis. 

All cells must obtain iron for the activity of the many 
proteins that require it as a cofactor. In humans, severe 
iron deficiency results in anemia, an insufficient supply 
of erythrocytes and a reduced oxygen-carrying capacity 
that can be life-threatening. Too much iron is also harm- 
ful: it accumulates in and damages the liver in hemochro- 
matosis and other diseases. Iron obtained in the diet is 
carried in the blood by the protein transferrin and en- 
ters cells via endocytosis mediated by the transferrin 
receptor. Once inside cells, iron is used in the synthesis 
of hemes, cytochromes, Fe-S proteins, and other Fe- 



dependent proteins, and excess iron is stored bound to 
the protein ferritin. The levels of transferrin, transferrin 
receptor, and ferritin are therefore crucial to cellular iron 
homeostasis. The synthesis of these three proteins is reg- 
ulated in response to iron availability — and aconitase, in 
its "moonlighting" job, plays a key regulatory role. 

Aconitase has an essential Fe-S cluster at its active 
site (see Fig. 16-10). When a cell is depleted of iron, this 
Fe-S cluster is disassembled and the enzyme loses its 
aconitase activity But the apoenzyme (apo-aconitase, 
lacking its Fe-S cluster) so formed has now acquired its 
second activity — the ability to bind to specific sequences 
in the mRNAs for the transferrin receptor and ferritin, 
thus regulating protein synthesis at the translational level. 
Two iron regulatory proteins, IRP1 and IRP2, were in- 
dependently discovered as regulators of iron metabolism. 
As it turned out, IRP1 is identical to cytosolic apo-aconi- 
tase, and IRP2 is very closely related to IRP1 in structure 
and function, but unlike IRP1 it cannot be converted to en- 
zymatically active aconitase. Both IRP1 and IRP2 bind to 
regions in the mRNAs encoding ferritin and the transferrin 
receptor, with effects on iron mobilization and iron uptake. 
These mRNA sequences are part of hairpin structures 
(p. 285) called iron response elements (IREs), located 
at the 5' and 3' ends of the mRNAs (Fig. 1). When bound 



Low [iron] 



IRP bound to iron response element (IRE)? 




IRE 



Ferritin mRNA 

■AAA(A)„ 3' 



Ferritin mRNA translation Repressed Activated 
Ferritin synthesis Decreased Increased 



FIGURE 1 Effect of IRP1 and IRP2 on the 
mRNAs for ferritin and the transferrin 
receptor. 
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MECHANISM FIGURE 16-11 Isocitrate dehydrogenase. In this reaction, 
the substrate, isocitrate, loses one carbon by oxidative decarboxyla- 



tion. See Fig. 1 4-1 3 for more information on hydride transfer reactions 
involving NAD + and NADP + . 
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to the 5 '-untranslated IRE sequence in the ferritin mRNA, 
IRPs block ferritin synthesis; when bound to the 3'- 
untranslated IRE sequences in the transferrin receptor 
mRNA, they stabilize the mRNA, preventing its degrada- 
tion and thus allowing the synthesis of more copies of the 
receptor protein per mRNA molecule. So, in iron-deficient 
cells, iron uptake becomes more efficient and iron storage 
(bound to ferritin) is reduced. When cellular iron concen- 
trations return to normal levels, IRP1 is converted to 
aconitase, and IRP2 undergoes proteolytic degradation, 
ending the low-iron response. 

The enzymatically active aconitase and the moon- 
lighting, regulatory apo-aconitase have different struc- 
tures. As the active aconitase, the protein has two lobes 
that close around the Fe-S cluster; as IRP1, the two 
lobes open, exposing the mRNA-binding site (Fig. 2). 

Aconitase is just one of a growing list of enzymes 
known (or believed) to moonlight in a second role. Many 
of the glycolytic enzymes are included in this group. 
Pyruvate kinase acts in the nucleus to regulate the tran- 
scription of genes that respond to thyroid hormone. 
Glyceraldehyde 3-phosphate dehydrogenase moonlights 
both as uracil DNA glycosylase, effecting the repair of 
damaged DNA, and as a regulator of histone H2B tran- 
scription. The crystallins in the lens of the vertebrate 
eye are several moonlighting glycolytic enzymes, includ- 
ing phosphoglycerate kinase, those phosphate iso- 
merase, and lactate dehydrogenase. 

Until recently, the discovery that a protein has more 
than one function was largely a matter of serendipity: 
two groups of investigators studying two unrelated 
questions discovered that "their" proteins had similar 
properties, compared them carefully, and found them to 
be identical. With the growth of annotated protein and 
DNA sequence databases, researchers can now deliber- 
ately look for moonlighting proteins by searching the 
databases for any other protein with the same sequence 
as the one under study, but with a different function. 
This also means that in the databases, a protein anno- 



tated as having a given function doesn't necessarily have 
only that function. Protein moonlighting may also ex- 
plain some puzzling findings: experiments in which a 
protein with a known function is made inactive by a mu- 
tation, and the resulting mutant organisms show a phe- 
notype with no obvious relation to that function. 



(a) 



Fe-S 




FIGURE 2 Two forms of cytosolic aconitase/IRP1 with two distinct 
functions, (a) In aconitase, the two major lobes are closed and the 
Fe-S cluster is buried; the protein has been made transparent here to 
show the Fe-S cluster (PDB ID 2B3Y). (b) In IRP1, the lobes open 
up, exposing a binding site for the mRNA hairpin of the substrate 
(PDB 1D2IPY). 



and the other requiring NADP + . The overall reactions are 
otherwise identical. In eukaryotic cells, the NAD-depend- 
ent enzyme occurs in the mitochondrial matrix and serves 
in the citric acid cycle. The main function of the NADP-de- 
pendent enzyme, found in both the mitochondrial matrix 
and the cytosol, may be the generation of NADPH, which 
is essential for reductive anabolic reactions. 

(J) Oxidation of a-Ketoglutarate to Succinyl-CoA 

and CO a The next step is another oxidative decarboxy- 
lation, in which a-ketoglutarate is converted to succinyl- 
CoA and C0 2 by the action of the a-ketoglutarate 
dehydrogenase complex; NAD + serves as electron 



acceptor and CoA as the carrier of the succinyl group. 
The energy of oxidation of a-ketoglutarate is conserved 
in the formation of the thioester bond of succinyl-CoA: 
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This reaction is virtually identical to the pyruvate 
dehydrogenase reaction discussed above, and the a- 
ketoglutarate dehydrogenase complex closely resem- 
bles the PDH complex in both structure and function. It 
includes three enzymes, homologous to E 1; E 2 , and E 3 of 
the PDH complex, as well as enzyme-bound TPP, bound 
lipoate, FAD, NAD, and coenzyme A. Both complexes 
are certainly derived from a common evolutionary ances- 
tor. Although the E l components of the two complexes 
are structurally similar, their amino acid sequences dif- 
fer and, of course, they have different binding specifici- 
ties: E x of the PDH complex binds pyruvate, and E l of 
the a-ketoglutarate dehydrogenase complex binds a- 
ketoglutarate. The E 2 components of the two complexes 
are also very similar, both having covalently bound lipoyl 
moieties. The subunits of E 3 are identical in the two en- 
zyme complexes. 

(§) Conversion of Succinyl-CoA to Succinate 

Succinyl-CoA, like acetyl-CoA, has a thioester bond with 
a strongly negative standard free energy of hydrolysis 
(AG'° ~ -36 kJ/mol). In the next step of the citric acid 
cycle, energy released in the breakage of this bond is used 
to drive the synthesis of a phosphoanhydride bond in 
either GTP or ATP, with a net AG' 0 of only -2.9 kJ/mol. 
Succinate is formed in the process: 




AG' 0 



COCT 
Succinate 

-2.9kJ/mol 



The enzyme that catalyzes this reversible reaction is 
called succinyl-CoA synthetase or succinic thioki- 
nase; both names indicate the participation of a nucleo- 
side triphosphate in the reaction (Box 16-2). 

This energy-conserving reaction involves an in- 
termediate step in which the enzyme molecule itself 
becomes phosphorylated at a His residue in the active 
site (Fig. 16-12a). This phosphoryl group, which has 



FIGURE 16-12 The succinyl-CoA synthetase reaction, (a) In step (T) a 
phosphoryl group replaces the CoA of succinyl-CoA bound to the en- 
zyme, forming a high-energy acyl phosphate. In step @ the succinyl 
phosphate donates its phosphoryl group to a His residue of the en- 
zyme, forming a high-energy phosphohistidyl enzyme, In step (3) the 
phosphoryl group is transferred from the His residue to the terminal 
phosphate of GDP (or ADP), forming GTP (or ATP), (b) Active site of 
succinyl-CoA synthetase of E. coli (derived from PDB ID 1SCU). The 
active site includes part of both the a (blue) and the (3 (brown) sub- 
units The power helices (blue, brown) place the partial positive 
charges of the helix dipole near the phosphate group of (g)-His 246 in 
the a chain, stabilizing the phosphohistidyl enzyme. The bacterial and 
mammalian enzymes have similar amino acid sequences and three- 
dimensional structures. 
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Synthases and Synthetases; Ligases and Lyases; Kinases, Phosphatases, and 
Phosphorylases: Yes, the Names Are Confusing! 



Citrate synthase is one of many enzymes that catalyze 
condensation reactions, yielding a product more chemi- 
cally complex than its precursors. Synthases catalyze 
condensation reactions in which no nucleoside triphos- 
phate (ATP, GTP, and so forth) is required as an energy 
source. Synthetases catalyze condensations that do 
use ATP or another nucleoside triphosphate as a source 
of energy for the synthetic reaction. Succinyl-CoA syn- 
thetase is such an enzyme. Ligases (from the Latin lig- 
are, "to tie together") are enzymes that catalyze 
condensation reactions in which two atoms are joined, 
using ATP or another energy source. (Thus synthetases 
are ligases.) DNA ligase, for example, closes breaks in 
DNA molecules, using energy supplied by either ATP or 
NAD + ; it is widely used in joining DNA pieces for 
genetic engineering. Ligases are not to be confused with 
lyases, enzymes that catalyze cleavages (or, in the 
reverse direction, additions) in which electronic re- 
arrangements occur. The PDH complex, which oxida- 
tively cleaves C0 2 from pyruvate, is a member of the 
large class of lyases. 

The name kinase is applied to enzymes that trans- 
fer a phosphoryl group from a nucleoside triphosphate 
such as ATP to an acceptor molecule — a sugar (as in 
hexokinase and glucokinase), a protein (as in glycogen 
phosphorylase kinase), another nucleotide (as in nucle- 
oside diphosphate kinase), or a metabolic intermediate 
such as oxaloacetate (as in PEP carboxykinase). The re- 
action catalyzed by a kinase is a phosphorylation. On 
the other hand, phosphor oly sis is a displacement reac- 
tion in which phosphate is the attacking species and 
becomes covalently attached at the point of bond break- 
age. Such reactions are catalyzed by phosphorylases. 
Glycogen phosphorylase, for example, catalyzes the phos- 
phorolysis of glycogen, producing glucose 1-phosphate. 
Dephosphorylation, the removal of a phosphoryl group 
from a phosphate ester, is catalyzed by phosphatases, 



with water as the attacking species. Fructose bisphos- 
phatase-1 converts fructose 1,6-bisphosphate to fruc- 
tose 6-phosphate in gluconeogenesis, and phosphorylase 
a phosphatase removes phosphoryl groups from phos- 
phoserine in phosphorylated glycogen phosphorylase. 
Whew! 

Unfortunately, these descriptions of enzyme types 
overlap, and many enzymes are commonly called by two 
or more names. Succinyl-CoA synthetase, for example, 
is also called succinate thiokinase; the enzyme is both a 
synthetase in the citric acid cycle and a kinase when 
acting in the direction of succinyl-CoA synthesis. This 
raises another source of confusion in the naming of 
enzymes. An enzyme may have been discovered by the 
use of an assay in which, say, A is converted to B. The 
enzyme is then named for that reaction. Later work may 
show, however, that in the cell, the enzyme functions 
primarily in converting B to A. Commonly, the first name 
continues to be used, although the metabolic role of the 
enzyme would be better described by naming it for the 
reverse reaction. The glycolytic enzyme pyruvate kinase 
illustrates this situation (p. 538). To a beginner in bio- 
chemistry, this duplication in nomenclature can be be- 
wildering. International committees have made heroic 
efforts to systematize the nomenclature of enzymes 
(see Table 6-3 for a brief summary of the system), but 
some systematic names have proved too long and cum- 
bersome and are not frequently used in biochemical 
conversation. 

We have tried throughout this book to use the 
enzyme name most commonly used by working 
biochemists and to point out cases in which an enzyme 
has more than one widely used name. For current infor- 
mation on enzyme nomenclature, refer to the recom- 
mendations of the Nomenclature Committee of the 
International Union of Biochemistry and Molecular Biol- 
ogy (www.chem.qmw.ac.uk/iubmb/nomenclature/) . 



a high group transfer potential, is transferred to ADP 
(or GDP) to form ATP (or GTP). Animal cells have 
two isozymes of succinyl-CoA synthetase, one spe- 
cific for ADP and the other for GDP. The enzyme has 
two sub units, a (M r 32,000), which has the (£^His 
residue (His 246 ) and the binding site for CoA, and /3 
(M r 42,000), which confers specificity for either ADP 
or GDP. The active site is at the interface between 
subunits. The crystal structure of succinyl-CoA syn- 
thetase reveals two "power helices" (one from each 
subunit), oriented so that their electric dipoles situ- 
ate partial positive charges close to the negatively 
charged (P)-His (Fig. 16-12b), stabilizing the phos- 
phoenzyme intermediate. (Recall the similar role of 
helix dipoles in stabilizing K + ions in the K + channel; 
see Fig. 11-48.) 



The formation of ATP (or GTP) at the expense of the 
energy released by the oxidative decarboxylation of a- 
ketoglutarate is a substrate-level phosphorylation, like 
the synthesis of ATP in the glycolytic reactions catalyzed 
by glyceraldehyde 3-phosphate dehydrogenase and 
pyruvate kinase (see Fig. 14-2). The GTP formed by suc- 
cinyl-CoA synthetase can donate its terminal phosphoryl 
group to ADP to form ATP, in a reversible reaction cat- 
alyzed by nucleoside diphosphate kinase (p. 510): 

GTP + ADP > GDP + ATP AG' 0 = 0 kJ/mol 

Thus the net result of the activity of either isozyme of 
succinyl-CoA synthetase is the conservation of energy as 
ATP There is no change in free energy for the nucleoside 
diphosphate kinase reaction; ATP and GTP are energeti- 
cally equivalent. 
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(6) Oxidation of Succinate to Fumarate The succi- 
nate formed from succinyl-CoA is oxidized to fumarate 
by the fiavoprotein succinate dehydrogenase: 



COO" 

I 

CH 2 
CH 2 

coo- 

Succinate 



FAD 



FADH 2 



duhydruijiltui 



.COO" 

C 



OOC H 



Fumarate 



AG'° = 0 kJ/mol 



In eukaryotes, succinate dehydrogenase is tightly bound 
to the mitochondrial inner membrane; in bacteria, to the 
plasma membrane. The enzyme contains three different 
iron-sulfur clusters and one molecule of covalently 
bound FAD (see Fig. 19-10). Electrons pass from suc- 
cinate through the FAD and iron-sulfur centers before 
entering the chain of electron carriers in the mitochon- 
drial inner membrane (the plasma membrane in bacte- 
ria) . Electron flow from succinate through these carriers 
to the final electron acceptor, 0 2j is coupled to the syn- 
thesis of about 1.5 ATP molecules per pair of electrons 
(respiration-linked phosphorylation). Malonate, an ana- 
log of succinate not normally present in cells, is a strong 
competitive inhibitor of succinate dehydrogenase, and 
its addition to mitochondria blocks the activity of the 
citric acid cycle. 
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© Hydration of Fumarate to Malate The reversible 
hydration of fumarate to L-malate is catalyzed by fu- 
marase (formally, fumarate hydratase). The transi- 
tion state in this reaction is a carbanion: 
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This enzyme is highly stereospecific; it catalyzes hydra- 
tion of the trans double bond of fumarate but not the cis 
double bond of maleate (the cis isomer of fumarate). 
In the reverse direction (from L-malate to fumarate), 
fumarase is equally stereospecific: D-malate is not a 
substrate. 
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(8) Oxidation of Malate to Oxaloacetate In the last 
reaction of the citric acid cycle, NAD-linked L-malate 
dehydrogenase catalyzes the oxidation of L-malate to 
oxaloacetate: 
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The equilibrium of this reaction lies far to the left under 
standard thermodynamic conditions, but in intact cells 
oxaloacetate is continually removed by the highly exer- 
gonic citrate synthase reaction (step ® of Fig. 16-7). 
This keeps the concentration of oxaloacetate in the cell 
extremely low (<10" 6 m), pulling the malate dehydro- 
genase reaction toward the formation of oxaloacetate. 

Although the individual reactions of the citric acid cycle 
were initially worked out in vitro, using minced muscle 
tissue, the pathway and its regulation have also been 
studied extensively in vivo. By using radioactively la- 
beled precursors such as [ 14 C]pyruvate and [ 14 C]acetate, 
researchers have traced the fate of individual carbon 
atoms through the citric acid cycle. Some of the earliest 
experiments with isotopes produced an unexpected re- 
sult, however, which aroused considerable controversy 
about the pathway and mechanism of the citric acid cy- 
cle. In fact, these experiments at first seemed to show 
that citrate was not the first tricarboxylic acid to be 
formed. Box 16-3 gives some details of this episode in 
the history of citric acid cycle research. Metabolic flux 
through the cycle can now be monitored in living tissue 
by using 13 C-labeled precursors and whole-tissue NMR 
spectroscopy. Because the NMR signal is unique to the 
compound containing the 13 C, biochemists can trace the 
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BOX 1 6-3 Citrate: A Symmetric Molecule That Reacts Asymmetrically 



When compounds enriched in the heavy-carbon isotope 
13 C and the radioactive carbon isotopes n C and 14 C be- 
came available about 60 years ago, they were soon put to 
use in tracing the pathway of carbon atoms through the 
citric acid cycle. One such experiment initiated the con- 
troversy over the role of citrate. Acetate labeled in the car- 
boxyl group (designated [1- 14 C] acetate) was incubated 
aerobically with an animal tissue preparation. Acetate is 
enzymatically converted to acetyl-CoA in animal tissues, 
and the pathway of the labeled carboxyl carbon of the 
acetyl group in the cycle reactions could thus be traced. 
a-Ketoglutarate was isolated from the tissue after incuba- 
tion, then degraded by known chemical reactions to estab- 
lish the position(s) of the isotopic carbon. 

Condensation of unlabeled oxaloacetate with 
carboxyl-labeled acetate would be expected to produce 
citrate labeled in one of the two primary carboxyl groups. 
Citrate is a symmetric molecule, its two terminal carboxyl 
groups being chemically indistinguishable. Therefore, half 
the labeled citrate molecules were expected to yield a-ke- 
toglutarate labeled in the a-carboxyl group and the other 
half to yield a-ketoglutarate labeled in the y-carboxyl 
group; that is, the a-ketoglutarate isolated was expected 
to be a mixture of the two types of labeled molecules 



(Fig. 1, pathways (T) and (2)). Contrary to this expecta- 
tion, the labeled a-ketoglutarate isolated from the tissue 
suspension contained 14 C only in the y-carboxyl group 
(Fig. 1, pathway (V)). The investigators concluded that 
citrate (or any other symmetric molecule) could not be an 
intermediate in the pathway from acetate to a-ketoglu- 
tarate. Rather, an asymmetric tricarboxylic acid, presum- 
ably cis-aconitate or isocitrate, must be the first product 
formed from condensation of acetate and oxaloacetate. 

In 1948, however, Alexander Ogston pointed out 
that although citrate has no chiral center (see Fig. 1-19), 
it has the potential to react asymmetrically if an enzyme 
with which it interacts has an active site that is asym- 
metric. He suggested that the active site of aconitase 
may have three points to which the citrate must be 
bound and that the citrate must undergo a specific 
three-point attachment to these binding points. As seen 
in Figure 2, the binding of citrate to three such points 
could happen in only one way, and this would account for 
the formation of only one type of labeled a-ketoglutarate. 
Organic molecules such as citrate that have no chiral 
center but are potentially capable of reacting asymmet- 
rically with an asymmetric active site are now called 
prochiral molecules. 
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FIGURE 1 Incorporation of the isotopic carbon ( 14 C) of the labeled 
acetyl group into a-ketoglutarate by the citric acid cycle. The carbon 
atoms of the entering acetyl group are shown in red. 



T CH 2 - 14 COO" 

ii I 

CH 2 
0= C-COO 

Only this product 
was formed. 

"CHs-COO" 

CH-COO 

O^C- 1 '*. 00 

This second form 
of labeled 

u-ketoglutarate was 
also expected, but 
was not formed. 



CH 2 COO" 



HO 




Susceptible 
bond 



CH-COO" 



Z 

I 

Y 



FIGURE 2 The prochiral nature of citrate, (a) Structure of citrate; (b) 
schematic representation of citrate: X = — OH; Y = — COO"; 
Z = — CH 2 COO~ (c) Correct complementary fit of citrate to the bind- 
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ing site of aconitase. There is only one way in which the three specified 
groups of citrate can fit on the three points of the binding site. Thus 
only one of the two — CH 2 COO~ groups is bound by aconitase. 
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movement of precursor carbons into each cycle inter- 
mediate and into compounds derived from the interme- 
diates. This technique has great promise for studies of 
regulation of the citric acid cycle and its interconnec- 
tions with other metabolic pathways such as glycolysis. 

The Energy of Oxidations in the Cycle 
Is Efficiently Conserved 

We have now covered one complete turn of the citric 
acid cycle (Fig. 16-13). A two-carbon acetyl group en- 
tered the cycle by combining with oxaloacetate. Two 
carbon atoms emerged from the cycle as C0 2 from the 
oxidation of isocitrate and a-ketoglutarate. The energy 
released by these oxidations was conserved in the re- 
duction of three NAD + and one FAD and the production 
of one ATP or GTP. At the end of the cycle a molecule of 
oxaloacetate was regenerated. Note that the two carbon 
atoms appearing as C0 2 are not the same two carbons 
that entered in the form of the acetyl group; additional 
turns around the cycle are required to release these car- 
bons as C0 2 (Fig. 16-7). 

Although the citric acid cycle directly generates only 
one ATP per turn (in the conversion of succinyl-CoA to 
succinate), the four oxidation steps in the cycle provide a 
large flow of electrons into the respiratory chain via NADH 
and FADH 2 and thus lead to formation of a large number 
of ATP molecules during oxidative phosphorylation. 

We saw in Chapter 14 that the energy yield from the 
production of two molecules of pyruvate from one mole- 
cule of glucose in glycolysis is 2 ATP and 2 NADH. In ox- 
idative phosphorylation (Chapter 19), passage of two 
electrons from NADH to 0 2 drives the formation of about 
2.5 ATP, and passage of two electrons from FADH 2 to 0 2 
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FIGURE 16-13 Products of one turn of the citric acid cycle. At each 
turn of the cycle, three NADH, one FADH 2/ one GTP (or ATP), and two 
C0 2 are released in oxidative decarboxylation reactions. Here and in 
several following figures, all cycle reactions are shown as proceeding 
in one direction only, but keep in mind that most of the reactions are 
reversible (see Fig. 16-7), 

yields about 1.5 ATP. This stoichiometry allows us to cal- 
culate the overall yield of ATP from the complete oxida- 
tion of glucose. When both pyruvate molecules are 
oxidized to 6 C0 2 via the pyruvate dehydrogenase com- 
plex and the citric acid cycle, and the electrons are trans- 
ferred to 0 2 via oxidative phosphorylation, as many as 32 
ATP are obtained per glucose (Table 16-1). In round 
numbers, this represents the conservation of 32 X 30.5 
kJ/mol = 976 kJ/mol, or 34% of the theoretical maximum 



TABLE 16-1 



Reaction 



Stoichiometry of Coenzyme Reduction and ATP Formation in the Aerobic Oxidation of Glucose via Glycolysis, 
the Pyruvate Dehydrogenase Complex Reaction, the Citric Acid Cyde, and Oxidative Phosphorylation 

Number of ATP or reduced Number of ATP 

coenzyme directly formed ultimately formed* 



Glucose > glucose 6-phosphate 

Fructose 6-phosphate > fructose 1,6-bisphosphate 

2 Glyceraldehyde 3-phosphate ► 2 1,3-bisphosphoglycerate 

2 1,3-Bisphosphoglycerate ► 2 3-phosphoglycerate 

2 Phosphoenolpyruvate > 2 pyruvate 

2 Pyruvate > 2 acetyl-CoA 

2 Isocitrate ► 2 a-ketoglutarate 

2 a-Ketoglutarate > 2 succinyl-CoA 

2 Succinyl-CoA > 2 succinate 

2 Succinate > 2 fumarate 

2 Malate ► 2 oxaloacetate 

Total 



-1 ATP 
-1 ATP 

2 NADH 

2 ATP 

2 ATP 

2 NADH 

2 NADH 

2 NADH 

2 ATP (or 2 GTP) 
2 FADH 2 
2 NADH 



-1 
-1 
3 or 

2 
2 
5 
5 
5 
2 
3 
5 

30-32 



*This is calculated as 2.5 ATP per NADH and 1.5 ATP per FADH 2 . A negative value indicates consumption. 

+ This number is either 3 or 5, depending on the mechanism used to shuttle NADH equivalents from the cytosol to the mitochondrial matrix; see Figures 19-30 and 19-31, 
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of about 2,840 kJ/mol available from the complete oxida- 
tion of glucose. These calculations employ the standard 
free-energy changes; when corrected for the actual free 
energy required to form ATP within cells (see Worked 
Example 13-2, p. 503), the calculated efficiency of the 
process is closer to 65%. 

Why is the Oxidation of Acetate So Compiicated? 

The eight-step cyclic process for oxidation of simple 
two-carbon acetyl groups to C0 2 may seem unnecessar- 
ily cumbersome and not in keeping with the biological 
principle of maximum economy. The role of the citric 
acid cycle is not confined to the oxidation of acetate, 
however. This pathway is the hub of intermediary me- 
tabolism. Four- and five-carbon end products of many 
catabolic processes feed into the cycle to serve as fuels. 
Oxaloacetate and a-ketoglutarate, for example, are pro- 
duced from aspartate and glutamate, respectively, when 
proteins are degraded. Under some metabolic circum- 
stances, intermediates are drawn out of the cycle to be 
used as precursors in a variety of biosynthetic pathways. 

The citric acid cycle, like all other metabolic path- 
ways, is the product of evolution, and much of this 
evolution occurred before the advent of aerobic organ- 
isms. It does not necessarily represent the shortest 
pathway from acetate to C0 2 , but it is the pathway 
that has, over time, conferred the greatest selective 
advantage. Early anaerobes most probably used some 



Acetyl-CoA 



PEP 
or 

pyruvate 



C0 2 

N 



Oxaloacetate 



/ 

Malate 

\ 

Fumarate 

\ 

Succinate 



Citrate 

\ 

Isocitrate 



i 



a-Ketoglutarate 



Succinyl-CoA ' 



Biosynthetic 

products 
(amino acids, 
nucleotides, 
heme, etc.) 



FIGURE 16-14 Biosynthetic precursors produced by an incomplete 
citric acid cycle in anaerobic bacteria. These anaerobes lack a- 
ketoglutarate dehydrogenase and therefore cannot carry out the 
complete citric acid cycle. a-Ketoglutarate and succinyl-CoA serve 
as precursors in a variety of biosynthetic pathways. (See Fig. 16-13 
for the "normal" direction of these reactions in the citric acid cycle.) 



of the reactions of the citric acid cycle in linear biosyn- 
thetic processes. In fact, some modern anaerobic mi- 
croorganisms use an incomplete citric acid cycle as a 
source of, not energy, but biosynthetic precursors 
(Fig. 16-14). These organisms use the first three re- 
actions of the cycle to make a-ketoglutarate but, lack- 
ing a-ketoglutarate dehydrogenase, they cannot carry 
out the complete set of citric acid cycle reactions. 
They do have the four enzymes that catalyze the re- 
versible conversion of oxaloacetate to succinyl-CoA 
and can produce malate, fumarate, succinate, and suc- 
cinyl-CoA from oxaloacetate in a reversal of the "nor- 
mal" (oxidative) direction of flow through the cycle. 
This pathway is a fermentation, with the NADH pro- 
duced by isocitrate oxidation recycled to NAD + by re- 
duction of oxaloacetate to succinate. 

With the evolution of cyanobacteria that produced 
0 2 from water, the earth's atmosphere became aerobic 
and organisms were under selective pressure to develop 
aerobic metabolism, which, as we have seen, is much 
more efficient than anaerobic fermentation. 

Citric Acid Cycle Components Are important 
Biosynthetic Intermediates 

In aerobic organisms, the citric acid cycle is an amphi- 
bolic pathway, one that serves in both catabolic and 
anabolic processes. Besides its role in the oxidative ca- 
tabolism of carbohydrates, fatty acids, and amino acids, 
the cycle provides precursors for many biosynthetic 
pathways (Fig. 16-15), through reactions that served the 
same purpose in anaerobic ancestors. a-Ketoglutarate 
and oxaloacetate can, for example, serve as precursors 
of the amino acids aspartate and glutamate by simple 
transamination (Chapter 22). Through aspartate and glu- 
tamate, the carbons of oxaloacetate and a-ketoglutarate 
are then used to build other amino acids, as well as 
purine and pyrimidine nucleotides. Oxaloacetate is con- 
verted to glucose in gluconeogenesis (see Fig. 15-11). 
Succinyl-CoA is a central intermediate in the synthesis 
of the porphyrin ring of heme groups, which serve as 
oxygen carriers (in hemoglobin and myoglobin) and 
electron carriers (in cytochromes) (see Fig. 22-23). 
And the citrate produced in some organisms is used 
commercially for a variety of purposes (Box 16-4). 

Anaplerotic Reactions Replenish Citric Acid 
Cycle Intermediates 

As intermediates of the citric acid cycle are removed 
to serve as biosynthetic precursors, they are replen- 
ished by anaplerotic reactions (Fig. 16-15; Table 
16-2). Under normal circumstances, the reactions by 
which cycle intermediates are siphoned off into other 
pathways and those by which they are replenished 
are in dynamic balance, so that the concentrations of 
the citric acid cycle intermediates remain almost 
constant. 
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FIGURE 16-15 Role of the citric acid cycle in anabolism. Intermedi- 
ates of the citric acid cycle are drawn off as precursors in many biosyn- 



thetic pathways. Shown in red are four anaplerotic reactions that re- 
plenish depleted cycle intermediates (see Table 16-2), 



Table 16-2 shows the most common anaplerotic reac- 
tions, all of which, in various tissues and organisms, convert 
either pyruvate or phosphoenolpyruvate to oxaloacetate or 
malate. The most important anaplerotic reaction in mam- 
malian liver and kidney is the reversible carboxylation of 
pyruvate by C0 2 to form oxaloacetate, catalyzed by pyru- 
vate carboxylase. When the citric acid cycle is deficient in 
oxaloacetate or any other intermediates, pyruvate is car- 
boxylated to produce more oxaloacetate. The enzymatic 
addition of a carboxyl group to pyruvate requires energy, 
which is supplied by ATP— the free energy required to at- 
tach a carboxyl group to pyruvate is about equal to the free 
energy available from ATP. 



TABLE 16-2 



Reaction 



Anaplerotic Reactions 



Pyruvate carboxylase is a regulatory enzyme and is 
virtually inactive in the absence of acetyl-CoA, its posi- 
tive allosteric modulator. Whenever acetyl-CoA, the fuel 
for the citric acid cycle, is present in excess, it stimu- 
lates the pyruvate carboxylase reaction to produce more 
oxaloacetate, enabling the cycle to use more acetyl-CoA 
in the citrate synthase reaction. 

The other anaplerotic reactions shown in Table 
16-2 are also regulated to keep the level of intermedi- 
ates high enough to support the activity of the citric acid 
cycle. Phosphoenolpyruvate (PEP) carboxylase, for ex- 
ample, is activated by the glycolytic intermediate fruc- 
tose 1,6-bisphosphate, which accumulates when the 



Tissue(s)/organism(s) 



Pyruvate + HC0 3 4- ATP I - oxaloacetate + ADP + P i 

Phosphoenolpyruvate + C0 2 + GDP , - oxaloacetate + GTP 



Phosphoenolpyruvate + HC0 3 , 
Pyruvate + HCO3 + NAD(P)H ■ 



- oxaloacetate 4- Pi 



t malate + NAD(P) H 



Liver, kidney 

Heart, skeletal muscle 

Higher plants, yeast, bacteria 

Widely distributed in eukaryotes 
and bacteria 
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BOX 16-4 



Citrate Synthase, Soda Pop, and the World Food Supply 



Citrate has a number of important industrial applica- 
tions. A quick examination of the ingredients in most 
soft drinks reveals the common use of citric acid to pro- 
vide a tart or fruity flavor. Citric acid is also used as a 
plasticizer and foam inhibitor in the manufacture of cer- 
tain resins, as a mordant to brighten colors, and as an 
antioxidant to preserve the flavors of foods. Citric acid is 
produced industrially by growing the fungus As- 
pergillus niger in the presence of an inexpensive sugar 
source, usually beet molasses. Culture conditions are 
designed to inhibit the reactions of the citric acid cycle 
such that citrate accumulates. 

On a grander scale, citric acid may one day play a 
spectacular role in the alleviation of world hunger. With 
its three negatively charged carboxyl groups, citrate is a 
good chelator of metal ions, and some plants exploit this 
property by releasing citrate into the soil, where it binds 
metal ions and prevents their absorption by the plant. Of 
particular importance is the aluminum ion (Al 3+ ) , which 
is toxic to many plants and causes decreased crop yields 
on 30% to 40% of the world's arable land. Aluminum is 
the most abundant metal in the earth's crust, yet it oc- 
curs mostly in chemical compounds, such as Al(OH) 3 , 
that are biologically inert. However, when soil pH is less 
than 5, Al 3+ becomes soluble and thus can be absorbed 



by plant roots. Acidic soil and Al 3+ toxicity are most 
prevalent in the tropics, where maize yields can be de- 
pressed by as much as 80% . In Mexico, Al 3+ toxicity lim- 
its papaya production to 20,000 hectares, instead of the 
3 million hectares that could theoretically be cultivated. 
One solution would be to raise soil pH with lime, but this 
is economically and environmentally unsound. An alter- 
native would be to breed AT 3 + -resistant plants. Naturally 
resistant plants do exist, and these provide the means 
for a third solution: transferring resistance to crop 
plants by genetic engineering. 

A group of researchers in Mexico has genetically engi- 
neered tobacco and papaya plants to express elevated lev- 
els of bacterial citrate synthase. These plants secrete five 
to six times their normal amount of Al 3+ -chelating citric 
acid and can grow in soils with Al 3+ levels 10 times those 
at which control plants can grow. This degree of resistance 
would allow Mexico to grow papaya on the 3 million 
hectares of land currently rendered unsuitable by Al 3+ . 

Given projected levels of population growth, world 
food production must more than triple in the next 50 years 
to adequately feed 9.6 billion people. A long-term solution 
may turn on increasing crop productivity on the arable 
land affected by aluminum toxicity, and citric acid may 
play an important role in achieving this goal. 



citric acid cycle operates too slowly to process the pyru- 
vate generated by glycolysis. 

Biotin in Pyruvate Carboxylase Carries 
C0 2 Groups 

The pyruvate carboxylase reaction requires the vitamin 
biotin (Fig. 16-16), which is the prosthetic group of 
the enzyme. Biotin plays a key role in many carboxyla- 
tion reactions. It is a specialized carrier of one-carbon 
groups in their most oxidized form: C0 2 . (The transfer 
of one-carbon groups in more reduced forms is medi- 
ated by other cofactors, notably tetrahydrofolate and 
S-adenosylmethionine, as described in Chapter 18.) 
Carboxyl groups are activated in a reaction that con- 
sumes ATP and joins C0 2 to enzyme-bound biotin. This 
"activated" C0 2 is then passed to an acceptor (pyruvate 
in this case) in a carboxylation reaction. 

Pyruvate carboxylase has four identical subunits, 
each containing a molecule of biotin covalently at- 
tached through an amide linkage to the e-amino group 
of a specific Lys residue in the enzyme active site. 
Carboxylation of pyruvate proceeds in two steps (Fig. 
16-16): first, a carboxyl group derived from HCO3 is 
attached to biotin, then the carboxyl group is trans- 
ferred to pyruvate to form oxaloacetate. These two 



steps occur at separate active sites; the long flexible 
arm of biotin transfers activated carboxyl groups from 
the first active site (on one monomer of the tetramer) 
to the second (on the adjacent monomer), function- 
ing much like the long lipoyllysyl arm of E 2 in the 
PDH complex (Fig. 16-6) and the long arm of the 
CoA-like moiety in the acyl carrier protein involved 
in fatty acid synthesis (see Fig. 21-5); these are 
compared in Figure 16-17. Lipoate, biotin, and pan- 
tothenate all enter cells on the same transporter; all 
become covalently attached to proteins by similar re- 
actions; and all provide a flexible tether that allows 
bound reaction intermediates to move from one ac- 
tive site to another in an enzyme complex, without 
dissociating from it — all, that is, participate in sub- 
strate channeling. 

Biotin is a vitamin required in the human diet; it is 
abundant in many foods and is synthesized by intestinal 
bacteria. Biotin deficiency is rare, but can sometimes be 
caused by a diet rich in raw eggs. Egg whites contain a 
large amount of the protein avidin (M T 70,000), which 
binds very tightly to biotin and prevents its absorption in 
the intestine. The avidin of egg whites may be a defense 
mechanism for the potential chick embryo, inhibiting 
the growth of bacteria. When eggs are cooked, avidin is 
denatured (and thereby inactivated) along with all other 
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MECHANISM FIGURE 16-16 The role of biotin in the reaction cat- 
alyzed by pyruvate carboxylase. Biotin is attached to the enzyme 
through an amide bond with the £--amino group of a Lys residue, form- 
ing biotinyl-enzyme. Biotin-mediated carboxylation reactions occur in 
two phases, generally catalyzed in separate active sites on the enzyme 
as exemplified by the pyruvate carboxylase reaction. In the first phase 
(steps® to (3)), bicarbonate is converted to the more activated C0 2 , 
and then used to carboxylate biotin. The biotin acts as a carrier to 
transport the C0 2 from one active site to another on an adjacent 
monomer of the tetrameric enzyme (step @). In the second phase 
(steps (5) to ©), catalyzed in this second active site, the C0 2 reacts 
with pyruvate to form oxaloacetate. 
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egg white proteins. Purified avidin is a useful reagent in 
biochemistry and cell biology. A protein that contains 
covalently bound biotin (derived experimentally or pro- 
duced in vivo) can be recovered by affinity chromato- 
graphy (see Fig. 3-1 7c) based on biotin's strong affinity 



for avidin. The protein is then eluted from the column 
with an excess of free biotin. The very high affinity of bi- 
otin for avidin is also used in the laboratory in the form 
of a molecular glue that can hold two structures to- 
gether (see Fig. 19-28). 
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FIGURE 16-17 Biological tethers. The cofactors lipoate, biotin, and 
the combination of j8-mercaptoethylamine and pantothenate form 
long, flexible arms (blue) on the enzymes to which they are covalently 
bound, acting as tethers that move intermediates from one active site 
to the next. The group shaded pink is in each case the point of attach- 
ment of the activated intermediate to the tether. 

SUMMARY 16.2 Reactions of the Citric 
Acid Cycle 

■ The citric acid cycle (Krebs cycle, TCA cycle) is 
a nearly universal central catabolic pathway in 
which compounds derived from the breakdown 
of carbohydrates, fats, and proteins are oxidized 
to C0 2 , with most of the energy of oxidation 
temporarily held in the electron carriers FADH 2 
and NADH. During aerobic metabolism, these 
electrons are transferred to 0 2 and the energy 
of electron flow is trapped as ATP. 

■ Acetyl-CoA enters the citric acid cycle (in the 
mitochondria of eukaryotes, the cytosol of bacteria) 
as citrate synthase catalyzes its condensation with 
oxaloacetate to form citrate. 

■ In seven sequential reactions, including two 
decarboxylations, the citric acid cycle converts 
citrate to oxaloacetate and releases two C0 2 . The 
pathway is cyclic in that the intermediates of the 
cycle are not used up; for each oxaloacetate 
consumed in the path, one is produced. 

■ For each acetyl-CoA oxidized by the citric acid 
cycle, the energy gain consists of three molecules 
of NADH, one FADH 2 , and one nucleoside 
triphosphate (either ATP or GTP). 

■ Besides acetyl-CoA, any compound that gives 
rise to a four- or five-carbon intermediate of the 



citric acid cycle — for example, the breakdown 
products of many amino acids — can be oxidized by 
the cycle. 

■ The citric acid cycle is amphibolic, serving in both 
catabolism and anabolism; cycle intermediates can 
be drawn off and used as the starting material for 
a variety of biosynthetic products. 

m When intermediates are shunted from the citric 
acid cycle to other pathways, they are replenished 
by several anaplerotic reactions, which produce 
four-carbon intermediates by carboxylation of 
three-carbon compounds; these reactions are 
catalyzed by pyruvate carboxylase, PEP 
carboxykinase, PEP carboxylase, and malic 
enzyme. Enzymes that catalyze carboxylations 
commonly employ biotin to activate C0 2 and to 
carry it to acceptors such as pyruvate or 
phosphoenolpyruvate. 

1 6.3 Regulation of the Citric Acid Cycle 

As we have seen in Chapter 15, the regulation of key en- 
zymes in metabolic pathways, by allosteric effectors and 
by covalent modification, ensures the production of in- 
termediates at the rates required to keep the cell in a 
stable steady state while avoiding wasteful overproduc- 
tion. The flow of carbon atoms from pyruvate into and 
through the citric acid cycle is under tight regulation at 
two levels: the conversion of pyruvate to acetyl-CoA, the 
starting material for the cycle (the pyruvate dehydroge- 
nase complex reaction), and the entry of acetyl-CoA 
into the cycle (the citrate synthase reaction). Acetyl- 
CoA is also produced by pathways other than the PDH 
complex reaction — most cells produce acetyl-CoA from 
the oxidation of fatty acids and certain amino acids — and 
the availability of intermediates from these other path- 
ways is important in the regulation of pyruvate oxidation 
and of the citric acid cycle. The cycle is also regulated 
at the isocitrate dehydrogenase and a-ketoglutarate 
dehydrogenase reactions. 

Production of Acetyl-CoA by the Pyruvate 
Dehydrogenase Complex Is Regulated by Allosteric 
and Covalent Mechanisms 

The PDH complex of mammals is strongly inhibited by 
ATP and by acetyl-CoA and NADH, the products of the 
reaction catalyzed by the complex (Fig. 16-18). The 
allosteric inhibition of pyruvate oxidation is greatly en- 
hanced when long-chain fatty acids are available. AMP, 
CoA, and NAD + , all of which accumulate when too little 
acetate flows into the citric acid cycle, allosterically ac- 
tivate the PDH complex. Thus, this enzyme activity is 
turned off when ample fuel is available in the form of fatty 
acids and acetyl-CoA and when the cell's [ATP]/[ADP] and 
[NADH]/[NAD + ] ratios are high, and it is turned on again 
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FIGURE 16-18 Regulation of metabolite flow from the 
PDH complex through the citric acid cycle in mam- 
mals. The PDH complex is allosterically inhibited when 
[ATP]/[ADP], [NADH]/[NAD + ], and [acetyl-CoA]/[CoA] 
ratios are high, indicating an energy-sufficient metabolic 
state. When these ratios decrease, allosteric activation of 
pyruvate oxidation results. The rate of flow through the 
citric acid cycle can be limited by the availability of the 
citrate synthase substrates, oxaloacetate and acetyl-CoA, 
or of NAD + , which is depleted by its conversion to 
NADH, slowing the three NAD-dependent oxidation 
steps. Feedback inhibition by succinyl-CoA, citrate, and 
ATP also slows the cycle by inhibiting early steps. In 
muscle tissue, Ca 2+ signals contraction and, as shown 
here, stimulates energy-yielding metabolism to replace 
the ATP consumed by contraction. 
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when energy demands are high and the cell requires 
greater flux of acetyl-CoA into the citric acid cycle. 

In mammals, these allosteric regulatory mecha- 
nisms are complemented by a second level of regulation: 
covalent protein modification. The PDH complex is in- 
hibited by reversible phosphorylation of a specific Ser 
residue on one of the two subunits of As noted ear- 
lier, in addition to the enzymes E x , E 2 , and E 3 , the mam- 
malian PDH complex contains two regulatory proteins 
whose sole purpose is to regulate the activity of the 
complex. A specific protein kinase phosphorylates and 
thereby inactivates E 1; and a specific phosphoprotein 
phosphatase removes the phosphoryl group by hydro- 
lysis and thereby activates E 2 . The kinase is allosterically 
activated by ATP: when [ATP] is high (reflecting a suffi- 
cient supply of energy), the PDH complex is inactivated 
by phosphorylation of E : . When [ATP] declines, kinase 
activity decreases and phosphatase action removes the 
phosphoryl groups from E 1; activating the complex. 

The PDH complex of plants, located in the mito- 
chondrial matrix and in plastids, is inhibited by its prod- 
ucts, NADH and acetyl-CoA. The plant mitochondrial 
enzyme is also regulated by reversible phosphorylation; 
pyruvate inhibits the kinase, thus activating the PDH 
complex, and NH4 stimulates the kinase, causing inacti- 
vation of the complex. The PDH complex of E. coli is 



under allosteric regulation similar to that of the mam- 
malian enzyme, but it does not seem to be regulated by 
phosphorylation. 

The Citric Acid Cycle Is Regulated at Its Three 
Exergonic Steps 

The flow of metabolites through the citric acid cycle is 
under stringent regulation. Three factors govern the 
rate of flux through the cycle: substrate availability, inhi- 
bition by accumulating products, and allosteric feed- 
back inhibition of the enzymes that catalyze early steps 
in the cycle. 

Each of the three strongly exergonic steps in the 
cycle — those catalyzed by citrate synthase, isocitrate 
dehydrogenase, and a-ketoglutarate dehydrogenase 
(Fig. 16-18) — can become the rate-limiting step under 
some circumstances. The availability of the substrates 
for citrate synthase (acetyl-CoA and oxaloacetate) 
varies with the metabolic state of the cell and sometimes 
limits the rate of citrate formation. NADH, a product of 
isocitrate and a-ketoglutarate oxidation, accumulates 
under some conditions, and at high [NADH]/[NAD + ] 
both dehydrogenase reactions are severely inhibited by 
mass action. Similarly, in the cell, the malate dehydroge- 
nase reaction is essentially at equilibrium (that is, it is 
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substrate-limited), and when [NADH]/[NAD + ] is high 
the concentration of oxaloacetate is low, slowing the 
first step in the cycle. Product accumulation inhibits all 
three limiting steps of the cycle: succinyl-CoA inhibits 
a-ketoglutarate dehydrogenase (and also citrate syn- 
thase); citrate blocks citrate synthase; and the end 
product, ATP, inhibits both citrate synthase and isoci- 
trate dehydrogenase. The inhibition of citrate synthase 
by ATP is relieved by ADP, an allosteric activator of this 
enzyme. In vertebrate muscle, Ca 2+ , the signal for con- 
traction and for a concomitant increase in demand for 
ATP, activates both isocitrate dehydrogenase and a- 
ketoglutarate dehydrogenase, as well as the PDH com- 
plex. In short, the concentrations of substrates and 
intermediates in the citric acid cycle set the flux through 
this pathway at a rate that provides optimal concentra- 
tions of ATP and NADH. 

Under normal conditions, the rates of glycolysis and 
of the citric acid cycle are integrated so that only as 
much glucose is metabolized to pyruvate as is needed to 
supply the citric acid cycle with its fuel, the acetyl 
groups of acetyl-CoA. Pyruvate, lactate, and acetyl-CoA 
are normally maintained at steady-state concentrations. 
The rate of glycolysis is matched to the rate of the citric 
acid cycle not only through its inhibition by high levels 
of ATP and NADH, which are common to both the gly- 
colytic and respiratory stages of glucose oxidation, but 
also by the concentration of citrate. Citrate, the product 
of the first step of the citric acid cycle, is an important 
allosteric inhibitor of phosphofructokinase-1 in the gly- 
colytic pathway (see Fig. 15-14). 

Substrate Channeling through Multienzyme 
Complexes May Occur in the Citric Acid Cycle 

Although the enzymes of the citric acid cycle are 
usually described as soluble components of the mito- 
chondrial matrix (except for succinate dehydrogenase, 
which is membrane-bound), growing evidence sug- 
gests that within the mitochondrion these enzymes ex- 
ist as multienzyme complexes. The classic approach of 
enzymology — purification of individual proteins from 
extracts of broken cells — was applied with great suc- 
cess to the citric acid cycle enzymes. However, the first 
casualty of cell breakage is higher-level organization 
within the cell — the noncovalent, reversible interaction 
of one protein with another, or of an enzyme with some 
structural component such as a membrane, micro- 
tubule, or microfilament. When cells are broken open, 
their contents, including enzymes, are diluted 100- or 
1,000-fold (Fig. 16-19). 

Several types of evidence suggest that, in cells, 
multienzyme complexes ensure efficient passage of the 
product of one enzyme reaction to the next enzyme in 
the pathway. Such complexes are called metabolons. 
Certain enzymes of the citric acid cycle have been iso- 
lated together as supramolecular complexes, or have 
been found associated with the inner mitochondrial 
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FIGURE 16-19 Dilution of a solution containing a noncovalent protein 
complex — such as one consisting of three enzymes — favors dissocia- 
tion of the complex into its constituents. 

membrane, or have been shown to diffuse in the mito- 
chondrial matrix more slowly than expected for the indi- 
vidual protein in solution. There is strong evidence for 
substrate channeling through multienzyme complexes 
in other metabolic pathways, and many enzymes 
thought of as "soluble" probably function in the cell as 
highly organized complexes that channel intermediates. 
We will encounter other examples of channeling when 
we discuss the biosynthesis of amino acids and nu- 
cleotides in Chapter 22. 

Some Mutations in Enzymes of the Citric Acid 
Cycle Lead to Cancer 

When the mechanisms for regulating a pathway 
such as the citric acid cycle are overwhelmed by 
a major metabolic perturbation, the result can be seri- 
ous disease. Mutations in citric acid cycle enzymes are 
very rare in humans and other mammals, but those that 
do occur are devastating. Genetic defects in the fu- 
marase gene lead to tumors of smooth muscle (leiomas) 
and kidney; mutations in succinate dehydrogenase lead 
to tumors of the adrenal gland (pheochromocytomas). 
In cultured cells with these mutations, fumarate (in the 
case of fumarase mutations) and, to a lesser extent, suc- 
cinate (in the case of succinate dehydrogenase muta- 
tions) accumulate, and this accumulation induces the 
hypoxia-inducible transcription factor HIF-la (see Box 
14-1). The mechanism of tumor formation may be the 
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production of a pseudohypoxic state. In cells with these 
mutations, there is an up-regulation of genes normally 
regulated by HIF-la. These effects of mutations in the 
fumarase and succinate dehydrogenase genes define 
them as tumor suppressor genes (see p. 474). ■ 

SUMMARY 16.3 Regulation of the Citric 
Acid Cycle 

■ The overall rate of the citric acid cycle is controlled 
by the rate of conversion of pyruvate to acetyl-CoA 
and by the flux through citrate synthase, isocitrate 
dehydrogenase, and a-ketoglutarate dehydrogenase. 
These fluxes are largely determined by the 
concentrations of substrates and products: the 
end products ATP and NADH are inhibitory, 

and the substrates NAD + and ADP are 
stimulatory. 

■ The production of acetyl-CoA for the citric acid 
cycle by the PDH complex is inhibited allosterically 
by metabolites that signal a sufficiency of 
metabolic energy (ATP, acetyl-CoA, NADH, 

and fatty acids) and stimulated by metabolites 
that indicate a reduced energy supply (AMP, 
NAD + ,CoA). 

■ Complexes of consecutive enzymes in a pathway 
allow substrate channeling between them. 

16.4 The Glyoxylate Cycle 

Vertebrates cannot convert fatty acids, or the acetate 
derived from them, to carbohydrates. Conversion of 
phosphoenolpyruvate to pyruvate (p. 538) and of pyru- 
vate to acetyl-CoA (Fig. 16-2) are so exergonic as to be 
essentially irreversible. If a cell cannot convert acetate 
into phosphoenolpyruvate, acetate cannot serve as the 
starting material for the gluconeogenic pathway, which 
leads from phosphoenolpyruvate to glucose (see Fig. 
15-11). Without this capacity, then, a cell or organism is 
unable to convert fuels or metabolites that are degraded 
to acetate (fatty acids and certain amino acids) into car- 
bohydrates. 

As noted in the discussion of anaplerotic reactions 
(Table 16-2), phosphoenolpyruvate can be synthesized 
from oxaloacetate in the reversible reaction catalyzed 
by PEP carboxykinase: 

Oxaloacetate + GTP ^=*= 

phosphoenolpyruvate + C0 2 + GDP 

Because the carbon atoms of acetate molecules that enter 
the citric acid cycle appear eight steps later in oxaloac- 
etate, it might seem that this pathway could generate ox- 
aloacetate from acetate and thus generate phospho- 
enolpyruvate for gluconeogenesis. However, as an exami- 
nation of the stoichiometry of the citric acid cycle shows, 
there is no net conversion of acetate to oxaloacetate; in 
vertebrates, for every two carbons that enter the cycle as 



acetyl-CoA, two leave as C0 2 . In many organisms other 
than vertebrates, the glyoxylate cycle serves as a mecha- 
nism for converting acetate to carbohydrate. 

The Glyoxylate Cycle Produces Four-Carbon 
Compounds from Acetate 

In plants, certain invertebrates, and some microorgan- 
isms (including E coli and yeast) acetate can serve 
both as an energy-rich fuel and as a source of phospho- 
enolpyruvate for carbohydrate synthesis. In these 
organisms, enzymes of the glyoxylate cycle catalyze 
the net conversion of acetate to succinate or other four- 
carbon intermediates of the citric acid cycle: 

2 Acetyl-CoA + NAD + + 2H 2 0 > 

succinate + 2CoA + NADH + H + 

In the glyoxylate cycle, acetyl-CoA condenses with ox- 
aloacetate to form citrate, and citrate is converted to isoc- 
itrate, exactly as in the citric acid cycle. The next step, 
however, is not the breakdown of isocitrate by isocitrate 
dehydrogenase but the cleavage of isocitrate by isoci- 
trate lyase, forming succinate and glyoxylate. The gly- 
oxylate then condenses with a second molecule of 
acetyl-CoA to yield malate, in a reaction catalyzed by 
malate synthase. The malate is subsequently oxidized 
to oxaloacetate, which can condense with another mole- 
cule of acetyl-CoA to start another turn of the cycle (Fig. 
16-20). Each turn of the glyoxylate cycle consumes two 
molecules of acetyl-CoA and produces one molecule of 
succinate, which is then available for biosynthetic pur- 
poses. The succinate maybe converted through fumarate 
and malate into oxaloacetate, which can then be con- 
verted to phosphoenolpyruvate by PEP carboxykinase, 
and thus to glucose by gluconeogenesis. Vertebrates do 
not have the enzymes specific to the glyoxylate cycle 
(isocitrate lyase and malate synthase) and therefore can- 
not bring about the net synthesis of glucose from lipids. 

In plants, the enzymes of the glyoxylate cycle are 
sequestered in membrane-bounded organelles called 
glyoxysomes, which are specialized peroxisomes (Fig. 
16-21). Those enzymes common to the citric acid and 
glyoxylate cycles have two isozymes, one specific to mi- 
tochondria, the other to glyoxysomes. Glyoxysomes are 
not present in all plant tissues at all times. They develop 
in lipid-rich seeds during germination, before the devel- 
oping plant acquires the ability to make glucose by pho- 
tosynthesis. In addition to glyoxylate cycle enzymes, 
glyoxysomes contain all the enzymes needed for the 
degradation of the fatty acids stored in seed oils (see 
Fig. 17-13). Acetyl-CoA formed from lipid breakdown 
is converted to succinate via the glyoxylate cycle, and 
the succinate is exported to mitochondria, where citric 
acid cycle enzymes transform it to malate. A cytosolic 
isozyme of malate dehydrogenase oxidizes malate to ox- 
aloacetate, a precursor for gluconeogenesis. Germinat- 
ing seeds can therefore convert the carbon of stored 
lipids into glucose. 
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FIGURE 16-20 Glyoxylate cycle. The citrate synthase, aconitase, and 
malate dehydrogenase of the glyoxylate cycle are isozymes of the citric 
acid cycle enzymes; isocitrate lyase and malate synthase are unique to the 
glyoxylate cycle. Notice that two acetyl groups (pink) enter the cycle and 
four carbons leave as succinate (blue). The glyoxylate cycle was eluci- 
dated by Hans Kornberg and Neil Madsen in the laboratory of Hans Krebs. 
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The Citric Acid and Glyoxylate Cycles Are 
Coordinately Regulated 

In germinating seeds, the enzymatic transformations of 
dicarboxylic and tricarboxylic acids occur in three in- 
tracellular compartments: mitochondria, glyoxysomes, 
and the cytosol. There is a continuous interchange of 
metabolites among these compartments (Fig. 16-22). 

The carbon skeleton of oxaloacetate from the citric 
acid cycle (in the mitochondrion) is carried to the 
glyoxysome in the form of aspartate. Aspartate is con- 
verted to oxaloacetate, which condenses with acetyl-CoA 
derived from fatty acid breakdown. The citrate thus 
formed is converted to isocitrate by aconitase, then split 
into glyoxylate and succinate by isocitrate lyase. The suc- 
cinate returns to the mitochondrion, where it reenters 
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FIGURE 16-21 Electron micrograph of a germinating cucumber seed, 
showing a glyoxysome, mitochondria, and surrounding lipid bodies. 



FIGURE 16-22 Relationship between the glyoxylate and citric acid cycles. The reactions of the 
glyoxylate cycle (in glyoxysomes) proceed simultaneously with, and mesh with, those of the cit- 
ric acid cycle (in mitochondria), as intermediates pass between these compartments. The con- 
version of succinate to oxaloacetate is catalyzed by citric acid cycle enzymes. The oxidation of 
fatty acids to acetyl-CoA is described in Chapter 1 7; the synthesis of hexoses from oxaloacetate 
is described in Chapter 20. 




640 J The Citric Acid Cycle 



the citric acid cycle and is transformed into malate, which 
enters the cytosol and is oxidized (by cytosolic malate de- 
hydrogenase) to oxaloacetate. Oxaloacetate is converted 
via gluconeogenesis into hexoses and sucrose, which can 
be transported to the growing roots and shoot. Four dis- 
tinct pathways participate in these conversions: fatty acid 
breakdown to acetyl-CoA (in glyoxysomes), the glyoxy- 
late cycle (in glyoxysomes), the citric acid cycle (in mito- 
chondria), and gluconeogenesis (in the cytosol). 

The sharing of common intermediates requires that 
these pathways be coordinately regulated. Isocitrate is a 
crucial intermediate, at the branch point between the gly- 
oxylate and citric acid cycles (Fig. 16-23). Isocitrate dehy- 
drogenase is regulated by covalent modification: a specific 
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FIGURE 16-23 Coordinated regulation of glyoxylate and citric acid cy- 
cles. Regulation of isocitrate dehydrogenase activity determines the par- 
titioning of isocitrate between the glyoxylate and citric acid cycles. When 
the enzyme is inactivated by phosphorylation (by a specific protein ki- 
nase), isocitrate is directed into biosynthetic reactions via the glyoxylate 
cycle When the enzyme is activated by dephosphorylation (by a specific 
phosphatase), isocitrate enters the citric acid cycle and ATP is produced. 



protein kinase phosphorylates and thereby inactivates the 
dehydrogenase. This inactivation shunts isocitrate to the 
glyoxylate cycle, where it begins the synthetic route to- 
ward glucose. A phosphoprotein phosphatase removes the 
phosphoryl group from isocitrate dehydrogenase, reacti- 
vating the enzyme and sending more isocitrate through the 
energy-yielding citric acid cycle. The regulatory protein ki- 
nase and phosphoprotein phosphatase are separate enzy- 
matic activities of a single polypeptide. 

Some bacteria, including E. coli, have the full comple- 
ment of enzymes for the glyoxylate and citric acid cycles in 
the cytosol and can therefore grow on acetate as their sole 
source of carbon and energy. The phosphoprotein phos- 
phatase that activates isocitrate dehydrogenase is stimu- 
lated by intermediates of the citric acid cycle and glycolysis 
and by indicators of reduced cellular energy supply (Fig. 
16-23). The same metabolites inhibit the protein kinase 
activity of the bifunctional polypeptide. Thus, the accumu- 
lation of intermediates of the central energy-yielding path- 
ways — indicating energy depletion — results in the activa- 
tion of isocitrate dehydrogenase. When the concentration 
of these regulators falls, signaling a sufficient flux through 
the energy-yielding citric acid cycle, isocitrate dehydroge- 
nase is inactivated by the protein kinase. 

The same intermediates of glycolysis and the citric 
acid cycle that activate isocitrate dehydrogenase are 
allosteric inhibitors of isocitrate lyase. When energy- 
yielding metabolism is sufficiently fast to keep the 
concentrations of glycolytic and citric acid cycle inter- 
mediates low, isocitrate dehydrogenase is inactivated, 
the inhibition of isocitrate lyase is relieved, and isoci- 
trate flows into the glyoxylate pathway, to be used in the 
biosynthesis of carbohydrates, amino acids, and other 
cellular components. 

SUMMARY 16.4 The Glyoxylate Cycle 

■ The glyoxylate cycle is active in the germinating 
seeds of some plants and in certain microorganisms 
that can live on acetate as the sole carbon source. 
In plants, the pathway takes place in glyoxysomes 
in seedlings. It involves several citric acid cycle 
enzymes and two additional enzymes: isocitrate 
lyase and malate synthase. 

■ In the glyoxylate cycle, the bypassing of the two 
decarboxylation steps of the citric acid cycle 
makes possible the net formation of succinate, 
oxaloacetate, and other cycle intermediates from 
acetyl-CoA. Oxaloacetate thus formed can be used 
to synthesize glucose via gluconeogenesis. 

m Vertebrates lack the glyoxylate cycle and cannot 
synthesize glucose from acetate or the fatty acids 
that give rise to acetyl-CoA. 

n The partitioning of isocitrate between the citric 
acid cycle and the glyoxylate cycle is controlled at 
the level of isocitrate dehydrogenase, which is 
regulated by reversible phosphorylation. 
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1. Balance Sheet for the Citric Acid Cycle The citric acid 
cycle has eight enzymes: citrate synthase, aconitase, isocitrate 
dehydrogenase, a-ketoglutarate dehydrogenase, succinyl-CoA 
synthetase, succinate dehydrogenase, fumarase, and malate 
dehydrogenase. 

(a) Write a balanced equation for the reaction catalyzed 
by each enzyme, 

(b) Name the cofactor(s) required by each enzyme reaction. 

(c) For each enzyme determine which of the following 
describes the type of reaction(s) catalyzed: condensation 
(carbon-carbon bond formation); dehydration (loss of water); 
hydration (addition of water); decarboxylation (loss of C0 2 ); 
oxidation-reduction; substrate -level phosphorylation; isomer- 
ization. 

(d) Write a balanced net equation for the catabolism of 
acetyl-CoA to C0 2 . 

2. Net Equation for Glycolysis and the Citric Acid 
Cycle Write the net biochemical equation for the metabolism 
of a molecule of glucose by glycolysis and the citric acid cycle, 
including all cofactors. 

3. Recognizing Oxidation and Reduction Reactions One 

biochemical strategy of many living organisms is the stepwise 
oxidation of organic compounds to C0 2 and H 2 0 and the con- 
servation of a major part of the energy thus produced in the 
form of ATP. It is important to be able to recognize oxidation- 
reduction processes in metabolism. Reduction of an organic 
molecule results from the hydrogenation of a double bond 
(Eqn 1, below) or of a single bond with accompanying cleav- 
age (Eqn 2). Conversely, oxidation results from dehydrogena- 
tion. In biochemical redox reactions, the coenzymes NAD and 



FAD dehydrogenate/hydrogenate organic molecules in the 
presence of the proper enzymes. 
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For each of the metabolic transformations in (a) through (h) , 
determine whether oxidation or reduction has occurred. Bal- 
ance each transformation by inserting H — H and, where neces- 
sary, H 2 0. 
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4. Relationship between Energy Release and the Oxi- 
dation State of Carbon A eukaryotic cell can use glucose 
(C 6 H 12 0 6 ) and hexanoic acid (C 6 H 14 0 2 ) as fuels for cellular 
respiration. On the basis of their structural formulas, which 
substance releases more energy per gram on complete com- 
bustion to C0 2 and H 2 0? 

5. Nicotinamide Coenzymes as Reversible Redox 
Carriers The nicotinamide coenzymes (see Fig. 13-24) can 
undergo reversible oxidation-reduction reactions with specific 
substrates in the presence of the appropriate dehydrogenase. 
In these reactions, NADH + H + serves as the hydrogen 
source, as described in Problem 3. Whenever the coenzyme is 
oxidized, a substrate must be simultaneously reduced: 



Substrate + NADH + H + 

Oxidized Reduced 



product + NAD + 

Reduced Oxidized 



For each of the reactions in (a) through (f), determine 
whether the substrate has been oxidized or reduced or is un- 
changed in oxidation state (see Problem 3). If a redox change 
has occurred, balance the reaction with the necessary amount 
of NAD + , NADH, H + , and H 2 0. The objective is to recognize 
when a redox coenzyme is necessary in a metabolic reaction. 
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6. Pyruvate Dehydrogenase Cofactors and Mechanism 

Describe the role of each cofactor involved in the reaction cat- 
alyzed by the pyruvate dehydrogenase complex. 

7. Thiamine Deficiency Individuals with a thiamine- 
deficient diet have relatively high levels of pyruvate in their 
blood. Explain this in biochemical terms, 

8. Isocitrate Dehydrogenase Reaction What type of 
chemical reaction is involved in the conversion of isocitrate to 
a-ketoglutarate? Name and describe the role of any cofactors. 
What other reaction(s) of the citric acid cycle are of this same 
type? 

9. Stimulation of Oxygen Consumption by Oxaloac- 
etate and Malate In the early 1930s, Albert Szent-Gyorgyi 
reported the interesting observation that the addition of small 
amounts of oxaloacetate or malate to suspensions of minced 
pigeon breast muscle stimulated the oxygen consumption of 
the preparation. Surprisingly, the amount of oxygen consumed 
was about seven times more than the amount necessary for 
complete oxidation (to C0 2 and H 2 0) of the added oxaloac- 
etate or malate. Why did the addition of oxaloacetate or malate 
stimulate oxygen consumption? Why was the amount of oxy- 
gen consumed so much greater than the amount necessary to 
completely oxidize the added oxaloacetate or malate? 

10. Formation of Oxaloacetate in a Mitochondrion In 

the last reaction of the citric acid cycle, malate is dehydro- 
genated to regenerate the oxaloacetate necessary for the en- 
try of acetyl-CoA into the cycle: 



L-Malate + NAD + 



oxaloacetate + NADH + H + 

AG'° = 30.0kJ/mol 



(a) Calculate the equilibrium constant for this reaction at 
25°C. 

(b) Because AG'° assumes a standard pH of 7, the equili- 
brium constant calculated in (a) corresponds to 



[oxaloacetate] [NADH] 
~~ [L-malate][NAD + ] 



The measured concentration of L-malate in rat liver mito- 
chondria is about 0.20 mM when [NAD + ]/[NADH] is 10. Cal- 
culate the concentration of oxaloacetate at pH 7 in these 
mitochondria. 

(c) To appreciate the magnitude of the mitochondrial ox- 
aloacetate concentration, calculate the number of oxaloac- 
etate molecules in a single rat liver mitochondrion, Assume the 
mitochondrion is a sphere of diameter 2.0 jLtm. 
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11. Cofactors for the Citric Acid Cycle Suppose you have 
prepared a mitochondrial extract that contains all of the solu- 
ble enzymes of the matrix but has lost (by dialysis) all the low 
molecular weight cofactors. What must you add to the extract 
so that the preparation will oxidize acetyl-CoA to C0 2 ? 

12. Riboflavin Deficiency How would a riboflavin deficiency 
affect the functioning of the citric acid cycle? Explain your 
answer. 

13. Oxaloacetate Pool What factors might decrease the 
pool of oxaloacetate available for the activity of the citric acid 
cycle? How can the pool of oxaloacetate be replenished? 

14. Energy Yield from the Citric Acid Cycle The reac- 
tion catalyzed by succinyl-CoA synthetase produces the high- 
energy compound GTP. How is the free energy contained in 
GTP incorporated into the cellular ATP pool? 

15. Respiration Studies in Isolated Mitochondria Cel- 
lular respiration can be studied in isolated mitochondria by 
measuring oxygen consumption under different conditions. If 
0.01 m sodium malonate is added to actively respiring mito- 
chondria that are using pyruvate as fuel source, respiration 
soon stops and a metabolic intermediate accumulates. 

(a) What is the structure of this intermediate? 

(b) Explain why it accumulates. 

(c) Explain why oxygen consumption stops. 

(d) Aside from removal of the malonate, how can this inhi- 
bition of respiration be overcome? Explain. 

16. Labeling Studies in Isolated Mitochondria The 

metabolic pathways of organic compounds have often been 
delineated by using a radioactively labeled substrate and 
following the fate of the label. 

(a) How can you determine whether glucose added to a 
suspension of isolated mitochondria is metabolized to C0 2 
and H 2 0? 

(b) Suppose you add a brief pulse of [3- 14 C]pyruvate (la- 
beled in the methyl position) to the mitochondria. After one 
turn of the citric acid cycle, what is the location of the 14 C in 
the oxaloacetate? Explain by tracing the 14 C label through the 
pathway. How many turns of the cycle are required to release 
all the [3- 14 C]pyruvate as C0 2 ? 

17. Pathway of C0 2 in Gluconeogenesis In the first by- 
pass step of gluconeogenesis, the conversion of pyruvate to 
phosphoenolpyruvate (PEP), pyruvate is carboxylated by 
pyruvate carboxylase to oxaloacetate, which is subsequently 
decarboxylated to PEP by PEP carboxykinase (Chapter 14). 
Because the addition of C0 2 is directly followed by the loss of 
C0 2 , you might expect that in tracer experiments, the 14 C of 
14 C0 2 would not be incorporated into PEP, glucose, or any in- 
termediates in gluconeogenesis. However, investigators find 
that when a rat liver preparation synthesizes glucose in the 
presence of 14 C0 2 , 14 C slowly appears in PEP and eventually at 
C-3 and C-4 of glucose. How does the 14 C label get into the 
PEP and glucose? (Hint: During gluconeogenesis in the pres- 
ence of 14 C0 2 , several of the four-carbon citric acid cycle inter- 
mediates also become labeled.) 



18. [1- C]Glucose Catabolism An actively respiring bac- 
terial culture is briefly incubated with [l- 14 C]glucose, and the 
glycolytic and citric acid cycle intermediates are isolated. 
Where is the 14 C in each of the intermediates listed below? 
Consider only the initial incorporation of 14 C, in the first pass 
of labeled glucose through the pathways. 

(a) Fructose 1,6-bisphosphate 

(b) Glyceraldehyde 3-phosphate 

(c) Phosphoenolpyruvate 

(d) Acetyl-CoA 

(e) Citrate 

(f) a-Ketoglutarate 

(g) Oxaloacetate 

I 19. Role of the Vitamin Thiamine People with 
Hfl beriberi, a disease caused by thiamine deficiency, have 
elevated levels of blood pyruvate and a-ketoglutarate, espe- 
cially after consuming a meal rich in glucose. How are these ef- 
fects related to a deficiency of thiamine? 

20. Synthesis of Oxaloacetate by the Citric Acid Cycle 

Oxaloacetate is formed in the last step of the citric acid cycle 
by the NAD + -dependent oxidation of L-malate. Can a net syn- 
thesis of oxaloacetate from acetyl-CoA occur using only the 
enzymes and cofactors of the citric acid cycle, without deplet- 
ing the intermediates of the cycle? Explain. How is oxaloac- 
etate that is lost from the cycle (to biosynthetic reactions) 
replenished? 

21. Oxaloacetate Depletion Mammalian liver can carry 
out gluconeogenesis using oxaloacetate as the starting mate- 
rial (Chapter 14). Would the operation of the citric acid cycle 
be affected by extensive use of oxaloacetate for gluconeogen- 
esis? Explain your answer. 

22. Mode of Action of the Rodenticide Fluoroacetate 

Fluoroacetate, prepared commercially for rodent control, is 
also produced by a South African plant. After entering a cell, 
fluoroacetate is converted to fluoroacetyl-CoA in a reaction 
catalyzed by the enzyme acetate thiokinase: 

F— CH 2 COO" + CoA-SH + ATP > 

F — CH 2 C — S-CoA + AMP + PPi 

0 

The toxic effect of fluoroacetate was studied in an experiment 
using intact isolated rat heart. After the heart was perfused 
with 0.22 mM fluoroacetate, the measured rate of glucose up- 
take and glycolysis decreased, and glucose 6-phosphate and 
fructose 6-phosphate accumulated. Examination of the citric 
acid cycle intermediates revealed that their concentrations 
were below normal, except for citrate, with a concentration 10 
times higher than normal. 

(a) Where did the block in the citric acid cycle occur? 
What caused citrate to accumulate and the other cycle inter- 
mediates to be depleted? 

(b) Fluoroacetyl-CoA is enzymatically transformed in the 
citric acid cycle. What is the structure of the end product of 
fluoroacetate metabolism? Why does it block the citric acid cy- 
cle? How might the inhibition be overcome? 
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(c) In the heart perfusion experiments, why did glucose 
uptake and glycolysis decrease? Why did hexose monophos- 
phates accumulate? 

(d) Why is fluoroacetate poisoning fatal? 

23. Synthesis of L-Malate in Wine Making The tartness 
of some wines is due to high concentrations of L-malate. Write 
a sequence of reactions showing how yeast cells synthesize 
L-malate from glucose under anaerobic conditions in the pres- 
ence of dissolved C0 2 (HCO3 ),, Note that the overall reaction 
for this fermentation cannot involve the consumption of nicoti- 
namide coenzymes or citric acid cycle intermediates. 

24. Net Synthesis of a-Ketoglutarate a-Ketoglutarate 
plays a central role in the biosynthesis of several amino acids. 
Write a sequence of enzymatic reactions that could result in 
the net synthesis of a-ketoglutarate from pyruvate. Your pro- 
posed sequence must not involve the net consumption of other 
citric acid cycle intermediates. Write an equation for the over- 
all reaction and identify the source of each reactant. 

25. Amphibolic Pathways Explain, giving examples, what is 
meant by the statement that the citric acid cycle is amphibolic, 

26. Regulation of the Pyruvate Dehydrogenase Complex 

In animal tissues, the rate of conversion of pyruvate to acetyl- 
CoA is regulated by the ratio of active, phosphorylated to inac- 
tive, unphosphorylated PDH complex. Determine what happens 
to the rate of this reaction when a preparation of rabbit muscle 
mitochondria containing the PDH complex is treated with (a) 
pyruvate dehydrogenase kinase, ATP, and NADH; (b) pyruvate 
dehydrogenase phosphatase and Ca 2+ ; (c) malonate 

27. Commercial Synthesis of Citric Acid Citric acid is 
used as a flavoring agent in soft drinks, fruit juices, and many 
other foods. Worldwide, the market for citric acid is valued at 
hundreds of millions of dollars per year. Commercial produc- 
tion uses the mold Aspergillus niger, which metabolizes su- 
crose under carefully controlled conditions. 

(a) The yield of citric acid is strongly dependent on the 
concentration of FeCl 3 in the culture medium, as indicated in 
the graph. Why does the yield decrease when the concentra- 
tion of Fe 3+ is above or below the optimal value of 0.5 mg/L? 
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(b) Write the sequence of reactions by which A. niger 
synthesizes citric acid from sucrose. Write an equation for the 
overall reaction, 

(c) Does the commercial process require the culture 
medium to be aerated — that is, is this a fermentation or an 
aerobic process? Explain, 



28. Regulation of Citrate Synthase In the presence of 
saturating amounts of oxaloacetate, the activity of citrate syn- 
thase from pig heart tissue shows a sigmoid dependence on 
the concentration of acetyl-CoA, as shown in the graph. When 
succinyl-CoA is added, the curve shifts to the right and the sig- 
moid dependence is more pronounced. 
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On the basis of these observations, suggest how succinyl-CoA 
regulates the activity of citrate synthase. (Hint: See Fig, 6-34.) 
Why is succinyl-CoA an appropriate signal for regulation of the 
citric acid cycle? How does the regulation of citrate synthase 
control the rate of cellular respiration in pig heart tissue? 

29. Regulation of Pyruvate Carboxylase The carboxyla- 
tion of pyruvate by pyruvate carboxylase occurs at a very low 
rate unless acetyl-CoA, a positive allosteric modulator, is pres- 
ent. If you have just eaten a meal rich in fatty acids (triacyl- 
glycerols) but low in carbohydrates (glucose), how does this 
regulatory property shut down the oxidation of glucose to C0 2 
and H 2 0 but increase the oxidation of acetyl-CoA derived from 
fatty acids? 

30. Relationship between Respiration and the Citric 
Acid Cycle Although oxygen does not participate directly 
in the citric acid cycle, the cycle operates only when 0 2 is 
present. Why? 

31. Effect of [NADH]/[NAD + ] on the Citric Acid Cycle 

How would you expect the operation of the citric acid cycle to 
respond to a rapid increase in the [NADH]/[NAD + ] ratio in the 
mitochondrial matrix? Why? 

32. Thermodynamics of Citrate Synthase Reaction in 
Cells Citrate is formed by the condensation of acetyl-CoA 
with oxaloacetate, catalyzed by citrate synthase: 

Oxaloacetate + acetyl-CoA + H 2 0 ► citrate + CoA + H + 

In rat heart mitochondria at pH 7,0 and 25 °C, the concentra- 
tions of reactants and products are: oxaloacetate, 1 /am; acetyl- 
CoA, 1 /j,m; citrate, 220 fiu; and CoA, 65 /am. The standard 
free-energy change for the citrate synthase reaction is -32.2 
kJ/mol. What is the direction of metabolite flow through the 
citrate synthase reaction in rat heart cells? Explain. 

33. Reactions of the Pyruvate Dehydrogenase Complex 

Two of the steps in the oxidative decarboxylation of pyruvate 
(steps @ and © in Fig. 16-6) do not involve any of the three 
carbons of pyruvate yet are essential to the operation of the 
PDH complex. Explain. 
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34. Citric Acid Cycle Mutants There are many cases of hu- 
man disease in which one or another enzyme activity is lacking 
due to genetic mutation. However, cases in which individuals 
lack one of the enzymes of the citric acid cycle are extremely 
rare Why? 

35. Partitioning between the Citric Acid and Glyoxy- 
late Cycles In an organism (such as E coll) that has both the 
citric acid cycle and the glyoxylate cycle, what determines 
which of these pathways isocitrate will enter? 

Data Analysis Problem 

36. How the Citric Acid Cycle Was Determined The de- 
tailed biochemistry of the citric acid cycle was determined by 
several researchers over a period of decades, In a 1937 article, 
Krebs and Johnson summarized their work and the work of 
others in the first published description of this pathway. 

The methods used by these researchers were very differ- 
ent from those of modern biochemistry. Radioactive tracers 
were not commonly available until the 1940s, so Krebs and 
other researchers had to use nontracer techniques to work out 
the pathway. Using freshly prepared samples of pigeon breast 
muscle, they determined oxygen consumption by suspending 
minced muscle in buffer in a sealed flask and measuring the 
volume (in /xL) of oxygen consumed under different condi- 
tions. They measured levels of substrates (intermediates) by 
treating samples with acid to remove contaminating proteins, 
then assaying the quantities of various small organic mole- 
cules. The two key observations that led Krebs and colleagues 
to propose a citric acid cycle as opposed to a linear pathway 
(like that of glycolysis) were made in the following experiments. 

Experiment I. They incubated 460 mg of minced muscle 
in 3 mL of buffer at 40 °C for 150 minutes. Addition of citrate 
increased 0 2 consumption by 893 (jlL compared with samples 
without added citrate They calculated, based on the 0 2 con- 
sumed during respiration of other carbon- containing com- 
pounds, that the expected 0 2 consumption for complete 
respiration of this quantity of citrate was only 302 /xL. 

Experiment II. They measured 0 2 consumption by 460 mg 
of minced muscle in 3 mL of buffer when incubated with 
citrate and/or with 1-phosphoglycerol (glycerol 1-phosphate; 
this was known to be readily oxidized by cellular respiration) 
at 40 °C for 140 minutes. The results are shown in the table. 

Sample Substrate(s) added jtL 0 2 absorbed 

1 No extra 342 

2 0.3 mL 0.2 m 

1-phosphoglycerol 757 

3 0.15 mL 0.02 m citrate 431 

4 0.3 mL 0.2 m 

1-phosphoglycerol and 

0.15 mL 0.02 m citrate 1,385 



(a) Why is 0 2 consumption a good measure of cellular 
respiration? 

(b) Why does sample 1 (unsupplemented muscle tissue) 
consume some oxygen? 

(c) Based on the results for samples 2 and 3, can you con- 
clude that 1-phosphoglycerol and citrate serve as substrates 
for cellular respiration in this system? Explain your reasoning. 

(d) Krebs and colleagues used the results from these ex- 
periments to argue that citrate was "catalytic" — that it helped 
the muscle tissue samples metabolize 1-phosphoglycerol 
more completely. How would you use their data to make this 
argument? 

(e) Krebs and colleagues further argued that citrate was 
not simply consumed by these reactions, but had to be regen- 
erated. Therefore, the reactions had to be a cycle rather than 
a linear pathway. How would you make this argument? 

Other researchers had found that arsenate (AsOf~) in- 
hibits a-ketoglutarate dehydrogenase and that malonate in- 
hibits succinate dehydrogenase. 

(f) Krebs and coworkers found that muscle tissue sam- 
ples treated with arsenate and citrate would consume citrate 
only in the presence of oxygen; and under these conditions, 
oxygen was consumed. Based on the pathway in Figure 16-7, 
what was the citrate converted to in this experiment, and why 
did the samples consume oxygen? 

In their article, Krebs and Johnson further reported the 
following. (1) In the presence of arsenate, 5.48 mmol of citrate 
was converted to 5.07 mmol of a-ketoglutarate. (2) In the 
presence of malonate, citrate was quantitatively converted to 
large amounts of succinate and small amounts of a-ketoglu- 
tarate. (3) Addition of oxaloacetate in the absence of oxygen 
led to production of a large amount of citrate; the amount was 
increased if glucose was also added. 

Other workers had found the following pathway in similar 
muscle tissue preparations: 

Succinate > fumarate > malate > 

oxaloacetate > pyruvate 

(g) Based only on the data presented in this problem, 
what is the order of the intermediates in the citric acid cycle? 
How does this compare with Figure 16-7? Explain your 
reasoning. 

(h) Why was it important to show the quantitative con- 
version of citrate to a-ketoglutarate? 

The Krebs and Johnson article also contains other data 
that filled in most of the missing components of the cycle. The 
only component left unresolved was the molecule that reacted 
with oxaloacetate to form citrate. 

Reference 
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Jack Sprat could eat no fat, 

His wife could eat no lean, 

And so between them both you see, 

They licked the platter clean. 



-John Clarke, Paroemiologia Anglo-Latina 
(Proverbs English and Latin), 1639 
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The oxidation of long-chain fatty acids to acetyl-CoA 
is a central energy -yielding pathway in many organ- 
isms and tissues. In mammalian heart and liver, for 
example, it provides as much as 80% of the energetic 
needs under all physiological circumstances. The elec- 
trons removed from fatty acids during oxidation pass 
through the respiratory chain, driving ATP synthesis; 
the acetyl-CoA produced from the fatty acids may be 
completely oxidized to C0 2 in the citric acid cycle, re- 
sulting in further energy conservation. In some species 
and in some tissues, the acetyl-CoA has alternative 
fates. In liver, acetyl-CoA may be converted to ketone 
bodies — water-soluble fuels exported to the brain and 
other tissues when glucose is not available. In higher 
plants, acetyl-CoA serves primarily as a biosynthetic 
precursor, only secondarily as fuel Although the biolog- 
ical role of fatty acid oxidation differs from organism to 
organism, the mechanism is essentially the same. The 
repetitive four-step process, called )3 oxidation, by 
which fatty acids are converted into acetyl-CoA is the 
main topic of this chapter. 

In Chapter 10 we described the properties of tri- 
acylglycerols (also called triglycerides or neutral fats) 
that make them especially suitable as storage fuels. The 
long alkyl chains of their constituent fatty acids are es- 
sentially hydrocarbons, highly reduced structures with 
an energy of complete oxidation (—38 kJ/g) more than 
twice that for the same weight of carbohydrate or pro- 
tein. This advantage is compounded by the extreme in- 
solubility of lipids in water; cellular triacylglycerols 
aggregate in lipid droplets, which do not raise the os- 
molarity of the cytosol, and they are unsolvated. (In 
storage polysaccharides, by contrast, water of solvation 
can account for two-thirds of the overall weight of 
the stored molecules.) And because of their relative 



chemical inertness, triacylglycerols can be stored in 
large quantity in cells without the risk of undesired 
chemical reactions with other cellular constituents. 

The properties that make triacylglycerols good stor- 
age compounds, however, present problems in their role 
as fuels. Because they are insoluble in water, ingested 
triacylglycerols must be emulsified before they can be 
digested by water-soluble enzymes in the intestine, and 
triacylglycerols absorbed in the intestine or mobilized 
from storage tissues must be carried in the blood bound 
to proteins that counteract their insolubility. To over- 
come the relative stability of the C— C bonds in a fatty 
acid, the carboxyl group at C-l is activated by attach- 
ment to coenzyme A, which allows stepwise oxidation of 
the fatty acyl group at the C-3, or /3, position— hence the 
name p oxidation. 

We begin this chapter with a brief discussion of the 
sources of fatty acids and the routes by which they travel 
to the site of their oxidation, with special emphasis on 
the process in vertebrates. We then describe the chemi- 
cal steps of fatty acid oxidation in mitochondria. The 
complete oxidation of fatty acids to C0 2 and H 2 0 takes 
place in three stages: the oxidation of long-chain fatty 
acids to two-carbon fragments, in the form of acetyl-CoA 
Q3 oxidation); the oxidation of acetyl-CoA to C0 2 in the 
citric acid cycle (Chapter 16); and the transfer of elec- 
trons from reduced electron carriers to the mitochon- 
drial respiratory chain (Chapter 19), In this chapter we 
focus on the first of these stages. We begin our discussion 
of /3 oxidation with the simple case in which a fully satu- 
rated fatty acid with an even number of carbon atoms is 
degraded to acetyl-CoA. We then look briefly at the extra 
transformations necessary for the degradation of unsatu- 
rated fatty acids and fatty acids with an odd number of 
carbons. Finally, we discuss variations on the /3-oxidation 
theme in specialized organelles— peroxisomes and gly- 
oxysomes — and two less common pathways of fatty acid 
catabolism, o) and a oxidation. The chapter concludes 
with a description of an alternative fate for the acetyl- 
CoA formed by jS oxidation in vertebrates: the produc- 
tion of ketone bodies in the liver. 
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17.1 Digestion, Mobilization, and 
Transport of Fats 

Cells can obtain fatty acid fuels from three sources: fats 
consumed in the diet, fats stored in cells as lipid 
droplets, and fats synthesized in one organ for export 
to another. Some species use all three sources under 
various circumstances, others use one or two. Verte- 
brates, for example, obtain fats in the diet, mobilize fats 
stored in specialized tissue (adipose tissue, consisting 
of cells called adipocytes), and, in the liver, convert ex- 
cess dietary carbohydrates to fats for export to other 
tissues. On average, 40% or more of the daily energy re- 
quirement of humans in highly industrialized countries 
is supplied by dietary triacylglycerols (although most 
nutritional guidelines recommend no more than 30% of 
daily caloric intake from fats). Triacylglycerols provide 
more than half the energy requirements of some organs, 
particularly the liver, heart, and resting skeletal muscle. 
Stored triacylglycerols are virtually the sole source of 
energy in hibernating animals and migrating birds. Pro- 



tists obtain fats by consuming organisms lower in the 
food chain, and some also store fats as cytosolic lipid 
droplets. Vascular plants mobilize fats stored in seeds 
during germination, but do not otherwise depend on fats 
for energy. 

Dietary Fats Are Absorbed in the Small Intestine 

In vertebrates, before ingested triacylglycerols can be 
absorbed through the intestinal wall they must be 
converted from insoluble macroscopic fat particles to 
finely dispersed microscopic micelles. This solubiliza- 
tion is carried out by bile salts, such as taurocholic acid 
(p. 357), which are synthesized from cholesterol in the 
liver, stored in the gallbladder, and released into the 
small intestine after ingestion of a fatty meal. Bile salts 
are amphipathic compounds that act as biological de- 
tergents, converting dietary fats into mixed micelles of 
bile salts and triacylglycerols (Fig. 17-1, step ®). Mi- 
celle formation enormously increases the fraction of 
lipid molecules accessible to the action of water-solu- 
ble lipases in the intestine, and lipase action converts 
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FIGURE 17-1 Processing of dietary lipids in vertebrates. Digestion fatty acids released from triacylglycerols are packaged and delivered to 
and absorption of dietary lipids occur in the small intestine, and the muscle and adipose tissues. The eight steps are discussed in the text. 



triacylglycerols to monoacylglycerols (monoglyc- 
erides) and diacylglycerols (diglycerides), free fatty 
acids, and glycerol (step (2)). These products of lipase 
action diffuse into the epithelial cells lining the intes- 
tinal surface (the intestinal mucosa) (step (3)) , where 
they are reconverted to triacylglycerols and packaged 
with dietary cholesterol and specific proteins into 
lipoprotein aggregates called chylomicrons (Fig. 17-2; 
see also Fig. 17-1, step (4)). 

Apolipoproteins are lipid-binding proteins in the 
blood, responsible for the transport of triacylglycerols, 
phospholipids, cholesterol, and cholesteryl esters be- 
tween organs. Apolipoproteins ( u apo" means "detached" 
or "separate," designating the protein in its lipid-free 
form) combine with lipids to form several classes of 
lipoprotein particles, spherical aggregates with hy- 
drophobic lipids at the core and hydrophilic protein side 
chains and lipid head groups at the surface. Various com- 
binations of lipid and protein produce particles of different 
densities, ranging from chylomicrons and very-low-density 
lipoproteins (VLDL) to very-high-density lipoproteins 
(VHDL), which can be separated by ultracentrifugation. 
The structures of these lipoprotein particles and their 
roles in lipid transport are detailed in Chapter 21. 

The protein moieties of lipoproteins are recognized 
by receptors on cell surfaces. In lipid uptake from the in- 
testine, chylomicrons, which contain apolipoprotein C-II 
(apoC-II), move from the intestinal mucosa into the lym- 
phatic system, and then enter the blood, which carries 
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FIGURE 17-2 Molecular structure of a chylomicron. The surface is a 
layer of phospholipids, with head groups facing the aqueous phase 
Triacylglycerols sequestered in the interior (yellow) make up more than 
80% of the mass. Several apolipoproteins that protrude from the sur- 
face (B-48, C-lll, C-II) act as signals in the uptake and metabolism of 
chylomicron contents. The diameter of chylomicrons ranges from 
about 1 00 to 500 nm, 
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them to muscle and adipose tissue (Fig. 17-1, step (5)). 
In the capillaries of these tissues, the extracellular enzyme 
lipoprotein lipase, activated by apoC-II, hydrolyzes 
triacylglycerols to fatty acids and glycerol (step ©), 
which are taken up by cells in the target tissues (step 
®). In muscle, the fatty acids are oxidized for energy; in 
adipose tissue, they are reesterified for storage as tria- 
cylglycerols (step ®). 

The remnants of chylomicrons, depleted of most of 
their triacylglycerols but still containing cholesterol and 
apolipoproteins, travel in the blood to the liver, where 
they are taken up by endocytosis, mediated by recep- 
tors for their apolipoproteins. Triacylglycerols that enter 
the liver by this route may be oxidized to provide energy 
or to provide precursors for the synthesis of ketone bod- 
ies, as described in Section 17.3. When the diet contains 
more fatty acids than are needed immediately for fuel or 
as precursors, the liver converts them to triacylglyc- 
erols, which are packaged with specific apolipoproteins 
into VLDLs. The VLDLs are transported in the blood to 
adipose tissues, where the triacylglycerols are removed 
and stored in lipid droplets within adipocytes. 

Hormones Trigger Mobilization 
of Stored Triacylglycerols 

Neutral lipids are stored in adipocytes (and in steroid- 
synthesizing cells of the adrenal cortex, ovary, and testes) 
in the form of lipid droplets, with a core of sterol esters 
and triacylglycerols surrounded by a monolayer of phos- 
pholipids. The surface of these droplets is coated with 
perilipins, a family of proteins that restrict access to lipid 
droplets, preventing untimely lipid mobilization. When 
hormones signal the need for metabolic energy, triacyl- 
glycerols stored in adipose tissue are mobilized (brought 
out of storage) and transported to tissues (skeletal mus- 
cle, heart, and renal cortex) in which fatty acids can be ox- 
idized for energy production. The hormones epinephrine 
and glucagon, secreted in response to low blood glucose 
levels, activate the enzyme adenylyl cyclase in the 
adipocyte plasma membrane (Fig. 17-3), which produces 
the intracellular second messenger cyclic AMP (cAMP; see 
Fig. 12-4). Cyclic AMP-dependent protein kinase (PKA) 
phosphorylates perilipin A, and the phosphorylated per- 
ilipin causes hormone-sensitive lipase in the cytosol to 
move to the lipid droplet surface, where it can begin hy- 
drolyzing triacylglycerols to free fatty acids and glycerol. 
PKA also phosphorylates hormone-sensitive lipase, dou- 
bling or tripling its activity, but the more than 50-fold in- 
crease in fat mobilization triggered by epinephrine is due 
primarily to perilipin phosphorylation. Cells with defective 
perilipin genes have almost no response to increases in 
cAMP concentration; their hormone-sensitive lipase does 
not associate with lipid droplets. 

As hormone-sensitive lipase hydrolyzes triacylglyc- 
erol in adipocytes, the fatty acids thus released (free 
fatty acids, FFA) pass from the adipocyte into the blood, 
where they bind to the blood protein serum albumin. 
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FIGURE 17-3 Mobilization of triacylglycerols stored in adipose tissue. 

When low levels of glucose in the blood trigger the release of glucagon, 
© the hormone binds its receptor in the adipocyte membrane and thus 
© stimulates adenylyl cyclase, via a G protein, to produce cAMP. This 
activates PKA, which phosphorylates © the hormone-sensitive lipase 
and @ perilipin molecules on the surface of the lipid droplet. Phospho- 
rylation of perilipin permits hormone-sensitive lipase access to the sur- 
face of the lipid droplet, where © it hydrolyzes triacylglycerols to free 
fatty acids. ® Fatty acids leave the adipocyte, bind serum albumin in 
the blood, and are carried in the blood; they are released from the albu- 
min and © enter a myocyte via a specific fatty acid transporter. ® In 
the myocyte, fatty acids are oxidized to C0 2 , and the energy of oxida- 
tion is conserved in ATP, which fuels muscle contraction and other en- 
ergy-requiring metabolism in the myocyte. 

This protein (M r 66,000), which makes up about half of 
the total serum protein, noncovalently binds as many as 
10 fatty acids per protein monomer. Bound to this soluble 
protein, the otherwise insoluble fatty acids are carried to 
tissues such as skeletal muscle, heart, and renal cortex. In 
these target tissues, fatty acids dissociate from albumin 
and are moved by plasma membrane transporters into 
cells to serve as fuel. 

About 95% of the biologically available energy of tri- 
acylglycerols resides in their three long-chain fatty 
acids; only 5% is contributed by the glycerol moiety. The 
glycerol released by lipase action is phosphorylated by 
glycerol kinase (Fig. 17-4), and the resulting glycerol 
3-phosphate is oxidized to dihydroxyacetone phos- 
phate. The glycolytic enzyme those phosphate iso- 
merase converts this compound to glyceraldehyde 
3-phosphate, which is oxidized via glycolysis. 
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FIGURE 17-4 Entry of glycerol into the glycolytic pathway. 

Fatty Acids Are Activated and Transported 
into Mitochondria 

The enzymes of fatty acid oxidation in animal cells are 
located in the mitochondrial matrix, as demonstrated in 
1948 by Eugene P. Kennedy and Albert Lehninger. The 
fatty acids with chain lengths of 12 or fewer carbons 
enter mitochondria without the help of membrane 
transporters. Those with 14 or more carbons, which 
constitute the majority of the FFA obtained in the diet 
or released from adipose tissue, cannot pass directly 
through the mitochondrial membranes — they must first 
undergo the three enzymatic reactions of the carnitine 
shuttle. The first reaction is catalyzed by a family of 
isozymes (different isozymes specific for fatty acids hav- 
ing short, intermediate, or long carbon chains) present 
in the outer mitochondrial membrane, the acyl-CoA 
synthetases, which promote the general reaction 

Fatty acid + CoA + ATP fatty acyl-CoA + AMP + PPi 

Thus, acyl-CoA synthetases catalyze the formation of a 
thioester linkage between the fatty acid carboxyl group and 
the thiol group of coenzyme A to yield a fatty acyl-CoA, 
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MECHANISM FIGURE 17-5 Conversion of a fatty acid to a fatty 
acyl-CoA. The conversion is catalyzed by fatty acyl-CoA synthetase 
and inorganic pyrophosphatase. Fatty acid activation by formation of 



the fatty acyl-CoA derivative occurs in two steps. The overall reaction 
is highly exergonic. ff| Fatty Acyl-CoA Synthetase Mechanism 



coupled to the cleavage of ATP to AMP and PPi. (Recall the 
description of this reaction in Chapter 13, to illustrate how 
the free energy released by cleavage of phosphoanhydride 
bonds in ATP could be coupled to the formation of a high- 
energy compound; p. 508.) The reaction occurs in two 
steps and involves a fatty acyl-adenylate intermediate 
(Fig. 17-5). 

Fatty acyl-CoAs, like acetyl-CoA, are high-energy 
compounds; their hydrolysis to FFA and CoA has a large, 
negative standard free-energy change (AG'°~-31 
kJ/mol). The formation of a fatty acyl-CoA is made more 
favorable by the hydrolysis of two high-energy bonds in 
ATP; the pyrophosphate formed in the activation reaction 
is immediately hydrolyzed by inorganic pyrophosphatase 
(left side of Fig. 17-5), which pulls the preceding activa- 
tion reaction in the direction of fatty acyl-CoA formation. 
The overall reaction is 

Fatty acid + CoA + ATP ► 

fatty acyl-CoA + AMP + 2P d (17-1) 
AG'° = -34kJ/mol 

Fatty acyl-CoA esters formed at the cytosolic side 
of the outer mitochondrial membrane can be trans- 
ported into the mitochondrion and oxidized to produce 
ATP, or they can be used in the cytosol to synthesize 
membrane lipids. Fatty acids destined for mitochondrial 
oxidation are transiently attached to the hydroxyl group 
of carnitine to form fatty acyl-carnitine — the second 
reaction of the shuttle. This transesteiification is catalyzed 
by carnitine acyltransferase I (M T 88,000), in the outer 



membrane. Either the acyl-CoA passes through the outer 
membrane and is converted to the carnitine ester in the in- 
termembrane space (Fig. 17-6), or the carnitine ester is 
formed on the cytosolic face of the outer membrane, then 
moved across the outer membrane to the intermembrane 
space— the current evidence does not reveal which. In ei- 
ther case, passage into the intermembrane space (the 
space between the outer and inner membranes) occurs 
through large pores (formed by the protein porin) in the 
outer membrane. The fatty acyl-carnitine ester then en- 
ters the matrix by facilitated diffusion through the acyl- 
carnitine/carnitine transporter of the inner mito- 
chondrial membrane (Fig. 17-6). 
CH 3 

CH 3 -N^CH 2 -CH-CH 2 -COCr 

CH 3 OH 
Carnitine 

In the third and final step of the carnitine shuttle, the 
fatty acyl group is enzymatically transferred from carni- 
tine to intramitochondrial coenzyme A by carnitine acyl- 
transferase II. This isozyme, located on the inner face of 
the inner mitochondrial membrane, regenerates fatty 
acyl-CoA and releases it, along with free carnitine, into 
the matrix (Fig. 17-6). Carnitine reenters the intermem- 
brane space via the acyl-carnitine/carnitine transporter. 

This three-step process for transferring fatty acids 
into the mitochondrion— esterification to CoA, transes- 
terification to carnitine followed by transport, and 
transesterification back to CoA— links two separate 
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FIGURE 17-6 Fatty acid entry into mitochondria via the acyl-carnitine/ 
carnitine transporter. After fatty acyl-carnitine is formed at the outer 
membrane or in the intermembrane space, it moves into the matrix by 
facilitated diffusion through the transporter in the inner membrane. In 
the matrix, the acyl group is transferred to mitochondrial coenzyme A, 



freeing carnitine to return to the intermembrane space through the 
same transporter. Acyltransferase I is inhibited by malonyl-CoA, the 
first intermediate in fatty acid synthesis {see Fig. 21-1). This inhibition 
prevents the simultaneous synthesis and degradation of fatty acids. 



pools of coenzyme A and of fatty acyl-CoA, one in the 
cytosol, the other in mitochondria. These pools have dif- 
ferent functions. Coenzyme A in the mitochondrial ma- 
trix is largely used in oxidative degradation of pyruvate, 
fatty acids, and some amino acids, whereas cytosolic 
coenzyme A is used in the biosynthesis of fatty acids 
(see Fig. 21-10). Fatty acyl-CoA in the cytosolic pool 
can be used for membrane lipid synthesis or can be 
moved into the mitochondrial matrix for oxidation and 
ATP production. Conversion to the carnitine ester com- 
mits the fatty acyl moiety to the oxidative fate. 

The carnitine-mediated entry process is the rate- 
limiting step for oxidation of fatty acids in mitochondria 
and, as discussed later, is a regulation point. Once inside 
the mitochondrion, the fatty acyl-CoA is acted upon by 
a set of enzymes in the matrix. 

SUMMARY 17.1 Digestion, Mobilization, 
and Transport of Fats 

■ The fatty acids of triacylglycerols furnish a large 
fraction of the oxidative energy in animals. Dietary 
triacylglycerols are emulsified in the small intestine 
by bile salts, hydrolyzed by intestinal lipases, 
absorbed by intestinal epithelial cells, reconverted 
into triacylglycerols, then formed into chylomicrons 
by combination with specific apolipoproteins. 

■ Chylomicrons deliver triacylglycerols to tissues, 
where lipoprotein lipase releases free fatty acids 
for entry into cells. Triacylglycerols stored in 
adipose tissue are mobilized by a hormone-sensitive 
triacylglycerol lipase. The released fatty acids bind 
to serum albumin and are carried in the blood to 
the heart, skeletal muscle, and other tissues that 
use fatty acids for fuel. 



■ Once inside cells, fatty acids are activated at the 
outer mitochondrial membrane by conversion to 
fatty acyl-CoA thioesters. Fatty acyl-CoA to be 
oxidized enters mitochondria in three steps, via 
the carnitine shuttle. 

17.2 Oxidation of Fatty Acids 

As noted earlier, mitochondrial oxidation of fatty acids 
takes place in three stages (Fig. 17-7). In the first 
stage— jS oxidation — fatty acids undergo oxidative re- 
moval of successive two-carbon units in the form of 
acetyl-CoA, starting from the carboxyl end of the fatty 
acyl chain. For example, the 16-carbon palmitic acid 
(palmitate at pH 7) undergoes seven passes through the 
oxidative sequence, in each pass losing two carbons as 
acetyl-CoA. At the end of seven cycles the last two car- 
bons of palmitate (originally C-15 and C-16) remain as 
acetyl-CoA. The overall result is the conversion of the 
16-carbon chain of palmitate to eight two-carbon acetyl 
groups of acetyl-CoA molecules. Formation of each 
acetyl-CoA requires removal of four hydrogen atoms 
(two pairs of electrons and four H + ) from the fatty acyl 
moiety by dehydrogenases. 

In the second stage of fatty acid oxidation, the 
acetyl groups of acetyl-CoA are oxidized to C0 2 in the 
citric acid cycle, which also takes place in the mito- 
chondrial matrix. Acetyl-CoA derived from fatty acids 
thus enters a final common pathway of oxidation with 
the acetyl-CoA derived from glucose via glycolysis and 
pyruvate oxidation (see Fig. 16-1). The first two 
stages of fatty acid oxidation produce the reduced 
electron carriers NADH and FADH 2 , which in the third 
stage donate electrons to the mitochondrial respira- 
tory chain, through which the electrons pass to oxygen 
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FIGURE 17-7 Stages of fatty acid oxidation. Stage 1 : A long-chain fatty 
acid is oxidized to yield acetyl residues in the form of acetyl-CoA. This 
process is called p oxidation. Stage 2: The acetyl groups are oxidized 
to C0 2 via the citric acid cycle. Stage 3: Electrons derived from the 
oxidations of stages 1 and 2 pass to 0 2 via the mitochondrial respira- 
tory chain, providing the energy for ATP synthesis by oxidative 
phosphorylation. 

with the concomitant phosphorylation of ADP to ATP 
(Fig. 17-7). The energy released by fatty acid oxida- 
tion is thus conserved as ATP. 

We now take a closer look at the first stage of fatty 
acid oxidation, beginning with the simple case of a satu- 
rated fatty acyl chain with an even number of carbons, 
then turning to the slightly more complicated cases of 
unsaturated and odd-number chains. We also consider 
the regulation of fatty acid oxidation, the /3-oxidative 
processes as they occur in organelles other than mito- 
chondria, and, finally, two less-general modes of fatty 
acid catabolism, a oxidation and &> oxidation. 

The j3 Oxidation of Saturated Fatty Acids Has Four 
Basic Steps 

Four enzyme-catalyzed reactions make up the first stage 
of fatty acid oxidation (Fig. 17-8a). First, dehydro- 
genation of fatty acyl-CoA produces a double bond be- 
tween the a and /3 carbon atoms (C-2 and C-3), yielding 



a £rans-A 2 -enoyl-CoA (the symbol A 2 designates the 
position of the double bond; you may want to review 
fatty acid nomenclature, p. 343.) Note that the new 
double bond has the trans configuration, whereas the 
double bonds in naturally occurring unsaturated fatty 
acids are normally in the cis configuration. We consider 
the significance of this difference later. 
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FIGURE 17-8 The ^-oxidation pathway, (a) In each pass through this 
four-step sequence, one acetyl residue (shaded in pink) is removed in 
the form of acetyl-CoA from the carboxyl end of the fatty acyl chain- 
in this example palmitate (C 16 ), which enters as palmitoyl-CoA. (b) Six 
more passes through the pathway yield seven more molecules of 
acetyl-CoA, the seventh arising from the last two carbon atoms of the 
1 6-carbon chain. Eight molecules of acetyl-CoA are formed in all. 
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This first step is catalyzed by three isozymes of 
acyl-CoA dehydrogenase, each specific for a range of 
fatty-acyl chain lengths: very-long-chain acyl-CoA dehy- 
drogenase (VLCAD) , acting on fatty acids of 12 to 18 
carbons; medium-chain (MCAD) , acting on fatty acids of 
4 to 14 carbons; and short-chain (SCAD), acting on fatty 
acids of 4 to 8 carbons. All three isozymes are flavopro- 
teins with FAD (see Fig. 13-27) as a prosthetic group. 
The electrons removed from the fatty acyl-CoA are 
transferred to FAD, and the reduced form of the dehy- 
drogenase immediately donates its electrons to an elec- 
tron carrier of the mitochondrial respiratory chain, the 
electron-transferring flavoprotein (ETF) (see Fig. 
19-8). The oxidation catalyzed by an acyl-CoA dehydro- 
genase is analogous to succinate dehydrogenation in the 
citric acid cycle (p. 628); in both reactions the enzyme is 
bound to the inner membrane, a double bond is intro- 
duced into a carboxylic acid between the a and f3 car- 
bons, FAD is the electron acceptor, and electrons from 
the reaction ultimately enter the respiratory chain and 
pass to 0 2 , with the concomitant synthesis of about 1.5 
ATP molecules per electron pair. 

In the second step of the jS-oxidation cycle (Fig. 
17-8a), water is added to the double bond of the trans- A 2 - 
enoyl-CoA to form the l stereoisomer of /Miydroxyacyl- 
CoA (3-hydroxyacyl-CoA). This reaction, catalyzed 
by enoyl-CoA hydratase, is formally analogous to the 
fumarase reaction in the citric acid cycle, in which H 2 0 
adds across an a-fi double bond (p. 628). 

In the third step, L-0-hydroxyacyl-CoA is dehy- 
drogenated to form 0-ketoacyl-CoA, by the action of 
/Miydroxyacyl-CoA dehydrogenase; NAD + is the 
electron acceptor. This enzyme is absolutely specific for 
the L stereoisomer of hydroxyacyl-CoA. The NADH 
formed in the reaction donates its electrons to NADH 
dehydrogenase, an electron carrier of the respiratory 
chain, and ATP is formed from ADP as the electrons 
pass to 0 2 . The reaction catalyzed by £-hydroxyacyl- 
CoA dehydrogenase is closely analogous to the malate 
dehydrogenase reaction of the citric acid cycle (p. 628). 

The fourth and last step of the ^-oxidation cycle is 
catalyzed by acyl-CoA acetyltransf erase, more com- 
monly called thiolase, which promotes reaction of /3- 
ketoacyl-CoA with a molecule of free coenzyme A to 
split off the carboxyl-terminal two-carbon fragment of 
the original fatty acid as acetyl-CoA. The other product 
is the coenzyme A thioester of the fatty acid, now short- 
ened by two carbon atoms (Fig. 17-8a). This reaction is 
called thiolysis, by analogy with the process of hydroly- 
sis, because the /3-ketoacyl-CoA is cleaved by reaction 
with the thiol group of coenzyme A. 

The last three steps of this four-step sequence are 
catalyzed by either of two sets of enzymes, with the en- 
zymes employed depending on the length of the fatty 
acyl chain. For fatty acyl chains of 12 or more carbons, 
the reactions are catalyzed by a multienzyme complex 
associated with the inner mitochondrial membrane, the 



trifunctional protein (TFP). TFP is a heterooctamer 
of a 4 /3 4 subunits. Each a subunit contains two activi- 
ties, the enoyl-CoA hydratase and the /3-hydroxyacyl- 
CoA dehydrogenase; the f3 subunits contain the thiolase 
activity. This tight association of three enzymes may al- 
low efficient substrate channeling from one active site 
to the next, without diffusion of the intermediates away 
from the enzyme surface. When TFP has shortened the 
fatty acyl chain to 12 or fewer carbons, further oxida- 
tions are catalyzed by a set of four soluble enzymes in 
the matrix. 

As noted earlier, the single bond between methyl- 
ene ( — CH 2 — ) groups in fatty acids is relatively stable. 
The /3-oxidation sequence is an elegant mechanism for 
destabilizing and breaking these bonds. The first three 
reactions of (3 oxidation create a much less stable C — C 
bond, in which the a carbon (C-2) is bonded to two 
carbonyl carbons (the /3-ketoacyl-CoA intermediate). 
The ketone function on the 0 carbon (C-3) makes it a 
good target for nucleophilic attack by the — SH of 
coenzyme A, catalyzed by thiolase. The acidity of the a 
hydrogen and the resonance stabilization of the car- 
banion generated by the departure of this hydrogen 
make the terminal— CH 2 — CO — S-CoA a good leaving 
group, facilitating breakage of the a-(3 bond. 

The Four j8-0xidation Steps Are Repeated to Yield 
Acetyl-CoAand ATP 

In one pass through the /3-oxidation sequence, one mol- 
ecule of acetyl-CoA, two pairs of electrons, and four pro- 
tons (H + ) are removed from the long-chain fatty 
acyl-CoA, shortening it by two carbon atoms. The equa- 
tion for one pass, beginning with the coenzyme A ester 
of our example, palmitate, is 

Palmitoyl-CoA + CoA + FAD + NAD + + H 2 0 > 

myristoyl-CoA + acetyl-CoA + FADH 2 + NADH + H + 

(17-2) 

Following removal of one acetyl-CoA unit from palmi- 
toyl-CoA, the coenzyme A thioester of the shortened 
fatty acid (now the 14-carbon myristate) remains. The 
myristoyl-CoA can now go through another set of four /3- 
oxidation reactions, exactly analogous to the first, to 
yield a second molecule of acetyl-CoA and lauroyl-CoA, 
the coenzyme A thioester of the 12-carbon laurate. Alto- 
gether, seven passes through the /3-oxidation sequence 
are required to oxidize one molecule of palmitoyl-CoA to 
eight molecules of acetyl-CoA (Fig. 17-8b). The overall 
equation is 

Palmitoyl-CoA + 7CoA + 7FAD + 7NAD + + 7H 2 0 > 

8 acetyl-CoA + 7FADH 2 + 7NADH + 7H + (17-3) 

Each molecule of FADH 2 formed during oxidation of the 
fatty acid donates a pair of electrons to ETF of the res- 
piratory chain, and about 1.5 molecules of ATP are gen- 
erated during the ensuing transfer of each electron pair 
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to 0 2 . Similarly, each molecule of NADH formed delivers 
a pair of electrons to the mitochondrial NADH dehydro- 
genase, and the subsequent transfer of each pair of 
electrons to 0 2 results in formation of about 2.5 mole- 
cules of ATP. Thus four molecules of ATP are formed for 
each two-carbon unit removed in one pass through the 
sequence. Note that water is also produced in this 
process. Transfer of electrons from NADH or FADH 2 to 
0 2 yields one H 2 0 per electron pair. Reduction of 0 2 by 
NADH also consumes one H + per NADH molecule: 
NADH + H + + \0 2 > NAD + + H 2 0. In hibernat- 
ing animals, fatty acid oxidation provides metabolic 
energy, heat, and water — all essential for survival of 
an animal that neither eats nor drinks for long periods 
(Box 17-1). Camels obtain water to supplement the 
meager supply available in their natural environment by 
oxidation of fats stored in their hump. 



The overall equation for the oxidation of palmitoyl- 
CoA to eight molecules of acetyl-CoA, including the 
electron transfers and oxidative phosphorylations, is 

Palmitoyl-CoA + 7CoA + 70 2 + 28Pi + 28ADP ► 

8 acetyl-CoA + 28ATP + 7H 2 0 (17-4) 

Acetyl-CoA Can Be Further Oxidized 
in the Citric Acid Cycle 

The acetyl-CoA produced from the oxidation of fatty acids 
can be oxidized to C0 2 and H 2 0 by the citric acid cycle. The 
following equation represents the balance sheet for the sec- 
ond stage in the oxidation of palmitoyl-CoA, together with 
the coupled phosphorylations of the third stage: 

8 Acetyl-CoA + 160 2 + 80Pi + 80ADP ► 

8CoA + 80ATP + 16C0 2 + 16H 2 0 (17-5) 



BOX 17-1 Fat Bears Carry Out fi Oxidation in Their Sleef 



Many animals depend on fat stores for energy during 
hibernation, during migratory periods, and in other situa- 
tions involving radical metabolic adjustments. One of the 
most pronounced adjustments of fat metabolism occurs 
in hibernating grizzly bears. These animals remain in a 
continuous state of dormancy for periods as long as seven 
months. Unlike most hibernating species, the bear main- 
tains a body temperature of between 32 and 35 °C, close to 
the normal (nonhibernating) level. Although expending 
about 25,000 kJ/day (6,000 kcal/day), the bear does not 
eat, drink, urinate, or defecate for months at a time. 

Experimental studies have shown that hibernating 
grizzly bears use body fat as their sole fuel. Fat oxidation 
yields sufficient energy for maintenance of body tem- 
perature, active synthesis of amino acids and proteins, 
and other energy-requiring activities, such as membrane 
transport. Fat oxidation also releases large amounts of 



water, as described in the text, which replenishes water 
lost in breathing. The glycerol released by degradation 
of triacylglycerols is converted into blood glucose by 
gluconeogenesis. Urea formed during breakdown of 
amino acids is reabsorbed in the kidneys and recycled, 
the amino groups reused to make new amino acids for 
maintaining body proteins. 

Bears store an enormous amount of body fat in 
preparation for their long sleep. An adult grizzly con- 
sumes about 38,000 kJ/day during the late spring and 
summer, but as winter approaches it feeds 20 hours a day, 
consuming up to 84,000 kJ daily. This change in feeding is 
a response to a seasonal change in hormone secretion. 
Large amounts of triacylglycerols are formed from the 
huge intake of carbohydrates during the fattening-up 
period. Other hibernating species, including the tiny 
dormouse, also accumulate large amounts of body fat. 




A grizzly bear prepares its hibernation nest, near the McNeil River in Canada. 
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^^3QQ| : :.y|eld of ATP during Oxidation of One Molecule of PalmitoyKoA to C0 2 and H 2 0 

Number of NADH or Number of ATP 

Enzyme catalyzing the oxidation step FADH 2 formed ultimately formed* 



Acyl-CoA dehydrogenase 


7 FADH 2 


10.5 


/3-Hydroxyacyl-CoA dehydrogenase 


7 NADH 


17.5 


Isocitrate dehydrogenase 


8 NADH 


20 


a-Ketoglutarate dehydrogenase 


8 NADH 


20 


Succinyl-CoA synthetase 




8 f 


Succinate dehydrogenase 


8 FADH 2 


12 


Malate dehydrogenase 


8 NADH 


20 


Total 




108 



*These calculations assume that mitochondrial oxidative phosphorylation produces 1.5 ATP per FADH 2 oxidized and 2.5 ATP per NADH oxidized. 
^TP produced directly in this step yields ATP in the reaction catalyzed by nucleoside diphosphate kinase (p. 510). 



Combining Equations 17-4 and 17-5, we obtain the 
overall equation for the complete oxidation of palmitoyl- 
CoA to carbon dioxide and water: 



Palmitoyl-CoA + 230 2 + 108Pi + 108ADP > 

CoA + 108ATP + 16C0 2 + 23H 2 0 



(17-6) 



Table 17-1 summarizes the yields of NADH, FADH 2 , and 
ATP in the successive steps of palmitoyl-CoA oxidation. 
Note that because the activation of palmitate to palmitoyl- 
CoA breaks both phosphoanhydride bonds in ATP 
(Fig. 17-5), the energetic cost of activating a fatty acid 
is equivalent to two ATP, and the net gain per molecule 
of palmitate is 106 ATP. The standard free-energy 
change for the oxidation of palmitate to C0 2 and H 2 0 is 
about 9,800 kJ/mol. Under standard conditions, the en- 
ergy recovered as the phosphate bond energy of ATP 
is 106 X 30.5 kJ/mol = 3,230 kJ/mol, about 33% of the 
theoretical maximum. However, when the free-energy 
changes are calculated from actual concentrations of 
reactants and products under intracellular conditions 
(see Worked Example 13-2, p. 503), the free-energy 
recovery is more than 60%; the energy conservation is 
remarkably efficient. 

Oxidation of Unsaturated Fatty Acids Requires 
Two Additional Reactions 

The fatty acid oxidation sequence just described is typi- 
cal when the incoming fatty acid is saturated (that is, 
has only single bonds in its carbon chain). However, 
most of the fatty acids in the triacylglycerols and phos- 
pholipids of animals and plants are unsaturated, having 
one or more double bonds. These bonds are in the cis 
configuration and cannot be acted upon by enoyl-CoA 
hydratase, the enzyme catalyzing the addition of H 2 0 to 
the trans double bond of the A 2 -enoyl-CoA generated 
during /3 oxidation. Two auxiliary enzymes are needed 
for p oxidation of the common unsaturated fatty acids: 
an isomerase and a reductase. We illustrate these auxil- 
iary reactions with two examples. 



Oleate is an abundant 18-carbon monounsaturated 
fatty acid with a cis double bond between C-9 and C-10 
(denoted A 9 ). In the first step of oxidation, oleate is con- 
verted to oleoyl-CoA and, like the saturated fatty acids, 
enters the mitochondrial matrix via the carnitine shuttle 
(Fig. 17-6). Oleoyl-CoA then undergoes three passes 
through the fatty acid oxidation cycle to yield three mol- 
ecules of acetyl-CoA and the coenzyme A ester of a A 3 , 
12-carbon unsaturated fatty acid, czs-A 3 -dodecenoyl-CoA 
(Fig. 17-9). This product cannot serve as a substrate for 
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FIGURE 17-9 Oxidation of a monounsaturated fatty acid. Oleic acid, 
as oleoyl-CoA (A 9 ), is the example used here. Oxidation requires an 
additional enzyme, enoyl-CoA isomerase, to reposition the double 
bond, converting the cis isomer to a trans isomer, a normal intermediate 
in jS oxidation. 
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enoyl-CoA hydratase, which acts only on trans double 
bonds. The auxiliary enzyme A 3 ,A 2 -enoyl-CoA iso- 

merase isomerizes the c£s-A 3 -enoyl-CoA to the trans- 
A 2 -enoyl-CoA, which is converted by enoyl-CoA 
hydratase into the corresponding L-/3-hydroxyacyl-CoA 
(£rans-A 2 -dodecenoyl-CoA). This intermediate is now 
acted upon by the remaining enzymes of /3 oxidation to 
yield acetyl-CoA and the coenzyme A ester of a 10-carbon 
saturated fatty acid, decanoyl-CoA. The latter undergoes 
four more passes through the pathway to yield five more 
molecules of acetyl-CoA. Altogether, nine acetyl-CoAs 
are produced from one molecule of the 18-carbon oleate. 

The other auxiliary enzyme (a reductase) is re- 
quired for oxidation of polyunsaturated fatty acids — for 
example, the 18-carbon linoleate, which has a cis- 
A 9 ,cis-A 12 configuration (Fig. 17-10). Linoleoyl-CoA 
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FIGURE 17-10 Oxidation of a polyunsaturated fatty acid. The exam- 
ple here is linoleic acid, as linoleoyl-CoA (A 9,12 ). Oxidation requires a 
second auxiliary enzyme in addition to enoyl-CoA isomerase: NADPH- 
dependent 2,4-dienoyl-CoA reductase. The combined action of these 
two enzymes converts a trans-A 2 ,c/s-A 4 -dienoyl-CoA intermediate to 
the frans-A 2 -enoyl-CoA substrate necessary for p oxidation. 



undergoes three passes through the /3-oxidation se- 
quence to yield three molecules of acetyl-CoA and the 
coenzyme A ester of a 12-carbon unsaturated fatty acid 
with a cis-k s ,cis-A6 configuration. This intermediate 
cannot be used by the enzymes of the /3-oxidation path- 
way; its double bonds are in the wrong position and have 
the wrong configuration (cis, not trans). However, the 
combined action of enoyl-CoA isomerase and 2,4- 
dienoyl-CoA reductase, as shown in Figure 17-10, al- 
lows reentry of this intermediate into the ^-oxidation 
pathway and its degradation to six acetyl-CoAs. The 
overall result is conversion of linoleate to nine molecules 
of acetyl-CoA. 

Complete Oxidation of Odd-Number Fatty Acids 
Requires Three Extra Reactions 

Although most naturally occurring lipids contain fatty 
acids with an even number of carbon atoms, fatty acids 
with an odd number of carbons are common in the lipids 
of many plants and some marine organisms. Cattle 
and other ruminant animals form large amounts of the 
three-carbon propionate (CH 3 — CH 2 — COO") during 
fermentation of carbohydrates in the rumen. The propi- 
onate is absorbed into the blood and oxidized by the 
liver and other tissues. And small quantities of propi- 
onate are added as a mold inhibitor to some breads and 
cereals, thus entering the human diet. 

Long-chain odd-number fatty acids are oxidized in 
the same pathway as the even-number acids, beginning 
at the carboxyl end of the chain. However, the substrate 
for the last pass through the /3-oxidation sequence is a 
fatty acyl-CoA with a five-carbon fatty acid. When this is 
oxidized and cleaved, the products are acetyl-CoA and 
propionyl-CoA. The acetyl-CoA can be oxidized in the 
citric acid cycle, of course, but propionyl-CoA enters a 
different pathway involving three enzymes. 

Propionyl-CoA is first carboxylated to form the d 
stereoisomer of methylmalonyl-CoA (Fig. 17-11) 
by propionyl-CoA carboxylase, which contains the 
cofactor biotin. In this enzymatic reaction, as in the 
pyruvate carboxylase reaction (see Fig. 16-16), C0 2 
(or its hydrated ion, HCO3 ) is activated by attach- 
ment to biotin before its transfer to the substrate, in 
this case the propionate moiety. Formation of the car- 
boxybiotin intermediate requires energy, which is pro- 
vided by the cleavage of ATP to ADP and Pi. The 
D-methylmalonyl-CoA thus formed is enzymatically 
epimerized to its l stereoisomer by methylmalonyl- 
CoA epimerase (Fig. 17-11). The L-methylmalonyl- 
CoA then undergoes an intramolecular rearrangement 
to form succinyl-CoA, which can enter the citric acid 
cycle. This rearrangement is catalyzed by methyl- 
malonyl-CoA mutase, which requires as its coen- 
zyme 5'-deoxyadenosylcobalamin, or coenzyme 
B 12 , which is derived from vitamin B 12 (cobalamin). 
Box 17-2 describes the role of coenzyme B 12 in this re- 
markable exchange reaction. 



BOX 17-2 Coenzyme B 12 ; A Radical Solution to a Perplexing Probler 



In the methylmalonyl-CoA mutase reaction (see Fig. 
17-11), the group — CO— S-CoAat C-2 of the original 
propionate exchanges position with a hydrogen atom 
at C-3 of the original propionate (Fig. la). Coenzyme 
B 12 is the cofactor for this reaction, as it is for almost 
all enzymes that catalyze reactions of this general 
type (Fig. lb). These coenzyme B 12 -dependent 
processes are among the very few enzymatic reac- 
tions in biology in which there is an exchange of an 
alkyl or substituted alkyl group (X) with a hydrogen 
atom on an adjacent carbon, with no mixing of the 
transferred hydrogen atom with the hydrogen of 
the solvent, H 2 0. How can the hydrogen atom move 
between two carbons without mixing with the enor- 
mous excess of hydrogen atoms in the solvent? 

Coenzyme B 12 is the cofactor form of vitamin B 12j 
which is unique among all the vitamins in that it con- 
tains not only a complex organic molecule but an es- 
sential trace element, cobalt. The complex corrin ring 
system of vitamin B 12 (colored blue in Fig. 2) , to which 
cobalt (as Co 3+ ) is coordinated, is chemically related to 
the porphyrin ring system of heme and heme proteins 
(see Fig. 5-1). A fifth coordination position of cobalt 
is filled by dimethylbenzimidazole ribonucleotide 
(shaded yellow), bound covalently by its 3 '-phosphate 
group to a side chain of the corrin ring, through 
aminoisopropanol. The formation of this complex cofac- 
tor occurs in one of only two known reactions in which 
triphosphate is cleaved from ATP (Fig. 3); the other re- 
action is the formation of S-adenosylmethionine from 
ATP and methionine (see Fig. 18-18). 

Vitamin B 12 as usually isolated is called cyano- 
cobalamin, because it contains a cyano group 
(picked up during purification) attached to cobalt in 
the sixth coordination posi- 
tion. In 5'-deoxyadenosyl- 
cobalamin, the cofactor for 
methylmalonyl-CoA mutase, 
the cyano group is replaced 
by the 5'-deoxyadenosyl 
group (red in Fig. 2), cova- 
lently bound through C-5' 
to the cobalt. The three- 
dimensional structure of the 
cofactor was determined by 
Dorothy Crowfoot Hodgkin 
in 1956, using x-ray crystal- 
lography. 

The key to understanding how coenzyme B 12 
catalyzes hydrogen exchange lies in the properties 
of the covalent bond between cobalt and C-5' of 
the deoxyadenosyl group (Fig. 2). This is a rela- 
tively weak bond; its bond dissociation energy is 
about 110 kJ/mol, compared with 348 kJ/mol for a 
typical C— C bond or 414 kJ/mol for a C— H bond. 
Merely illuminating the compound with visible 
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light is enough to break this Co— C bond. (This ex- 
treme photolability probably accounts for the ab- 
sence of vitamin B 12 in plants.) Dissociation 
produces a 5'-deoxyadenosyl radical and the Co 2+ 
form of the vitamin. The chemical function of 5'-de- 
oxyadenosylcobalamin is to generate free radicals in 
this way, thus initiating a series of transformations such 
as that illustrated in Figure 4— a postulated mechanism 
for the reaction catalyzed by methylmalonyl-CoA 
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mutase and several other coenzyme B 12 -dependent 
transformations. In this postulated mechanism, the 
migrating hydrogen atom never exists as a free 
species and is thus never free to exchange with the 
hydrogen of surrounding water molecules. 

Vitamin B 12 deficiency results in serious disease. 
This vitamin is not made by plants or animals and 
can be synthesized only by a few species of microorgan- 
isms. It is required by healthy people in only minute 
amounts, about 3 ^tg/day. The severe disease perni- 
cious anemia results from failure to absorb vitamin B 12 
efficiently from the intestine, where it is synthesized by 
intestinal bacteria or obtained from digestion of meat. 
Individuals with this disease do not produce sufficient 
amounts of intrinsic factor, a glycoprotein essential to 
vitamin B 12 absorption. The pathology in pernicious 
anemia includes reduced production of erythrocytes, re- 
duced levels of hemoglobin, and severe, progressive im- 
pairment of the central nervous system. Administration 
of large doses of vitamin B 12 alleviates these symptoms 
in at least some cases. ■ 
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FIGURE 17-11 Oxidation of propionyl-CoA produced by ft oxidation 
of odd-number fatty acids. The sequence involves the carboxylation of 
propionyl-CoA to D-methylmalonyl-CoA and conversion of the latter 
to succinyl-CoA. This conversion requires epimerization of d- to 
L-methylmalonyl-CoA, followed by a remarkable reaction in which 
substituents on adjacent carbon atoms exchange positions (see 
Box 17-2). 



Fatty Acid Oxidation Is Tightly Regulated 

Oxidation of fatty acids consumes a precious fuel, and it 
is regulated so as to occur only when the need for en- 
ergy requires it. In the liver, fatty acyl-CoA formed in 
the cytosol has two major pathways open to it: (1) /3 ox- 
idation by enzymes in mitochondria or (2) conversion 
into triacylglycerols and phospholipids by enzymes in 
the cytosol. The pathway taken depends on the rate 
of transfer of long-chain fatty acyl-CoA into mitochon- 
dria. The three-step process (carnitine shuttle) by 
which fatty acyl groups are carried from cytosolic fatty 
acyl-CoA into the mitochondrial matrix (Fig. 17-6) is 
rate-limiting for fatty acid oxidation and is an important 
point of regulation. Once fatty acyl groups have entered 
the mitochondrion, they are committed to oxidation to 
acetyl-CoA. 

Malonyl-CoA, the first intermediate in the cytosolic 
biosynthesis of long-chain fatty acids from acetyl-CoA 



(see Fig, 21-1), increases in concentration whenever the 
animal is well supplied with carbohydrate; excess glu- 
cose that cannot be oxidized or stored as glycogen is 
converted in the cytosol into fatty acids for storage as tri- 
acylglycerol. The inhibition of carnitine acyltransferase I 
by malonyl-CoA ensures that the oxidation of fatty acids 
is inhibited whenever the liver is amply supplied with 
glucose as fuel and is actively making triacylglycerols 
from excess glucose. 

O 

II 

~OOC— CH 2 — C—S-CoA 
Malonyl-CoA 

Two of the enzymes of /3 oxidation are also regu- 
lated by metabolites that signal energy sufficiency. 
When the [NADH]/[NAD + ] ratio is high, jS-hydroxyacyl- 
CoA dehydrogenase is inhibited; in addition, high con- 
centrations of acetyl-CoA inhibit thiolase (Fig. 17-12). 

Recall from Chapter 15 that during periods of vigor- 
ous muscle contraction or during fasting, the fall in [ATP] 
and the rise in [AMP] activate AMPK, the AMP-activated 
protein kinase. AMPK phosphorylates several target en- 
zymes, including acetyl-CoA carboxylase, which catalyzes 
malonyl-CoA synthesis. This phosphorylation and thus 
inhibition of acetyl-CoA carboxylase lowers the concen- 
tration of malonyl-CoA, relieving the inhibition of fatty 
acyl-carnitine transport into mitochondria (Fig. 17-12) 
and allowing /3 oxidation to replenish the supply of ATP 

Transcription Factors Turn on the Synthesis of 
Proteins for Lipid Catabolism 

In addition to the various short-term regulatory mecha- 
nisms that modulate the activity of existing enzymes, tran- 
scriptional regulation can change the number of molecules 
of the enzymes of fatty acid oxidation on a longer time scale, 
minutes to hours. The PPAR family of nuclear receptors 
are transcription factors that affect many metabolic 
processes in response to a variety of fatty acid-like ligands. 
(They were originally recognized as peroxisome prolifera- 
tor-activated receptors, then were found to function more 
broadly.) PPARa acts in muscle, adipose tissue, and liver to 
turn on a set of genes essential for fatty acid oxidation, in- 
cluding the fatty acid transporter, carnitine acyltransferases 
I and II, fatty acyl-CoA dehydrogenases for short, medium, 
long, and very long acyl chains, and related enzymes. This 
response is triggered when a cell or organism has an in- 
creased demand for energy from fat catabolism, such as 
during a fast between meals or under conditions of longer- 
term starvation. Glucagon, released in response to low 
blood glucose, can act through cAMP and the transcription 
factor CREB to turn on certain genes for lipid catabolism. 

Another situation that is accompanied by major 
changes in the expression of the enzymes of fatty acid 
oxidation is the transition from fetal to neonatal metab- 
olism in the heart. In the fetus the principal fuels are 
glucose and lactate, but in the neonatal heart, fatty acids 
are the main fuel. At the time of this transition, PPARa 
is activated and in turn activates the genes essential for 
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FIGURE 17-12 Coordinated regulation of fatty acid synthesis and 
breakdown. When the diet provides a ready source of carbohydrate 
as fuel, (B oxidation of fatty acids is unnecessary and is therefore down- 
regulated. Two enzymes are key to the coordination of fatty acid 
metabolism: acetyl-CoA carboxylase (ACC), the first enzyme in the 
synthesis of fatty acids (see Fig. 21-1), and carnitine acyltransferase I, 
which limits the transport of fatty acids into the mitochondrial matrix 
for (3 oxidation (see Fig, 1 7-6). Ingestion of a high-carbohydrate meal 
raises the blood glucose level and thus © triggers the release of in- 
sulin. @ Insulin-dependent protein phosphatase dephosphorylates 
ACC, activating it. (3) ACC catalyzes the formation of malonyl-CoA 
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(the first intermediate of fatty acid synthesis), and (4) malonyl-CoA in- 
hibits carnitine acyltransferase I, thereby preventing fatty acid entry 
into the mitochondrial matrix. 

When blood glucose levels drop between meals, © glucagon re- 
lease activates cAMP-dependent protein kinase (PKA), which @ phos- 
phorylates and inactivates ACC. The concentration of malonyl-CoA 
falls, the inhibition of fatty acid entry into mitochondria is relieved, 
and © fatty acids enter the mitochondrial matrix and @ become the 
major fuel. Because glucagon also triggers the mobilization of fatty 
acids in adipose tissue, a supply of fatty acids begins arriving in the 
blood. 



fatty acid metabolism. As we will see in Chapter 23, two 
other transcription factors in the PPAR family also play 
crucial roles in setting the enzyme complements — and 
therefore the metabolic activities — of specific tissues at 
particular times (see Fig. 23-42). 

The major site of fatty acid oxidation, at rest and 
during exercise, is skeletal muscle. Endurance training 
increases PPARa expression in muscle, leading to in- 
creased levels of fatty acid-oxidizing enzymes and 
increased oxidative capacity of the muscle. 

Genetic Defects in Fatty Acyl— CoA Dehydrogenases 
Cause Serious Disease 

H Stored triacylglycerols are typically the chief 
source of energy for muscle contraction, and an 
inability to oxidize fatty acids from triacylglycerols has 
serious consequences for health. The most common ge- 
netic defect in fatty acid catabolism in U.S. and northern 
European populations is due to a mutation in the gene 
encoding the medium-chain acyl-CoA dehydroge- 
nase (MCAD). Among northern Europeans, the fre- 
quency of carriers (individuals with this recessive 
mutation on one of the two homologous chromosomes) 
is about 1 in 40, and about 1 individual in 10,000 has the 
disease — that is, has two copies of the mutant MCAD al- 
lele and is unable to oxidize fatty acids of 6 to 12 carbons. 



The disease is characterized by recurring episodes of a 
syndrome that includes fat accumulation in the liver, 
high blood levels of octanoic acid, low blood glucose (hy- 
poglycemia), sleepiness, vomiting, and coma. The pat- 
tern of organic acids in the urine helps in the diagnosis of 
this disease: the urine commonly contains high levels of 
6-carbon to 10-carbon dicarboxylic acids (produced by co 
oxidation) and low levels of urinary ketone bodies (we 
discuss co oxidation below and ketone bodies in Section 
17.3). Although individuals may have no symptoms be- 
tween episodes, the episodes are very serious; mortality 
from this disease is 25% to 60% in early childhood. If the 
genetic defect is detected shortly after birth, the infant 
can be started on a low-fat, high-carbohydrate diet. With 
early detection and careful management of the diet — 
including avoiding long intervals between meals, to pre- 
vent the body from turning to its fat reserves for 
energy — the prognosis for these individuals is good. 

More than 20 other human genetic defects in fatty 
acid transport or oxidation have been documented, 
most much less common than the defect in MCAD. One 
of the most severe disorders results from loss of the 
long-chain /3-hydroxyacyl-CoA dehydrogenase activity 
of the trifunctional protein, TFP. Other disorders in- 
clude defects in the a or /3 subunits that affect all three 
activities of TFP and cause serious heart disease and ab- 
normal skeletal muscle. ■ 
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Peroxisomes Also Carry Out fi Oxidation 

The mitochondrial matrix is the major site of fatty acid 
oxidation in animal cells } but in certain cells other com- 
partments also contain enzymes capable of oxidizing 
fatty acids to acetyl-CoA, by a pathway similar to, but 
not identical with, that in mitochondria. In plant cells, 
the major site of /3 oxidation is not mitochondria but 
peroxisomes. 

In peroxisomes, membrane-enclosed organelles of 
animal and plant cells, the intermediates for /3 oxidation 
of fatty acids are coenzyme A derivatives, and the 
process consists of four steps, as in mitochondrial /3 ox- 
idation (Fig. 17-13): (1) dehydrogenation, (2) addition 
of water to the resulting double bond, (3) oxidation of 
the jS-hydroxyacyl-CoA to a ketone, and (4) thiolytic 
cleavage by coenzyme A. (The identical reactions also 
occur in glyoxysomes, as discussed below.) 

One difference between the peroxisomal and mito- 
chondrial pathways is in the chemistry of the first step. 
In peroxisomes, the flavoprotein acyl-CoA oxidase that 
introduces the double bond passes electrons directly to 
0 2 , producing H 2 0 2 (Fig. 17-13). This strong and poten- 
tially damaging oxidant is immediately cleaved to H 2 0 
and 0 2 by catalase. Recall that in mitochondria, the 
electrons removed in the first oxidation step pass 
through the respiratory chain to 0 2 to produce H 2 0, and 
this process is accompanied by ATP synthesis. In perox- 
isomes, the energy released in the first oxidative step of 
fatty acid breakdown is not conserved as ATP, but is 
dissipated as heat. 

A second important difference between mito- 
chondrial and peroxisomal /3 oxidation in mam- 
mals is in the specificity for fatty acyl-CoAs; the 
peroxisomal system is much more active on very-long- 
chain fatty acids such as hexacosanoic acid (26:0) and 
on branched-chain fatty acids such as phytanic acid and 
pristanic acid (see Fig. 17-17). These less-common 
fatty acids are obtained in the diet from dairy products, 
the fat of ruminant animals, meat, and fish. Their catab- 
olism in the peroxisome involves several auxiliary en- 
zymes unique to this organelle. The inability to oxidize 
these compounds is responsible for several serious hu- 
man diseases. Individuals with Zellweger syndrome 
are unable to make peroxisomes and therefore lack all 
the metabolism unique to that organelle. In X-linked 
adrenoleukodystrophy (XALD), peroxisomes fail to 
oxidize very-long-chain fatty acids, apparently for lack 
of a functional transporter for these fatty acids in the 
peroxisomal membrane. Both defects lead to accumula- 
tion in the blood of very-long-chain fatty acids, espe- 
cially 26:0. XALD affects young boys before the age of 
10 years, causing loss of vision, behavioral distur- 
bances, and death within a few years. ■ 

In mammals, high concentrations of fats in the diet 
result in increased synthesis of the enzymes of peroxi- 
somal jS oxidation in the liver. Liver peroxisomes do not 
contain the enzymes of the citric acid cycle and cannot 
catalyze the oxidation of acetyl-CoA to C0 2 . Instead, 
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FIGURE 17-13 Comparison of p oxidation in mitochondria and in 
peroxisomes and glyoxysomes. The peroxisomal/glyoxysomal system 
differs from the mitochondrial system in two respects: (1) in the first 
oxidative step electrons pass directly to 0 2/ generating H 2 0 2/ and (2) 
the NADH formed in the second oxidative step cannot be reoxidized 
in the peroxisome or glyoxysome, so reducing equivalents are 
exported to the cytosol, eventually entering mitochondria. The acetyl- 
CoA produced by peroxisomes and glyoxysomes is also exported; the 
acetate from glyoxysomes (organelles found only in germinating seeds) 
serves as a biosynthetic precursor (see Fig. 17-14). Acetyl-CoA pro- 
duced in mitochondria is further oxidized in the citric acid cycle. 



long-chain or branched fatty acids are catabolized to 
shorter-chain products, such as hexanoyl-CoA, which 
are exported to mitochondria and completely oxidized. 

Plant Peroxisomes and Glyoxysomes Use Acetyl-CoA 
from j8 Oxidation as a Biosynthetic Precursor 

In plants, fatty acid oxidation does not occur primarily 
in mitochondria (as noted earlier) but in the peroxi- 
somes of leaf tissue and in the glyoxysomes of germinat- 
ing seeds. Plant peroxisomes and glyoxysomes are 
similar in structure and function; glyoxysomes, which 
occur only in germinating seeds, may be considered spe- 
cialized peroxisomes. The biological role of ft oxidation 
in these organelles is to use stored lipids primarily to 
provide biosynthetic precursors, not energy. 
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During seed germination, stored triacylglycerols are 
converted into glucose, sucrose, and a wide variety of 
essential metabolites (Fig. 17-14). Fatty acids re- 
leased from the triacylglycerols are first activated to 
their coenzyme A derivatives and oxidized in gly- 
oxysomes by the same four-step process that takes 
place in peroxisomes (Fig. 17-13). The acetyl-CoA 
produced is converted via the glyoxylate cycle (see Fig. 
16-20) to four-carbon precursors for gluconeogenesis 
(see Fig. 14-19). Glyoxysomes, like peroxisomes, con- 
tain high concentrations of catalase, which converts the 
H 2 0 2 produced by £ oxidation to H 2 0 and 0 2 . 

The /J-Oxidation Enzymes of Different Organelles 
Have Diverged during Evolution 

Although the /3-oxidation reactions in mitochondria are 
essentially the same as those in peroxisomes and gly- 
oxysomes, the enzymes (isozymes) differ significantly 
between the two types of organelles. The differences 
apparently reflect an evolutionary divergence that oc- 
curred very early, with the separation of gram-positive 
and gram-negative bacteria (see Fig. 1-6). 

In mitochondria, the four /3-oxidation enzymes 
that act on short-chain fatty acyl-CoAs are separate, 
soluble proteins (as noted earlier), similar in structure 
to the analogous enzymes of gram-positive bacteria 
(Fig. 17-15a). The gram-negative bacteria have four 
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FIGURE 17-14 Triacylglycerols as glucose source in seeds. j0 Oxida- 
tion is one stage in a pathway that converts stored triacylglycerols to 
glucose in germinating seeds. For more detail, see Figure 16-22. 

activities in three soluble subunits (Fig. 17-1 5b), and 
the eukaryotic enzyme system that acts on long-chain 
fatty acids — the trifunctional protein, TFP — has three 
enzyme activities in two subunits that are membrane- 
associated (Fig. 17-15c). The /3-oxidation enzymes 
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FIGURE 17-15 The enzymes of p oxidation. Shown here are the different subunit struc- 
tures of the enzymes of oxidation in gram-positive and gram-negative bacteria, mito- 
chondria, and plant peroxisomes and glyoxysomes. Enz! is acyl-CoA dehydrogenase; 
Enz 2 , enoyl-CoA hydratase; Enz 3 , L-/3-hydroxyacyl-CoA dehydrogenase; Enz 4/ thiolase; 
Enz 5/ D-3-hydroxyacyl-CoA epimerase, and Enz 6 , A 3 ,A 2 -enoyl-CoA isomerase. (a) The 
four enzymes of jS oxidation in gram-positive bacteria are separate, soluble entities, as are 
those of the short-chain-specific system of mitochondria, (b) In gram-negative bacteria, 
the four enzyme activities reside in three polypeptides; Enz 2 and Enz 3 are parts of a single 
polypeptide chain, (c) The very-long-chain-specific system of mitochondria is also com- 
posed of three polypeptides, one of which includes the activities of Enz 2 and Enz 3 ; in this 
case, the system is bound to the inner mitochondrial membrane, (d) In the peroxisomal 
and glyoxysomal ^-oxidation systems of plants, EnZi and Enz 4 are separate polypeptides, 
but Enz 2 and Enz 3 , as well as two auxiliary enzymes (Enz 5 and Enz 6 ), are part of a single 
polypeptide chain: the multifunctional protein, MFP. 
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of plant peroxisomes and glyoxysomes, however, form 
a complex of proteins, one of which contains four 
enzymatic activities in a single polypeptide chain 
(Fig. 17-15d). The first enzyme, acyl-CoA oxidase, is a 
single polypeptide chain; the multifunctional protein 
(MFP) contains the second and third enzyme activities 
(enoyl-CoA hydratase and hydroxyacyl-CoA dehydro- 
genase) as well as two auxiliary activities needed for 
the oxidation of unsaturated fatty acids (D-3-hydroxy- 
acyl-CoA epimerase and A 3 ,A 2 -enoyl-CoA isomerase); 
the fourth enzyme, thiolase, is a separate, soluble 
polypeptide. 

It is interesting that the enzymes that catalyze es- 
sentially the reversal of jS oxidation in the synthesis of 
fatty acids are also organized differently in bacteria and 
eukaryotes; in bacteria, the seven enzymes needed for 
fatty acid synthesis are separate polypeptides, but in 
mammals, all seven activities are part of a single, huge 
polypeptide chain. One advantage to the cell in having 
several enzymes of the same pathway encoded in a sin- 
gle polypeptide chain is that this solves the problem of 
regulating the synthesis of enzymes that must interact 
functionally; regulation of the expression of one gene 
ensures production of the same number of active sites 
for all enzymes in the path. When each enzyme activity 
is on a separate polypeptide, some mechanism is re- 
quired to coordinate the synthesis of all the gene prod- 
ucts. The disadvantage of having several activities on 
the same polypeptide is that the longer the polypeptide 
chain, the greater is the probability of a mistake in its 
synthesis: a single incorrect amino acid in the chain may 
make all the enzyme activities in that chain useless. 
Comparison of the gene structures for these proteins in 
many species may shed light on the reasons for the se- 
lection of one or the other strategy in evolution. 

The oj Oxidation of Fatty Acids Occurs 
in the Endoplasmic Reticulum 

Although mitochondrial (3 oxidation, in which enzymes 
act at the carboxyl end of a fatty acid, is by far the most 
important catabolic fate for fatty acids in animal cells, 
there is another pathway in some species, including ver- 
tebrates, that involves oxidation of the co (omega) car- 
bon — the carbon most distant from the carboxyl group. 
The enzymes unique to to oxidation are located (in ver- 
tebrates) in the endoplasmic reticulum of liver and kid- 
ney, and the preferred substrates are fatty acids of 10 or 
12 carbon atoms. In mammals co oxidation is normally a 
minor pathway for fatty acid degradation, but when /3 ox- 
idation is defective (because of mutation or a carnitine 
deficiency, for example) it becomes more important. 

The first step introduces a hydroxyl group onto the 
co carbon (Fig. 17-16), The oxygen for this group 
comes from molecular oxygen (0 2 ) in a complex reac- 
tion that involves cytochrome P450 and the electron 
donor NADPH. Reactions of this type are catalyzed by 
mixed-function oxidases, described in Box 21-1. Two 
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substrate is usually a medium-chain fatty acid; shown here is lauric 
acid (lau rate). This pathway is generally not the major route for oxida- 
tive catabolism of fatty acids. 



more enzymes now act on the co carbon: alcohol dehy- 
drogenase oxidizes the hydroxyl group to an aldehyde, 
and aldehyde dehydrogenase oxidizes the aldehyde 
group to a carboxylic acid, producing a fatty acid with a 
carboxyl group at each end. At this point, either end can 
be attached to coenzyme A, and the molecule can enter 
the mitochondrion and undergo jS oxidation by the nor- 
mal route. In each pass through the ^-oxidation path- 
way, the "double-ended" fatty acid yields dicarboxylic 
acids such as succinic acid, which can enter the citric 
acid cycle, and adipic acid (Fig. 17-16). 

Phytanic Acid Undergoes a Oxidation in Peroxisomes 

The presence of a methyl group on the f3 carbon of 
a fatty acid makes [3 oxidation impossible, and 
these branched fatty acids are catabolized in peroxi- 
somes of animal cells by a oxidation. In the oxidation of 
phytanic acid, for example (Fig. 17-17), phytanoyl-CoA 
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FIGURE 17-17 The a oxidation of a branched-chain fatty acid (phy- 
tanic acid) in peroxisomes. Phytanic acid has a methyl-substituted /3 
carbon and therefore cannot undergo /3 oxidation. The combined 
action of the enzymes shown here removes the carboxyl carbon of 
phytanic acid to produce pristanic acid, in which the jS carbon is un- 
substituted, allowing oxidation. Notice that 0 oxidation of pristanic 
acid releases propionyl-CoA, not acetyl-CoA. This is further catabolized 
as in Figure 17-11. (The details of the reaction that produces pristanal 
remain controversial.) 



is hydroxylated on its a carbon, in a reaction that in- 
volves molecular oxygen; decarboxylated to form an 
aldehyde one carbon shorter; and then oxidized to the 
corresponding carboxylic acid, which now has no sub- 
stituent on the (3 carbon and can be oxidized further by 
(3 oxidation. Refsum's disease, resulting from a genetic 
defect in phytanoyl-CoA hydroxylase, leads to very high 
blood levels of phytanic acid and severe neurological 
problems, including blindness and deafness. ■ 



SUMMARY 17.2 Oxidation of Fatty Acids 

m In the first stage of ft oxidation, four reactions 

remove each acetyl-CoA unit from the carboxyl end 
of a saturated fatty acyl-CoA: (1) dehydrogenation 
of the a and /3 carbons (C-2 and C-3) by 
FAD-linked acyl-CoA dehydrogenases, 
(2) hydration of the resulting trans- A 2 double 
bond by enoyl-CoA hydratase, (3) dehydrogenation 
of the resulting L-/3-hydroxyacyl-CoA by NAD-linked 
/3-hydroxyacyl-CoA dehydrogenase, and (4) CoA- 
requiring cleavage of the resulting /3-ketoacyl-CoA 
by thiolase, to form acetyl-CoA and a fatty 
acyl-CoA shortened by two carbons. The shortened 
fatty acyl-CoA then reenters the sequence. 

■ In the second stage of fatty acid oxidation, the 
acetyl-CoA is oxidized to C0 2 in the citric acid 
cycle. A large fraction of the theoretical yield of 
free energy from fatty acid oxidation is recovered 
as ATP by oxidative phosphorylation, the final 
stage of the oxidative pathway. 

■ Malonyl-CoA, an early intermediate of fatty acid 
synthesis, inhibits carnitine acyltransferase I, 
preventing fatty acid entry into mitochondria. 
This blocks fatty acid breakdown while synthesis 
is occurring. 

■ Genetic defects in the medium-chain acyl-CoA 
dehydrogenase result in serious human disease, 
as do mutations in other components of the 
/3-oxidation system. 

■ Oxidation of unsaturated fatty acids requires two 
additional enzymes: enoyl-CoA isomerase and 
2,4-dienoyl-CoA reductase. Odd-number fatty acids 
are oxidized by the /3-oxidation pathway to yield 
acetyl-CoA and a molecule of propionyl-CoA. This 
is carboxylated to methylmalonyl-CoA, which is 
isomerized to succinyl-CoA in a reaction catalyzed 
by methylmalonyl-CoA mutase, an enzyme 
requiring coenzyme B 12 . 

* Peroxisomes of plants and animals, and glyoxysomes 
of plants, carry out (5 oxidation in four steps similar 
to those of the mitochondrial pathway in animals. 
The first oxidation step, however, transfers 
electrons directly to 0 2 , generating H 2 0 2 . 
Peroxisomes of animal tissues specialize in the 
oxidation of very-long-chain fatty acids and 
branched fatty acids. In glyoxysomes, in 
germinating seeds, /3 oxidation is one step in 
the conversion of stored lipids into a variety of 
intermediates and products. 

■ The reactions of co oxidation, occurring in the 
endoplasmic reticulum, produce dicarboxylic fatty 
acyl intermediates, which can undergo f3 oxidation 
at either end to yield short dicarboxylic acids such 
as succinate. 
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17.3 Ketone Bodies 

In humans and most other mammals, acetyl-CoA formed 
in the liver during oxidation of fatty acids can either en- 
ter the citric acid cycle (stage 2 of Fig. 17-7) or undergo 
conversion to the "ketone bodies," acetone, acetoac- 
etate, and D-)8-hydroxybutyrate, for export to other 
tissues. (The term "bodies" is a historical artifact; the 
term is occasionally applied to insoluble particles, but 
these compounds are quite soluble in blood and urine.) 
Acetone, produced in smaller quantities than the other 
ketone bodies, is exhaled. Acetoacetate and D-/3-hy- 
droxybutyrate are transported by the blood to tissues 
other than the liver (extrahepatic tissues), where they 
are converted to acetyl-CoA and oxidized in the citric 
acid cycle, providing much of the energy required by tis- 
sues such as skeletal and heart muscle and the renal 
cortex. The brain, which preferentially uses glucose as 
fuel, can adapt to the use of acetoacetate or D-/3-hydrox- 
ybutyrate under starvation conditions, when glucose is 
unavailable. The production and export of ketone bod- 
ies from the liver to extrahepatic tissues allows contin- 
ued oxidation of fatty acids in the liver when acetyl-CoA 
is not being oxidized in the citric acid cycle. 
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Ketone Bodies, Formed in the Liver, Are Exported 
to Other Organs as Fuel 

The first step in the formation of acetoacetate, occur- 
ring in the liver (Fig. 17-18), is the enzymatic conden- 
sation of two molecules of acetyl-CoA, catalyzed by 
thiolase; this is simply the reversal of the last step of /3 
oxidation. The acetoacetyl-CoA then condenses with 
acetyl-CoA to form )3-hydroxy-j8-methylglutaryl-CoA 
(HMG-CoA), which is cleaved to free acetoacetate and 
acetyl-CoA. The acetoacetate is reversibly reduced by 
D-jS-hydroxybutyrate dehydrogenase, a mitochondrial 
enzyme, to D-jS-hy droxybutyrate. This enzyme is spe- 
cific for the D stereoisomer; it does not act on L-/3- 
hydroxyacyl-CoAs and is not to be confused with 
L-jS-hydroxyacyl-CoA dehydrogenase of the /3-oxidation 
pathway. 

In healthy people, acetone is formed in very small 
amounts from acetoacetate, which is easily de- 
carboxylated, either spontaneously or by the action of 
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FIGURE 17-18 Formation of ketone bodies from acetyl-CoA. Healthy, 
well-nourished individuals produce ketone bodies at a relatively low 
rate. When acetyl -CoA accumulates (as in starvation or untreated dia- 
betes, for example), thiolase catalyzes the condensation of two acetyl- 
CoA molecules to acetoacetyl-CoA, the parent compound of the three 
ketone bodies. The reactions of ketone body formation occur in the 
matrix of liver mitochondria. The six-carbon compound /3-hydroxy-/3- 
methylglutaryl-CoA (HMG-CoA) is also an intermediate of sterol 
biosynthesis, but the enzyme that forms HMG-CoA in that pathway is 
cytosolic. HMG-CoA lyase is present only in the mitochondrial matrix. 

acetoacetate decarboxylase (Fig. 17-18). Because 
individuals with untreated diabetes produce large quan- 
tities of acetoacetate, their blood contains significant 
amounts of acetone, which is toxic. Acetone is volatile 
and imparts a characteristic odor to the breath, which is 
sometimes useful in diagnosing diabetes. ■ 

In extrahepatic tissues, D-j3-hydroxybutyrate is oxi- 
dized to acetoacetate by D-jS-hydroxybutyrate dehydro- 
genase (Fig. 17-19). The acetoacetate is activated to its 
coenzyme A ester by transfer of CoA from succinyl-CoA, 
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FIGURE 17-19 D-0-Hydroxybutyrate as a fuel. D-/3-Hydroxybutyrate, 
synthesized in the liver, passes into the blood and thus to other tissues, 
where it is converted in three steps to acetyl-CoA. It is first oxidized to 
acetoacetate, which is activated with coenzyme A donated from suc- 
cinyl-CoA, then split by thiolase. The acetyl-CoA thus formed is used 
for energy production. 

an intermediate of the citric acid cycle (see Fig. 16-7), 
in a reaction catalyzed by /3-ketoacyl-CoA trans- 
ferase, also called thiophorase. The acetoacetyl-CoA is 
then cleaved by thiolase to yield two acetyl-CoAs, which 
enter the citric acid cycle. Thus the ketone bodies are 
used as fuels in all tissues except liver, which lacks thio- 
phorase. The liver is therefore a producer of ketone bod- 
ies for the other tissues, but not a consumer. 

The production and export of ketone bodies by the 
liver allows continued oxidation of fatty acids with only 
minimal oxidation of acetyl-CoA. When intermediates of 
the citric acid cycle are being siphoned off for glucose 
synthesis by gluconeogenesis, for example, oxidation of 
cycle intermediates slows — and so does acetyl-CoA oxi- 
dation. Moreover, the liver contains only a limited 
amount of coenzyme A, and when most of it is tied up in 
acetyl-CoA, /3 oxidation slows for want of the free coen- 
zyme. The production and export of ketone bodies frees 
coenzyme A, allowing continued fatty acid oxidation. 

Ketone Bodies Are Overproduced in Diabetes 
and during Starvation 

Starvation and untreated diabetes mellitus lead 
to overproduction of ketone bodies, with several 
associated medical problems. During starvation, gluco- 
neogenesis depletes citric acid cycle intermediates, 



diverting acetyl-CoA to ketone body production (Fig. 
17-20). In untreated diabetes, when the insulin level 
is insufficient, extrahepatic tissues cannot take up 
glucose efficiently from the blood, either for fuel or for 
conversion to fat. Under these conditions, levels of 
malonyl-CoA (the starting material for fatty acid syn- 
thesis) fall, inhibition of carnitine acyltransferase I is 
relieved, and fatty acids enter mitochondria to be de- 
graded to acetyl-CoA — which cannot pass through the 
citric acid cycle because cycle intermediates have 
been drawn off for use as substrates in gluconeogene- 
sis. The resulting accumulation of acetyl-CoA acceler- 
ates the formation of ketone bodies beyond the 
capacity of extrahepatic tissues to oxidize them. The 
increased blood levels of acetoacetate and D-/3-hy- 
droxybutyrate lower the blood pH, causing the condi- 
tion known as acidosis. Extreme acidosis can lead to 
coma and in some cases death. Ketone bodies in the 
blood and urine of individuals with untreated diabetes 
can reach extraordinary levels — a blood concentration 
of 90 mg/100 mL (compared with a normal level of <3 
mg/100 mL) and urinary excretion of 5,000 mg/24 hr 
(compared with a normal rate of <125 mg/24 hr). This 
condition is called ketosis. 

Individuals on very low-calorie diets, using the fats 
stored in adipose tissue as their major energy source, 
also have increased levels of ketone bodies in their blood 
and urine. These levels must be monitored to avoid the 
dangers of acidosis and ketosis (ketoacidosis). ■ 
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FIGURE 17-20 Ketone body formation and export from the liver. 

Conditions that promote gluconeogenesis (untreated diabetes, severely 
reduced food intake) slow the citric acid cycle (by drawing off oxaloac- 
etate) and enhance the conversion of acetyl-CoA to acetoacetate. The 
released coenzyme A allows continued f3 oxidation of fatty acids. 
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SUMMARY 17.3 Ketone Bodies 

■ The ketone bodies — acetone, acetoacetate, and 
D-jS-hydroxybutyrate — are formed in the liver. The 
latter two compounds serve as fuel molecules 

in extrahepatic tissues, through oxidation to 
acetyl-CoA and entry into the citric acid cycle. 

■ Overproduction of ketone bodies in uncontrolled 
diabetes or severely reduced calorie intake can lead 
to acidosis or ketosis. 
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1. Energy in Triacylglycerols On a per-carbon basis, where 
does the largest amount of biologically available energy in tri- 
acylglycerols reside: in the fatty acid portions or the glycerol 
portion? Indicate how knowledge of the chemical structure of 
triacylglycerols provides the answer. 

2. Fuel Reserves in Adipose Tissue Triacylglycerols, with 
their hydrocarbon-like fatty acids, have the highest energy 
content of the major nutrients. 

(a) If 15% of the body mass of a 70.0 kg adult consists of 
triacylglycerols, what is the total available fuel reserve, in both 
kilojoules and kilocalories, in the form of triacylglycerols? 
Recall that 1.00 kcal = 4.18 kJ. 

(b) If the basal energy requirement is approximately 
8,400 kJ/day (2,000 kcal/day), how long could this person sur- 
vive if the oxidation of fatty acids stored as triacylglycerols 
were the only source of energy? 

(c) What would be the weight loss in pounds per day 
under such starvation conditions (1 lb = 0.454 kg)? 

3. Common Reaction Steps in the Fatty Acid Oxidation 
Cycle and Citric Acid Cycle Cells often use the same 
enzyme reaction pattern for analogous metabolic conversions. 
For example, the steps in the oxidation of pyruvate to acetyl- 
CoA and of a-ketoglutarate to succinyl-CoA, although catalyzed 
by different enzymes, are very similar. The first stage of fatty 
acid oxidation follows a reaction sequence closely resembling a 
sequence in the citric acid cycle. Use equations to show the 
analogous reaction sequences in the two pathways. 



4. p Oxidation: How Many Cycles? How many cycles of p 
oxidation are required for the complete oxidation of activated 
oleic acid, 18:1 (A 9 )? 

5. Chemistry of the Acyl-CoA Synthetase Reaction 

Fatty acids are converted to their coenzyme A esters in a 
reversible reaction catalyzed by acyl-CoA synthetase: 



R— COO" + ATP + CoA 



O 

R— C— CoA + AMP + PR 



(a) The enzyme-bound intermediate in this reaction has 
been identified as the mixed anhydride of the fatty acid and 
adenosine monophosphate (AMP), acyl-AMP: 



O 



O 



-c-o-p-o— ch 2 n 
I 

O" 




OH OH 



Write two equations corresponding to the two steps of the 
reaction catalyzed by acyl-CoA synthetase. 

(b) The acyl-CoA synthetase reaction is readily re- 
versible, with an equilibrium constant near 1. How can this re- 
action be made to favor formation of fatty acyl-CoA? 

6. Intermediates in Oleic Acid Oxidation What is the 
structure of the partially oxidized fatty acyl group that is 
formed when oleic acid, 18:1 (A 9 ), has undergone three cycles 
of p oxidation? What are the next two steps in the continued 
oxidation of this intermediate? 

7. p Oxidation of an Odd-Chain Fatty Acid What are the 
direct products of p oxidation of a fully saturated, straight- 
chain fatty acid of 11 carbons? 

8. Oxidation of Tritiated Palmitate Palmitate uni- 
formly labeled with tritium ( 3 H) to a specific activity of 
2.48 X 10 s counts per minute (cpm) per micromole of 
palmitate is added to a mitochondrial preparation that oxi- 
dizes it to acetyl-CoA. The acetyl-CoA is isolated and hy- 
drolyzed to acetate. The specific activity of the isolated 
acetate is 1,00 X 10 7 cpm/jiimol. Is this result consistent 
with the p -oxidation pathway? Explain, What is the final 
fate of the removed tritium? 

9. Compartmentation in p Oxidation Free palmitate is 
activated to its coenzyme A derivative (palmitoyl-CoA) in the 
cytosol before it can be oxidized in the mitochondrion. If 
palmitate and [ 14 C]coenzyme A are added to a liver ho- 
mogenate, palmitoyl-CoA isolated from the cytosolic fraction 
is radioactive, but that isolated from the mitochondrial frac- 
tion is not. Explain. 

10. Comparative Biochemistry: Energy-Generating 
Pathways in Birds One indication of the relative importance 
of various ATP-producing pathways is the V max of certain 
enzymes of these pathways. The values of 7 max of several 
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enzymes from the pectoral muscles (chest muscles used for 
flying) of pigeon and pheasant are listed below. 

v 

' max 

Omol substrate/min/g tissue) 



Enzyme 


Pigeon 


Pheasant 


Hexokinase 


3.0 


2.3 


Glycogen phosphorylase 


18 0 


120.0 


Phosphofructokinase- 1 


24.0 


143.0 


Citrate synthase 


100.0 


15.0 


Triacylglycerol lipase 


0.07 


0.01 



(a) Discuss the relative importance of glycogen metabo- 
lism and fat metabolism in generating ATP in the pectoral mus- 
cles of these birds. 

(b) Compare oxygen consumption in the two birds. 

(c) Judging from the data in the table, which bird is the 
long-distance flyer? Justify your answer. 

(d) Why were these particular enzymes selected for com- 
parison? Would the activities of triose phosphate isomerase 
and malate dehydrogenase be equally good bases for compari- 
son? Explain. 

11. Mutant Carnitine Acyltransf erase What changes in 
metabolic pattern would result from a mutation in the muscle 
carnitine acyltransferase I in which the mutant protein has lost 
its affinity for malonyl-CoA but not its catalytic activity? 

BS 12. Effect of Carnitine Deficiency An individual de- 
Kifl veloped a condition characterized by progressive muscu- 
lar weakness and aching muscle cramps. The symptoms were 
aggravated by fasting, exercise, and a high-fat diet. The ho- 
mogenate of a skeletal muscle specimen from the patient oxi- 
dized added oleate more slowly than did control homogenates, 
consisting of muscle specimens from healthy individuals. When 
carnitine was added to the patient's muscle homogenate, the rate 
of oleate oxidation equaled that in the control homogenates. The 
patient was diagnosed as having a carnitine deficiency. 

(a) Why did added carnitine increase the rate of oleate 
oxidation in the patient's muscle homogenate? 

(b) Why were the patient's symptoms aggravated by fast- 
ing, exercise, and a high-fat diet? 

(c) Suggest two possible reasons for the deficiency of 
muscle carnitine in this individual. 

13. Fatty Acids as a Source of Water Contrary to legend, 
camels do not store water in their humps, which actually con- 
sist of large fat deposits. How can these fat deposits serve as a 
source of water? Calculate the amount of water (in liters) that 
a camel can produce from 1.0 kg of fat. Assume for simplicity 
that the fat consists entirely of tripalmitoylglycerol. 

14. Petroleum as a Microbial Food Source Some mi- 
croorganisms of the genera Nocardia and Pseudomonas can 
grow in an environment where hydrocarbons are the only food 
source. These bacteria oxidize straight-chain aliphatic hydro- 
carbons, such as octane, to their corresponding carboxylic 
acids: 

CH 3 (CH 2 ) 6 CH 3 + NAD + + 0 2 

CH 3 (CH 2 ) 6 COOH + NADH + H + 



How could these bacteria be used to clean up oil spills? What 
would be some of the limiting factors to the efficiency of this 
process? 

15. Metabolism of a Straight-Chain Phenylated Fatty 
Acid A crystalline metabolite was isolated from the urine of a 
rabbit that had been fed a straight-chain fatty acid containing 
a terminal phenyl group: 

^^CH 2 -(CH 2 )„-COO" 

A 302 mg sample of the metabolite in aqueous solution was 
completely neutralized by 22,2 mL of 0.100 M NaOH. 

(a) What is the probable molecular weight and structure 
of the metabolite? 

(b) Did the straight-chain fatty acid contain an even or an 
odd number of methylene ( — CH 2 — ) groups (i.e., is n even or 
odd)? Explain. 

16. Fatty Acid Oxidation in Uncontrolled Diabetes 

| When the acetyl-CoA produced during /3 oxidation in the 
liver exceeds the capacity of the citric acid cycle, the excess acetyl- 
CoA forms ketone bodies — acetone, acetoacetate, and D-/3-hydrox- 
ybutyrate. This occurs in severe, uncontrolled diabetes: because 
the tissues cannot use glucose, they oxidize large amounts of fatty 
acids instead. Although acetyl-CoA is not toxic, the mitochondrion 
must divert the acetyl-CoA to ketone bodies. What problem would 
arise if acetyl-CoA were not converted to ketone bodies? How does 
the diversion to ketone bodies solve the problem? 

17. Consequences of a High-Fat Diet with No Carbohy- 
drates Suppose you had to subsist on a diet of whale blubber 
and seal blubber, with little or no carbohydrate. 

(a) What would be the effect of carbohydrate deprivation 
on the utilization of fats for energy? 

(b) If your diet were totally devoid of carbohydrate, 
would it be better to consume odd- or even-numbered fatty 
acids? Explain. 

18. Even- and Odd-Chain Fatty Acids in the Diet In a 

laboratory experiment, two groups of rats are fed two different 
fatty acids as their sole source of carbon for a month. The first 
group gets heptanoic acid (7:0), and the second gets octanoic 
acid (8:0). After the experiment, a striking difference is seen 
between the two groups. Those in the first group are healthy 
and have gained weight, whereas those in the second group 
are weak and have lost weight as a result of losing muscle 
mass. What is the biochemical basis for this difference? 

19. Metabolic Consequences of Ingesting w-Fluoro- 
oleate The shrub Dichapetalum toxicarium, native to 
Sierra Leone, produces <w-fluorooleate, which is highly toxic to 
warm-blooded animals. 

H H 

I I 

F— CH 2 — (CH 2 ) 7 -C=C— (CH 2 ) 7 — COO" 
o>-Fluorooleate 

This substance has been used as an arrow poison, and pow- 
dered fruit from the plant is sometimes used as a rat poison 
(hence the plant's common name, ratsbane). Why is this sub- 
stance so toxic? (Hint: Review Chapter 16, Problem 22.) 
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20. Mutant Acetyl-CoA Carboxylase What would be the 
consequences for fat metabolism of a mutation in acetyl-CoA 
carboxylase that replaced the Ser residue normally phospho- 
rylated by AMPK to an Ala residue? What might happen if the 
same Ser were replaced by Asp? (Hint: See Fig. 17-12.) 

21. Effect of PDE Inhibitor on Adipocytes How would an 
adipocyte's response to epinephrine be affected by the addi- 
tion of an inhibitor of cAMP phosphodiesterase (PDE)? (Hint: 
See Fig. 12-4.) 

22. Role of FAD as Electron Acceptor Acyl-CoA dehydro- 
genase uses enzyme-bound FAD as a prosthetic group to dehy- 
drogenate the a and /3 carbons of fatty acyl-CoA. What is the 
advantage of using FAD as an electron acceptor rather than 
NAD + ? Explain in terms of the standard reduction potentials 
for the Enz-FAD/F ADH 2 (E to = -0219V) and NAD+/NADH 
(E'° = -0.320V) half-reactions. 

23. ft Oxidation of Arachidic Acid How many turns of the 
fatty acid oxidation cycle are required for complete oxidation 
of arachidic acid (see Table 10-1) to acetyl-CoA? 

24. Fate of Labeled Propionate If [3- 14 C]propionate ( 14 C 
in the methyl group) is added to a liver homogenate, 14 C-la- 
beled oxaloacetate is rapidly produced. Draw a flow chart for 
the pathway by which propionate is transformed to oxaloac- 
etate, and indicate the location of the 14 C in oxaloacetate. 

25. Phytanic Acid Metabolism When phytanic acid uni- 
formly labeled with 14 C is fed to a mouse, radioactivity can be 
detected in malate, a citric acid cycle intermediate, within 
minutes. Draw a metabolic pathway that could account for 
this Which of the carbon atoms in malate would contain 14 C 
label? 

26. Sources of H 2 0 Produced in Oxidation The com 

plete oxidation of palmitoyl-CoA to carbon dioxide and water 
is represented by the overall equation 



Palmitoyl-CoA + 230 2 + 108P, + 108ADP > 

CoA + 16C0 2 + 108ATP 

Water is also produced in the reaction 



23H 9 0 



ADP + ^ > ATP + H 2 0 

but is not included as a product in the overall equation. Why? 

27. Biological Importance of Cobalt In cattle, deer, sheep, 
and other ruminant animals, large amounts of propionate are 
produced in the rumen through the bacterial fermentation of 
ingested plant matter. Propionate is the principal source of glu- 
cose for these animals, via the route propionate -> oxaloacetate 
— » glucose, In some areas of the world, notably Australia, rumi- 
nant animals sometimes show symptoms of anemia with con- 
comitant loss of appetite and retarded growth, resulting from 
an inability to transform propionate to oxaloacetate. This con- 
dition is due to a cobalt deficiency caused by very low cobalt 
levels in the soil and thus in plant matter. Explain. 

28. Fat Loss during Hibernation Bears expend about 
25 X 10 6 J/day during periods of hibernation, which may last 
as long as seven months. The energy required to sustain life is 
obtained from fatty acid oxidation, How much weight loss (in 



kilograms) has occurred after seven months? How might 
ketosis be minimized during hibernation? (Assume the oxida- 
tion of fat yields 38 kJ/g.) 



Data Analysis Problem 



29. p Oxidation of Trans Fats Unsaturated fats with trans 
double bonds are commonly referred to as "trans fats." There 
has been much discussion about the effects of dietary trans 
fats on health. In their investigations of the effects of trans 
fatty acid metabolism on health, Yu and colleagues (2004) 
showed that a model trans fatty acid was processed differently 
from its cis isomer. They used three related 18-carbon fatty 
acids to explore the difference in p oxidation between cis and 
trans isomers of the same-size fatty acid. 

O 



OH 



Stearic acid 
(octadecenoic acid) 




Oleic acid 
(cis-A 9 -octadecenoic acid) 




Elaidic acid 
(£rarcs-A 9 -octadecenoic acid) 

The researchers incubated the coenzyme A derivative of 
each acid with rat liver mitochondria for 5 minutes, then sepa- 
rated the remaining CoA derivatives in each mixture by HPLC 
(high-performance liquid chromatography). The results are 
shown below, with separate panels for the three experiments. 
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In the figure, IS indicates an internal standard (pentade- 
canoyl-CoA) added to the mixture, after the reaction, as a 
molecular marker. The researchers abbreviated the CoA 
derivatives as follows: stearoyl-CoA, Ci 8 -CoA; czs-A 5 -tetrade- 
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cenoyl-CoA, cA 5 C 14 -CoA; oleoyl-CoA, cA 9 C 18 -CoA; trans-k 5 - 
tetradecenoyl-CoA, tA 5 C 14 -CoA; and elaidoyl-CoA, tA 9 C 18 - 
CoA. 



O 



ds-A 5 -Tetradecenoyl-CoA 



£rarc.s-A 5 -Tetradecenoyl-CoA 



S-CoA 



S-CoA 



(a) Why did Yu and colleagues need to use CoA deriva- 
tives rather than the free fatty acids in these experiments? 

(b) Why were no lower molecular weight CoA derivatives 
found in the reaction with stearoyl-CoA? 

(c) How many rounds of j3 oxidation would be required 
to convert the oleoyl-CoA and the elaidoyl-CoA to c£s-A 5 -tetrade- 
cenoyl-CoA and £rcms-A 5 -tetradecenoyl-CoA, respectively? 

There are two forms of the enzyme acyl-CoA dehydrogenase 
(see Fig, 17-8a): long-chain acyl-CoA dehydrogenase (LCAD) and 
very-long-chain acyl-CoA dehydrogenase (VLCAD). Yu and co- 
workers measured the kinetic parameters of both enzymes. 
They used the CoA derivatives of three fatty acids: tetradecanoyl- 
CoA (C 14 -CoA), cis-A 5 -tetradecenoyl-CoA (cA 5 C 14 -CoA), and 
£r<ms-A 5 -tetradecenoyl-CoA (tA 5 C 14 -CoA). The results are shown 
below. (See Chapter 6 for definitions of the kinetic parameters.) 
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(d) For LCAD, the K m differs dramatically for the cis and 
trans substrates. Provide a plausible explanation for this ob- 
servation in terms of the structures of the substrate molecules. 
(Hint: You may want to refer to Fig. 10-2.) 

(e) The kinetic parameters of the two enzymes are rele- 
vant to the differential processing of these fatty acids only if 
the LCAD or VLCAD reaction (or both) is the rate-limiting 
step in the pathway. What evidence is there to support this 
assumption? 

(f) How do these different kinetic parameters explain the 
different levels of the CoA derivatives found after incubation 
of rat liver mitochondria with stearoyl-CoA, oleoyl-CoA, and 
elaidoyl-CoA (shown in the three-panel figure)? 

Yu and coworkers measured the substrate specificity of 
rat liver mitochondrial thioesterase, which hydrolyzes acyl- 
CoA to CoA and free fatty acid (see Chapter 21). This enzyme 
was approximately twice as active with C 14 -CoA thioesters as 
with Ci 8 -CoA thioesters. 

(g) Other research has suggested that free fatty acids 
can pass through membranes. In their experiments, Yu 
and colleagues found £rcms-A 5 -tetradecenoic acid outside 
mitochondria (i.e., in the medium) that had been incu- 
bated with elaidoyl-CoA. Describe the pathway that led to 
this extramitochondrial £rcms-A 5 -tetradecenoic acid. Be 
sure to indicate where in the cell the various transforma- 
tions take place, as well as the enzymes that catalyze the 
transformations. 

(h) It is often said in the popular press that "trans fats are 
not broken down by your cells and instead accumulate in your 
body." In what sense is this statement correct and in what 
sense is it an oversimplification? 

Reference 

Yu, W., Liang, X., Ensenauer, R., Vockley, J., Sweetman, L., & 
Schultz, H. (2004) Leaky jS-oxidation of a irtms-fatty acid. J. Biol. 
Chem 279,52,160-52,167 




1 chose the study of the synthesis of urea in the liver because it ap- 
peared to be a relatively simple problem. 

— Hans Krebs, article in Perspectives in Biology and 

Medicine, 1970 
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We now turn our attention to the amino acids, the 
final class of biomolecules that, through their 
oxidative degradation, make a significant contri- 
bution to the generation of metabolic energy. The frac- 
tion of metabolic energy obtained from amino acids, 
whether they are derived from dietary protein or from 
tissue protein, varies greatly with the type of organism 
and with metabolic conditions. Carnivores can obtain 
(immediately following a meal) up to 90% of their en- 
ergy requirements from amino acid oxidation, whereas 
herbivores may fill only a small fraction of their energy 
needs by this route. Most microorganisms can scavenge 
amino acids from their environment and use them as 
fuel when required by metabolic conditions. Plants, 
however, rarely if ever oxidize amino acids to provide 
energy; the carbohydrate produced from C0 2 and H 2 0 
in photosynthesis is generally their sole energy source. 
Amino acid concentrations in plant tissues are carefully 
regulated to just meet the requirements for biosynthesis 
of proteins, nucleic acids, and other molecules needed 
to support growth. Amino acid catabolism does occur in 
plants, but its purpose is to produce metabolites for 
other biosynthetic pathways. 

In animals, amino acids undergo oxidative degrada- 
tion in three different metabolic circumstances: 

1 . During the normal synthesis and degradation of 
cellular proteins (protein turnover; Chapter 27), 
some amino acids that are released from protein 



breakdown and are not needed for new protein 
synthesis undergo oxidative degradation. 

2. When a diet is rich in protein and the ingested 
amino acids exceed the body's needs for protein 
synthesis, the surplus is catabolized; amino acids 
cannot be stored. 

3. During starvation or in uncontrolled diabetes 
mellitus, when carbohydrates are either unavailable 
or not properly utilized, cellular proteins are used 
as fuel. 

Under all these metabolic conditions, amino acids lose 
their amino groups to form a-keto acids, the "carbon 
skeletons" of amino acids. The a-keto acids undergo ox- 
idation to C0 2 and H 2 0 or, often more importantly, pro- 
vide three- and four-carbon units that can be converted 
by gluconeogenesis into glucose, the fuel for brain, 
skeletal muscle, and other tissues. 

The pathways of amino acid catabolism are quite 
similar in most organisms. The focus of this chapter is 
on the pathways in vertebrates, because these have re- 
ceived the most research attention. As in carbohydrate 
and fatty acid catabolism, the processes of amino acid 
degradation converge on the central catabolic path- 
ways, with the carbon skeletons of most amino acids 
finding their way to the citric acid cycle. In some 
cases the reaction pathways of amino acid breakdown 
closely parallel steps in the catabolism of fatty acids 
(Chapter 17). 

One important feature distinguishes amino acid 
degradation from other catabolic processes described to 
this point: every amino acid contains an amino group, 
and the pathways for amino acid degradation therefore 
include a key step in which the a-amino group is sepa- 
rated from the carbon skeleton and shunted into the 
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FIGURE 18-1 Overview of amino acid catabolism in mammals. The amino groups and the 
carbon skeleton take separate but interconnected pathways. 



pathways of amino group metabolism (Fig. 18-1). We 
deal first with amino group metabolism and nitrogen 
excretion, then with the fate of the carbon skeletons 
derived from the amino acids; along the way we see how 
the pathways are interconnected. 

18.1 Metabolic Fates of Amino Groups 

Nitrogen, N 2 , is abundant in the atmosphere but is too 
inert for use in most biochemical processes. Because 
only a few microorganisms can convert N 2 to biologically 
useful forms such as NH 3 (Chapter 22), amino groups 
are carefully husbanded in biological systems. 

Figure 18-2a provides an overview of the catabolic 
pathways of ammonia and amino groups in vertebrates. 
Amino acids derived from dietary protein are the source 
of most amino groups. Most amino acids are metabolized 
in the liver. Some of the ammonia generated in this 
process is recycled and used in a variety of biosynthetic 
pathways; the excess is either excreted directly or con- 
verted to urea or uric acid for excretion, depending on 
the organism (Fig. 18-2b). Excess ammonia generated 
in other (extrahepatic) tissues travels to the liver (in the 
form of amino groups, as described below) for conver- 
sion to the excretory form. 



Glutamate and glutamine play especially critical 
roles in nitrogen metabolism, acting as a kind of general 
collection point for amino groups. In the cytosol of he- 
patocytes, amino groups from most amino acids are 
transferred to a-ketoglutarate to form glutamate, which 
enters mitochondria and gives up its amino group to 
form NH4 . Excess ammonia generated in most other 
tissues is converted to the amide nitrogen of glutamine, 
which passes to the liver, then into liver mitochondria. 
Glutamine or glutamate or both are present in higher 
concentrations than other amino acids in most tissues. 

In skeletal muscle, excess amino groups are generally 
transferred to pyruvate to form alanine, another impor- 
tant molecule in the transport of amino groups to the liver. 

We begin with a discussion of the breakdown of di- 
etary proteins, then give a general description of the 
metabolic fates of amino groups. 

Dietary Protein Is Enzymatically Degraded to 
Amino Adds 

In humans, the degradation of ingested proteins to 
their constituent amino acids occurs in the gastroin- 
testinal tract. Entry of dietary protein into the stomach 
stimulates the gastric mucosa to secrete the hormone 
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FIGURE 18-2 Amino group catabolism. (a) Overview of catabolism of 
amino groups (shaded) in vertebrate liver, (b) Excretory forms of nitro- 
gen. Excess NHj is excreted as ammonia (microbes, bony fishes), urea 
(most terrestrial vertebrates), or uric acid (birds and terrestrial reptiles). 

gastrin, which in turn stimulates the secretion of hy- 
drochloric acid by the parietal cells and pepsinogen by 
the chief cells of the gastric glands (Fig. 18-3a). The 
acidic gastric juice (pH 1.0 to 2.5) is both an antiseptic, 
killing most bacteria and other foreign cells, and a de- 
naturing agent, unfolding globular proteins and ren- 
dering their internal peptide bonds more accessible to 
enzymatic hydrolysis. Pepsinogen (M T 40,554), an in- 
active precursor, or zymogen (p. 227), is converted to 
active pepsin (M T 34,614) by an autocatalytic cleavage 
(a cleavage mediated by the pepsinogen itself) that oc- 
curs only at low pH. In the stomach, pepsin hydrolyzes 
ingested proteins at peptide bonds on the amino- 
terminal side of the aromatic amino acid residues Phe, 
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Notice that the carbon atoms of urea and uric acid are highly oxidized; 
the organism discards carbon only after extracting most of its available 
energy of oxidation. 



Trp, and Tyr (see Table 3-7), cleaving long polypeptide 
chains into a mixture of smaller peptides. 

As the acidic stomach contents pass into the small in- 
testine, the low pH triggers secretion of the hormone se- 
cretin into the blood. Secretin stimulates the pancreas to 
secrete bicarbonate into the small intestine to neutralize 
the gastric HC1, abruptly increasing the pH to about 7. 
(All pancreatic secretions pass into the small intestine 
through the pancreatic duct.) The digestion of proteins 
now continues in the small intestine. Arrival of amino 
acids in the upper part of the intestine (duodenum) 
causes release into the blood of the hormone cholecys- 
tokinin, which stimulates secretion of several pancreatic 
enzymes with activity optima at pH 7 to 8. Trypsinogen, 
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FIGURE 18-3 Part of the human digestive 
(gastrointestinal) tract, (a) The parietal cells 
and chief cells of the gastric glands secrete 
their products in response to the hormone 
gastrin. Pepsin begins the process of protein 
degradation in the stomach, (b) The cyto- 
plasm of exocrine cells is completely filled 
with rough endoplasmic reticulum, the site 
of synthesis of the zymogens of many diges- 
tive enzymes. The zymogens are concen- 
trated in membrane-enclosed transport 
particles called zymogen granules. When an 
exocrine cell is stimulated, its plasma mem- 
brane fuses with the zymogen granule 
membrane and zymogens are released into 
the lumen of the collecting duct by exocyto- 
sis. The collecting ducts ultimately lead to 
the pancreatic duct and thence to the small 
intestine, (c) Amino acids are absorbed 
through the epithelial cell layer (intestinal 
mucosa) of the villi and enter the capillaries. 
Recall that the products of lipid hydrolysis in 
the small intestine enter the lymphatic system 
after their absorption by the intestinal mucosa 
(see Fig. 17-1). 



chymotrypsinogen, and procarboxypeptidases A 

and B — the zymogens of trypsin, chymotrypsin, and 
carboxypeptidases A and B — are synthesized and se- 
creted by the exocrine cells of the pancreas (Fig. 18-3b). 
Trypsinogen is converted to its active form, trypsin, by 
enteropeptidase, a proteolytic enzyme secreted by in- 
testinal cells. Free trypsin then catalyzes the conversion 
of additional trypsinogen to trypsin (see Fig. 6-38). 
Trypsin also activates chymotrypsinogen, the procar- 
boxypeptidases, and proelastase. 

Why this elaborate mechanism for getting active di- 
gestive enzymes into the gastrointestinal tract? Synthe- 
sis of the enzymes as inactive precursors protects the 
exocrine cells from destructive proteolytic attack. The 
pancreas further protects itself against self-digestion by 
making a specific inhibitor, a protein called pancreatic 
trypsin inhibitor (p. 227), that effectively prevents 
premature production of active proteolytic enzymes 
within the pancreatic cells. 



Trypsin and chymotrypsin further hydrolyze the pep- 
tides that were produced by pepsin in the stomach. This 
stage of protein digestion is accomplished very efficiently, 
because pepsin, trypsin, and chymotrypsin have different 
amino acid specificities (see Table 3-7). Degradation of 
the short peptides in the small intestine is then completed 
by other intestinal peptidases. These include carboxypep- 
tidases A and B (both of which are zinc-containing 
enzymes), which remove successive carboxyl-terminal 
residues from peptides, and an aminopeptidase that 
hydrolyzes successive amino-terminal residues from 
short peptides. The resulting mixture of free amino acids 
is transported into the epithelial cells lining the small 
intestine (Fig. 18-3c), through which the amino acids 
enter the blood capillaries in the villi and travel to the 
liver. In humans, most globular proteins from animal 
sources are almost completely hydrolyzed to amino acids 
in the gastrointestinal tract, but some fibrous proteins, 
such as keratin, are only partly digested. In addition, the 
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protein content of some plant foods is protected against 
breakdown by indigestible cellulose husks. 

Acute pancreatitis is a disease caused by ob- 
struction of the normal pathway by which pan- 
creatic secretions enter the intestine. The zymogens of 
the proteolytic enzymes are converted to their catalyti- 
cally active forms prematurely, inside the pancreatic 
cells, and attack the pancreatic tissue itself. This causes 
excruciating pain and damage to the organ that can 
prove fatal. ■ 

Pyridoxal Phosphate Participates in the Transfer of 
tt-Amino Groups to a-Ketoglutarate 

The first step in the catabolism of most L-amino acids, 
once they have reached the liver, is removal of the a- 
amino groups, promoted by enzymes called aminotrans- 
ferases or transaminases. In these transamination 

reactions, the a-amino group is transferred to the a- 
carbon atom of a-ketoglutarate, leaving behind the 
corresponding a-keto acid analog of the amino acid 
(Fig. 18-4). There is no net deamination (loss of amino 
groups) in these reactions, because the a-ketoglutarate 
becomes aminated as the a-amino acid is deaminated. 
The effect of transamination reactions is to collect the 
amino groups from many different amino acids in the 
form of L-glutamate. The glutamate then functions as 
an amino group donor for biosynthetic pathways or for 
excretion pathways that lead to the elimination of 
nitrogenous waste products. 

Cells contain different types of aminotransferases. 
Many are specific for a-ketoglutarate as the amino group 
acceptor but differ in their specificity for the L-amino 
acid. The enzymes are named for the amino group donor 
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FIGURE 18-4 Enzyme-catalyzed transaminations. In many amino- 
transferase reactions, a-ketoglutarate is the amino group acceptor. All 
aminotransferases have pyridoxal phosphate (PLP) as cofactor. Al- 
though the reaction is shown here in the direction of transfer of the 
amino group to a-ketoglutarate, it is readily reversible. 



(alanine aminotransferase, aspartate aminotransferase, 
for example). The reactions catalyzed by aminotrans- 
ferases are freely reversible, having an equilibrium con- 
stant of about 1.0 (AG' « 0 kJ/mol). 

All aminotransferases have the same prosthetic 
group and the same reaction mechanism. The prosthetic 
group is pyridoxal phosphate (PLP), the coenzyme 
form of pyridoxine, or vitamin B 6 . We encountered pyri- 
doxal phosphate in Chapter 15, as a coenzyme in the 
glycogen phosphorylase reaction, but its role in that re- 
action is not representative of its usual coenzyme func- 
tion. Its primary role in cells is in the metabolism of 
molecules with amino groups. 

Pyridoxal phosphate functions as an intermediate 
carrier of amino groups at the active site of aminotrans- 
ferases. It undergoes reversible transformations be- 
tween its aldehyde form, pyridoxal phosphate, which 
can accept an amino group, and its aminated form, pyri- 
doxamine phosphate, which can donate its amino group 
to an a-keto acid (Fig. 18-5a). Pyridoxal phosphate is 
generally covalently bound to the enzyme's active site 
through an aldimine (Schiff base) linkage to the e-amino 
group of a Lys residue (Fig. 18-5b, d). 

Pyridoxal phosphate participates in a variety of re- 
actions at the a, /3, and y carbons (C-2 to C-4) of amino 
acids. Reactions at the a carbon (Fig. 18-6) include 
racemizations (interconverting L- and D-amino acids) 
and decarboxylations, as well as transaminations. Pyri- 
doxal phosphate plays the same chemical role in each of 
these reactions. A bond to the a carbon of the substrate 
is broken, removing either a proton or a carboxyl group. 
The electron pair left behind on the a carbon would 
form a highly unstable carbanion, but pyridoxal phos- 
phate provides resonance stabilization of this intermedi- 
ate (Fig. 18-6 inset). The highly conjugated structure 
of PLP (an electron sink) permits derealization of the 
negative charge. 

Aminotransferases (Fig. 18-5) are classic examples 
of enzymes catalyzing bimolecular Ping-Pong reactions 
(see Fig. 6-13b), in which the first substrate reacts and 
the product must leave the active site before the second 
substrate can bind. Thus the incoming amino acid binds 
to the active site, donates its amino group to pyridoxal 
phosphate, and departs in the form of an a-keto acid. 
The incoming a-keto acid then binds, accepts the amino 
group from pyridoxamine phosphate, and departs in the 
form of an amino acid. As described in Box 18-1 on 
page 678, measurement of the alanine aminotransferase 
and aspartate aminotransferase levels in blood serum is 
important in some medical diagnoses. 

Glutamate Releases Its Amino Group As Ammonia 
in the Liver 

As we have seen, the amino groups from many of the 
a-amino acids are collected in the liver in the form of the 
amino group of L-glutamate molecules. These amino 
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FIGURE 18-5 Pyridoxal phosphate, the prosthetic group of amino- 
transferases, (a) Pyridoxal phosphate (PLP) and its aminated form, 
pyridoxamine phosphate, are the tightly bound coenzymes of amino- 
transferases. The functional groups are shaded, (b) Pyridoxal phos- 
phate is bound to the enzyme through noncovalent interactions and a 
Schiff-base (aldimine) linkage to a Lys residue at the active site. The 
steps in the formation of a Schiff base from a primary amine and a car- 




bonyl group are detailed in Figure 14-5. (c) PLP (red) bound to one of 
the two active sites of the dimeric enzyme aspartate aminotransferase, 
a typical aminotransferase; (d) close-up view of the active site, with 
PLP (red, with yellow phosphorus) in aldimine linkage with the side 
chain of Lys 258 (purple); (e) another close-up view of the active site, 
with PLP linked to the substrate analog 2-methylaspartate (green) via a 
Schiff base (PDB ID 1 AJS). 



BOX 



18-1 



MEDICINE 



Assays for Tissue Damage 



Analyses of certain enzyme activities in blood serum 
give valuable diagnostic information for several disease 
conditions. 

Alanine aminotransferase (ALT; also called gluta- 
mate-pyruvate transaminase, GPT) and aspartate 
aminotransferase (AST; also called glutamate-oxaloac- 
etate transaminase, GOT) are important in the diagnosis 
of heart and liver damage caused by heart attack, drug 
toxicity, or infection. After a heart attack, a variety of 
enzymes, including these aminotransferases, leak from 
injured heart cells into the bloodstream. Measurements 
of the blood serum concentrations of the two amino- 
transferases by the SGPT and SGOT tests (S for 
serum) — and of another enzyme, creatine kinase, by 
the SCK test — can provide information about the 
severity of the damage. Creatine kinase is the first 



heart enzyme to appear in the blood after a heart attack; 
it also disappears quickly from the blood. GOT is the next 
to appear, and GPT follows later. Lactate dehydrogenase 
also leaks from injured or anaerobic heart muscle. 

The SGOT and SGPT tests are also important in 
occupational medicine, to determine whether people 
exposed to carbon tetrachloride, chloroform, or other 
industrial solvents have suffered liver damage. Liver de- 
generation caused by these solvents is accompanied by 
leakage of various enzymes from injured hepatocytes 
into the blood. Aminotransferases are most useful in the 
monitoring of people exposed to these chemicals, be- 
cause these enzyme activities are high in liver and thus 
are likely to be among the proteins leaked from dam- 
aged hepatocytes; also, they can be detected in the 
bloodstream in very small amounts. 
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MECHANISM FIGURE 18-6 Some amino acid transformations at the a 
carbon that are facilitated by pyridoxal phosphate. Pyridoxal phos- 
phate is generally bonded to the enzyme through a Schiff base, also 
called an internal aldimine. This activated form of PLP readily under- 
goes transimination to form a new Schiff base (external aldimine) with 
the a-amino group of the substrate amino acid (see Fig. 18-5b, d). 
Three alternative fates for the external aldimine are shown: (A) transam- 
ination, (§) racemization, and ©decarboxylation The PLP-amino acid 
Schiff base is in conjugation with the pyridine ring, an electron sink that 
permits derealization of an electron pair to avoid formation of an 



unstable carbanion on the a carbon (inset). A quinonoid intermediate is 
involved in all three types of reactions. The transamination route® is 
especially important in the pathways described in this chapter. The 
pathway highlighted here (shown left to right) represents only part of the 
overall reaction catalyzed by aminotransferases. To complete the 
process, a second a-keto acid replaces the one that is released, and 
this is converted to an amino acid in a reversal of the reaction steps 
(right to left). Pyridoxal phosphate is also involved in certain reactions 
at the /3 and y carbons of some amino acids (not shown). 0 Pyridoxal 
Phosphate Reaction Mechanisms 



groups must next be removed from glutamate to pre- 
pare them for excretion. In hepatocytes, glutamate is 
transported from the cytosol into mitochondria, where it 
undergoes oxidative deamination catalyzed by l- 
glutamate dehydrogenase (M r 330,000). In mammals, 
this enzyme is present in the mitochondrial matrix. It is 
the only enzyme that can use either NAD + or NADP + as 
the acceptor of reducing equivalents (Fig. 18-7). 

The combined action of an aminotransferase and 
glutamate dehydrogenase is referred to as trans- 



deamination. A few amino acids bypass the trans- 
deamination pathway and undergo direct oxidative 
deamination. The fate of the NH4 produced by any of 
these deamination processes is discussed in detail in 
Section 18.2. The a-ketoglutarate formed from gluta- 
mate deamination can be used in the citric acid cycle 
and for glucose synthesis. 

Glutamate dehydrogenase operates at an impor- 
tant intersection of carbon and nitrogen metabolism. 
An allosteric enzyme with six identical sub units, its 
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FIGURE 18-7 Reaction catalyzed by glutamate dehydrogenase. The glu- 
taniate dehydrogenase of mammalian liver has the unusual capacity to 
use either NAD + or NADP + as cofactor. The glutamate dehydrogenases 
of plants and microorganisms are generally specific for one or the other. 
The mammalian enzyme is allosterically regulated by GTP and ADR 
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activity is influenced by a complicated array of al- 
losteric modulators. The best-studied of these are 
the positive modulator ADP and the negative modu- 
lator GTP. The metabolic rationale for this regulatory 
pattern has not been elucidated in detail. Mutations 
that alter the allosteric binding site for GTP or other- 
wise cause permanent activation of glutamate dehy- 
drogenase lead to a human genetic disorder called 
hyperinsulinism-hyperammonemia syndrome, charac- 
terized by elevated levels of ammonia in the blood- 
stream and hypoglycemia. 

Giutamine Transports Ammonia in the Bloodstream 

Ammonia is quite toxic to animal tissues (we examine 
some possible reasons for this toxicity later), and the 
levels present in blood are regulated. In many tissues, 
including the brain, some processes such as nucleotide 
degradation generate free ammonia. In most animals 
much of the free ammonia is converted to a nontoxic 
compound before export from the extrahepatic 
tissues into the blood and transport to the liver or kid- 
neys. For this transport function, glutamate, critical to 
intracellular amino group metabolism, is supplanted 
by L-glutamine. The free ammonia produced in tissues is 
combined with glutamate to yield giutamine by the ac- 
tion of giutamine synthetase. This reaction requires 
ATP and occurs in two steps (Fig. 18-8). First, gluta- 
mate and ATP react to form ADP and a y-glutamyl phos- 
phate intermediate, which then reacts with ammonia to 
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FIGURE 18-8 Ammonia transport in the form of giutamine. Excess 
ammonia in tissues is added to glutamate to form giutamine, a process 
catalyzed by giutamine synthetase. After transport in the bloodstream, 
the giutamine enters the liver and NH4 is liberated in mitochondria by 
the enzyme glutaminase. 



produce giutamine and inorganic phosphate. Giutamine 
is a nontoxic transport form of ammonia; it is normally 
present in blood in much higher concentrations than 
other amino acids. Giutamine also serves as a source of 
amino groups in a variety of biosynthetic reactions. Giu- 
tamine synthetase is found in all organisms, always play- 
ing a central metabolic role. In microorganisms, the 
enzyme serves as an essential portal for the entry of 
fixed nitrogen into biological systems. (The roles of 
giutamine and giutamine synthetase in metabolism are 
further discussed in Chapter 22.) 

In most terrestrial animals, giutamine in excess of 
that required for biosynthesis is transported in the blood 
to the intestine, liver, and kidneys for processing. In 
these tissues, the amide nitrogen is released as ammo- 
nium ion in the mitochondria, where the enzyme glut- 
aminase converts giutamine to glutamate and NH4 
(Fig. 18-8). The NH4 from intestine and kidney is 
transported in the blood to the liver. In the liver, the 
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ammonia from all sources is disposed of by urea synthe- 
sis. Some of the glutamate produced in the glutaminase 
reaction may be further processed in the liver by gluta- 
mate dehydrogenase, releasing more ammonia and pro- 
ducing carbon skeletons for metabolic fuel. However, 
most glutamate enters the transamination reactions re- 
quired for amino acid biosynthesis and other processes 
(Chapter 22). 

In metabolic acidosis (p. 667) there is an increase 
in glutamine processing by the kidneys. Not all 
the excess NH4 thus produced is released into the 
bloodstream or converted to urea; some is excreted 
directly into the urine. In the kidney, the NH4 forms 
salts with metabolic acids, facilitating their removal in 
the urine. Bicarbonate produced by the decarboxylation 
of a-ketoglutarate in the citric acid cycle can also serve 
as a buffer in blood plasma. Taken together, these effects 
of glutamine metabolism in the kidney tend to counter- 
act acidosis. ■ 

Alanine Transports Ammonia from Skeletal Muscles to 
the Liver 

Alanine also plays a special role in transporting amino 
groups to the liver in a nontoxic form, via a pathway 
called the glucose-alanine cycle (Fig. 18-9). In 

muscle and certain other tissues that degrade amino 
acids for fuel, amino groups are collected in the form 
of glutamate by transamination (Fig. 18-2a). Gluta- 
mate can be converted to glutamine for transport to 
the liver, as described above, or it can transfer its a- 
amino group to pyruvate, a readily available product of 
muscle glycolysis, by the action of alanine amino- 
transferase (Fig. 18-9). The alanine so formed 
passes into the blood and travels to the liver. In the cy- 
tosol of hepatocytes, alanine aminotransferase trans- 
fers the amino group from alanine to a-ketoglutarate, 
forming pyruvate and glutamate. Glutamate can then 
enter mitochondria, where the glutamate dehydroge- 
nase reaction releases NH4 (Fig. 18-7), or can un- 
dergo transamination with oxaloacetate to form 
aspartate, another nitrogen donor in urea synthesis, as 
we shall see. 

The use of alanine to transport ammonia from 
skeletal muscles to the liver is another example of the 
intrinsic economy of living organisms. Vigorously con- 
tracting skeletal muscles operate anaerobically, pro- 
ducing pyruvate and lactate from glycolysis as well as 
ammonia from protein breakdown. These products 
must find their way to the liver, where pyruvate and lac- 
tate are incorporated into glucose, which is returned to 
the muscles, and ammonia is converted to urea for ex- 
cretion. The glucose-alanine cycle, in concert with the 
Cori cycle (see Box 14-2 and Fig. 23-20), accom- 
plishes this transaction. The energetic burden of gluco- 
neogenesis is thus imposed on the liver rather than the 
muscle, and all available ATP in muscle is devoted to 
muscle contraction. 
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FIGURE 18-9 Glucose-alanine cycle. Alanine serves as a carrier of 
ammonia and of the carbon skeleton of pyruvate from skeletal muscle 
to liver. The ammonia is excreted and the pyruvate is used to produce 
glucose, which is returned to the muscle. 



Ammonia Is Toxic to Animals 

The catabolic production of ammonia poses a 
serious biochemical problem, because ammonia 
is very toxic. The molecular basis for this toxicity is 
not entirely understood. The terminal stages of ammo- 
nia intoxication in humans are characterized by onset 
of a comatose state accompanied by cerebral edema 
(an increase in the brain's water content) and in- 
creased cranial pressure, so research and speculation 
on ammonia toxicity have focused on this tissue. Spec- 
ulation centers on a potential depletion of ATP in 
brain cells. 

Ridding the cytosol of excess ammonia requires re- 
ductive animation of a-ketoglutarate to glutamate by 
glutamate dehydrogenase (the reverse of the reaction 
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described earlier; Fig. 18-7) and conversion of gluta- 
mate to glutamine by glutamine synthetase. Both en- 
zymes are present at high levels in the brain, although 
the glutamine synthetase reaction is almost certainly 
the more important pathway for removal of ammonia. 
High levels of NH4 lead to increased levels of glutamine, 
which acts as an osmotically active solute (osmolyte) in 
brain astrocytes, star-shaped cells of the nervous system 
that provide nutrients, support, and insulation for neu- 
rons. This triggers an uptake of water into the astrocytes 
to maintain osmotic balance, leading to swelling of the 
cells and the brain, leading to coma. 

Depletion of glutamate in the glutamine synthetase 
reaction may have additional effects on the brain. Glu- 
tamate and its derivative y-aminobutyrate (GABA; see 
Fig. 22-29) are important neurotransmitters; the sensi- 
tivity of the brain to ammonia may reflect a depletion of 
neurotransmitters as well as changes in cellular osmotic 
balance. ■ 

As we close this discussion of amino group metabo- 
lism, note that we have described several processes that 
deposit excess ammonia in the mitochondria of hepato- 
cytes (Fig. 18-2). We now look at the fate of that ammonia. 

SUMMARY 18.1 Metabolic Fates of Amino 
Groups 

■ Humans derive a small fraction of their oxidative 
energy from the catabolism of amino acids. Amino 
acids are derived from the normal breakdown 
(recycling) of cellular proteins, degradation of 
ingested proteins, and breakdown of body proteins 
in lieu of other fuel sources during starvation or in 
uncontrolled diabetes mellitus. 

■ Proteases degrade ingested proteins in the stomach 
and small intestine. Most proteases are initially 
synthesized as inactive zymogens. 

■ An early step in the catabolism of amino acids 
is the separation of the amino group from the 
carbon skeleton. In most cases, the amino group is 
transferred to a-ketoglutarate to form glutamate. 
This transamination reaction requires the coenzyme 
pyridoxal phosphate. 

■ Glutamate is transported to liver mitochondria, 
where glutamate dehydrogenase liberates the 
amino group as ammonium ion (NH4). Ammonia 
formed in other tissues is transported to the liver 
as the amide nitrogen of glutamine or, in transport 
from skeletal muscle, as the amino group of 
alanine. 

■ The pyruvate produced by deamination of alanine 
in the liver is converted to glucose, which is 
transported back to muscle as part of the glucose- 
alanine cycle. 



18.2 Nitrogen Excretion and the Urea Cycle 

If not reused for the synthesis of new amino acids or 
other nitrogenous products, amino groups are channeled 
into a single excretory end product (Fig. 18-10). Most 
aquatic species, such as the bony fishes, are am- 
monotelic, excreting amino nitrogen as ammonia. The 
toxic ammonia is simply diluted in the surrounding wa- 
ter. Terrestrial animals require pathways for nitrogen 
excretion that minimize toxicity and water loss. Most ter- 
restrial animals are ureotelic, excreting amino nitrogen 
in the form of urea; birds and reptiles are uricotelic, ex- 
creting amino nitrogen as uric acid. (The pathway of uric 
acid synthesis is described in Fig. 22-45.) Plants recycle 
virtually all amino groups, and nitrogen excretion occurs 
only under very unusual circumstances. 

In ureotelic organisms, the ammonia deposited in 
the mitochondria of hepatocytes is converted to urea in 
the urea cycle. This pathway was discovered in 1932 by 
Hans Krebs (who later also discovered the citric acid cy- 
cle) and a medical student associate, Kurt Henseleit. 
Urea production occurs almost exclusively in the liver 
and is the fate of most of the ammonia channeled there. 
The urea passes into the bloodstream and thus to the 
kidneys and is excreted into the urine. The production 
of urea now becomes the focus of our discussion. 

Urea Is Produced from Ammonia in Five Enzymatic Steps 

The urea cycle begins inside liver mitochondria, but 
three of the subsequent steps take place in the cy- 
tosol; the cycle thus spans two cellular compartments 
(Fig. 18-10). The first amino group to enter the urea 
cycle is derived from ammonia in the mitochondrial 
matrix — NHj arising by the pathways described above. 
The liver also receives some ammonia via the portal 
vein from the intestine, from the bacterial oxidation of 
amino acids. Whatever its source, the NH4 generated 
in liver mitochondria is immediately used, together with 
C0 2 (as HCO3 ) produced by mitochondrial respiration, to 



FIGURE 18-10 Urea cycle and reactions that feed amino groups into the 

cycle. The enzymes catalyzing these reactions (named in the text) are 
distributed between the mitochondrial matrix and the cytosol. One 
amino group enters the urea cycle as carbamoyl phosphate, formed in 
the matrix; the other enters as aspartate, formed in the matrix by transam- 
ination of oxaloacetate and glutamate, catalyzed by aspartate amino- 
transferase. The urea cycle consists of four steps. (T) Formation of 
citrulline from ornithine and carbamoyl phosphate (entry of the first 
amino group); the citrulline passes into the cytosol. (5) Formation of 
argininosuccinate through a citrullyl-AMP intermediate (entry of the sec- 
ond amino group). (§) Formation of arginine from argininosuccinate; 
this reaction releases fumarate, which enters the citric acid cycle. 
(4) Formation of urea; this reaction also regenerates ornithine. The path- 
ways by which NH4 arrives in the mitochondrial matrix of hepatocytes 
were discussed in Section 18,1 , 
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form carbamoyl phosphate in the matrix (Fig. 18-1 la; 
see also Fig. 18-10). This ATP-dependent reaction is 
catalyzed by carbamoyl phosphate synthetase I, a 

regulatory enzyme (see below). The mitochondrial form 
of the enzyme is distinct from the cytosolic (II) form, 
which has a separate function in pyrimidine biosynthesis 
(Chapter 22). 

The carbamoyl phosphate, which functions as an 
activated carbamoyl group donor, now enters the urea 
cycle. The cycle has four enzymatic steps. First, car- 
bamoyl phosphate donates its carbamoyl group to 
ornithine to form citrulline, with the release of 
(Fig. 18-10, step (D). Ornithine plays a role resembling 
that of oxaloacetate in the citric acid cycle, accepting ma- 
terial at each turn of the cycle. The reaction is catalyzed 
by ornithine transcarbamoylase, and the citrulline 
passes from the mitochondrion to the cytosol. 

The second amino group now enters from aspartate 
(generated in mitochondria by transamination and 
transported into the cytosol) by a condensation reaction 
between the amino group of aspartate and the ureido 
(carbonyl) group of citrulline, forming argininosucci- 
nate (step (2) in Fig. 18-10). This cytosolic reaction, cat- 
alyzed by argininosuccinate synthetase, requires 
ATP and proceeds through a citrullyl-AMP intermediate 
(Fig. 18-1 lb). The argininosuccinate is then cleaved by 
argininosuccinase (step (3) in Fig. 18-10) to form free 



arginine and fumarate, the latter entering mitochondria 
to join the pool of citric acid cycle intermediates. This is 
the only reversible step in the urea cycle. In the last re- 
action of the urea cycle (step @), the cytosolic enzyme 
arginase cleaves arginine to yield urea and ornithine. 
Ornithine is transported into the mitochondrion to initi- 
ate another round of the urea cycle. 

As we noted in Chapter 16, the enzymes of many 
metabolic pathways are clustered (p. 619), with the 
product of one enzyme reaction being channeled di- 
rectly to the next enzyme in the pathway. In the urea cy- 
cle, the mitochondrial and cytosolic enzymes seem to be 
clustered in this way. The citrulline transported out of 
the mitochondrion is not diluted into the general pool of 
metabolites in the cytosol but is passed directly to the 
active site of argininosuccinate synthetase. This chan- 
neling between enzymes continues for argininosucci- 
nate, arginine, and ornithine. Only urea is released into 
the general cytosolic pool of metabolites. 

The Citric Acid and Urea Cycles Can Be Linked 

Because the fumarate produced in the argininosuccinase 
reaction is also an intermediate of the citric acid cycle, 
the cycles are, in principle, interconnected — in a process 
dubbed the "Krebs bicycle" (Fig. 18-12). However, each 
cycle can operate independently, and communication 
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MECHANISM FIGURE 18-11 Nitrogen-acquiring reactions in the syn- 
thesis of urea. The urea nitrogens are acquired in two reactions, each 
requiring ATP. (a) In the reaction catalyzed by carbamoyl phosphate 
synthetase I, the first nitrogen enters from ammonia. The terminal phos- 
phate groups of two molecules of ATP are used to form one molecule 
of carbamoyl phosphate. In other words, this reaction has two activa- 



tion steps (0 and (J)). $ Carbamoyl Phosphate Synthetase I Mech- 
anism (b) In the reaction catalyzed by argininosuccinate synthetase, 
the second nitrogen enters from aspartate. Activation of the ureido 
oxygen of citrulline in step (T) sets up the addition of aspartate in 
step (5). £ Argininosuccinate Synthetase Mechanism 
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FIGURE 18-12 Links between the urea cycle and citric acid cycle. 

The interconnected cycles have been called the "Krebs bicycle." The 
pathways linking the citric acid and urea cycles are known as the 
aspartate-argininosuccinate shunt; these effectively link the fates of the 
amino groups and the carbon skeletons of amino acids. The intercon- 
nections are even more elaborate than the arrows suggest. For example, 

between them depends on the transport of key interme- 
diates between the mitochondrion and cytosol. Several 
enzymes of the citric acid cycle, including fumarase (fu- 
marate hydratase) and malate dehydrogenase (p. 628), 
are also present as isozymes in the cytosol. The fu- 
marate generated in cytosolic arginine synthesis can 
therefore be converted to malate in the cytosol, and 
these intermediates can be further metabolized in the 
cytosol or transported into mitochondria for use in the 
citric acid cycle. Aspartate formed in mitochondria by 
transamination between oxaloacetate and glutamate 
can be transported to the cytosol, where it serves as ni- 
trogen donor in the urea cycle reaction catalyzed by 
argininosuccinate synthetase. These reactions, making 
up the aspartate-argininosuccinate shunt, provide 
metabolic links between the separate pathways by 
which the amino groups and carbon skeletons of amino 
acids are processed. 

The Activity of the Urea Cycle Is Regulated at Two Levels 

The flux of nitrogen through the urea cycle in an indi- 
vidual animal varies with diet. When the dietary intake is 
primarily protein, the carbon skeletons of amino acids 
are used for fuel, producing much urea from the excess 



some citric acid cycle enzymes, such as fumarase and malate dehydro- 
genase, have both cytosolic and mitochondrial isozymes. Fumarate 
produced in the cytosol— whether by the urea cycle, purine biosynthe- 
sis, or other processes — can be converted to cytosolic malate, which 
is used in the cytosol or transported into mitochondria (via the malate- 
aspartate shuttle; see Fig. 19-29) to enter the citric acid cycle. 

amino groups. During prolonged starvation, when 
breakdown of muscle protein begins to supply much of 
the organism's metabolic energy, urea production also 
increases substantially. 

These changes in demand for urea cycle activity 
are met over the long term by regulation of the rates of 
synthesis of the four urea cycle enzymes and car- 
bamoyl phosphate synthetase I in the liver. All five 
enzymes are synthesized at higher rates in starving 
animals and in animals on very-high-protein diets than 
in well-fed animals eating primarily carbohydrates and 
fats. Animals on protein-free diets produce lower levels 
of urea cycle enzymes. 

On a shorter time scale, allosteric regulation of at least 
one key enzyme adjusts the flux through the urea cycle. 
The first enzyme in the pathway, carbamoyl phosphate 
synthetase I, is allosterically activated by AT-acetylgluta- 
mate, which is synthesized from acetyl-CoA and gluta- 
mate byiV-acetylglutamate synthase (Fig. 18-13). In 
plants and microorganisms this enzyme catalyzes the first 
step in the de novo synthesis of arginine from glutamate 
(see Fig. 22-10), but in mammals Af-acetylglutamate 
synthase activity in the liver has a purely regulatory 
function (mammals lack the other enzymes needed to 
convert glutamate to arginine). The steady-state levels 
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FIGURE 18-13 Synthesis of /V-acetylglutamate and its activation of 
carbamoyl phosphate synthetase I. 



of 7V-acetylglutamate are determined by the concentra- 
tions of glutamate and acetyl-CoA (the substrates for 
Af-acetylglutamate synthase) and arginine (an activator 
of A^-acetylglutamate synthase, and thus an activator of 
the urea cycle). 

Pathway Interconnections Reduce the Energetic Cost of 
Urea Synthesis 

If we consider the urea cycle in isolation, we see that the 
synthesis of one molecule of urea requires four high- 
energy phosphate groups (Fig. 18-10). Two ATP mole- 
cules are required to make carbamoyl phosphate, and 
one ATP to make argininosuccinate — the latter ATP un- 
dergoing a pyrophosphate cleavage to AMP and PP i; 
which is hydrolyzed to two P t . The overall equation of 
the urea cycle is 



2NHJ + HC0 3 + 3 ATP 4 " + H 2 0 — 
urea + 2ADP 3 " 



4Pf ~ + AMP 2 " + 2H+ 



However, the urea cycle also causes a net conversion of 
oxaloacetate to fumarate (via aspartate), and the regen- 
eration of oxaloacetate (Fig. 18-12) produces NADH in 
the malate dehydrogenase reaction. Each NADH mole- 
cule can generate up to 2.5 ATP during mitochondrial 
respiration (Chapter 19), greatly reducing the overall 
energetic cost of urea synthesis. 



Genetic Defects in the Urea Cycle Can Be 
Life-Threatening 

People with genetic defects in any enzyme in- 
volved in urea formation cannot tolerate pro- 
tein-rich diets. Amino acids ingested in excess of the 
minimum daily requirements for protein synthesis are 
deaminated in the liver, producing free ammonia that 
cannot be converted to urea and exported into the 
bloodstream, and, as we have seen, ammonia is highly 
toxic. The absence of a urea cycle enzyme can result in 
hyperammonemia or in the build-up of one or more 
urea cycle intermediates, depending on the enzyme 
that is missing. Given that most urea cycle steps are ir- 
reversible, the absent enzyme activity can often be 
identified by determining which cycle intermediate is 
present in especially elevated concentration in the 
blood and/or urine. Although the breakdown of amino 
acids can have serious health consequences in individ- 
uals with urea cycle deficiencies, a protein-free diet is 
not a treatment option. Humans are incapable of syn- 
thesizing half of the 20 common amino acids, and these 
essential amino acids (Table 18-1) must be pro- 
vided in the diet. 

A variety of treatments are available for individuals 
with urea cycle defects. Careful administration of the 
aromatic acids benzoate or phenylbutyrate in the diet 
can help lower the level of ammonia in the blood. Ben- 
zoate is converted to benzoyl-CoA, which combines 
with glycine to form hippurate (Fig. 18-14, left). The 
glycine used up in this reaction must be regenerated, 
and ammonia is thus taken up in the glycine synthase 
reaction. Phenylbutyrate is converted to phenylacetate 
by [3 oxidation. The phenylacetate is then converted to 
phenylacetyl-CoA, which combines with glutamine to 
form phenylacetylglutamine (Fig. 18-14, right). The re- 
sulting removal of glutamine triggers its further synthe- 
sis by glutamine synthetase (see Eqn 22-1) in a 



TABLE T8-1 



Nonessential and Essential Amino Adds 
for Humans and the Albino Rat 



Nonessential 


Conditionally 
essential* 


Essential 


Alanine 


Arginine 


Histidine 


Asparagine 


Cysteine 


Isoleucine 


Aspartate 


Glutamine 


Leucine 


Glutamate 


Glycine 


Lysine 


Serine 


Proline 


Methionine 




Tyrosine 


Phenylalanine 






Threonine 






Tryptophan 






Valine 



"■Required to some degree in young, growing animals, and/or sometimes during illness. 
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FIGURE 18-14 Treatment for deficiencies in urea cycle en- 
zymes. The aromatic acids benzoate and phenylbutyrate, ad- 
ministered in the diet, are metabolized and combine with glycine and 
glutamine, respectively. The products are excreted in the urine. Subse- 
quent synthesis of glycine and glutamine to replenish the pool of these 
intermediates removes ammonia from the bloodstream. 



reaction that takes up ammonia. Both hippurate and 
phenylacetylglutamine are nontoxic compounds that 
are excreted in the urine. The pathways shown in Fig- 
ure 18-14 make only minor contributions to normal 
metabolism, but they become prominent when aro- 
matic acids are ingested. 

Other therapies are more specific to a particular 
enzyme deficiency. Deficiency of AT-acetylglutamate 



synthase results in the absence of the normal activator 
of carbamoyl phosphate synthetase I (Fig. 18-13). This 
condition can be treated by administering carbamoyl 
glutamate, an analog of Af-acetylglutamate that is effec- 
tive in activating carbamoyl phosphate synthetase I. 



O 



COO" 



H 2 N-C-NH-C-H 

I 

CH 2 

CH 2 

COO" 
Carbamoyl glutamate 

Supplementing the diet with arginine is useful in treat- 
ing deficiencies of ornithine transcarbamoylase, argini- 
nosuccinate synthetase, and argininosuccinase. Many 
of these treatments must be accompanied by strict di- 
etary control and supplements of essential amino acids. 
In the rare cases of arginase deficiency, arginine, the 
substrate of the defective enzyme, must be excluded 
from the diet. ■ 



SUMMARY 18.2 



Nitrogen Excretion and 
the Urea Cycle 



■ Ammonia is highly toxic to animal tissues. In 

the urea cycle, ornithine combines with ammonia, 
in the form of carbamoyl phosphate, to form 
citrulline. A second amino group is transferred to 
citrulline from aspartate to form arginine— the 
immediate precursor of urea. Arginase catalyzes 
hydrolysis of arginine to urea and ornithine; 
thus ornithine is regenerated in each turn of 
the cycle. 

■ The urea cycle results in a net conversion of 
oxaloacetate to fumarate, both of which are 
intermediates in the citric acid cycle. The two 
cycles are thus interconnected. 

■ The activity of the urea cycle is regulated at the 
level of enzyme synthesis and by allosteric 
regulation of the enzyme that catalyzes the 
formation of carbamoyl phosphate. 

1 8.3 Pathways of Amino Acid Degradation 

The pathways of amino acid catabolism, taken to- 
gether, normally account for only 10% to 15% of the 
human body's energy production; these pathways are 
not nearly as active as glycolysis and fatty acid oxi- 
dation. Flux through these catabolic routes also 
varies greatly, depending on the balance between re- 
quirements for biosynthetic processes and the avail- 
ability of a particular amino acid. The 20 catabolic 
pathways converge to form only six major products, 



688 Amino Acid Oxidation and the Production of Urea 



all of which enter the citric acid cycle (Fig. 18-15). 
From here the carbon skeletons are diverted to glu- 
coneogenesis or ketogenesis or are completely oxi- 
dized to C0 2 and H 2 0. 

All or part of the carbon skeletons of seven amino 
acids are ultimately broken down to acetyl-CoA. Five 
amino acids are converted to a-ketoglutarate, four to 
succinyl-CoA, two to fumarate, and two to oxaloac- 
etate. Parts or all of six amino acids are converted to 
pyruvate, which can be converted to either acetyl-CoA 
or oxaloacetate. We later summarize the individual 
pathways for the 20 amino acids in flow diagrams, 
each leading to a specific point of entry into the citric 
acid cycle. In these diagrams the carbon atoms that 
enter the citric acid cycle are shown in color. Note that 
some amino acids appear more than once, reflecting 
different fates for different parts of their carbon skele- 
tons. Rather than examining every step of every path- 
way in amino acid catabolism, we single out for special 
discussion some enzymatic reactions that are particu- 
larly noteworthy for their mechanisms or their med- 
ical significance. 



Some Amino Acids Are Converted to Glucose, Others to 
Ketone Bodies 

The seven amino acids that are degraded entirely or in 
part to acetoacetyl-CoA and/or acetyl-CoA — phenylala- 
nine, tyrosine, isoleucine, leucine, tryptophan, threo- 
nine, and lysine — can yield ketone bodies in the liver, 
where acetoacetyl-CoA is converted to acetoacetate and 
then to acetone and /3-hydroxybutyrate (see Fig. 17-18). 
These are the ketogenic amino acids (Fig. 18-15). 
Their ability to form ketone bodies is particularly evi- 
dent in uncontrolled diabetes mellitus, in which the liver 
produces large amounts of ketone bodies from both 
fatty acids and the ketogenic amino acids. 

The amino acids that are degraded to pyruvate, 
a-ketoglutarate, succinyl-CoA, fumarate, and/or oxaloac- 
etate can be converted to glucose and glycogen by path- 
ways described in Chapters 14 and 15. They are the 
glucogenic amino acids. The division between keto- 
genic and glucogenic amino acids is not sharp; five amino 
acids — tryptophan, phenylalanine, tyrosine, threonine, 
and isoleucine — are both ketogenic and glucogenic. 
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FIGURE 18-15 Summary of amino acid catabolism. Amino acids are 
grouped according to their major degradative end product. Some 
amino acids are listed more than once because different parts of their 
carbon skeletons are degraded to different end products, The figure 
shows the most important catabolic pathways in vertebrates, but there 
are minor variations among vertebrate species. Threonine, for instance, 
is degraded via at least two different pathways (see Figs 1 8-1 9, 1 8-27), 



and the importance of a given pathway can vary with the organism and 
its metabolic conditions, The glucogenic and ketogenic amino acids 
are also delineated in the figure, by color shading. Notice that five of 
the amino acids are both glucogenic and ketogenic. The amino acids 
degraded to pyruvate are also potentially ketogenic. Only two amino 
acids, leucine and lysine, are exclusively ketogenic. 
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FIGURE 18-16 Some enzyme cofactors important in one-carbon transfer reactions. The nitrogen 
atoms to which one-carbon groups are attached in tetrahydrofolate are shown in blue. 



Catabolism of amino acids is particularly critical to the 
survival of animals with high-protein diets or during 
starvation. Leucine is an exclusively ketogenic amino 
acid that is very common in proteins. Its degradation 
makes a substantial contribution to ketosis under star- 
vation conditions. 

Several Enzyme Cofactors Play Important Roles in 
Amino Acid Catabolism 

A variety of interesting chemical rearrangements occur 
in the catabolic pathways of amino acids. It is useful to 
begin our study of these pathways by noting the classes 
of reactions that recur and introducing their enzyme 
cofactors. We have already considered one important 
class: transamination reactions requiring pyridoxal 
phosphate. Another common type of reaction in amino 
acid catabolism is one-carbon transfers, which usually 
involve one of three cofactors: biotin, tetrahydrofolate, 
or S-adenosylmethionine (Fig. 18-16). These cofac- 
tors transfer one-carbon groups in different oxidation 
states: biotin transfers carbon in its most oxidized state, 
C0 2 (see Fig. 14-18); tetrahydrofolate transfers one- 
carbon groups in intermediate oxidation states and 
sometimes as methyl groups; and S-adenosylmethionine 
transfers methyl groups, the most reduced state of car- 
bon. The latter two cofactors are especially important in 
amino acid and nucleotide metabolism. 

Tetrahydrofolate (H 4 folate), synthesized in bac- 
teria, consists of substituted pterin (6-methylpterin), 
p-aminobenzoate, and glutamate moieties (Fig. 18-16). 



H.N 



O 

Pterin 



a 



The oxidized form, folate, is a vitamin for mammals; it is 
converted in two steps to tetrahydrofolate by the en- 
zyme dihydrofolate reductase. The one-carbon group 
undergoing transfer, in any of three oxidation states, is 
bonded to N-5 or N-10 or both. The most reduced form 
of the cofactor carries a methyl group, a more oxidized 
form carries a methylene group, and the most oxidized 
forms carry a methenyl, formyl, or formimino group 
(Fig. 18-17). Most forms of tetrahydrofolate are inter- 
convertible and serve as donors of one-carbon units in a 
variety of metabolic reactions. The primary source of 
one-carbon units for tetrahydrofolate is the carbon re- 
moved in the conversion of serine to glycine, producing 
A^A/^-methylenetetrahydrofolate. 

Although tetrahydrofolate can carry a methyl 
group at N-5, the transfer potential of this methyl 
group is insufficient for most biosynthetic reactions. S- 
Adenosylmethionine (adoMet) is the preferred co- 
factor for biological methyl group transfers. It is 
synthesized from ATP and methionine by the action of 
methionine adenosyl transferase (Fig. 18-18, 
step ®). This reaction is unusual in that the nucle- 
ophilic sulfur atom of methionine attacks the 5' carbon 
of the ribose moiety of ATP rather than one of the 
phosphorus atoms. Triphosphate is released and is 
cleaved to P { and PPi on the enzyme, and the PPi is 
cleaved by inorganic pyrophosphatase; thus three 
bonds, including two bonds of high-energy phosphate 
groups, are broken in this reaction. The only other 
known reaction in which triphosphate is displaced 
from ATP occurs in the synthesis of coenzyme B 12 (see 
Box 17-2, Fig. 3). 

S-Adenosylmethionine is a potent alkylating agent 
by virtue of its destabilizing sulfonium ion. The methyl 
group is subject to attack by nucleophiles and is about 
1,000 times more reactive than the methyl group of 
A^-methyltetrahydrofolate. 
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FIGURE 18-17 Conversions of one-carbon units on tetrahydrofolate. 

The different molecular species are grouped according to oxidation 
state, with the most reduced at the top and most oxidized at the bot- 
tom. All species within a single shaded box are at the same oxidation 
state. The conversion of N 5 ,N 10 -methylenetetrahydrofolate to N 5 - 
methyltetrahydrofolate is effectively irreversible. The enzymatic trans- 
fer of formyl groups, as in purine synthesis (see Fig. 22-33) and in the 
formation of formylmethionine in bacteria (Chapter 27), generally uses 

Transfer of the methyl group from S-adenosylmethio- 
nine to an acceptor yields S-adenosylhomocysteiite 

(Fig. 18-18, step (D), which is subsequently broken down 
to homocysteine and adenosine (step ®). Methionine is 
regenerated by transfer of a methyl group to homocys- 
teine in a reaction catalyzed by methionine synthase 
(step (3)), and methionine is reconverted to S-adenosyl- 
methionine to complete an activated-methyl cycle. 



J 1 N H 
0 II n 



N 10 -formyltetrahydrofolate rather than N 5 -formyltetrahydrofolate. The 
latter species is significantly more stable and therefore a weaker donor 
of formyl groups. /V 5 -Formyltetrahydrofolate is a minor byproduct of 
the cyclohydrolase reaction, and can also form spontaneously. Con- 
version of N 5 -formyltetrahydrofolate to N 5 ,N 10 -methenyltetrahydrofo- 
late requires ATP, because of an otherwise unfavorable equilibrium. 
Note that N 5 -formiminotetrahydrofolate is derived from histidine in a 
pathway shown in Figure 18-26. 

One form of methionine synthase common in bacte- 
ria uses A^-methyltetrahydrofolate as a methyl donor. 
Another form of the enzyme present in some bacteria 
and mammals uses A^-methyltetrahydrofolate, but the 
methyl group is first transferred to cobalamin, derived 
from coenzyme B 12 , to form methylcobalamin as the 
methyl donor in methionine formation. This reaction 
and the rearrangement of L-methylmalonyl-CoA to 
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FIGURE 18-18 Synthesis of methionine and S-adenosylmethionine in 
an activated-methyl cycle. The steps are described in the text. In the 
methionine synthase reaction (step (?)), the methyl group is transferred 
to cobalamin to form methylcobalamin, which in turn is the methyl 

succinyl-CoA (see Box 17-2, Fig. la) are the only known 
coenzyme B 12 -dependent reactions in mammals. 

The vitamins B 12 and folate are closely linked in 
these metabolic pathways. The B 12 deficiency 
disease pernicious anemia is rare, seen only in indi- 
viduals who have a defect in the intestinal absorption 
pathways for this vitamin (see Box 17-2) or in strict 
vegetarians (B 12 is not present in plants). The disease 
progresses slowly, because only small amounts of vita- 
min B 12 are required and normal stores of B 12 in the liver 
can last three to five years. Symptoms include not only 
anemia but a variety of neurological disorders. 

The anemia can be traced to the methionine syn- 
thase reaction. As noted above, the methyl group of 
methylcobalamin is derived from A^ 5 -methyltetrahy- 
drofolate, and this is the only reaction in mammals 
that uses 7V 5 -methyltetrahydrofolate. The reaction 
converting the A/' 5 ,7V rl0 -methylene form to the AT 5 - 
methyl form of tetrahydrofolate is irreversible (Fig. 
18-17). Thus, if coenzyme B 12 is not available for 
the synthesis of methylcobalamin, metabolic folates 
become trapped in the A/^-methyl form. The anemia 
associated with vitamin B 12 deficiency is called mega- 
loblastic anemia. It manifests as a decline in the pro- 
duction of mature erythrocytes (red blood cells) and 
the appearance in the bone marrow of immature pre- 
cursor cells, or megaloblasts. Erythrocytes are grad- 
ually replaced in the blood by smaller numbers of 



donor in the formation of methionine. 5-Adenosylmethionine, which 
has a positively charged sulfur (and is thus a sulfonium ion), is a pow- 
erful methylating agent in several biosynthetic reactions. The methyl 
group acceptor (step (2)) is designated R. 

abnormally large erythrocytes called macrocytes. The 
defect in erythrocyte development is a direct conse- 
quence of the depletion of the A/ 5 ,A/' 10 -methylene- 
tetrahydrofolate, which is required for synthesis of the 
thymidine nucleotides needed for DNA synthesis (see 
Chapter 22). Folate deficiency, in which all forms of 
tetrahydrofolate are depleted, also produces anemia, 
for much the same reasons. The anemia symptoms of 
B 12 deficiency can be alleviated by administering either 
vitamin B 12 or folate. 

However, it is dangerous to treat pernicious anemia 
by folate supplementation alone, because the neurologi- 
cal symptoms of B 12 deficiency will progress. These 
symptoms do not arise from the defect in the methion- 
ine synthase reaction. Instead, the impaired methyl- 
malonyl-CoA mutase causes accumulation of unusual, 
odd-number fatty acids in neuronal membranes. The 
anemia associated with folate deficiency is thus often 
treated by administering both folate and vitamin B 12 , at 
least until the metabolic source of the anemia is unam- 
biguously defined. Early diagnosis of B 12 deficiency is 
important because some of its associated neurological 
conditions may be irreversible. 

Folate deficiency also reduces the availability of the 
A^-methyltetrahydrofolate required for methionine syn- 
thase function. This leads to a rise in homocysteine lev- 
els in blood, a condition linked to heart disease, 
hypertension, and stroke. High levels of homocysteine 
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may be responsible for 10% of all cases of heart disease. 
The condition is treated with folate supplements. ■ 

Tetrahydrobiopterin, another cofactor of amino 
acid catabolism, is similar to the pterin moiety of 
tetrahydrofolate, but it is not involved in one-carbon 
transfers; instead it participates in oxidation reactions. 
We consider its mode of action when we discuss phenyl- 
alanine degradation (see Fig. 18-24). 

Six Amino Acids Are Degraded to Pyruvate 

The carbon skeletons of six amino acids are converted 
in whole or in part to pyruvate. The pyruvate can then 
be converted to acetyl-CoA and eventually oxidized via 
the citric acid cycle, or to oxaloacetate and shunted 
into gluconeogenesis. The six amino acids are alanine, 
tryptophan, cysteine, serine, glycine, and threonine 
(Fig. 18-19). Alanine yields pyruvate directly on 
transamination with a-ketoglutarate, and the side chain 
of tryptophan is cleaved to yield alanine and thus pyru- 
vate. Cysteine is converted to pyruvate in two steps; one 
removes the sulfur atom, the other is a transamination. 



Serine is converted to pyruvate by serine dehydratase. 
Both the jS-hydroxyl and the a-amino groups of serine are 
removed in this single pyridoxal phosphate-dependent 
reaction (Fig. 18-20a). 

Glycine is degraded via three pathways, only one of 
which leads to pyruvate. Glycine is converted to serine 
by enzymatic addition of a hydroxymethyl group (Figs 
18-19, 18-20b). This reaction, catalyzed by serine hy- 
droxymethyl transferase, requires the coenzymes 
tetrahydrofolate and pyridoxal phosphate. The serine is 
converted to pyruvate as described above. In the second 
pathway, which predominates in animals, glycine under- 
goes oxidative cleavage to C0 2) NH4 , and a methylene 
group (— CH 2 — ) (Figs 18-19, 18-20c). This readily re- 
versible reaction, catalyzed by glycine cleavage en- 
zyme (also called glycine synthase), also requires 
tetrahydrofolate, which accepts the methylene group. In 
this oxidative cleavage pathway, the two carbon atoms of 
glycine do not enter the citric acid cycle. One carbon is 
lost as C0 2 and the other becomes the methylene group 
of A^,A^ 10 -methylenetetrahydrofolate (Fig. 18-17), a one- 
carbon group donor in certain biosynthetic pathways. 



FIGURE 18-19 Catabolic pathways for alanine, glycine, serine, 
cysteine, tryptophan, and threonine. The fate of the i ndole group 
of tryptophan is shown in Figure 18-21. Details of most of the re- 
actions involving serine and glycine are shown in Figure 18-20. 
The pathway for threonine degradation shown here accounts for 
only about a third of threonine catabolism {for the alternative 
pathway, see Fig. 18-27). Several pathways for cysteine degrada- 
tion lead to pyruvate. The sulfur of cysteine has several alternative 
fates, one of which is shown in Figure 22-15. Carbon atoms here 
and in subsequent figures are color-coded as necessary to trace 
their fates. 



NH 3 

CH 3 -CH~CH-( GO 



I 

OH I Tlnx niiu 

i but uiinf 'f NAD + 



NH 3 

CH 3 — C— CH— COO" 

O 2-Amino-3-ketobutyrate 



• ^ CoA 

^^^^^^^^^ 

^ Acetyl-CoA 



Glycine 



h'. iir .:>.; 
rin iJiyJ 



I 

CH 2 -COO~ 

AT 5 , ^-methylene 
f H 4 folate 

PLP 



NAD + NADH 



ck'avcigp 

t-Sl/ Villi? 



C0 2 + NH4 



* H 4 folate 




pHgjj; — CH — 00" 



Tryptophan 



N 
H 

4 steps 



NHq 

l 

CH 3 — CH— COO" 



HO — CH 2 — CH— COO" Serine 



atnni* 
i.k-tiVdi'iJLitb 




iimiiir mil 



PLP 



o 

II 2 

CH 3 — C— COO" <- 



Alanine | a-Ketoglutarate Glutamate Pyruvate 



SH NH 3 
CH 2 — CH— COO" 



Cysteine 



18.3 Pathways of Amino Acid Degradation [j>93j 



(a) 

Serine 
dehydratase 
reaction 



H 



O-H 



tion of OH. 



CH 9 



| PLP f — CH= NH— C HB 



The link 
to PLP 

is reversed. 



I 

coo- 

^(Enz) Lys 
^ (Enz) -PLP 



rCH 2 
H 2 N-C 



Rearrange- 
ment of the 
enamine 
forms an 
imine. 



Hydrolytic 
deamination 
yields 
pyruvate. 



® 



I 

COO" 
H + 



® 



CH 3 

I 3 
=C 
I 

COO" 
H 2 0 

^NHo 



CH 2 

I 

o=c 
cocr 

Pyruvate 



(b) 

Serine 

hydroxy me thy transferase 
reaction 



o 



I PLP h CH=NH-C"H^:B- 

I 

qqq- Covalently 

Proton bound serine 

abstraction at 
a carbon leads (l) 
to (3 elimina- 



| PLP |— CH— NH — C— H 



(c) 
Glycine 
cleavage enzyme 
reaction 

H 



coo- 



Abstr action of a 
proton forms a 
PLP-stabilized 
carbanion. 



Covalently 
bound glycine 



CD 



I RLPh CH— NH— -C- H 

i — , q 

0 o 1 



Covalently 
bound glycine 



[PLPh CH-NH— C— H HB- 



Decarboxylation 
of glycine 
generates a 
PLP-stabilized 
carbanion. 



CO,, 



H 



The carbanion 
attacks the 
methylene of 
2V 5 , iV i0 -methylene 
H 4 folate to 
yield serine. 



COO" 



PLP-stabilized 
carbanion 



[PLPl-CH=NH— C: -n 
H ) 



PLP-stabilized 
carbanion 



-A^,AT 10 -methylene 
H 4 folate 

k H, folate 



H + 



HO— CH 2 

PLPh CH— NH— C— H 
I 

coo- 



The carbanion 
attacks the 
oxidized lipoic 
acid component 
of enzyme H. 



® 



H 



The aldimine 
link to PLP is 
reversed, and 
the product 
is released. 



® 



^ (Enz)— Lys 
^(g)-PLP 



fPLPV CH=NH— C — S 

H > 

HS 



-(Enz^) 



CH 2 OH 

+ I 
H 3 N— C— H 

COO" 



The link 

to PLP 

is reversed. 



-(Enz)— Lys 



Enz) -PLP 



H 3 N-CH 2 -S 
HS 



Serine 



Methylene is 
transferred to 
H 4 folate with 
elimination of 
ammonia. 



>■ H4 folate 

v® 

^ jys .^-methylene 
folate 

^NHo 



m 

HS 



5H 



Lipoic acid is 
reoxidized. 



(EnzL) 



NAD + 



NADH 



lV(En^ 



MECHANISM FIGURE 18-20 Interplay of the pyridoxal phosphate and 
tetrahydrofolate cofactors in serine and glycine metabolism. The first 
step in each of these reactions (not shown) involves the formation of a 
covalent imine linkage between enzyme-bound PLP and the substrate 
amino acid — serine in (a), glycine in (b) and (c), (a) A PLP-catalyzed 
elimination of water in the serine dehydratase reaction (step (T)) begins 
the pathway to pyruvate, (b) In the serine hydroxymethyltransferase re- 
action, a PLP-stabilized carbanion (product of step @) is a key inter- 
mediate in the reversible transfer of the methylene group (as 



S 

_CH 2 — OH) from N 5 ,N 10 -methylenetetrahydrofolate to form serine, 
(c) The glycine cleavage enzyme is a multienzyme complex, with com- 
ponents P, H, T, and L. The overall reaction, which is reversible, con- 
verts glycine to C0 2 and NH4, with the second glycine carbon taken 
up by tetrahydrofolate to form /V 5 ,/V 10 -methyIenetetrahydrofolate. Pyri- 
doxal phosphate activates the a carbon of amino acids at critical stages 
in all these reactions, and tetrahydrofolate carries one-carbon units in 
two of them (see Figs 1 8-6, 1 8-1 7). 
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This second pathway for glycine degradation 
seems to be critical in mammals. Humans with se- 
rious defects in glycine cleavage enzyme activity suffer 
from a condition known as nonketotic hyperglycinemia. 
The condition is characterized by elevated serum levels 
of glycine, leading to severe mental deficiencies and 
death in very early childhood. At high levels, glycine is 
an inhibitory neurotransmitter, perhaps explaining the 
neurological effects of the disease. Many genetic defects 
of amino acid metabolism have been identified in hu- 
mans (Table 18-2). We will encounter several more in 
this chapter. ■ 

In the third and final pathway of glycine degrada- 
tion, the achiral glycine molecule is a substrate for the 
enzyme D-amino acid oxidase. The glycine is converted 
to glyoxylate, an alternative substrate for hepatic lactate 
dehydrogenase (p. 547). Glyoxylate is oxidized in an 
NAD + -dependent reaction to oxalate: 



NH 3 
CH 2 
COO" 



¥ R 3 
J- > 



i)-amino acid 
oxidase 



II 

CH 

I 

COO" 
Glyoxylate 



NAD' NADH 
^ ^ > 



COO" 

I 

COCT 
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The primary function of D-amino acid oxidase, 
present at high levels in the kidney, is thought to 
be the detoxification of ingested D-amino acids derived 
from bacterial cell walls and from grilled foodstuffs 
(high heat causes some spontaneous racemization of the 
L-amino acids in proteins). Oxalate, whether obtained in 
foods or produced enzymatically in the kidneys, has 
medical significance. Crystals of calcium oxalate ac- 
count for up to 75% of all kidney stones. ■ 

There are two significant pathways for threonine 
degradation. One pathway leads to pyruvate via glycine 
(Fig. 18-19). The conversion to glycine occurs in two 
steps, with threonine first converted to 2-amino- 
3-ketobutyrate by the action of threonine dehydroge- 
nase. This is a relatively minor pathway in humans, ac- 
counting for 10% to 30% of threonine catabolism, but is 
more important in some other mammals. The major path- 
way in humans leads to succinyl-CoA and is described 
later. 

In the laboratory, serine hydroxymethyltransferase 
will catalyze the conversion of threonine to glycine and 
acetaldehyde in one step, but this is not a significant 
pathway for threonine degradation in mammals. 



TABLE 18-2 
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Approximate 
incidence 
(per 100,000 








Medical condition 


births) 


Defective process 


Defective enzyme 


Symptoms and effects 


Albinism 


<3 


Melanin synthesis 


Tyrosine 3- 


Lack of pigmentation; 






from tyrosine 


monooxygenase 
(tyrosinase) 


white hair, pink skin 


Alkaptonuria 


<0.4 


Tyrosine degradation 


Homogentisate 
1,2-dioxygenase 


Dark pigment in urine; 
late-developing 
arthritis 


Argininemia 


<0.5 


Urea synthesis 


Arginase 


Mental retardation 


Argininosuccinic 


<1.5 


Urea synthesis 


Argininosuccinase 


Vomiting; convulsions 


acidemia 








Carbamoyl phosphate 


<0.5 


Urea synthesis 


Carbamoyl phosphate 


Lethargy; convulsions; 


synthetase 1 






synthetase 1 


early death 


deficiency 










Homocystinuria 


<0.5 


Methionine 


Cystathionine 


Faulty bone develop- 






degradation 


/3-synthase 


ment; mental 
retardation 


Maple syrup urine 


<0.4 


Isoleucine, leucine, 


Branched-chain 


Vomiting; convulsions; 


disease (branched- 




and valine 


a-keto acid 


mental retardation; 


chain ketoaciduria) 




degradation 


dehydrogenase 
complex 


early death 


Methylmalonic 


<0.5 


Conversion of 


Methylmalonyl- 


Vomiting; convulsions; 


acidemia 




propionyl-CoA to 
succinyl-CoA 


CoA mutase 


mental retardation; 
early death 


Phenylketonuria 


<8 


Conversion of 


Phenylalanine 


Neonatal vomiting; 






phenylalanine to 


hydroxylase 


mental retardation 






tyrosine 
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Seven Amino Acids Are Degraded to Acetyl-CoA 

Portions of the carbon skeletons of seven amino acids— 
tryptophan, lysine, phenylalanine, tyrosine, leucine, 
isoleucine, and threonine — yield acetyl-CoA and/or 
acetoacetyl-CoA, the latter being converted to acetyl-CoA 
(Fig. 18-21). Some of the final steps in the degradative 
pathways for leucine, lysine, and tryptophan resemble 
steps in the oxidation of fatty acids. Threonine (not shown 
in Fig. 18-21) yields some acetyhCoA via the minor path- 
way illustrated in Figure 18-19. 

The degradative pathways of two of these seven 
amino acids deserve special mention. Tryptophan break- 



down is the most complex of all the pathways of amino 
acid catabolism in animal tissues; portions of tryptophan 
(four of its carbons) yield acetyl-CoA via acetoacetyl- 
CoA. Some of the intermediates in tryptophan catabolism 
are precursors for the synthesis of other biomolecules 
(Fig. 18-22), including nicotinate, a precursor of NAD 
and NADP in animals; serotonin, a neurotransmitter in 
vertebrates; and indoleacetate, a growth factor in plants. 
Some of these biosynthetic pathways are described in 
more detail in Chapter 22 (see Figs 22-28, 22-29). 

The breakdown of phenylalanine is noteworthy be- 
cause genetic defects in the enzymes of this pathway 
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FIGURE 18-21 Catabolic pathways for tryptophan, lysine, phenylala- 
nine, tyrosine, leucine, and isoleucine. These amino acids donate 
some of their carbons (red) to acetyl-CoA. Tryptophan, phenylalanine, 
tyrosine, and isoleucine also contribute carbons (blue) to pyruvate or 



citric acid cycle intermediates. The phenylalanine pathway is described 
in more detail in Figure 1 8-23. The fate of nitrogen atoms is not traced 
in this scheme; in most cases they are transferred to a-ketoglutarate to 
form glutamate. 
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FIGURE 18-22 Tryptophan as precursor. The aromatic rings of trypto- 
phan give rise to nicotinate (niacin), indoleacetate, and serotonin. Col- 
ored atoms trace the source of the ring atoms in nicotinate. 



lead to several inheritable human diseases (Fig. 18-23), 
as discussed below. Phenylalanine and its oxidation 
product tyrosine (both with nine carbons) are degraded 
into two fragments, both of which can enter the citric 
acid cycle: four of the nine carbon atoms yield free ace- 
toacetate, which is converted to acetoacetyl-CoA and 
thus acetyl-CoA, and a second four-carbon fragment is 
recovered as fumarate. Eight of the nine carbons of 
these two amino acids thus enter the citric acid cycle; 
the remaining carbon is lost as C0 2 . Phenylalanine, after 
its hydroxylation to tyrosine, is also the precursor of 
dopamine, a neurotransmitter, and of norepinephrine 
and epinephrine, hormones secreted by the adrenal 
medulla (see Fig. 22-29). Melanin, the black pigment of 
skin and hair, is also derived from tyrosine. 

Phenylalanine Catabolism Is Genetically Defective in 
Some People 

BS Given that many amino acids are either neuro- 
£21 transmitters or precursors or antagonists of neu- 
rotransmitters, genetic defects of amino acid metabolism 
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FIGURE 18-23 Catabolic pathways for phenylalanine and ty- 
rosine. In humans these amino acids are normally converted 




OH 

CH 2 — COO 

Homogentisate 



1 2~d&EXygB*UM><! 



Alkaptonuria 

^-»H + 



"OOC— C=C-C— CH 2 — C— CH 2 — COO" 
H H || || 
O O 

Maleylacetoacetate 



maleylacetoacetate 
isomerase 



H 

"OOC-C=C— C— CH 2 — ( — CH 2 — COO" 
H || || 
O O 



Fumarylacetoacetate 
^H^O 

¥ Tyrosinemia 



H 



Fumarate 



"OOC— C=C— COO" + CH 3 — C— CH 2 — COO" 
H || 

O Acetoacetate 
Succinyl-CoA 

i \ 

^> Succinate 

CH 3 — C— CH 2 — C— S-CoA 
O O 

to acetoacetyl-CoA and fumarate. Genetic defects in many of these 
enzymes cause inheritable human diseases (shaded yellow). 
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FIGURE 18-24 Role of tetrahydrobiopterin in the phenylalanine hy- 
droxylase reaction. The H atom shaded pink is transferred directly 

can cause defective neural development and mental 
retardation. In most such diseases specific intermedi- 
ates accumulate. For example, a genetic defect in 
phenylalanine hydroxylase, the first enzyme in the 
catabolic pathway for phenylalanine (Fig. 18-23), is re- 
sponsible for the disease phenylketonuria (PKU), the 
most common cause of elevated levels of phenylalanine 
(hyperphenylalaninemia) . 

Phenylalanine hydroxylase (also called phenylala- 
nine-4-monooxygenase) is one of a general class of 
enzymes called mixed-function oxidases (see Box 
21-1), all of which catalyze simultaneous hydroxylation 
of a substrate by an oxygen atom of 0 2 and reduction of 
the other oxygen atom to H 2 0. Phenylalanine hydroxy- 
lase requires the cofactor tetrahydrobiopterin, which car- 
ries electrons from NADPH to 0 2 and becomes oxidized 
to dihydrobiopterin in the process (Fig. 18-24). It is 
subsequently reduced by the enzyme dihydrobiopterin 
reductase in a reaction that requires NADPH. 

In individuals with PKU, a secondary, normally little- 
used pathway of phenylalanine metabolism comes into 
play. In this pathway phenylalanine undergoes trans- 
amination with pyruvate to yield phenylpyruvate 
(Fig. 18-25). Phenylalanine and phenylpyruvate ac- 
cumulate in the blood and tissues and are excreted in 
the urine — hence the name "phenylketonuria." Much 
of the phenylpyruvate, rather than being excreted as 
such, is either decarboxylated to phenylacetate or re- 
duced to phenyllactate. Phenylacetate imparts a char- 
acteristic odor to the urine, which nurses have 
traditionally used to detect PKU in infants. The accu- 
mulation of phenylalanine or its metabolites in early 
life impairs normal development of the brain, causing 
severe mental retardation. This may be caused by ex- 
cess phenylalanine competing with other amino acids 
for transport across the blood-brain barrier, resulting 
in a deficit of required metabolites. 

Phenylketonuria was among the first inheritable 
metabolic defects discovered in humans. When this 
condition is recognized early in infancy, mental retarda- 
tion can largely be prevented by rigid dietary control. 
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from C-4 to C-3 in the reaction. This feature, discovered at the National 
Institutes of Health, is called the NIH shift. 

The diet must supply only enough phenylalanine and 
tyrosine to meet the needs for protein synthesis. Con- 
sumption of protein-rich foods must be curtailed. Nat- 
ural proteins, such as casein of milk, must first be 
hydrolyzed and much of the phenylalanine removed to 
provide an appropriate diet, at least through childhood. 
Because the artificial sweetener aspartame is a dipep- 
tide of aspartate and the methyl ester of phenylalanine 
(see Fig. l-23b), foods sweetened with aspartame bear 
warnings addressed to individuals on phenylalanine- 
controlled diets. 
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FIGURE 18-25 Alternative pathways for catabolism of 
phenylalanine in phenylketonuria. In PKU, phenylpyruvate 
accumulates in the tissues, blood, and urine. The urine may also 
contain phenylacetate and phenyllactate. 
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Phenylketonuria can also be caused by a defect in the 
enzyme that catalyzes the regeneration of tetrahydro- 
biopterin (Fig. 18-24). The treatment in this case is more 
complex than restricting the intake of phenylalanine and 
tyrosine. Tetrahydrobiopterin is also required for the for- 
mation of L-3,4-dihydroxyphenylalanine (L-dopa) and 5- 
hydroxytryptophan — precursors of the neurotransmitters 
norepinephrine and serotonin, respectively — and in 
phenylketonuria of this type, these precursors must be 
supplied in the diet. Supplementing the diet with 
tetrahydrobiopterin itself is ineffective because it is 
unstable and does not cross the blood-brain barrier. 

Screening newborns for genetic diseases can be 
highly cost-effective, especially in the case of PKU. The 
tests (no longer relying on urine odor) are relatively in- 
expensive, and the detection and early treatment of PKU 
in infants (eight to ten cases per 100,000 newborns) 
saves millions of dollars in later health care costs each 
year. More importantly, the emotional trauma avoided by 
early detection with these simple tests is inestimable. 

Another inheritable disease of phenylalanine catab- 
olism is alkaptonuria, in which the defective enzyme is 



homogentisate dioxygenase (Fig. 18-23). Less seri- 
ous than PKU, this condition produces few ill effects, al- 
though large amounts of homogentisate are excreted 
and its oxidation turns the urine black. Individuals with 
alkaptonuria are also prone to develop a form of arthri- 
tis. Alkaptonuria is of considerable historical interest. 
Archibald Garrod discovered in the early 1900s that this 
condition is inherited, and he traced the cause to the 
absence of a single enzyme. Garrod was the first to 
make a connection between an inheritable trait and an 
enzyme — a great advance on the path that ultimately 
led to our current understanding of genes and the 
information pathways described in Part III. ■ 

Five Amino Acids Are Converted to a-Ketoglutarate 

The carbon skeletons of five amino acids (proline, gluta- 
mate, glutamine, arginine, and histidine) enter the citric 
acid cycle as a-ketoglutarate (Fig. 18-26). Proline, 
glutamate, and glutamine have five-carbon skeletons. 
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FIGURE 18-26 Catabolic pathways for arginine, histidine; gluta- 
mate, glutamine, and proline. These amino acids are converted to 
a-ketoglutarate. The numbered steps in the histidine pathway are cat- 
alyzed by (T) histidine ammonia lyase, @ urocanate hydratase, (5) imi- 
dazolonepropionase, and ©glutamate formimino transferase. 
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The cyclic structure of proline is opened by oxidation of 
the carbon most distant from the carboxyl group to cre- 
ate a Schiff base, then hydrolysis of the Schiff base to a 
linear semialdehyde, glutamate y-semialdehyde. This 
intermediate is further oxidized at the same carbon to 
produce glutamate. The action of glutaminase, or any of 
several enzyme reactions in which glutamine donates its 
amide nitrogen to an acceptor, converts glutamine to 
glutamate. Transamination or deamination of glutamate 
produces a-ketoglutarate. 

Arginine and histidine contain five adjacent car- 
bons and a sixth carbon attached through a nitrogen 
atom. The catabolic conversion of these amino acids to 
glutamate is therefore slightly more complex than the 
path from proline or glutamine (Fig. 18-26). Arginine is 
converted to the five-carbon skeleton of ornithine in the 
urea cycle (Fig. 18-10), and the ornithine is transami- 
nated to glutamate y-semialdehyde. Conversion of histi- 
dine to the five-carbon glutamate occurs in a multistep 
pathway; the extra carbon is removed in a step that uses 
tetrahydrofolate as cofactor. 



Four Amino Acids Are Converted to Succinyl-CoA 

The carbon skeletons of methionine, isoleucine, threo- 
nine, and valine are degraded by pathways that yield suc- 
cinyl-CoA (Fig. 18-27), an intermediate of the citric acid 
cycle. Methionine donates its methyl group to one of sev- 
eral possible acceptors through S-adenosylmethionine, 
and three of its four remaining carbon atoms are con- 
verted to the propionate of propionyl-CoA, a precursor of 
succinyl-CoA. Isoleucine undergoes transairiination, fol- 
lowed by oxidative decarboxylation of the resulting a-keto 
acid. The remaining five-carbon skeleton is further oxi- 
dized to acetyl-CoA and propionyl-CoA. Valine undergoes 
transamination and decarboxylation, then a series of oxi- 
dation reactions that convert the remaining four carbons 
to propionyl-CoA. Some parts of the valine and isoleucine 
degradative pathways closely parallel steps in fatty acid 
degradation (see Fig. 17-8a). In human tissues, threo- 
nine is also converted in two steps to propionyl-CoA. This 
is the primary pathway for threonine degradation in hu- 
mans (see Fig. 18-19 for the alternative pathway). The 



FIGURE 18-27 Catabolic pathways for methionine, 
isoleucine, threonine, and valine. These amino acids are 
converted to succinyl-CoA; isoleucine also contributes 
two of its carbon atoms to acetyl-CoA (see Fig. 18-21). 
The pathway of threonine degradation shown here occurs 
in humans; a pathway found in other organisms is shown 
in Figure 18-19. The route from methionine to homocys- 
teine is described in more detail in Figure 18-18; the 
conversion of homocysteine to a-ketobutyrate in Figure 
22-14; the conversion of propionyl-CoA to succinyl- 
CoA in Figure 17-11. 
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mechanism of the first step is analogous to that catalyzed 
by serine dehydratase, and the serine and threonine dehy- 
dratases may actually be the same enzyme. 

The propionyl-CoA derived from these three amino 
acids is converted to succinyl-CoA by a pathway de- 
scribed in Chapter 17: carboxylation to methylmalonyl- 



CoA, epimerization of the methylmalonyl-CoA, and con- 
version to succinyl-CoA by the coenzyme In- 
dependent methylmalonyl-CoA mutase (see Fig. 17-11). 
In the rare genetic disease known as methylmalonic 
acidemia, methylmalonyl-CoA mutase is lacking — with 
serious metabolic consequences (Table 18-2; Box 18-2). 



BOX 



18-2 



MEDICINE 



Scientific Sleuths Solve a Murder Mystery 



Truth can sometimes be stranger than fiction — or at 
least as strange as a made-for-TV movie. Take, for ex- 
ample, the case of Patricia Stallings. Convicted of the 
murder of her infant son, she was sentenced to life in 
prison — but was later found innocent, thanks to the 
medical sleuthing of three persistent researchers. 

The story began in the summer of 1989 when 
Stallings brought her three-month-old son, Ryan, to the 
emergency room of Cardinal Glennon Children's Hospi- 
tal in St. Louis. The child had labored breathing, uncon- 
trollable vomiting, and gastric distress. According to the 
attending physician, a toxicologist, the child's symptoms 
indicated that he had been poisoned with ethylene gly- 
col, an ingredient of antifreeze, a conclusion apparently 
confirmed by analysis at a commercial lab. 

After he recovered, the child was placed in a foster 
home, and Stallings and her husband, David, were al- 
lowed to see him in supervised visits. But when the infant 
became ill, and subsequently died, after a visit in which 
Stallings had been briefly left alone with him, she was 
charged with first-degree murder and held without bail. 
At the time, the evidence seemed compelling as both the 
commercial lab and the hospital lab found large amounts 
of ethylene glycol in the boy's blood and traces of it in a 
bottle of milk Stallings had fed her son during the visit. 

But without knowing it, Stallings had performed a 
brilliant experiment. While in custody, she learned she 
was pregnant; she subsequently gave birth to another son, 
David Stallings Jr., in February 1990. He was placed im- 
mediately in a foster home, but within two weeks he 
started having symptoms similar to Ryan's. David was 
eventually diagnosed with a rare metabolic disorder called 
methylmalonic acidemia (MMA). A recessive genetic dis- 
order of amino acid metabolism, MMA affects about 1 in 
48,000 newborns and presents symptoms almost identical 
with those caused by ethylene glycol poisoning. 

Stallings couldn't possibly have poisoned her sec- 
ond son, but the Missouri state prosecutor's office was 
not impressed by the new developments and pressed 
forward with her trial anyway. The court wouldn't allow 
the MMA diagnosis of the second child to be introduced 
as evidence, and in January 1991 Patricia Stallings was 
convicted of assault with a deadly weapon and sen- 
tenced to life in prison. 

Fortunately for Stallings, however, William Sly, chair- 
man of the Department of Biochemistry and Molecular 



Biology at St. Louis University, and James Shoemaker, 
head of a metabolic screening lab at the university, got in- 
terested in her case when they heard about it from a tele- 
vision broadcast. Shoemaker performed his own analysis 
of Ryan's blood and didn't detect ethylene glycol. He and 
Sly then contacted Piero Rinaldo, a metabolic disease ex- 
pert at Yale University School of Medicine whose lab is 
equipped to diagnose MMA from blood samples. 

When Rinaldo analyzed Ryan's blood serum, he 
found high concentrations of methylmalonic acid, a 
breakdown product of the branched-chain amino acids 
isoleucine and valine, which accumulates in MMA pa- 
tients because the enzyme that should convert it to the 
next product in the metabolic pathway is defective. 
And particularly telling, he says, the child's blood and 
urine contained massive amounts of ketones, another 
metabolic consequence of the disease. Like Shoemaker, 
he did not find any ethylene glycol in a sample of the 
baby's bodily fluids. The bottle couldn't be tested, since 
it had mysteriously disappeared. Rinaldo's analyses 
convinced him that Ryan had died from MMA, but how 
to account for the results from two labs, indicating that 
the boy had ethylene glycol in his blood? Could they 
both be wrong? 

When Rinaldo obtained the lab reports, what he 
saw was, he says, "scary." One lab said that Ryan 
Stallings' blood contained ethylene glycol, even though 
the blood sample analysis did not match the lab's own 
profile for a known sample containing ethylene glycol. 
"This was not just a matter of questionable interpreta- 
tion. The quality of their analysis was unacceptable," 
Rinaldo says. And the second laboratory? According to 
Rinaldo, that lab detected an abnormal component in 
Ryan's blood and just "assumed it was ethylene glycol." 
Samples from the bottle had produced nothing unusual, 
says Rinaldo, yet the lab claimed evidence of ethylene 
glycol in that, too. 

Rinaldo presented his findings to the case's prosecu- 
tor, George McElroy, who called a press conference the 
very next day. "I no longer believe the laboratory data," 
he told reporters. Having concluded that Ryan Stallings 
had died of MMA after all, McElroy dismissed all charges 
against Patricia Stallings on September 20, 1991. 

*By Michelle Hoffman (1991). Science 253, 931. Copyright 1991 by the 
American Association for the Advancement of Science. 
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Branched-Chain Amino Acids Are Not Degraded 
in the Liver 

Although much of the catabolism of amino acids takes 
place in the liver, the three amino acids with branched 
side chains (leucine, isoleucine, and valine) are oxi- 
dized as fuels primarily in muscle, adipose, kidney, and 
brain tissue. These extrahepatic tissues contain an 
aminotransferase, absent in liver, that acts on all three 
branched-chain amino acids to produce the corre- 
sponding a-keto acids (Fig. 18-28). The branched- 
chain a-keto acid dehydrogenase complex then 
catalyzes oxidative decarboxylation of all three a-keto 
acids, in each case releasing the carboxyl group as 
C0 2 and producing the acyl-CoA derivative. This reac- 
tion is formally analogous to two other oxidative de- 
carboxylations encountered in Chapter 16: oxidation 
of pyruvate to acetyl-CoA by the pyruvate dehydroge- 
nase complex (see Fig. 16-6) and oxidation of a- 
ketoglutarate to succinyl-CoA by the a-ketoglutarate 
dehydrogenase complex (p. 625). In fact, all three en- 
zyme complexes are similar in structure and share es- 
sentially the same reaction mechanism. Five cofactors 
(thiamine pyrophosphate, FAD, NAD, lipoate, and 
coenzyme A) participate, and the three proteins in 
each complex catalyze homologous reactions. This is 
clearly a case in which enzymatic machinery that 
evolved to catalyze one reaction was "borrowed" by 
gene duplication and further evolved to catalyze simi- 
lar reactions in other pathways. 



Experiments with rats have shown that the 
branched-chain a-keto acid dehydrogenase complex is 
regulated by covalent modification in response to the 
content of branched-chain amino acids in the diet. With 
little or no excess dietary intake of branched-chain 
amino acids, the enzyme complex is phosphorylated and 
thereby inactivated by a protein kinase. Addition of ex- 
cess branched-chain amino acids to the diet results in 
dephosphorylation and consequent activation of the en- 
zyme. Recall that the pyruvate dehydrogenase complex 
is subject to similar regulation by phosphorylation and 
dephosphorylation (p. 636). 

There is a relatively rare genetic disease in which 
the three branched-chain a-keto acids (as well as 
their precursor amino acids, especially leucine) accu- 
mulate in the blood and "spill over" into the urine. This 
condition, called maple syrup urine disease because 
of the characteristic odor imparted to the urine by the 
a-keto acids, results from a defective branched-chain 
a-keto acid dehydrogenase complex. Untreated, the dis- 
ease results in abnormal development of the brain, men- 
tal retardation, and death in early infancy Treatment 
entails rigid control of the diet, limiting the intake of 
valine, isoleucine, and leucine to the minimum required 
to permit normal growth. ■ 

Asparagine and Aspartate Are Degraded to Oxaloacetate 

The carbon skeletons of asparagine and aspartate ul- 
timately enter the citric acid cycle as oxaloacetate. The 
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FIGURE 18-28 Catabolic pathways for the three branched- 
chain amino acids: valine, isoleucine, and leucine. All three 
pathways occur in extrahepatic tissues and share the first two enzymes, 
as shown here. The branched-chain a-keto acid dehydrogenase 



complex is analogous to the pyruvate and a-ketoglutarate dehydro- 
genase complexes and requires the same five cofactors (some not 
shown here). This enzyme is defective in people with maple syrup 
urine disease. 
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FIGURE 18-29 Catabolic pathway for asparagine and aspartate. 

Both amino acids are converted to oxaloacetate. 

enzyme asparaginase catalyzes the hydrolysis of as- 
paragine to aspartate, which undergoes transamination 
with a-ketoglutarate to yield glutamate and oxaloac- 
etate (Fig. 18-29). 

We have now seen how the 20 common amino acids, 
after losing their nitrogen atoms, are degraded by de- 
hydrogenation, decarboxylation, and other reactions 
to yield portions of their carbon backbones in the form 
of six central metabolites that can enter the citric acid 
cycle. Those portions degraded to acetyl-CoA are 
completely oxidized to carbon dioxide and water, with 
generation of ATP by oxidative phosphorylation. 

As was the case for carbohydrates and lipids, the 
degradation of amino acids results ultimately in the gen- 
eration of reducing equivalents (NADH and FADH 2 ) 
through the action of the citric acid cycle. Our survey of 
catabolic processes concludes in the next chapter with a 
discussion of respiration, in which these reducing equiv- 
alents fuel the ultimate oxidative and energy-generating 
process in aerobic organisms. 
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Pathways of Amino Acid 
Degradation 



After the removal of amino groups, the carbon 
skeletons of amino acids undergo oxidation to 
compounds that can enter the citric acid cycle 
for oxidation to C0 2 and H 2 0. The reactions 
of these pathways require several cof actors, 
including tetrahydrofolate and 5-adenosyl- 
methionine in one-carbon transfer reactions 



and tetrahydrobiopterin in the oxidation of 
phenylalanine by phenylalanine hydroxylase. 

Depending on their degradative end product, some 
amino acids can be converted to ketone bodies, 
some to glucose, and some to both. Thus amino 
acid degradation is integrated into intermediary 
metabolism and can be critical to survival under 
conditions in which amino acids are a significant 
source of metabolic energy. 

The carbon skeletons of amino acids enter the citric 
acid cycle through five intermediates: acetyl-CoA, 
a-ketoglutarate, succinyl-CoA, fumarate, and 
oxaloacetate. Some are also degraded to pyruvate, 
which can be converted to either acetyl-CoA or 
oxaloacetate. 

The amino acids producing pyruvate are alanine, 
cysteine, glycine, serine, threonine, and 
tryptophan. Leucine, lysine, phenylalanine, and 
tryptophan yield acetyl-CoA via acetoacetyl-CoA. 
Isoleucine, leucine, threonine, and tryptophan also 
form acetyl-CoA directly. 

Arginine, glutamate, glutamine, histidine, and 
proline produce a-ketoglutarate; isoleucine, 
methionine, threonine, and valine produce 
succinyl-CoA; four carbon atoms of phenylalanine 
and tyrosine give rise to fumarate; and asparagine 
and aspartate produce oxaloacetate. 

The branched-chain amino acids (isoleucine, 
leucine, and valine), unlike the other amino acids, 
are degraded only in extrahepatic tissues. 

Several serious human diseases can be traced 
to genetic defects in the enzymes of amino acid 
catabolism. 
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Problems 

1. Products of Amino Acid Transamination Name and 
draw the structure of the a-keto acid resulting when each of 
the following amino acids undergoes transamination with a- 
ketoglutarate: (a) aspartate, (b) glutamate, (c) alanine, (d) 
phenylalanine . 

2. Measurement of Alanine Aminotransferase Activity 

The activity (reaction rate) of alanine aminotransferase is usu- 
ally measured by including an excess of pure lactate dehydro- 
genase and NADH in the reaction system. The rate of alanine 
disappearance is equal to the rate of NADH disappearance 
measured spectrophotometrically, Explain how this assay 
works. 

3. Alanine and Glutamine in the Blood Normal human 
blood plasma contains all the amino acids required for the syn- 
thesis of body proteins, but not in equal concentrations. Ala- 
nine and glutamine are present in much higher concentrations 
than any other amino acids. Suggest why. 

4. Distribution of Amino Nitrogen If your diet is rich in 
alanine but deficient in aspartate, will you show signs of aspar- 
tate deficiency? Explain. 

5. Lactate versus Alanine as Metabolic Fuel: The Cost 
of Nitrogen Removal The three carbons in lactate and ala- 
nine have identical oxidation states, and animals can use ei- 
ther carbon source as a metabolic fuel. Compare the net ATP 
yield (moles of ATP per mole of substrate) for the complete 
oxidation (to C0 2 and H 2 0) of lactate versus alanine when the 
cost of nitrogen excretion as urea is included. 
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6. Ammonia Toxicity Resulting from an Arginine- 
Deficient Diet In a study conducted some years ago, cats 
were fasted overnight then given a single meal complete in all 
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amino acids except arginine Within 2 hours, blood ammonia 
levels increased from a normal level of 18 fig/h to 140 jug/L, 
and the cats showed the clinical symptoms of ammonia toxic- 
ity. A control group fed a complete amino acid diet or an amino 
acid diet in which arginine was replaced by ornithine showed 
no unusual clinical symptoms. 

(a) What was the role of fasting in the experiment? 

(b) What caused the ammonia levels to rise in the experi- 
mental group? Why did the absence of arginine lead to 
ammonia toxicity? Is arginine an essential amino acid in cats? 
Why or why not? 

(c) Why can ornithine be substituted for arginine? 

7. Oxidation of Glutamate Write a series of balanced equa- 
tions, and an overall equation for the net reaction, describing 
the oxidation of 2 mol of glutamate to 2 mol of a-ketoglutarate 
and 1 mol of urea. 

8. Transamination and the Urea Cycle Aspartate amino- 
transferase has the highest activity of all the mammalian liver 
aminotransferases. Why? 

9. The Case against the Liquid Protein Diet A 

weight-reducing diet heavily promoted some years ago 
required the daily intake of "liquid protein" (soup of hy- 
drolyzed gelatin), water, and an assortment of vitamins. All 
other food and drink were to be avoided. People on this diet 
typically lost 10 to 14 lb in the first week. 

(a) Opponents argued that the weight loss was almost en- 
tirely due to water loss and would be regained very soon after 
a normal diet was resumed. What is the biochemical basis for 
this argument? 

(b) A few people on this diet died. What are some of the 
dangers inherent in the diet, and how can they lead to death? 

10. Ketogenic Amino Acids Which amino acids are exclu- 
sively ketogenic? 

11. A Genetic Defect in Amino Acid Metabolism: A 
Case History A two-year-old child was taken to the hos- 
pital. His mother said that he vomited frequently, especially after 
feedings. The child's weight and physical development were be- 
low normal. His hair, although dark, contained patches of white. A 
urine sample treated with ferric chloride (FeCl 3 ) gave a green 
color characteristic of the presence of phenylpyruvate. Quantita- 
tive analysis of urine samples gave the results shown in the table, 



Concentration (raM) 



Substance 



Patient's urine 



Normal urine 



Phenylalanine 

Phenylpyruvate 

Phenyllactate 



7.0 
48 
10.3 



0.01 

0 

0 



(a) Suggest which enzyme might be deficient in this child. 
Propose a treatment. 

(b) Why does phenylalanine appear in the urine in large 
amounts? 

(c) What is the source of phenylpyruvate and phenyllac- 
tate? Why does this pathway (normally not functional) come 
into play when the concentration of phenylalanine rises? 



(d) Why does the boy's hair contain patches of white? 

12. Role of Cobalamin in Amino Acid Catabolism 

Pernicious anemia is caused by impaired absorption 
of vitamin B 12 . What is the effect of this impairment on the 
catabolism of amino acids? Are all amino acids equally affected? 
(Hint: See Box 17-2.) 

13. Vegetarian Diets Vegetarian diets can provide 
high levels of antioxidants and a lipid profile that can 

help prevent coronary disease. However, there can be some 
associated problems. Blood samples were taken from a large 
group of volunteer subjects who were vegans (strict vegetari- 
ans: no animal products), lactovegetarians (vegetarians who 
eat dairy products) , or omnivores (individuals with a normal, 
varied diet including meat). In each case, the volunteers had 
followed the diet for several years. The blood levels of both ho- 
mocysteine and methylmalonate were elevated in the vegan 
group, somewhat lower in the lactovegetarian group, and 
much lower in the omnivore group. Explain. 

14. Pernicious Anemia Vitamin B 12 deficiency can 
arise from a few rare genetic diseases that lead to low 

B 12 levels despite a normal diet that includes B 12 -rich meat 
and dairy sources. These conditions cannot be treated with 
dietary B 12 supplements, Explain. 

15. Pyridoxal Phosphate Reaction Mechanisms Threo- 
nine can be broken down by the enzyme threonine dehy- 
dratase, which catalyzes the conversion of threonine to 
a-ketobutyrate and ammonia. The enzyme uses PLP as a co- 
factor. Suggest a mechanism for this reaction, based on the 
mechanisms in Figure 18-6. Note that this reaction includes 
an elimination at the p carbon of threonine. 
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16. Pathway of Carbon and Nitrogen in Glutamate Me- 
tabolism When [2- 14 C, 15 N] glutamate undergoes oxidative 
degradation in the liver of a rat, in which atoms of the follow- 
ing metabolites will each isotope be found: (a) urea, (b) succi- 
nate, (c) arginine, (d) citrulline, (e) ornithine, (f) aspartate? 
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I I 
H CH 2 

coo- 

Labeled glutamate 

17. Chemical Strategy of Isoleucine Catabolism 

Isoleucine is degraded in six steps to propionyl-CoA and 
acetyl-CoA. 
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(a) The chemical process of isoleucine degradation in- 
cludes strategies analogous to those used in the citric acid cycle 
and the j8 oxidation of fatty acids. The intermediates of 
isoleucine degradation (I to V) shown below are not in the proper 
order. Use your knowledge and understanding of the citric acid 
cycle and ^-oxidation pathway to arrange the intermediates in 
the proper metabolic sequence for isoleucine degradation. 
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(b) For each step you propose, describe the chemical 
process, provide an analogous example from the citric acid 
cycle or /3-oxidation pathway (where possible), and indicate 
any necessary cofactors. 

18. Role of Pyridoxal Phosphate in Glycine Metabolism 

The enzyme serine hydroxymethyltransferase requires pyridoxal 
phosphate as cofactor. Propose a mechanism for the reaction cat- 
alyzed by this enzyme, in the direction of serine degradation 
(glycine production). (Hint: See Figs 18-19 and 18-20b.) 

19. Parallel Pathways for Amino Acid and Fatty Acid 
Degradation The carbon skeleton of leucine is degraded by a 
series of reactions closely analogous to those of the citric acid 
cycle and £ oxidation. For each reaction, (a) through (f), 
shown at right, indicate its type, provide an analogous exam- 
ple from the citric acid cycle or /3-oxidation pathway (where 
possible), and note any necessary cofactors. 
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Data Analysis Problem 



20. Mapie Syrup Urine Disease Figure 18-28 shows 
the pathway for the degradation of branched-chain 
amino acids and the site of the biochemical defect that causes 
maple syrup urine disease. The initial findings that eventually 
led to the discovery of the defect in this disease were presented 
in three papers published in the late 1950s and early 1960s. 
This problem traces the history of the findings from initial clin- 
ical observations to proposal of a biochemical mechanism. 

Menkes, Hurst, and Craig (1954) presented the cases of 
four siblings, all of whom died following a similar course of 
symptoms. In all four cases, the mother's pregnancy and the 
birth had been normal. The first 3 to 5 days of each child's life 
were also normal. But soon thereafter each child began having 
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convulsions, and the children died between the ages of 1 1 days 
and 3 months. Autopsy showed considerable swelling of the 
brain in all cases. The children's urine had a strong, unusual 
"maple syrup" odor, starting from about the third day of life. 

Menkes (1959) reported data collected from six more chil- 
dren. All showed symptoms similar to those described above, and 
died within 15 days to 20 months of birth. In one case, Menkes 
was able to obtain urine samples during the last months of the in- 
fant's life. When he treated the urine with 2,4-dinitrophenylhy- 
drazone, which forms colored precipitates with keto compounds, 
he found three a-keto acids in unusually large amounts: 



COO* 

I 

c=o 
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CH 2 
CH 3 -CH 
CH 3 

ff-Ketoisocaproate 



COO" 
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CH 3 -CH 
CH 3 



COO" 
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c=o 
I 

CH 3 — CH 
CH 2 
CH 3 



tf-Ketoisovalerate tf-Keto-/3-methyl-ft-valerate 



(a) These a-keto acids are produced by the deamination 
of amino acids. For each of the a-keto acids above, draw and 
name the amino acid from which it was derived. 

Dancis, Levitz, and Westall (1960) collected further data 
that led them to propose the biochemical defect shown in Fig- 
ure 18-28. In one case, they examined a patient whose urine 
first showed the maple syrup odor when he was 4 months old. 
At the age of 10 months (March 1956), the child was admitted 
to the hospital because he had a fever, and he showed grossly 
retarded motor development. At the age of 20 months (Janu- 
ary 1957), he was readmitted and was found to have the de- 
generative neurological symptoms seen in previous cases of 
maple syrup urine disease; he died soon after. Results of his 
blood and urine analyses are shown in the table below, along 
with normal values for each component. 



(b) The table includes taurine, an amino acid not nor- 
mally found in proteins. Taurine is often produced as a by- 
product of cell damage. Its structure is: 

O 

II 

H 3 N— CHo- CH 2 -S- O" 



Based on its structure and the information in this chapter, 
what is the most likely amino acid precursor of taurine? Ex- 
plain your reasoning. 

(c) Compared with the normal values given in the table, 
which amino acids showed significantly elevated levels in the pa- 
tient's blood in January 1957? Which ones in the patient's urine? 

Based on their results and their knowledge of the path- 
way shown in Figure 18-28, Dancis and coauthors concluded: 
"although it appears most likely to the authors that the pri- 
mary block is in the metabolic degradative pathway of the 
branched-chain amino acids, this cannot be considered estab- 
lished beyond question." 

(d) How do the data presented here support this con- 
clusion? 

(e) Which data presented here do not fit this model of 
maple syrup urine disease? How do you explain these seem- 
ingly contradictory data? 

(f) What data would you need to collect to be more secure 
in your conclusion? 
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Urine (mg/24 h) Plasma (mg/ml) 



Amino acid(s) 


Normal 


Patient 


Normal 


Patient 




Mar. 1956 


Jan. 1957 




Jan. 1957 


Alanine 


5-15 


0.2 


0.4 


3.0-4.8 


0.6 


Asparagine and glutamine 


5-15 


0.4 


0 


3 0-5 0 


2.0 


Aspartic acid 


1-2 


0.2 


1.5 


0 1-0 2 


0.04 


Arginine 


1 5-3 


03 


0.7 


0.8-1.4 


0.8 


Cystine 


2-4 


05 


0.3 


1.0-1.5 


0 


Glutamic acid 


1.5-3 


0.7 


1.6 


1.0-1.5 


0.9 


Glycine 


20-40 


46 


20.7 


1.0-2.0 


15 


Histidine 


8-15 


03 


47 


1.0-1.7 


07 


Isoleucine 


2-5 


2.0 


13.5 


0.8-1.5 


2.2 


Leucine 


3-8 


27 


39.4 


1.7-2.4 


14.5 


Lysine 


2-12 


16 


4.3 


1.5-2.7 


1.1 


Methionine 


2-5 


1.4 


1 4 


0 3-0.6 


2.7 


Ornithine 


1-2 


0 


1 3 


0.6-0.8 


05 


Phenylalanine 


2-A 


0.4 


2.6 


1.0-1.7 


08 


Proline 


2-4 


0.5 


0 3 


15-3 0 


0.9 


Serine 


5-15 


1 2 


0 


1.3-2.2 


0.9 


Taurine 


1-10 


0.2 


18.7 


0.9-1.8 


0.4 


Threonine 


5-10 


0.6 


0 


1.2-1.6 


03 


Tryptophan 


3-8 


0.9 


2.3 


Not measured 


0 


Tyrosine 


4-8 


0.3 


3.7 


1.5-2.3 


0.7 


Valine 


2-4 


1 6 


15.4 


2.0-3,0 


13.1 



If an idea presents itself to us, we must not reject it simply because it 
does not agree with the logical deductions of a reigning theory. 

—Claude Bernard, An Introduction to the Study of Experimental 

Medicine, 1813 

The aspect of the present position of consensus that I find most remark- 
able and admirable, is the altruism and generosity with which former 
opponents of the chemiosmotic hypothesis have not only come to 
accept it, but have actively promoted it to the status of a theory. 

—Peter Mitchell, Nobel Address, 1978 
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Oxidative phosphorylation is the culmination of 
energy-yielding metabolism in aerobic organ- 
isms. All oxidative steps in the degradation of 
carbohydrates, fats, and amino acids converge at this fi- 
nal stage of cellular respiration, in which the energy of 
oxidation drives the synthesis of ATP. Photophosphory- 
lation is the means by which photosynthetic organisms 



capture the energy of sunlight — the ultimate source of 
energy in the biosphere — and harness it to make ATP. 
Together, oxidative phosphorylation and photophospho- 
rylation account for most of the ATP synthesized by 
most organisms most of the time. 

In eukaryotes, oxidative phosphorylation occurs in 
mitochondria, photophosphorylation in chloroplasts. 
Oxidative phosphorylation involves the reduction of 0 2 
to H 2 0 with electrons donated by NADH and FADH 2 ; it 
occurs equally well in light or darkness. Photophospho- 
rylation involves the oxidation of H 2 0 to 0 2 , with 
NADP + as ultimate electron acceptor; it is absolutely 
dependent on the energy of light. Despite their differ- 
ences, these two highly efficient energy-converting 
processes have fundamentally similar mechanisms. 

Our current understanding of ATP synthesis in mi- 
tochondria and chloroplasts is based on the hypothesis, 
introduced by Peter Mitchell in 1961, that transmem- 
brane differences in proton concentration are the reser- 
voir for the energy extracted from biological oxidation 
reactions. This chemiosmotic theory has been ac- 
cepted as one of the great unifying principles of twenti- 
eth century biology. It provides insight into the processes 
of oxidative phosphorylation and photophosphorylation, 
and into such apparently disparate energy transduc- 
tions as active transport across membranes and the 
motion of bacterial flagella. 

Oxidative phosphorylation and photophosphoryla- 
tion are mechanistically similar in three respects. (1) 
Both processes involve the flow of electrons through a 
chain of membrane-bound carriers. (2) The free energy 
made available by this "downhill" (exergonic) electron 
flow is coupled to the "uphill" transport of protons 
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across a proton-impermeable membrane, conserving the 
free energy of fuel oxidation as a transmembrane elec- 
trochemical potential (p. 390). (3) The transmembrane 
flow of protons down their concentration gradient 
through specific protein channels provides the free en- 
ergy for synthesis of ATP, catalyzed by a membrane pro- 
tein complex (ATP synthase) that couples proton flow 
to phosphorylation of ADP 

The chapter begins with oxidative phosphorylation. 
We first describe the components of the electron-trans- 
fer chain, their organization into large functional com- 
plexes in the inner mitochondrial membrane, the path 
of electron flow through them, and the proton move- 
ments that accompany this flow. We then consider the 
remarkable enzyme complex that, by "rotational cataly- 
sis," captures the energy of proton flow in ATP, and the 
regulatory mechanisms that coordinate oxidative phos- 
phorylation with the many catabolic pathways by which 
fuels are oxidized. We also describe the roles that mito- 
chondria play in thermogenesis, steroid synthesis, and 
apoptosis. With this understanding of mitochondrial ox- 
idative phosphorylation, we turn to photophosphoryla- 
tion, looking first at the absorption of light by 
photosynthetic pigments, then at the light-driven flow 
of electrons from H 2 0 to NADP + and the molecular ba- 
sis for coupling electron and proton flow. We also con- 
sider the similarities of structure and mechanism 
between the ATP synthases of chloroplasts and mito- 
chondria, and the evolutionary basis for this conserva- 
tion of mechanism. 

OXIDATIVE PHOSPHORYLATION 

19,1 Electron-Transfer Reactions in 
Mitochondria 

The discovery in 1948 by Eugene Kennedy and Albert 
Lehninger that mitochondria are the site of oxidative phos- 
phorylation in eukaryotes marked the beginning of the 
modern phase of studies in biological energy transductions. 
Mitochondria, like gram-negative bacteria, have two 
membranes (Fig. 19-1). The outer mitochondrial mem- 
brane is readily permeable to small molecules (M T <5,000) 
and ions, which move freely 
through transmembrane chan- 
nels formed by a family of inte- 
gral membrane proteins called 
porins. The inner membrane is 
impermeable to most small mol- 
ecules and ions, including pro- 
tons (H + ); the only species that 
cross this membrane do so 
through specific transporters. 
The inner membrane bears the 
components of the respiratory 
chain and the ATP synthase. 
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• Many other enzymes 

•ATP, ADP, Pj, Mg 2 * Ca 2+ , K + 

• Many soluble metabolic 
intermediates 



FIGURE 19-1 Biochemical anatomy of a mitochondrion. The convolu- 
tions (cristae) of the inner membrane provide a very large surface area. 
The inner membrane of a single liver mitochondrion may have more 
than 10,000 sets of electron-transfer systems (respiratory chains) and 
ATP synthase molecules, distributed over the membrane surface. The 
mitochondria of heart muscle, which have more profuse cristae and 
thus a much larger area of inner membrane, contain more than three 
times as many sets of electron-transfer systems as liver mitochondria. 
The mitochondrial pool of coenzymes and intermediates is function- 
ally separate from the cytosolic pool. The mitochondria of inverte- 
brates, plants, and microbial eukaryotes are similar to those shown 
here, but with much variation in size, shape, and degree of convolu- 
tion of the inner membrane. 



The mitochondrial matrix, enclosed by the inner 
membrane, contains the pyruvate dehydrogenase com- 
plex and the enzymes of the citric acid cycle, the fatty 
acid £f-oxidation pathway, and the pathways of amino 
acid oxidation — all the pathways of fuel oxidation except 
glycolysis, which takes place in the cytosol. The selec- 
tively permeable inner membrane segregates the inter- 
mediates and enzymes of cytosolic metabolic pathways 
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from those of metabolic processes occurring in the ma- 
trix. However, specific transporters carry pyruvate, fatty 
acids, and amino acids or their a-keto derivatives into the 
matrix for access to the machinery of the citric acid cycle. 
ADP and Pi are specifically transported into the matrix as 
newly synthesized ATP is transported out. 

Electrons Are Funneled to Universal 
Electron Acceptors 

Oxidative phosphorylation begins with the entry of 
electrons into the respiratory chain. Most of these elec- 
trons arise from the action of dehydrogenases that col- 
lect electrons from catabolic pathways and funnel 
them into universal electron acceptors — nicotinamide 
nucleotides (NAD + or NADP + ) or flavin nucleotides 
(FMN or FAD). 

Nicotinamide nucleotide-linked dehydroge- 
nases catalyze reversible reactions of the following gen- 
eral types: 

Reduced substrate + NAD + 

oxidized substrate + NADH + H + 

Reduced substrate + NADP + 

oxidized substrate + NADPH + H + 

Most dehydrogenases that act in catabolism are specific 
for NAD 4 " as electron acceptor (Table 19-1). Some are 
in the cytosol, others are in mitochondria, and still oth- 
ers have mitochondrial and cytosolic isozymes. 

NAD-linked dehydrogenases remove two hydrogen 
atoms from their substrates. One of these is transferred 
as a hydride ion (:H~) to NAD + ; the other is released as 
H + in the medium (see Fig. 13-24). NADH and NADPH 
are water-soluble electron carriers that associate rev- 
ersibly with dehydrogenases. NADH carries electrons 



from catabolic reactions to their point of entry into the 
respiratory chain, the NADH dehydrogenase complex 
described below. NADPH generally supplies electrons to 
anabolic reactions. Cells maintain separate pools of 
NADPH and NADH, with different redox potentials. This 
is accomplished by holding the ratio of [reduced 
form]/[oxidized form] relatively high for NADPH and rel- 
atively low for NADH. Neither NADH nor NADPH can 
cross the inner mitochondrial membrane, but the elec- 
trons they carry can be shuttled across indirectly, as we 
shall see. 

Flavoproteins contain a very tightly, sometimes 
covalently, bound flavin nucleotide, either FMN or FAD 
(see Fig. 13-27). The oxidized flavin nucleotide can ac- 
cept either one electron (yielding the semiquinone 
form) or two (yielding FADH 2 or FMNH 2 ). Electron 
transfer occurs because the flavoprotein has a higher 
reduction potential than the compound oxidized. The 
standard reduction potential of a flavin nucleotide, un- 
like that of NAD or NADP, depends on the protein with 
which it is associated. Local interactions with func- 
tional groups in the protein distort the electron orbitals 
in the flavin ring, changing the relative stabilities of ox- 
idized and reduced forms. The relevant standard reduc- 
tion potential is therefore that of the particular 
flavoprotein, not that of isolated FAD or FMN. The 
flavin nucleotide should be considered part of the flavo- 
protein's active site rather than a reactant or product in 
the electron-transfer reaction. Because flavoproteins 
can participate in either one- or two-electron transfers, 
they can serve as intermediates between reactions in 
which two electrons are donated (as in dehydrogena- 
tions) and those in which only one electron is accepted 
(as in the reduction of a quinone to a hydroquinone, de- 
scribed below). 



^^^^^J Some Important Reactions Catalyzed by NAD(P)H-Linked Dehydrogenases 


Reaction* 


Location 1 


NAD-linked 




a-Ketoglutarate + CoA + NAD + succinyl-CoA + C0 2 + NADH + H + 


M 


L-Malate + NAD + =f=^ oxaloacetate + NADH + H + 


MandC 


Pyruvate + CoA + NAD 4 " acetyl-CoA + C0 2 + NADH + H + 


M 


Glyceraldehyde 3-phosphate + P 4 + NAD + =f=^ 1,3-bisphosphoglycerate + NADH + H + 


C 


Lactate + NAD + pyruvate + NADH + H + 


c 


0-Hydroxyacyl-CoA + NAD + ^f=^ /3-ketoacyl-CoA + NADH + H + 


M 



NADP-linked 

Glucose 6-phosphate + NADP + 6-phosphogluconate + NADPH + H + C 
NAD- or NADP-linked 

L-Glutamate + H 2 0 + NAD(P) + a-ketoglutarate + NH 4 + + NAD(P)H M 
Isocitrate + NAD(P) + a-ketoglutarate + C0 2 + NAD(P)H + H + M and C 



*These reactions and their enzymes are discussed in Chapters 14 through 18. 
*"M designates mitochondria; C, cytosol 
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Electrons Pass through a Series of Membrane-Bound 
Carriers 

The mitochondrial respiratory chain consists of a series 
of sequentially acting electron carriers, most of which 
are integral proteins with prosthetic groups capable of 
accepting and donating either one or two electrons. 
Three types of electron transfers occur in oxidative 
phosphorylation: (1) direct transfer of electrons, as in 
the reduction of Fe 3+ to Fe 2+ ; (2) transfer as a hydro- 
gen atom (H + + and (3) transfer as a hydride ion 
(:H~), which bears two electrons. The term reducing 
equivalent is used to designate a single electron equiv- 
alent transferred in an oxidation-reduction reaction. 

In addition to NAD and flavoproteins, three other 
types of electron-carrying molecules function in the 
respiratory chain: a hydrophobic quinone (ubiquinone) 
and two different types of iron-containing proteins 
(cytochromes and iron-sulfur proteins). Ubiquinone 
(also called coenzyme Q, or simply Q) is a lipid-soluble 
benzoquinone with a long isoprenoid side chain 
(Fig. 19-2). The closely related compounds plasto- 
quinone (of plant chloroplasts) and menaquinone (of 
bacteria) play roles analogous to that of ubiquinone, 
carrying electrons in membrane-associated electron- 
transfer chains. Ubiquinone can accept one electron to 
become the semiquinone radical (*QH) or two electrons 
to form ubiquinol (QH 2 ) (Fig. 19-2) and, like flavoprotein 
carriers, it can act at the junction between a two-electron 
donor and a one-electron acceptor. Because ubiquinone 
is both small and hydrophobic, it is freely diffusible within 
the lipid bilayer of the inner mitochondrial membrane 
and can shuttle reducing equivalents between other, less 
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FIGURE 19-2 Ubiquinone (Q, or coenzyme Q). Complete reduction 
of ubiquinone requires two electrons and two protons, and occurs in 
two steps through the semiquinone radical intermediate. 



mobile electron carriers in the membrane. And because it 
carries both electrons and protons, it plays a central role 
in coupling electron flow to proton movement. 

The cytochromes are proteins with characteristic 
strong absorption of visible light, due to their iron-con- 
taining heme prosthetic groups (Fig. 19-3). Mitochon- 
dria contain three classes of cytochromes, designated a, 
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FIGURE 19-3 Prosthetic groups of cytochromes. Each group 
consists of four five-membered, nitrogen-containing rings in 
a cyclic structure called a porphyrin. The four nitrogen atoms 
are coordinated with a central Fe ion, either Fe 2+ or Fe 3 + . 
Iron protoporphyrin IX is found in b-type cytochromes and in 
hemoglobin and myoglobin (see Fig. 4-1 6). Heme c is cova- 
lently bound to the protein of cytochrome c through 
thioether bonds to two Cys residues. Heme a, found in a- 
type cytochromes, has a long isoprenoid tail attached to one 
of the five-membered rings. The conjugated double-bond 
system (shaded pink) of the porphyrin ring accounts for the 
absorption of visible light by these hemes. 
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FIGURE 19-4 Absorption spectra of cytochrome c (cyt c) in its oxidized 
(red) and reduced (blue) forms. Also labeled are the characteristic a, (3, 
and y bands of the reduced form. 



b, and c, which are distinguished by differences in their 
light-absorption spectra. Each type of cytochrome in its 
reduced (Fe 2+ ) state has three absorption bands in the 
visible range (Fig. 19-4). The longest-wavelength band 
is near 600 nm in type a cytochromes, near 560 nm in 
type 5, and near 550 nm in type c. To distinguish among 
closely related cytochromes of one type, the exact ab- 
sorption maximum is sometimes used in the names, as in 
cytochrome 6 562 . 

The heme cofactors of a and b cytochromes are 
tightly, but not covalently, bound to their associated 
proteins; the hemes of c-type cytochromes are cova- 
lently attached through Cys residues (Fig. 19-3). As 
with the flavoproteins, the standard reduction potential 
of the heme iron atom of a cytochrome depends on its 
interaction with protein side chains and is therefore dif- 
ferent for each cytochrome. The cytochromes of type a 
and b and some of type c are integral proteins of the in- 
ner mitochondrial membrane. One striking exception is 



the cytochrome c of mitochondria, a soluble protein 
that associates through electrostatic interactions with 
the outer surface of the inner membrane. 

In iron-sulfur proteins, the iron is present not in 
heme but in association with inorganic sulfur atoms or 
with the sulfur atoms of Cys residues in the protein, or 
both. These iron-sulfur (Fe-S) centers range from sim- 
ple structures with a single Fe atom coordinated to four 
Cys — SH groups to more complex Fe-S centers with 
two or four Fe atoms (Fig. 19-5). Rieske iron-sulfur 
proteins (named after their discoverer, John S. Rieske) 
are a variation on this theme, in which one Fe atom 
is coordinated to two His residues rather than two 
Cys residues. All iron-sulfur proteins participate in one- 
electron transfers in which one iron atom of the iron- 
sulfur cluster is oxidized or reduced. At least eight Fe-S 
proteins function in mitochondrial electron transfer. 
The reduction potential of Fe-S proteins varies from 
-0.65 V to +0.45 V, depending on the microenviron- 
ment of the iron within the protein. 

In the overall reaction catalyzed by the mitochond- 
rial respiratory chain, electrons move from NADH, succi- 
nate, or some other primary electron donor through 
flavoproteins, ubiquinone, iron-sulfur proteins, and cy- 
tochromes, and finally to 0 2 . A look at the methods used 
to determine the sequence in which the carriers act is 
instructive, as the same general approaches have been 
used to study other electron-transfer chains, such as 
those of chloroplasts. 

First, the standard reduction potentials of the indi- 
vidual electron carriers have been determined experi- 
mentally (Table 19-2). We would expect the carriers to 
function in order of increasing reduction potential, be- 
cause electrons tend to flow spontaneously from 
carriers of lower E'° to carriers of higher E !0 . The order 
of carriers deduced by this method is NADH -> Q -> 
cytochrome b — > cytochrome C\ — » cytochrome c — > 
cytochrome a — » cytochrome a 3 — > 0 2 . Note, however, 
that the order of standard reduction potentials is not 
necessarily the same as the order of actual reduction 
potentials under cellular conditions, which depend on 
the concentration of reduced and oxidized forms (see 
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FIGURE 19-5 Iron-sulfur centers. The Fe-S centers of iron-sulfur pro- 
teins may be as simple as (a), with a single Fe ion surrounded by the 
S atoms of four Cys residues. Other centers include both inorganic and 
Cys S atoms, as in (b) 2Fe-2S or (c) 4Fe-4S centers, (d) The ferredoxin 
of the cyanobacterium Anabaena 7120 has one 2Fe~2S center (PDB ID 
1FRD); Fe is red, inorganic S is yellow, and the S of Cys is orange. 



(Note that in these designations only the inorganic S atoms are 
counted. For example, in the 2Fe-2S center (b), each Fe ion is actually 
surrounded by four S atoms.) The exact standard reduction potential of 
the iron in these centers depends on the type of center and its interac- 
tion with the associated protein. 
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TABLE 19-2 



Standard Reduction Potentials of Respiratory Chain and Related Electron Carriers 
Redox reaction (half-reaction) 



E'° (V) 



2H + + 2e~ > H 2 

NAD + + H + + 2e~ - 



> NADH 

NADP+ + H + + 2e~ > NADPH 

NADH dehydrogenase (FMN) + 2H + + 2e~ 
Ubiquinone + 2H + + 2e~ > ubiquinol 



NADH dehydrogenase (FMNH 2 ) 



Cytochrome b (Fe 3+ ) + e~ — 
Cytochrome c t (Fe 3+ ) + e~ — 
Cytochrome c (Fe 3+ ) + e~ — 
Cytochrome a (Fe 3+ ) + e~ — 
Cytochrome a 3 (Fe 3+ ) + e~ - 
|0 2 + 2H + + 2e~ > H 2 0 



■> cytochrome b (Fe ) 
~> cytochrome c t (Fe 2+ ) 



-> cytochrome c (Fe ) 



-> cytochrome a (Fe ) 
— » cytochrome a 3 (Fe 2+ ) 



-0.414 
-0.320 
-0.324 
-0.30 

0.045 

0.077 

0.22 

0.254 

0.29 

0.35 

0.8166 



Eqn 13-5, p. 515). A second method for determining the 
sequence of electron carriers involves reducing the en- 
tire chain of carriers experimentally by providing an 
electron source but no electron acceptor (no 0 2 ). When 
0 2 is suddenly introduced into the system, the rate at 
which each electron carrier becomes oxidized (mea- 
sured spectroscopically) reveals the order in which the 
carriers function. The carrier nearest 0 2 (at the end of 
the chain) gives up its electrons first, the second carrier 
from the end is oxidized next, and so on. Such experi- 
ments have confirmed the sequence deduced from stan- 
dard reduction potentials. 

In a final confirmation, agents that inhibit the flow of 
electrons through the chain have been used in combina- 
tion with measurements of the degree of oxidation of 
each carrier. In the presence of 0 2 and an electron 
donor, carriers that function before the inhibited step 
become fully reduced, and those that function after this 
step are completely oxidized (Fig. 19-6). By using 
several inhibitors that block different steps in the chain, 
investigators have determined the entire sequence; it is 
the same as deduced in the first two approaches. 



Electron Carriers Function in Multienzyme Complexes 

The electron carriers of the respiratory chain are 
organized into membrane-embedded supramolecular 
complexes that can be physically separated. Gentle 
treatment of the inner mitochondrial membrane with 
detergents allows the resolution of four unique electron- 
carrier complexes, each capable of catalyzing electron 
transfer through a portion of the chain (Table 19-3; 
Fig. 19-7). Complexes I and II catalyze electron trans- 
fer to ubiquinone from two different electron donors: 
NADH (Complex I) and succinate (Complex II). Com- 
plex III carries electrons from reduced ubiquinone to cy- 
tochrome c, and Complex IV completes the sequence by 
transferring electrons from cytochrome c to 0 2 . 

We now look in more detail at the structure and 
function of each complex of the mitochondrial respira- 
tory chain. 

Complex I: NADH to Ubiquinone Figure 19-8 illus- 
trates the relationship between Complexes I and II and 
ubiquinone. Complex I, also called NADH:ubiquinone 



I rotenone | 

NADH > Q - — > Cyt b > Cyt c x > Cyt c > Cyt (a + a 3 ) > 0 2 



I antimycin A | 

NADH > Q > Cyt b > Cyt c x > Cyt c > Cyt (a + a 3 ) > 0 2 



NADH > Q > Cyt b — > Cyt c x 

FIGURE 19-6 Method for determining the sequence of electron carri- 
ers. This method measures the effects of inhibitors of electron transfer 
on the oxidation state of each carrier. In the presence of an electron 



CN" or CO 



> Cyt c > Cyt (a + a B ) > 0 2 

donor and 0 2/ each inhibitor causes a characteristic pattern of oxidized/ 
reduced carriers: those before the block become reduced (blue), and 
those after the block become oxidized (pink). 
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The Protein Components of the Mitochondrial Electron-Transfer Chain 

Enzyme complex/protein Mass (kDa) Number of subunits* 



Prosthetic group(s) 



I NADH dehydrogenase 

II Succinate dehydrogenase 

III Ubiquinone: cytochrome c oxidoreductase 
Cytochrome c 1 

IV Cytochrome oxidase 



850 
140 
250 
13 
160 



43 (14) 

4 
11 

1 

13 (3^) 



FMN, Fe-S 
FAD, Fe-S 
Hemes, Fe-S 
Heme 

Hemes; Cu A , Cu B 



*Numbers of subunits in the bacterial equivalents in parentheses. 

Cytochrome c is not part of an enzyme complex; it moves between Complexes III and IV as a freely soluble protein. 
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Reactions catalyzed by isolated fractions in vitro 

FIGURE 19-7 Separation of functional complexes of the respiratory chain. 

The outer mitochondrial membrane is first removed by treatment with the 
detergent digitonin. Fragments of inner membrane are then obtained by os- 
motic rupture of the mitochondria, and the fragments are gently dissolved 
in a second detergent. The resulting mixture of inner membrane proteins is 
resolved by ion-exchange chromatography into different complexes (I 
through IV) of the respiratory chain, each with its unique protein composi- 
tion (see Table 19-3), and the enzyme ATP synthase (sometimes called 
Complex V). The isolated Complexes I through IV catalyze transfers be- 
tween donors (NADH and succinate), intermediate carriers (Q and cy- 
tochrome c), and 0 2 , as shown. In vitro, isolated ATP synthase has only 
ATP-hydrolyzing (ATPase), not ATP-synthesizing, activity. 
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FIGURE 19-8 Path of electrons from NADH, succinate, fatty acyl-CoA, 
and glycerol 3-phosphate to ubiquinone. Electrons from NADH pass 
through a flavoprotein to a series of iron-sulfur proteins (in Complex I) 
and then to Q. Electrons from succinate pass through a flavoprotein 
and several Fe-S centers (in Complex II) on the way to Q. Glycerol 3- 
phosphate donates electrons to a flavoprotein (glycerol 3-phosphate 
dehydrogenase) on the outer face of the inner mitochondrial membrane, 
from which they pass to Q. Acyl-CoA dehydrogenase (the first enzyme of 
/3 oxidation) transfers electrons to electron-transferring flavoprotein 
(ETF), from which they pass to Q via ETF:ubiquinone oxidoreductase. 



oxidoreductase or NADH dehydrogenase, is a large 
enzyme composed of 42 different polypeptide chains, 
including an FMN-containing flavoprotein and at least six 
iron-sulfur centers. High-resolution electron microscopy 
shows Complex I to be L-shaped, with one arm of the L in 
the membrane and the other extending into the matrix. 
As shown in Figure 19-9, Complex I catalyzes two simul- 
taneous and obligately coupled processes: (1) the exer- 
gonic transfer to ubiquinone of a hydride ion from NADH 
and a proton from the matrix, expressed by 



NADH + H + + Q 



NAD + + QH 2 



(19-1) 



and (2) the endergonic transfer of four protons from the 
matrix to the intermembrane space. Complex I is there- 
fore a proton pump driven by the energy of electron 
transfer, and the reaction it catalyzes is vectorial: it 
moves protons in a specific direction from one location 
(the matrix, which becomes negatively charged with the 



[7,4] 



Oxidative Phosphorylation and Photophosphorylation 



Intermembrane 
space (p side) 



Matrix 
(n side) 




NADH + H + 



departure of protons) to another (the intermembrane 
space, which becomes positively charged). To empha- 
size the vectorial nature of the process, the overall reac- 
tion is often written with subscripts that indicate the 



FIGURE 19-9 NADH:ubiquinone oxidoreductase (Complex I). Com- 
plex I catalyzes the transfer of a hydride ion from NADH to FMN, from 
which two electrons pass through a series of Fe-S centers to the iron- 
sulfur protein N-2 in the matrix arm of the complex. The domain that 
extends into the matrix has been crystallized and its structure solved 
(PDB ID 2FUG); the structure of the membrane domain of Complex I 
is not yet known. Electron transfer from N-2 to ubiquinone on the 
membrane arm forms QH 2 , which diffuses into the lipid bilayer. This 
electron transfer also drives the expulsion from the matrix of four pro- 
tons per pair of electrons. The detailed mechanism that couples elec- 
tron and proton transfer in Complex I is not yet known, but probably 
involves a Q cycle similar to that in Complex III in which QH 2 partic- 
ipates twice per electron pair (see Fig. 1 9-1 2). Proton flux produces an 
electrochemical potential across the inner mitochondrial membrane (n 
side negative, p side positive), which conserves some of the energy re- 
leased by the electron-transfer reactions. This electrochemical poten- 
tial drives ATP synthesis. 



location of the protons: p for the positive side of the in- 
ner membrane (the intermembrane space), N for the 
negative side (the matrix): 

NADH + 5HjJ + Q > NAD + + QH 2 + 4H^ (19-2) 

Amytal (a barbiturate drug), rotenone (a plant product 
commonly used as an insecticide), and piericidin A (an 
antibiotic) inhibit electron flow from the Fe-S centers of 
Complex I to ubiquinone (Table 19-4) and therefore 
block the overall process of oxidative phosphorylation. 

Ubiquinol (QH 2 , the fully reduced form; Fig. 19-2) 
diffuses in the inner mitochondrial membrane from 
Complex I to Complex III, where it is oxidized to Q in a 
process that also involves the outward movement of H + . 



TABLE 19-4 



Agents That Interfere with Oxidative Phosphorylation or Photophosphorylation 

T^e of interference Compound* Target/mode of action 



Inhibition of electron transfer 



Inhibition of ATP synthase 



Uncoupling of phosphorylation 
from electron transfer 



Inhibition of ATP-ADP exchange 



Cyanide 

Carbon monoxide 

Antimycin A 

Myxothiazol 
Rotenone 
Amytal 
Piericidin A 

DCMU 

Aurovertin 

Oligomycin 
Venturicidin 

DCCD 

FCCP 
DNP 

Valinomycin 
Thermogenin 

Atractyloside 



Inhibit cytochrome oxidase 

Blocks electron transfer from cytochrome b to cytochrome C\ 

Prevent electron transfer from Fe-S center to ubiquinone 

Competes with Q B for binding site in PSII 
Inhibits Fi 

Inhibit F 0 and CF Q 

Blocks proton flow through F Q and CF G 
Hydrophobic proton carriers 
K + ionophore 

In brown adipose tissue, forms proton-conducting pores in 
inner mitochondrial membrane 

Inhibits adenine nucleotide translocase 



*DCMU is 3-(3,4-dichlorophenyl)-l,l-dimethylurea; DCCD, dicyclohexylcarbodiimide; FCCR cyanide-p-trifluoromethoxyphenylhydrazone; DNR 2,4-dinitrophenol. 
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FIGURE 19-10 Structure of Complex II (succinate dehydrogenase). 

(PDB ID 1ZOY) This complex (shown here is the porcine heart en- 
zyme) has two transmembrane subunits, C and D; the cytoplasmic ex- 
tensions contain subunits A and B. Just behind the FAD in subunit A is 
the binding site for succinate. Subunit B has three sets of Fe-S centers; 
ubiquinone is bound to subunit B; and heme b is sandwiched between 
subunits C and D. Two phosphatidylethanolamine molecules are so 
tightly bound to subunit D that they show up in the crystal structure. 
Electrons move (blue arrows) from succinate to FAD, then through the 
three Fe-S centers to ubiquinone. The heme b is not on the main path 
of electron transfer but protects against the formation of reactive oxy- 
gen species (ROS) by electrons that go astray. 

Complex II: Succinate to Ubiquinone We encoun- 
tered Complex II in Chapter 16 as succinate dehy- 
drogenase, the only membrane-bound enzyme in the 
citric acid cycle (p. 628). Although smaller and simpler 
than Complex I, it contains five prosthetic groups of two 
types and four different protein subunits (Fig. 19-10). 
Subunits C and D are integral membrane proteins, each 
with three transmembrane helices. They contain a heme 
group, heme b, and a binding site for ubiquinone, the fi- 
nal electron acceptor in the reaction catalyzed by Com- 
plex II. Subunits A and B extend into the matrix; they 
contain three 2Fe-2S centers, bound FAD, and a binding 
site for the substrate, succinate. The path of electron 
transfer from the succinate-binding site to FAD, then 
through the Fe-S centers to the Q-binding site, is more 
than 40 A long, but none of the individual electron- 
transfer distances exceeds about 11 A — a reasonable 
distance for rapid electron transfer (Fig. 19-10). 
BH The heme b of Complex II is apparently not in the 
BH direct path of electron transfer; it may serve in- 
stead to reduce the frequency with which electrons "leak" 
out of the system, moving from succinate to molecular 



oxygen to produce the reactive oxygen species (ROS) 
hydrogen peroxide (H 2 0 2 ) and the superoxide radical 

(*0 2 ~), as described below Humans with point mutations 
in Complex II subunits near heme b or the quinone-bind- 
ing site suffer from hereditary paraganglioma. This inher- 
ited condition is characterized by benign tumors of the 
head and neck, commonly in the carotid body, an organ 
that senses 0 2 levels in the blood. These mutations result 
in greater production of ROS and perhaps greater tissue 
damage during succinate oxidation. ■ 

Other substrates for mitochondrial dehydrogenases 
pass electrons into the respiratory chain at the level of 
ubiquinone, but not through Complex II. The first step 
in the /3 oxidation of fatty acyl-CoA, catalyzed by the 
flavoprotein acyl-CoA dehydrogenase (see Fig. 17-8), 
involves transfer of electrons from the substrate to the 
FAD of the dehydrogenase, then to electron-transfer- 
ring flavoprotein (ETF), which in turn passes its elec- 
trons to ETF:ubiquinone oxidoreductase (Fig. 19-8). 
This enzyme transfers electrons into the respiratory 
chain by reducing ubiquinone. Glycerol 3-phosphate, 
formed either from glycerol released by triacylglycerol 
breakdown or by the reduction of dihydroxyacetone 
phosphate from glycolysis, is oxidized by glycerol 3- 
phosphate dehydrogenase (see Fig. 17^). This en- 
zyme is a flavoprotein located on the outer face of the 
inner mitochondrial membrane, and like succinate de- 
hydrogenase and acyl-CoA dehydrogenase it channels 
electrons into the respiratory chain by reducing 
ubiquinone (Fig. 19-8). The important role of glycerol 
3-phosphate dehydrogenase in shuttling reducing 
equivalents from cytosolic NADH into the mitochondrial 
matrix is described in Section 19.2 (see Fig. 19-30). The 
effect of each of these electron-transferring enzymes is 
to contribute to the pool of reduced ubiquinone. QH 2 
from all these reactions is reoxidized by Complex III. 

Complex III: Ubiquinone to Cytochrome c The 

next respiratory complex, Complex III, also called cy- 
tochrome bc x complex or ubiquinone:cytochrome c 
oxidoreductase, couples the transfer of electrons from 
ubiquinol (QH 2 ) to cytochrome c with the vectorial 
transport of protons from the matrix to the intermem- 
brane space. The determinations of the complete struc- 
ture of this huge complex (Fig. 19-11) and of Complex 
IV (below) by x-ray crystallography, achieved between 
1995 and 1998, were landmarks in the study of mito- 
chondrial electron transfer, providing the structural 
framework to integrate the many biochemical observa- 
tions on the functions of the respiratory complexes. 

The functional unit of Complex III is a dimer, with 
the two monomeric units of cytochrome b surrounding a 
"cavern" in the middle of the membrane, in which 
ubiquinone is free to move from the matrix side of the 
membrane (site Q N on one monomer) to the intermem- 
brane space (site Q P of the other monomer) as it shuttles 
electrons and protons across the inner mitochondrial 
membrane (Fig. 19-1 lb). 
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FIGURE 19-11 Cytochrome 6d complex (Complex III). The complex 
is a dimer of identical monomers, each with 11 different subunits. 
(a) The functional core of each monomer is three subunits: cytochrome 
b (green) with its two hemes (b H and b L ); the Rieske iron-sulfur protein 
(purple) with its 2Fe-2S centers; and cytochrome c, (blue) with its 
heme (PDB ID 1BGY). (b) This cartoon view of the complex shows 
how cytochrome and the Rieske iron-sulfur protein project from the 
p surface and can interact with cytochrome c (not part of the functional 
complex) in the intermembrane space. The complex has two distinct 
binding sites for ubiquinone, Q N and Q P , which correspond to the sites 
of inhibition by two drugs that block oxidative phosphorylation. An- 
timycin A, which blocks electron flow from heme b H to Q, binds at 
Q N , close to heme b H on the n (matrix) side of the membrane, Myxoth- 

Based on the structure of Complex III and detailed 
biochemical studies of the redox reactions, a reasonable 
model, the Q cycle, has been proposed for the passage 
of electrons and protons through the complex. The 
net equation for the redox reactions of the Q cycle 
(Fig. 19-12) is 

QH 2 + 2 cytCi (oxidized) + 2H^ > 

Q + 2 cyt c ± (reduced) + 4H^ (19-3) 

The Q cycle accommodates the switch between the two- 
electron carrier ubiquinone and the one-electron 
carriers— cytochromes b m2 , 5 566 , c u and c— and explains 
the measured stoichiometry of four protons translocated 
per pair of electrons passing through Complex III to cy- 
tochrome c. Although the path of electrons through this 
segment of the respiratory chain is complicated, the net 
effect of the transfer is simple: QH 2 is oxidized to Q and 
two molecules of cytochrome c are reduced. 

Cytochrome c is a soluble protein of the intermem- 
brane space. After its single heme accepts an electron 




iazol, which prevents electron flow from QH 2 to the Rieske iron-sulfur 
protein, binds at Q P/ near the 2Fe-2S center and heme b L on the p side. 
The dimeric structure is essential to the function of Complex III. The in- 
terface between monomers forms two caverns, each containing a Q P 
site from one monomer and a Q N site from the other. The ubiquinone 
intermediates move within these sheltered caverns. 

Complex III crystallizes in two distinct conformations (not shown). 
In one, the Rieske Fe-S center is close to its electron acceptor, the heme 
of cytochrome c 1; but relatively distant from cytochrome b and the 
QH 2 -binding site at which the Rieske Fe-S center receives electrons. In 
the other, the Fe-S center has moved away from cytochrome q and to- 
ward cytochrome b. The Rieske protein is thought to oscillate between 
these two conformations as it is first reduced, then oxidized. 

from Complex III, cytochrome c moves to Complex IV to 
donate the electron to a binuclear copper center. 

Complex IV: Cytochrome c to 0 2 In the final step of 
the respiratory chain, Complex IV, also called cy- 
tochrome oxidase, carries electrons from cytochrome 
c to molecular oxygen, reducing it to H 2 0. Complex IV is 
a large enzyme (13 subunits; M r 204,000) of the inner 
mitochondrial membrane. Bacteria contain a form that 
is much simpler, with only three or four subunits, but 
still capable of catalyzing both electron transfer and pro- 
ton pumping. Comparison of the mitochondrial and bac- 
terial complexes suggests that three subunits are critical 
to the function (Fig. 19-13). 

Mitochondrial subunit II contains two Cu ions com- 
plexed with the — SH groups of two Cys residues in a 
binuclear center (Cu A ; Fig. 19-1 3b) that resembles the 
2Fe-2S centers of iron-sulfur proteins. Subunit I con- 
tains two heme groups, designated a and a 3 , and an- 
other copper ion (Cu B ). Heme a s and Cu B form a second 
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QH 2 + Q + cyt c 1 (oxidized) > 

*Q~ + Q + 2Hp + cyt c 1 (reduced) 



Cytc 




QH 2 



QH 2 +*Q + 2Hn + cyt c 1 (oxidized) > 

QH 2 + 2Hp + Q + cyt c r (reduced) 



Net equation: QH 2 + 2 cyt c 1 (oxidized) + 2HJ 



FIGURE 19-12 The Q cycle, shown in two stages. The path of electrons 
through Complex 111 is shown by blue arrows. In the first stage (left), Q 
on the n side is reduced to the semiquinone radical, which in the sec- 
ond stage (right) is converted to QH 2 . Meanwhile, on the p side of the 
membrane, two molecules of QH 2 are oxidized to Q, releasing two 



Q + 2 cyt Ci (reduced) + 4Hp 



protons per Q molecule (four protons in all) into the intermembrane 
space. Each QH 2 donates one electron (via the Rieske Fe-S center) to 
cytochrome q, and one electron (via cytochrome b) to a molecule of 
Q near the n side, reducing it in two steps to QH 2 . This reduction also 
uses two protons per Q, which are taken up from the matrix. 
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FIGURE 19-13 Structure of cytochrome oxidase (Complex IV). This 
complex from bovine mitochondria has 1 3 subunits, but only four core 
proteins are shown here (PDB ID 10CC). (a) Complex IV, with four 
subunits in each of two identical units of a dimer, Subunit I (yellow) has 
two heme groups, a and a 3 , near a single copper ion, Cu B (green 
sphere). Heme a 3 and Cu B form a binuclear Fe-Cu center. Subunit II 
(purple) contains two Cu ions complexed with the — SH groups of two 
Cys residues in a binuclear center, Cu A , that resembles the 2Fe-2S cen- 
ters of iron-sulfur proteins. This binuclear center and the cytochrome 



c-binding site are located in a domain of subunit II that protrudes from 
the p side of the inner membrane (into the intermembrane space). Sub- 
unit III (light blue) is essential for rapid proton movement through sub- 
unit II. The role of subunit IV (green) is not yet known, (b) The binuclear 
center of Cu A . The Cu ions share electrons equally. When the center is 
reduced, the ions have the formal charges Cu 1+ Cu 1+ ; when oxidized, 
Cu 1 5+ Cu 1 5+ . Six amino acids residues are ligands around the Cu ions: 
two His, two Cys, Glu, and Met. 
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binuclear center that accepts electrons from heme a 
and transfers them to 0 2 bound to heme a 3 . 

Electron transfer through Complex IV is from cy- 
tochrome c to the Cu A center, to heme a, to the heme 
a 3 -Cu B center, and finally to 0 2 (Fig. 19-14). For every 
four electrons passing through this complex, the en- 
zyme consumes four "substrate" H + from the matrix (n 
side) in converting 0 2 to 2H 2 0. It also uses the energy of 
this redox reaction to pump one proton outward into the 
intermembrane space (p side) for each electron that 
passes through, adding to the electrochemical potential 
produced by redox-driven proton transport through 
Complexes I and III. The overall reaction catalyzed by 
Complex IV is 

4 Cyt c (reduced) + 8Hn + 0 2 ► 

4 cyt c (oxidized) + 4H^ + 2H 2 0 (19-4) 

This four-electron reduction of 0 2 involves redox centers 
that carry only one electron at a time, and it must occur 



4Cytc 




(pumped) 

FIGURE 19-14 Path of electrons through Complex IV. The three pro- 
teins critical to electron flow are subunits I, II, and III. The larger green 
structure includes the other 10 proteins in the complex. Electron trans- 
fer through Complex IV begins with cytochrome c (top). Two mole- 
cules of reduced cytochrome c each donate an electron to the 
binuclear center Cu A . From here electrons pass through heme a to the 
Fe-Cu center (cytochrome a 3 and Cu B ). Oxygen now binds to heme a 3 
and is reduced to its peroxy derivative {0 2 2 ~; not shown here) by two 
electrons from the Fe-Cu center. Delivery of two more electrons from 
cytochrome c (top, making four electrons in all) converts the 0\~ to 
two molecules of water, with consumption of four "substrate" protons 
from the matrix. At the same time, four protons are pumped from the 
matrix by an as yet unknown mechanism. 



without the release of incompletely reduced intermedi- 
ates such as hydrogen peroxide or hydroxyl free 
radicals — very reactive species that would damage cellu- 
lar components. The intermediates remain tightly bound 
to the complex until completely converted to water. 

Mitochondrial Complexes May Associate in Respirasomes 

There is growing experimental evidence that in the in- 
tact mitochondrion, the respiratory complexes tightly 
associate with each other in the inner membrane to 
form respirasomes, functional combinations of two 
or more electron-transfer complexes. For example, 
when complex III is gently extracted from mitochon- 
drial membranes, it is found to be associated with 
Complex I and remains associated during gentle elec- 
trophoresis. Supercomplexes of Complex III and IV 
can also be isolated, and when viewed with the elec- 
tron microscope are of the right size and shape to ac- 
commodate the crystal structures of both complexes 
(Fig. 19-15). The kinetics of electron flow through 
the series of respiratory complexes would be very dif- 
ferent in the two extreme cases of tight versus no as- 
sociation: (1) if complexes were tightly associated, 
electron transfers would essentially occur through a 
solid state; and (2) if the complexes functioned sepa- 
rately, electrons would be carried between them by 
ubiquinone and cytochrome c. The kinetic evidence 
supports electron transfer through a solid state, and 
thus the respirasome model. 

Cardiolipin, the lipid that is especially abundant in 
the inner mitochondrial membrane (see Figs 10-9 and 
11-2), may be critical to the integrity of respirasomes; 
its removal with detergents, or its absence in certain 
yeast mutants, results in defective mitochondrial elec- 
tron transfer and a loss of affinity between the respira- 
tory complexes. 

The Energy of Electron Transfer Is Efficiently Conserved in a 
Proton Gradient 

The transfer of two electrons from NADH through the 
respiratory chain to molecular oxygen can be written as 

NADH + H + + ±0 2 > NAD + + H 2 0 (19-5) 

This net reaction is highly exergonic. For the redox pair 
NAD + /NADH, E'° is -0.320 V, and for the pair 0 2 /H 2 0, 
E'° is 0.816 V. The AE'° for this reaction is therefore 
1.14 V, and the standard free-energy change (see 
Eqn 13-7, p. 515) is 

AG'° = -n3AE'° (19-6) 

= -2(96.5 kJ/V • mol)(1.14 V) 

= -220 kJ/mol (of NADH) 

This standard free-energy change is based on the as- 
sumption of equal concentrations (1 m) of NADH and 
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(a) 

FIGURE 19-15 A putative respirasome composed of Complexes III 

and IV. (a) Purified supercomplexes containing Complexes III and IV, 
from yeast, visualized by electron microscopy after staining with 
uranyl acetate. The electron densities of hundreds of images were av- 
eraged to yield this composite view, (b) The x-ray structures of one 




(b) 

molecule of Complex III (red; from yeast) and two of Complex IV 
(green; from bovine heart) could be fitted to the electron-density map 
to suggest one possible mode of interaction of these complexes in a 
respirasome. This view is in the plane of the bilayer (yellow). 



NAD + . In actively respiring mitochondria, the actions of 
many dehydrogenases keep the actual [NADH]/[NAD + ] 
ratio well above unity, and the real free-energy change for 
the reaction shown in Equation 19-5 is therefore substan- 
tially greater (more negative) than -220 kJ/mol. A simi- 
lar calculation for the oxidation of succinate shows that 
electron transfer from succinate (E'° for fumarate/succi- 
nate = 0.031 V) to 0 2 has a smaller, but still negative, 
standard free-energy change of about - 150 kJ/mol. 



Much of this energy is used to pump protons out of 
the matrix. For each pair of electrons transferred to 0 2 , 
four protons are pumped out by Complex I, four by 
Complex III, and two by Complex IV (Fig. 19-16). The 
vectorial equation for the process is therefore 

NADH + 11HS + \0 2 >NAD + + 10H? + H 2 0 (19-7) 

The electrochemical energy inherent in this difference 
in proton concentration and separation of charge 
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FIGURE 19-16 Summary of the flow of electrons and protons through 
the four complexes of the respiratory chain. Electrons reach Q 
through Complexes I and II. The reduced Q (QH 2 ) serves as a mobile 
carrier of electrons and protons. It passes electrons to Complex III, 
which passes them to another mobile connecting link, cytochrome c. 
Complex IV then transfers electrons from reduced cytochrome c to 0 2 . 
Electron flow through Complexes I, III, and IV is accompanied by pro- 



ton flow from the matrix to the intermembrane space. Recall that elec- 
trons from p oxidation of fatty acids can also enter the respiratory 
chain through Q (see Fig. 19-8). The structures shown here are from 
several sources: Complex I, Thermus thermophiius (PDB ID 2FUG); 
Complex II, porcine heart (PDB ID 1ZOY); Complex III, bovine heart 
(PDB ID 1BGY); cytochrome c, equine heart (PDB ID 1HRC); Com- 
plex IV, bovine heart (PDB ID 10CC). 
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AG = RT In (Ca/Cj) + Z JA*A 
= 2.3i?TApH + jAtA 



FIGURE 19-17 Proton-motive force. The inner mitochondrial mem- 
brane separates two compartments of different [H + ], resulting in differ- 
ences in chemical concentration (ApH) and charge distribution {Aip) 
across the membrane The net effect is the proton-motive force (AC), 
which can be calculated as shown here. This is explained more fully in 
the text 



represents a temporary conservation of much of the 
energy of electron transfer. The energy stored in such 
a gradient, termed the proton-motive force, has two 
components: (1) the chemical potential energy due 
to the difference in concentration of a chemical 
species (H + ) in the two regions separated by the 
membrane, and (2) the electrical potential energy 
that results from the separation of charge when a pro- 
ton moves across the membrane without a counterion 
(Fig. 19-17). 

As we showed in Chapter 11, the free-energy 
change for the creation of an electrochemical gradient 
by an ion pump is 

AG - RT\n(^j + ZJAip (19-8) 

where C 2 and d are the concentrations of an ion in two 
regions, and C 2 > C x \ Z is the absolute value of its elec- 
trical charge (1 for a proton); and is the transmem- 
brane difference in electrical potential, measured in 
volts. 

For protons at 25 °C, 

ln(J^) = 2.3(log[H + ] P - log[H + ] N ) 

= 2.3(pH N - pH P ) = 2.3 ApH 

and Equation 19-8 reduces to 

AG = 2.3RT ApH + JA*p (19-9) 
= (5.70kJ/mol)ApH + (96.5 kJ/V-moMt// 

In actively respiring mitochondria, the measured Aip is 
0.15 to 0.20 V and the pH of the matrix is about 0.75 
units more alkaline than that of the intermembrane 
space. 



WORKED EXAMPLE 19-1 Energetics of Electron 

Transfer 

Calculate the amount of energy conserved in the proton 
gradient across the inner mitochondrial membrane per 
pair of electrons transferred through the respiratory 
chain from NADH to oxygen. Assume Aijj is 0.15 V and 
the pH difference is 0.75 units. 

Solution: Equation 19-9 gives the free-energy change 
when one mole of protons moves across the inner mem- 
brane. Substituting the measured values for ApH 
(0.75 units) and Ai/f (0.15 V) in this equation gives AG = 
19 kJ/mol (of protons). Because the transfer of two elec- 
trons from NADH to 0 2 is accompanied by the outward 
pumping of 10 protons (Eqn 19-7), roughly 200 kJ (of 
the 220 kJ released by oxidation of 1 mol of NADH) is 
conserved in the proton gradient. 



When protons flow spontaneously down their elec- 
trochemical gradient, energy is made available to do 
work. In mitochondria, chloroplasts, and aerobic bacte- 
ria, the electrochemical energy in the proton gradient 
drives the synthesis of ATP from ADP and Pi. We return 
to the energetics and stoichiometry of ATP synthesis 
driven by the electrochemical potential of the proton 
gradient in Section 19.2. 

Reactive Oxygen Species Are Generated during 
Oxidative Phosphorylation 

Several steps in the path of oxygen reduction in mito- 
chondria have the potential to produce highly reactive 
free radicals that can damage cells. The passage of elec- 
trons from QH 2 to Complex III, and passage of electrons 
from Complex I to QH 2 , involve the radical *Q~ as an in- 
termediate. The 'Q~ can, with a low probability, pass an 
electron to 0 2 in the reaction 

0 2 + e~ > '0 2 

The superoxide free radical thus generated is highly re- 
active; its formation also leads to production of the even 
more reactive hydroxyl free radical, 'OH (Fig. 19-18). 

These reactive oxygen species can wreak havoc, re- 
acting with and damaging enzymes, membrane lipids, 
and nucleic acids. In actively respiring mitochondria, 
0.1% to as much as 4% of the 0 2 used in respiration 
forms *0^" — more than enough to have lethal effects 
unless the free radical is quickly disposed of Factors 
that slow the flow of electrons through the respiratory 
chain increase the formation of superoxide, perhaps by 
prolonging the lifetime of 'O2 generated in the Q cycle. 

To prevent oxidative damage by "0^~ , cells have sev- 
eral forms of the enzyme superoxide dismutase, 
which catalyzes the reaction 

2 "Oa" + 2H + ► H 2 0 2 + 0 2 
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FIGURE 19-18 ROS formation in mitochondria and mitochondrial de- 
fenses. When the rate of electron entry into the respiratory chain and 
the rate of electron transfer through the chain are mismatched, super- 
oxide radical ("OJ) production increases at Complexes I and 111 as the 
partially reduced ubiquinone radical ("Q") donates an electron to 0 2 . 
Superoxide acts on aconitase, a 4Fe-4S protein, to release Fe 2 + . In the 
presence of Fe 2+ , the Fenton reaction leads to formation of the highly 
reactive hydroxyl free radical ("OH). The reactions shown in blue de- 
fend the cell against the damaging effects of superoxide. Reduced glu- 
tathione (GSH; see Fig, 22-27) donates electrons for the reduction of 
H 2 0 2 and of the oxidized Cys residues ( — S — S — ) of enzymes and 
other proteins, and GSH is regenerated from the oxidized form (GSSG) 
by reduction with NADPH. 

The hydrogen peroxide (H 2 0 2 ) thus generated is ren- 
dered harmless by the action of glutathione peroxi- 
dase (Fig. 19-18). Glutathione reductase recycles the 
oxidized glutathione to its reduced form, using electrons 
from the NADPH generated by nicotinamide nucleotide 
transhydrogenase (in the mitochondrion) or by the pen- 
tose phosphate pathway (in the cytosol; see Fig. 14-20). 
Reduced glutathione also serves to keep protein 
sulfhydryl groups in their reduced state, preventing 
some of the deleterious effects of oxidative stress 
(Fig. 19-18). Nicotinamide nucleotide transhydroge- 
nase is critical in this process: it produces the NADPH 
essential for glutathione reductase activity 

Piant Mitochondria Have Alternative Mechanisms for 
Oxidizing NADH 

Plant mitochondria supply the cell with ATP during 
periods of low illumination or darkness by mecha- 
nisms entirely analogous to those used by nonphoto- 
synthetic organisms. In the light, the principal source 
of mitochondrial NADH is a reaction in which glycine, 



produced by a process known as photorespiration, is 
converted to serine (see Fig. 20-21): 

2 Glycine + NAD + ► serine + C0 2 + NH 3 + NADH + H + 

For reasons discussed in Chapter 20, plants must carry 
out this reaction even when they do not need NADH for 
ATP production. To regenerate NAD + from unneeded 
NADH, plant mitochondria transfer electrons from NADH 
directly to ubiquinone and from ubiquinone directly to 
0 2 , bypassing Complexes III and IV and their proton 
pumps. In this process the energy in NADH is dissipated 
as heat, which can sometimes be of value to the plant 
(Box 19-1). Unlike cytochrome oxidase (Complex IV), 
the alternative QH 2 oxidase is not inhibited by cyanide. 
Cyanide-resistant NADH oxidation is therefore the hall- 
mark of this unique plant electron-transfer pathway. 



SUMMARY 19.1 



Electron-Transfer 
Reactions in Mitochondria 



Chemiosmotic theory provides the intellectual 
framework for understanding many biological 
energy transductions, including oxidative 
phosphorylation and photophosphorylation. The 
mechanism of energy coupling is similar in both 
cases: the energy of electron flow is conserved by 
the concomitant pumping of protons across the 
membrane, producing an electrochemical gradient, 
the proton-motive force. 

In mitochondria, hydride ions removed from 
substrates by NAD-linked dehydrogenases donate 
electrons to the respiratory (electron-transfer) 
chain, which transfers the electrons to molecular 
0 2 , reducing it to H 2 0. 

Shuttle systems convey reducing equivalents from 
cytosolic NADH to mitochondrial NADH. Reducing 
equivalents from all NAD-linked dehydrogenations 
are transferred to mitochondrial NADH 
dehydrogenase (Complex I). 

Reducing equivalents are then passed through a 
series of Fe-S centers to ubiquinone, which 
transfers the electrons to cytochrome b, the first 
carrier in Complex III. In this complex, electrons 
take two separate paths through two 6-type 
cytochromes and cytochrome Ci to an Fe-S center. 
The Fe-S center passes electrons, one at a time, 
through cytochrome c and into Complex IV, 
cytochrome oxidase. This copper-containing 
enzyme, which also contains cytochromes a and a 3 , 
accumulates electrons, then passes them to 0 2 , 
reducing it to H 2 0. 

Some electrons enter this chain of carriers through 
alternative paths. Succinate is oxidized by 
succinate dehydrogenase (Complex II), which 
contains a flavoprotein that passes electrons 
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through several Fe-S centers to ubiquinone. 
Electrons derived from the oxidation of fatty acids 
pass to ubiquinone via the electron-transferring 
flavoprotein. 

Potentially harmful reactive oxygen species 
produced in mitochondria are inactivated by a set 



of protective enzymes, including superoxide 
dismutase and glutathione peroxidase. 

Plants, fungi, and unicellular eukaryotes have, in 
addition to the typical cyanide-sensitive path for 
electron transfer, an alternative, cyanide-resistant 
NADH oxidation pathway. 



BOX 19-1 Hot, Stinking Plants and Alternative Respiratory Pathways 



Many flowering plants attract insect pollinators by re- 
leasing odorant molecules that mimic an insect's natural 
food sources or potential egg-laying sites. Plants polli- 
nated by flies or beetles that normally feed on or lay 
their eggs in dung or carrion sometimes use foul- 
smelling compounds to attract these insects. 

One family of stinking plants is the Araceae, which in- 
cludes philodendrons, arum lilies, and skunk cabbages. 
These plants have tiny flowers densely packed on an 
erect structure, the spadix, surrounded by a modified 
leaf, the spathe. The spadix releases odors of rotting flesh 
or dung. Before pollination the spadix also heats up, in 
some species to as much as 20 to 40 °C above the ambient 
temperature. Heat production (thermogenesis) helps 
evaporate odorant molecules for better dispersal, and be- 
cause rotting flesh and dung are usually warm from the 
hyperactive metabolism of scavenging microbes, the heat 
itself might also attract insects. In the case of the eastern 
skunk cabbage (Fig. 1), which flowers in late winter or 
early spring when snow still covers the ground, thermo- 
genesis allows the spadix to grow up through the snow. 

How does a skunk cabbage heat its spadix? The mito- 
chondria of plants, fungi, and unicellular eukaryotes have 
electron-transfer systems that are essentially the same as 
those in animals, but they also have an alternative respira- 
tory pathway. A cyanide-resistant QH 2 oxidase transfers 
electrons from the ubiquinone pool directly to oxygen, by- 
passing the two proton-translocating steps of Complexes 
III and IV (Fig. 2), Energy that might have been conserved 
as ATP is instead released as heat. Plant mitochondria also 




FIGURE 1 Eastern skunk cabbage. 

have an alternative NADH dehydrogenase , insensitive to 
the Complex I inhibitor rotenone (see Table 19-4), that 
transfers electrons from NADH in the matrix directly to 
ubiquinone, bypassing Complex I and its associated proton 
pumping. And plant mitochondria have yet another NADH 
dehydrogenase, on the external face of the inner mem- 
brane, that transfers electrons from NADPH or NADH in 
the intermembrane space to ubiquinone, again bypassing 
Complex I. Thus when electrons enter the alternative res- 
piratory pathway through the rotenone-insensitive NADH 
dehydrogenase, the external NADH dehydrogenase, or 
succinate dehydrogenase (Complex II), and pass to 0 2 via 
the cyanide-resistant alternative oxidase, energy is not 
conserved as ATP but is released as heat. A skunk cabbage 
can use the heat to melt snow, produce a foul stench, or at- 
tract beetles or flies. 




FIGURE 2 Electron carriers of the inner membrane of plant mitochondria. Electrons can flow through Complexes I, III, 
Matrix (n side) and IV, as in animal mitochondria, or through plant-specific alternative carriers by the paths shown with blue arrows. 
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19.2 ATP Synthesis 

How is a concentration gradient of protons transformed 
into ATP? We have seen that electron transfer releases, 
and the proton-motive force conserves, more than 
enough free energy (about 200 kJ) per "mole" of elec- 
tron pairs to drive the formation of a mole of ATP, which 
requires about 50 kJ (p. 503). Mitochondrial oxidative 
phosphorylation therefore poses no thermodynamic 
problem. But what is the chemical mechanism that cou- 
ples proton flux with phosphorylation? 

The chemiosmotic model, proposed by Peter 
Mitchell, is the paradigm for this mechanism. According 
to the model (Fig. 19-19), the electrochemical energy 
inherent in the difference in proton concentration and 
the separation of charge across the inner mitochondrial 
membrane — the proton-motive force — drives the syn- 
thesis of ATP as protons flow passively back into the 
matrix through a proton pore associated with ATP syn- 
thase. To emphasize this crucial role of the proton-mo- 
tive force, the equation for ATP synthesis is sometimes 
written 

ADP + Pi + nii^ ► ATP + H 2 0 + nK^ (19-10) 

Mitchell used "chemiosmotic" to describe enzymatic reac- 
tions that involve, simultaneously, a chemical reaction and 
a transport process. The operational definition of "cou- 
pling" is shown in Figure 19-20. When isolated mito- 
chondria are suspended in a buffer containing ADP, Pi, and 
an oxidizable substrate such as succinate, three easily 
measured processes occur: (1) the substrate is oxidized 



(succinate yields fumarate), (2) 
0 2 is consumed, and (3) ATP is 
synthesized. Oxygen consump- 
tion and ATP synthesis depend 
on the presence of an oxidizable 
substrate (succinate in this case) 
as well as ADP and Pi. 

Because the energy of sub- 
strate oxidation drives ATP 
synthesis in mitochondria, we 




would expect inhibitors of the p e ter Mitchell 
passage of electrons to 0 2 (such 1 920 _ 1 992 
as cyanide, carbon monoxide, 

and antimycin A) to block ATP synthesis (Fig. 19-20a). 
More surprising is the finding that the converse is also 
true: inhibition of ATP synthesis blocks electron transfer 
in intact mitochondria. This obligatory coupling can be 
demonstrated in isolated mitochondria by providing 0 2 
and oxidizable substrates, but not ADP (Fig. 19-20b). 
Under these conditions, no ATP synthesis can occur and 
electron transfer to 0 2 does not proceed. Coupling of ox- 
idation and phosphorylation can also be demonstrated 
using oligomycin or venturicidin, toxic antibiotics that 
bind to the ATP synthase in mitochondria. These com- 
pounds are potent inhibitors of both ATP synthesis and 
the transfer of electrons through the chain of carriers to 
0 2 (Fig. 19-20b). Because oligomycin is known to inter- 
act not directly with the electron carriers but with ATP 
synthase, it follows that electron transfer and ATP syn- 
thesis are obligately coupled: neither reaction occurs 
without the other. 




FIGURE 19-19 Chemiosmotic model. In this simple representation of 
the chemiosmotic theory applied to mitochondria, electrons from 
NADH and other oxidizable substrates pass through a chain of carriers 
arranged asymmetrically in the inner membrane. Electron flow is ac- 
companied by proton transfer across the membrane, producing both a 



chemical gradient (ApH) and an electrical gradient (At/r). The inner mito- 
chondrial membrane is impermeable to protons; protons can reenter the 
matrix only through proton-specific channels (F 0 ). The proton-motive 
force that drives protons back into the matrix provides the energy for ATP 
synthesis, catalyzed by the complex associated with F G . 
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FIGURE 19-20 Coupling of electron transfer and ATP synthesis in mi- 
tochondria. In experiments to demonstrate coupling, mitochondria are 
suspended in a buffered medium and an 0 2 electrode monitors 0 2 
consumption. At intervals, samples are removed and assayed for the 
presence of ATP. (a) Addition of ADP and Pj alone results in little or no 
increase in either respiration (0 2 consumption; black) or ATP synthesis 
(red). When succinate is added, respiration begins immediately and 
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ATP is synthesized. Addition of cyanide (CN~), which blocks electron 
transfer between cytochrome oxidase (Complex IV) and 0 2 , inhibits both 
respiration and ATP synthesis, (b) Mitochondria provided with succinate 
respire and synthesize ATP only when ADP and Pj are added. Subse- 
quent addition of venturicidin or oligomycin, inhibitors of ATP synthase, 
blocks both ATP synthesis and respiration. Dinitrophenol (DNP) is an un- 
couple^ allowing respiration to continue without ATP synthesis. 



Chemiosmotic theory readily explains the dependence 
of electron transfer on ATP synthesis in mitochondria. 
When the flow of protons into the matrix through the pro- 
ton channel of ATP synthase is blocked (with oligomycin, 
for example) , no path exists for the return of protons to the 
matrix, and the continued extrusion of protons driven by 
the activity of the respiratory chain generates a large pro- 
ton gradient. The proton-motive force builds up until the 
cost (free energy) of pumping protons out of the matrix 
against this gradient equals or exceeds the energy released 
by the transfer of electrons from NADH to 0 2 . At this point 
electron flow must stop; the free energy for the overall 
process of electron flow coupled to proton pumping be- 
comes zero, and the system is at equilibrium. 

Certain conditions and reagents, however, can un- 
couple oxidation from phosphorylation. When intact mi- 
tochondria are disrupted by treatment with detergent or 
by physical shear, the resulting membrane fragments 
can still catalyze electron transfer from succinate or 
NADH to 0 2? but no ATP synthesis is coupled to this res- 
piration. Certain chemical compounds cause uncoupling 
without disrupting mitochondrial structure. Chemical 
uncouplers include 2,4-dinitrophenol (DNP) and car- 
bonylcyanide-p-trifluoromethoxyphenylhydrazone 
(FCCP) (Table 19-4; Fig. 19-21), weak acids with hy- 
drophobic properties that permit them to diffuse readily 
across mitochondrial membranes. After entering the 
matrix in the protonated form, they can release a pro- 
ton, thus dissipating the proton gradient. Ionophores 
such as valinomycin (see Fig. 11-45) allow inorganic 
ions to pass easily through membranes. Ionophores un- 
couple electron transfer from oxidative phosphorylation by 
dissipating the electrical contribution to the electrochemi- 
cal gradient across the mitochondrial membrane. 

A prediction of the chemiosmotic theory is that, be- 
cause the role of electron transfer in mitochondrial ATP 
synthesis is simply to pump protons to create the elec- 
trochemical potential of the proton-motive force, an ar- 



tificially created proton gradient should be able to re- 
place electron transfer in driving ATP synthesis. This 
has been experimentally confirmed (Fig. 19-22). Mito- 
chondria manipulated so as to impose a difference of 
proton concentration and a separation of charge across 
the inner membrane synthesize ATP in the absence of 
an oxidizable substrate; the proton-motive force alone 
suffices to drive ATP synthesis. 
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FIGURE 19-21 Two chemical uncouplers of oxidative phosphorylation. 

Both DNP and FCCP have a dissociable proton and are very hydropho- 
bic. They carry protons across the inner mitochondrial membrane, dissi- 
pating the proton gradient. Both also uncouple photophosphorylation 
(see Fig. 19-63). 
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FIGURE 19-22 Evidence for the role of a proton gradient in ATP syn- 
thesis. An artificially imposed electrochemical gradient can drive ATP 
synthesis in the absence of an oxidizable substrate as electron donor. 
In this two-step experiment, (a) isolated mitochondria are first incu- 
bated in a pH 9 buffer containing 0.1 m KCI. Slow leakage of buffer 
and KCI into the mitochondria eventually brings the matrix into equi- 
librium with the surrounding medium. No oxidizable substrates are 
present, (b) Mitochondria are now separated from the pH 9 buffer and 
resuspended in pH 7 buffer containing valinomycin but no KCI. The 
change in buffer creates a difference of two pH units across the inner 
mitochondrial membrane. The outward flow of K + , carried (by valino- 
mycin) down its concentration gradient without a counterion, creates 
a charge imbalance across the membrane (matrix negative). The sum of 
the chemical potential provided by the pH difference and the electri- 
cal potential provided by the separation of charges is a proton-motive 
force large enough to support ATP synthesis in the absence of an oxi- 
dizable substrate. 



ATP Synthase Has Two Functional Domains, 

F 0 and h 

Mitochondrial ATP synthase is an F-type ATPase (see Fig. 
11-39) similar in structure and mechanism to the ATP 
synthases of chloroplasts and bacteria. This large enzyme 
complex of the inner mitochondrial membrane catalyzes 
the formation of ATP from ADP and P i? accompanied by 
the flow of protons from the p to the n side of the mem- 
brane (Eqn 19-10). ATP synthase, also called Complex V, 
has two distinct components: F l5 a peripheral membrane 
protein, and F 0 (o denoting otigomycin-sensitive), which 
is integral to the membrane. F ls the first factor recognized 




Efraim Racker, 1913-1991 



as essential for oxidative phos- 
phorylation, was identified and 
purified by Efraim Racker and 
his colleagues in the early 1960s. 

In the laboratory, small mem- 
brane vesicles formed from in- 
ner mitochondrial membranes 
carry out ATP synthesis coupled 
to electron transfer. When F x is 
gently extracted, the "stripped" 
vesicles still contain intact respi- 
ratory chains and the F 0 portion 
of ATP synthase. The vesicles 

can catalyze electron transfer from NADH to 0 2 but cannot 
produce a proton gradient: F 0 has a proton pore through 
which protons leak as fast as they are pumped by electron 
transfer, and without a proton gradient the F : -depleted 
vesicles cannot make ATP. Isolated F x catalyzes ATP hy- 
drolysis (the reversal of synthesis) and was therefore 
originally called FxATPase. When purified F x is added 
back to the depleted vesicles, it reassociates with F 0J 
plugging its proton pore and restoring the membrane's 
capacity to couple electron transfer and ATP synthesis. 

ATP Is Stabilized Relative to ADP on the 
Surface of F 1 

Isotope exchange experiments with purified F : reveal a 
remarkable fact about the enzyme's catalytic mecha- 
nism: on the enzyme surface, the reaction ADP + Pi 
ATP + H 2 0 is readily reversible — the free-energy 
change for ATP synthesis is close to zero! When ATP 
is hydrolyzed by F x in the presence of 18 0-labeled wa- 
ter, the Pi released contains an 18 0 atom. Careful meas- 
urement of the 18 0 content of Pi formed in vitro by 
Fx-catalyzed hydrolysis of ATP reveals that the Pi has 
not one, but three or four 18 0 atoms (Fig. 19-23). This 
indicates that the terminal pyrophosphate bond in ATP 
is cleaved and re-formed repeatedly before P { leaves the 
enzyme surface. With P { free to tumble in its binding 
site, each hydrolysis inserts 18 0 randomly at one of the 
four positions in the molecule. This exchange reaction 
occurs in unenergized F 0 Fx complexes (with no proton 
gradient) and with isolated F 1 — the exchange does not 
require the input of energy. 

Kinetic studies of the initial rates of ATP synthesis 
and hydrolysis confirm the conclusion that AG'° for ATP 
synthesis on the enzyme is near zero. From the mea- 
sured rates of hydrolysis (k x = 10 s" 1 ) and synthesis 
(/c_! = 24 s" 1 ), the calculated equilibrium constant for 
the reaction 



is 



Enz-ATP 



Enz-(ADP + Pi) 



24 s" 
10 s" 



= 2.4 



From this K' eq , the calculated apparent AG'° is close to 
zero. This is much different from the K' eq of about 10 5 
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FIGURE 19-23 Catalytic mechanism of F lt (a) 18 Oexchanee experiment, 
Ft solubilized from mitochondrial membranes is incubated with ATP in 
the presence of 18 0-labeled water. At intervals, a sample of the solution is 
withdrawn and analyzed for the incorporation of 1 s O into the Pj produced 
from ATP hydrolysis. In minutes, the Pj contains three or four 18 0 atoms, 
indicating that both ATP hydrolysis and ATP synthesis have occurred sev- 
eral times during the incubation, (b) The likely transition state complex for 

(AG'° = -30.5 kJ/mol) for the hydrolysis of ATP free in 
solution (not on the enzyme surface). 

What accounts for the huge difference? ATP syn- 
thase stabilizes ATP relative to ADP 4- P { by binding ATP 
more tightly, releasing enough energy to counterbalance 
the cost of making ATP. Careful measurements of the 
binding constants show that F Q F 1 binds ATP with very 
high affinity (K d < 10" 12 m) and ADP with much lower 
affinity (K d « 10" 5 m). The difference in K d corresponds 
to a difference of about 40 kJ/mol in binding energy, and 
this binding energy drives the equilibrium toward for- 
mation of the product ATP. 

The Proton Gradient Drives the Release of ATP from the 
Enzyme Surface 

Although ATP synthase equilibrates ATP with ADP 4- P i5 
in the absence of a proton gradient the newly synthesized 
ATP does not leave the surface of the enzyme. It is the 



ATP hydrolysis and synthesis by ATP synthase {derived from PDB ID 
1 BMF). The a subunit is shown in green, (3 in gray. The positively charged 
residues jS-Arg 182 and a-Arg 376 coordinate two oxygens of the pentavalent 
phosphate intermediate; /3-Lys 155 interacts with a third oxygen, and the 
Mg 2 + ion (green sphere) further stabilizes the intermediate. The blue 
sphere represents the leaving group (H 2 0). These interactions result in the 
ready equilibration of ATP and ADP + P, in the active site. 

proton gradient that causes the enzyme to release the 
ATP formed on its surface. The reaction coordinate dia- 
gram of the process (Fig. 19-24) illustrates the differ- 
ence between the mechanism of ATP synthase and that of 
many other enzymes that catalyze endergonic reactions. 

For the continued synthesis of ATP, the enzyme 
must cycle between a form that binds ATP very tightly 
and a form that releases ATP. Chemical and crystallo- 
graphic studies of the ATP synthase have revealed the 
structural basis for this alternation in function. 

Each p Subunit of ATP Synthase Can Assume Three 
Different Conformations 

Mitochondrial F x has nine subunits of five different 
types, with the composition a s /3 3 y8e. Each of the three 
/3 subunits has one catalytic site for ATP synthesis. The 
crystallographic determination of the F : structure by 
John E. Walker and colleagues revealed structural details 



FIGURE 19-24 Reaction coordinate di agrams for ATP 
synthase and for a more typical enzyme, in a typical 
enzyme-catalyzed reaction (left), reaching the transition 
state (*) between substrate and product is the major en- 
ergy barrier to overcome. In the reaction catalyzed by 
ATP synthase (right), release of ATP from the enzyme, 
not formation of ATP, is the major energy barrier. The 
free-energy change for the formation of ATP from ADP 
and P f in aqueous solution is large and positive, but on 
the enzyme surface, the very tight binding of ATP pro- 
vides sufficient binding energy to bring the free energy 
of the enzyme-bound ATP close to that of ADP + Pj, so 
the reaction is readily reversible. The equilibrium con- 
stant is near 1 . The free energy required for the release of 
ATP is provided by the proton-motive force. 
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very helpful in explaining the catalytic mechanism of the 
enzyme. The knoblike portion of F 1 is a flattened sphere, 
8 nm high and 10 run across, consisting of alternating a 
and [3 subunits arranged like the sections of an orange 
(Fig. 19-25a, b ? c). The polypeptides that make up the 
stalk in the F : crystal structure are asymmetrically 
arranged, with one domain of the single y subunit mak- 
ing up a central shaft that passes through F 1; and an- 
other domain of y associated primarily with one of the 
three /3 subunits, designated /3-empty (Fig. 19-25c). Al- 
though the amino acid sequences of the three /3 subunits 
are identical, their conformations differ, in part be- 
cause of the association of the y subunit with just one of 
the three. The structures of the 8 and e subunits are not 
revealed in these crystallographic studies. 

The conformational differences among /3 sub- 
units extend to differences in their ATP/ADP-binding 
sites. When researchers crystallized the protein in the 
presence of ADP and App(NH)p, a close structural ana- 
log of ATP that cannot be hydrolyzed by the ATPase ac- 
tivity of F 1; the binding site of one of the three (3 



subunits was filled with App(NH)p, the second was 
filled with ADP, and the third was empty. The correspon- 
ding jS subunit conformations are designated /3-ATP, /3- 
ADP, and £-empty (Fig. 19-25c). This difference in 
nucleotide binding among the three subunits is critical 
to the mechanism of the complex. 
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FIGURE 19-25 Mitochondrial ATP synthase complex, (a) Structure of the 
Ft complex, deduced from crystallographic and biochemical studies. In 
Ft, three a and three p subunits are arranged like the segments of an or- 
ange, with alternating a (shades of gray) and 0 (shades of purple) subunits 
around a central shaft, the y subunit (green), (b) Crystal structure of 
bovine Ft (PDB ID 1 BMF), viewed from the side. Two a subunits and one 
(3 subunit have been omitted to reveal the central shaft (y subunit) and the 
binding sites for ATP and ADP on the /3 subunits. The 8 and e subunits are 
not shown here, (c) Ft viewed from above (that is, from the n side of the 
membrane), showing the three j6 and three a subunits and the central 
shaft {y subunit, green). Each p subunit, near its interface with the neigh- 
boring a subunit, has a nucleotide-binding site critical to the catalytic ac- 
tivity. The single y subunit associates primarily with one of the three a/3 
pairs, forcing each of the three j8 subunits into slightly different conforma- 
tions, with different nucleotide-binding sites. In the crystalline enzyme, 
one subunit (/3-ADP) has ADP (yellow) in its binding site, the next (j8-ATP) 
has ATP (red), and the third (/3-empty) has no bound nucleotide. 

(figure continued on next page) 
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FIGURE 19-25 (continued) (d) Side view of the t o r, structure. This is a 
composite, in which the crystallography coordinates of bovine mito- 
chondrial Ft (shades of purple and gray) have been combined with 
those of yeast mitochondrial F 0 (shades of yellow and orange) (PDB ID 
1Q01). Subunits a, b, 8, and s were not part of the crystal structure 
shown here, (e) The structure, viewed end-on in the direction p 
side to n side. The major structures visible in this cross section are the 
two transmembrane helices of each of 10 c subunits arranged in con- 

polypeptide, consisting almost entirely of two transmem- 
brane helices, with a small loop extending from the matrix 
side of the membrane. The crystal structure of the yeast 
FoF^ solved in 1999, shows the arrangement of the c sub- 
units. The yeast complex has 10 c subunits, each with two 
transmembrane helices roughly perpendicular to the plane 
of the membrane and arranged in two concentric circles 
(Fig. 19-25d, e). The inner circle is made up of the amino- 
terminal helices of each c subunit; the outer circle, about 
55 A in diameter, is made up of the carboxyl-terminal he- 
lices. The e and y subunits of F x form a leg-and-foot that 
projects from the bottom (membrane) side of F x and 
stands firmly on the ring of c subunits. The schematic 




(f) H+ 

centric circles, (f) Diagram of the F 0 Ft complex, deduced from bio- 
chemical and crystallographic studies. The two b subunits (b 2 ) of F G as- 
sociate firmly with the a and (3 subunits of F 1? holding them fixed 
relative to the membrane. In F Q/ the membrane-embedded cylinder of 
c subunits (c 10 ) is attached to the shaft made up of F T subunits y and e. 
As protons flow through the membrane from the p side to the n side 
through F Q , the cylinder and shaft rotate, and the j8 subunits of Ft 
change conformation as the y subunit associates with each in turn. 

drawing in Figure 19-25f combines the structural informa- 
tion from studies of bovine F x and yeast F^. 

Rotational Catalysis Is Key to the Binding-Change 
Mechanism for ATP Synthesis 

On the basis of detailed kinetic and binding studies of 
the reactions catalyzed by F 0 F!, Paul Boyer proposed a 
rotational catalysis mechanism in which the three 
active sites of F x take turns catalyzing ATP synthesis 
(Fig. 19-26). A given jS subunit starts in the /3-ADP con- 
formation, which binds ADP and P^ from the surrounding 
medium. The subunit now changes conformation, as- 
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suming the /3-ATP form that 
tightly binds and stabilizes 
ATP, bringing about the ready 
equilibration of ADP + P t with 
ATP on the enzyme surface. Fi- 
nally, the subunit changes to 
the /3-empty conformation, 
which has very low affinity for 
ATP, and the newly synthe- 
sized ATP leaves the enzyme 



Paul Boyer surface. Another round of 

catalysis begins when this sub- 
unit again assumes the /3-ADP form and binds ADP and 
Pi- 

The conformational changes central to this mechanism 
are driven by the passage of protons through the F G portion 
of ATP synthase. The streaming of protons through the F Q 
"pore" causes the cylinder of c subunits and the attached y 




FIGURE 19-26 Binding-change model for ATP synthase. The com- 
plex has three nonequivalent adenine nucleotide-binding sites, one 
for each pair of a and j8 subunits. At any given moment, one of these 
sites is in the /3-ATP conformation (which binds ATP tightly), a second 
is in the /3-ADP (loose-binding) conformation, and a third is in the /3- 
empty (very-loose-binding) conformation. The proton-motive force 
causes rotation of the central shaft — the y subunit, shown as a green 
arrowhead — which comes into contact with each a/3 subunit pair in 
succession. This produces a cooperative conformational change in 
which the /3-ATP site is converted to the jS-empty conformation, and 
ATP dissociates; the jS-ADP site is converted to the /3-ATP conforma- 
tion, which promotes condensation of bound ADP + Pj to form ATP; 
and the /3-empty site becomes a jS-ADP site, which loosely binds ADP + 
Pi entering from the solvent. This model, based on experimental find- 
ings, requires that at least two of the three catalytic sites alternate in 
activity; ATP cannot be released from one site unless and until ADP 
and Pj are bound at the other. 



subunit to rotate about the long axis of y, which is perpendi- 
cular to the plane of the membrane. The y subunit passes 
through the center of the spheroid, which is held sta- 
tionary relative to the membrane surface by the b 2 and 8 
subunits (Fig. 19-25f). With each rotation of 120°, y comes 
into contact with a different /3 subunit, and the contact 
forces that /B subunit into the /3-empty conformation. 

The three /3 subunits interact in such a way that 
when one assumes the /3-empty conformation, its neigh- 
bor to one side must assume the /3-ADP form, and the 
other neighbor the /3-ATP form. Thus one complete 
rotation of the y subunit causes each /3 subunit to cycle 
through all three of its possible conformations, and for 
each rotation, three ATP are synthesized and released 
from the enzyme surface. 

One strong prediction of this binding-change model 
is that the y subunit should rotate in one direction when 
F 0 F : is synthesizing ATP and in the opposite direction 
when the enzyme is hydrolyzing ATP This prediction 
was confirmed in elegant experiments in the laborato- 
ries of Masasuke Yoshida and Kazuhiko Kinosita, Jr. The 
rotation of y in a single F x molecule was observed micro- 
scopically by attaching a long, thin, fluorescent actin 
polymer to y and watching it move relative to a 3 /3 3 im- 
mobilized on a microscope slide, as ATP was hydrolyzed. 
When the entire F 0 Fx complex (not just FO was used in 
a similar experiment, the entire ring of c subunits ro- 
tated with y (Fig. 19-27). The "shaft" rotated in the 
predicted direction through 360°. The rotation was not 
smooth, but occurred in three discrete steps of 120°. As 
calculated from the known rate of ATP hydrolysis by one 
F 2 molecule and from the frictional drag on the long 
actin polymer, the efficiency of this mechanism in con- 
verting chemical energy into motion is close to 100%. It 
is, in Boyer's words, "a splendid molecular machine!" 

Chemiosmotic Coupling Allows Nonintegral 
Stoichiometries of 0 2 Consumption and ATP Synthesis 

Before the general acceptance of the chemiosmotic 
model for oxidative phosphorylation, the assumption 
was that the overall reaction equation would take the 
following form: 

xADP + xP i + \0 2 + H + + NADH ► 

xATP + H 2 0 + NAD + (19-11) 

with the value oix — sometimes called the P/O ratio or 
the P/2e" ratio — always an integer. When intact mito- 
chondria are suspended in solution with an oxidizable 
substrate such as succinate or NADH and are provided 
with 0 2 , ATP synthesis is readily measurable, as is the 
decrease in 0 2 . Measurement of P/O, however, is compli- 
cated by the fact that intact mitochondria consume ATP 
in many reactions taking place in the matrix, and they 
consume 0 2 for purposes other than oxidative phosphor- 
ylation. Most experiments have yielded P/O (ATP to 
|0 2 ) ratios of between 2 and 3 when NADH was the elec- 
tron donor, and between 1 and 2 when succinate was the 
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FIGURE 19-27 Experimental demonstration of rotation of F G and 7. Ft genetically 
engineered to contain a run of His residues adheres tightly to a microscope slide 
coated with a Ni complex; biotin is covalently attached to a c subunit of F 0 . The pro- 
tein avidin, which binds biotin very tightly, is covalently attached to long filaments 
of actin labeled with a fluorescent probe, Biotin-avidin binding now attaches the 
actin filaments to the c subunit. When ATP is provided as substrate for the ATPase 
activity of F 1; the labeled filament is seen to rotate continuously in one direction, 
proving that the F Q cylinder of c subunits rotates. In another experiment, a fluores- 
cent actin filament was attached directly to the y subunit. The series of fluorescence 
micrographs {read left to right) shows the position of the actin filament at intervals of 
133 ms. Note that as the filament rotates, it makes a discrete jump about every 
eleventh frame. Presumably the cylinder and shaft move as one unit. 
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donor. Given the assumption that P/O should have an in- 
tegral value, most experimenters agreed that the P/O ra- 
tios must be 3 for NADH and 2 for succinate, and for 
years those values appeared in research papers and 
textbooks. 

With introduction of the chemiosmotic paradigm for 
coupling ATP synthesis to electron transfer, there was no 
theoretical requirement for P/O to be integral. The rele- 
vant questions about stoichiometry became, How many 
protons are pumped outward by electron transfer from 
one NADH to 0 2 , and how many protons must flow in- 
ward through the F 0 Fi complex to drive the synthesis of 
one ATP? The measurement of proton fluxes is techni- 
cally complicated; the investigator must take into account 
the buffering capacity of mitochondria, nonproductive 
leakage of protons across the inner membrane, and use of 
the proton gradient for functions other than ATP synthe- 
sis, such as driving the transport of substrates across the 
inner mitochondrial membrane (described below). The 
consensus values for number of protons pumped out per 
pair of electrons are 10 for NADH and 6 for succinate. The 
most widely accepted experimental value for number of 
protons required to drive the synthesis of an ATP mole- 
cule is 4, of which 1 is used in transporting P b ATP, and 
ADP across the mitochondrial membrane (see below). If 
10 protons are pumped out per NADH and 4 must flow in 
to produce 1 ATP, the proton-based P/O ratio is 2,5 for 
NADH as the electron donor and 1.5 (6/4) for succinate. 
We use the P/O values of 2.5 and 1.5 throughout this 
book, but the values 3.0 and 2.0 are still common in the 
biochemical literature. The final word on proton stoi- 
chiometry will probably not be written until we know the 
full details of the F 0 Fi reaction mechanism. 

The Proton-Motive Force Energizes Active Transport 

Although the primary role of the proton gradient in mi- 
tochondria is to furnish energy for the synthesis of ATP, 
the proton-motive force also drives several transport 



processes essential to oxidative phosphorylation. The 
inner mitochondrial membrane is generally imperme- 
able to charged species, but two specific systems trans- 
port ADP and P^ into the matrix and ATP out to the 
cytosol (Fig. 19-28). 
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FIGURE 19-28 Adenine nucleotide and phosphate translocases. Trans- 
port systems of the inner mitochondrial membrane carry ADP and P, 
into the matrix and newly synthesized ATP into the cytosol, The adenine 
nucleotide translocase is an antiporter; the same protein moves ADP 
into the matrix and ATP out. The effect of replacing ATP 4- with ADP 3- 
in the matrix is the net efflux of one negative charge, which is favored 
by the charge difference across the inner membrane (outside positive). 
At pH 7, Pj is present as both HPC>4~ and H 2 P0 4 " ; the phosphate 
translocase is specific for H 2 PO^. There is no net flow of charge during 
symport of H 2 PC>4~ and H + , but the relatively low proton concentration 
in the matrix favors the inward movement of H + . Thus the proton-mo- 
tive force is responsible both for providing the energy for ATP synthesis 
and for transporting substrates (ADP and Pj) into and product (ATP) out 
of the mitochondrial matrix. All three of these transport systems can be 
isolated as a single membrane-bound complex (ATP synthasome). 
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The adenine nucleotide translocase, integral to 
the inner membrane, binds ADP 3 " in the intermem- 
brane space and transports it into the matrix in exchange 
for an ATP 4- molecule simultaneously transported 
outward (see Fig. 13-11 for the ionic forms of ATP and 
ADP). Because this antiporter moves four negative 
charges out for every three moved in, its activity is fa- 
vored by the transmembrane electrochemical gradient, 
which gives the matrix a net negative charge; the pro- 
ton-motive force drives ATP -ADP exchange. Adenine 
nucleotide translocase is specifically inhibited by 
atractyloside, a toxic glycoside formed by a species of 
thistle. If the transport of ADP into and ATP out of mito- 
chondria is inhibited, cytosolic ATP cannot be regener- 
ated from ADP, explaining the toxicity of atractyloside. 

A second membrane transport system essential to 
oxidative phosphorylation is the phosphate translo- 
case, which promotes symport of one H2PO4 and one 
H + into the matrix. This transport process, too, is favored 
by the transmembrane proton gradient (Fig. 19-28). 
Notice that the process requires movement of one 
proton from the p to the n side of the inner membrane, 



consuming some of the energy of electron transfer. A 
complex of the ATP synthase and both translocases, the 
ATP synthasome, can be isolated from mitochondria 
by gentle dissection with detergents, suggesting that the 
functions of these three proteins are very tightly 
integrated. 

Shuttle Systems Indirectly Convey Cytosolic NADH into 
Mitochondria for Oxidation 

The NADH dehydrogenase of the inner mitochondrial 
membrane of animal cells can accept electrons only from 
NADH in the matrix. Given that the inner membrane is not 
permeable to NADH, how can the NADH generated by gly- 
colysis in the cytosol be reoxidized to NAD + by 0 2 via the 
respiratory chain? Special shuttle systems carry reducing 
equivalents from cytosolic NADH into mitochondria by an 
indirect route. The most active NADH shuttle, which func- 
tions in liver, kidney, and heart mitochondria, is the 
malate-aspartate shuttle (Fig. 19-29). The reducing 
equivalents of cytosolic NADH are first transferred to cy- 
tosolic oxaioacetate to yield malate, catalyzed by cytosolic 
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FIGURE 19-29 Malate-aspartate shuttle. This shuttle for transporting 
reducing equivalents from cytosolic NADH into the mitochondrial 
matrix is used in liver, kidney, and heart. 0 NADH in the cytosol 
(intermembrane space) passes two reducing equivalents to oxaioac- 
etate, producing malate. (2) Malate crosses the inner membrane via 
the malate-a-ketoglutarate transporter. (3) In the matrix, malate 



passes two reducing equivalents to NAD + , and the resulting NADH is 
oxidized by the respiratory chain; the oxaioacetate formed from 
malate cannot pass directly into the cytosol. @ Oxaioacetate is first 
transaminated to aspartate, and (5) aspartate can leave via the gluta- 
mate-aspartate transporter. © Oxaioacetate is regenerated in the 
cytosol, completing the cycle. 
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malate dehydrogenase. The malate thus formed passes 
through the inner membrane via the malate-a-ketoglu- 
tarate transporter. Within the matrix the reducing equiva- 
lents are passed to NAD + by the action of matrix malate 
dehydrogenase, forming NADH; this NADH can pass elec- 
trons directly to the respiratory chain. About 2.5 mole- 
cules of ATP are generated as this pair of electrons passes 
to 0 2 . Cytosolic oxaloacetate must be regenerated by 
transamination reactions and the activity of membrane 
transporters to start another cycle of the shuttle. 

Skeletal muscle and brain use a different NADH shut- 
tle, the glycerol 3-phosphate shuttle (Fig. 19-30). It 
differs from the malate-aspartate shuttle in that it deliv- 
ers the reducing equivalents from NADH to ubiquinone 
and thus into Complex III, not Complex I (Fig. 19-8), pro- 
viding only enough energy to synthesize 1.5 ATP mole- 
cules per pair of electrons. 

The mitochondria of plants have an externally ori- 
ented NADH dehydrogenase that can transfer electrons 
directly from cytosolic NADH into the respiratory chain 
at the level of ubiquinone. Because this pathway 
bypasses the NADH dehydrogenase of Complex I and 
the associated proton movement, the yield of ATP from 
cytosolic NADH is less than that from NADH generated 
in the matrix (Box 19-1). 

Intermembrane Glycolysis 
space (p side) 




FIGURE 19-30 Glycerol 3-phosphate shuttle. This alternative means of 
moving reducing equivalents from the cytosol to the mitochondrial ma- 
trix operates in skeletal muscle and the brain. In the cytosol, dihydrox- 
yacetone phosphate accepts two reducing equivalents from NADH in a 
reaction catalyzed by cytosolic glycerol 3-phosphate dehydrogenase. 
An isozyme of glycerol 3-phosphate dehydrogenase bound to the outer 
face of the inner membrane then transfers two reducing equivalents 
from glycerol 3-phosphate in the intermembrane space to ubiquinone. 
Note that this shuttle does not involve membrane transport systems. 



SUMMARY 19.2 ATP Synthesis 

■ The flow of electrons through Complexes I, III, and 
IV results in pumping of protons across the inner 
mitochondrial membrane, making the matrix 
alkaline relative to the intermembrane space. This 
proton gradient provides the energy (in the form of 
the proton-motive force) for ATP synthesis from 
ADP and by ATP synthase (F Q F 1 complex) in the 
inner membrane. 

■ ATP synthase carries out "rotational catalysis," in 
which the flow of protons through F G causes each 
of three nucleotide-binding sites in F : to cycle 
from (ADP + PJ-bound to ATP-bound to empty 
conformations. 

■ ATP formation on the enzyme requires little 
energy; the role of the proton-motive force is to 
push ATP from its binding site on the synthase. 

■ The ratio of ATP synthesized per \0 2 reduced to 
H 2 0 (the P/O ratio) is about 2.5 when electrons 
enter the respiratory chain at Complex I, and 1.5 
when electrons enter at ubiquinone. 

■ Energy conserved in a proton gradient can drive 
solute transport uphill across a membrane. 

■ The inner mitochondrial membrane is impermeable 
to NADH and NAD + , but NADH equivalents are 
moved from the cytosol to the matrix by either of 
two shuttles. NADH equivalents moved in by the 
malate-aspartate shuttle enter the respiratory chain 
at Complex I and yield a P/O ratio of 2.5; those 
moved in by the glycerol 3-phosphate shuttle enter 
at ubiquinone and give a P/O ratio of 1.5. 

19.3 Regulation of Oxidative 
Phosphorylation 

Oxidative phosphorylation produces most of the ATP 
made in aerobic cells. Complete oxidation of a molecule 
of glucose to C0 2 yields 30 or 32 ATP (Table 19-5). By 
comparison, glycolysis under anaerobic conditions (lac- 
tate fermentation) yields only 2 ATP per glucose. 
Clearly, the evolution of oxidative phosphorylation pro- 
vided a tremendous increase in the energy efficiency of 
catabolism. Complete oxidation to C0 2 of the coenzyme 
A derivative of palmitate (16:0) , which also occurs in the 
mitochondrial matrix, yields 108 ATP per palmitoyl-CoA 
(see Table 17-1). A similar calculation can be made for 
the ATP yield from oxidation of each of the amino acids 
(Chapter 18). Aerobic oxidative pathways that result in 
electron transfer to 0 2 accompanied by oxidative phos- 
phorylation therefore account for the vast majority of 
the ATP produced in catabolism, so the regulation 
of ATP production by oxidative phosphorylation to 
match the cell's fluctuating needs for ATP is absolutely 
essential. 
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TABLE 19-5 



ATP Yield from Complete Oxidation of Glucose 



Process 


Direct product 


Final ATP 


Glycolysis 


2 NADH (cytosolic) 


3 or 5* 




2 ATP 


2 


Pyruvate oxidation (two per glucose) 


2 NADH (mitochondrial matrix) 


5 


Acetyl-CoA oxidation in citric acid cycle 


6 NADH (mitochondrial matrix) 


15 


(two per glucose) 


2 FADH 2 


3 




2 ATP or 2 GTP 


2 


Total yield per glucose 




30 or 32 



*The number depends on which shuttle system transfers reducing equivalents into the mitochondrion. 



Oxidative Phosphorylation is Regulated by Cellular 
Energy Needs 

The rate of respiration (0 2 consumption) in mitochon- 
dria is tightly regulated; it is generally limited by the 
availability of ADP as a substrate for phosphorylation. 
Dependence of the rate of 0 2 consumption on the avail- 
ability of the ^ acceptor ADP (Fig. 19-20b), the accep- 
tor control of respiration, can be remarkable. In some 
animal tissues, the acceptor control ratio, the ratio of 
the maximal rate of ADP-induced 0 2 consumption to 
the basal rate in the absence of ADP, is at least 10. 

The intracellular concentration of ADP is one measure 
of the energy status of cells. Another, related measure is 
the mass-action ratio of the ATP-ADP system, 
[ATP]/([ADP][PJ). Normally this ratio is very high, so 
the ATP-ADP system is almost fully phosphorylated. 
When the rate of some energy-requiring process (pro- 
tein synthesis, for example) increases, the rate of break- 
down of ATP to ADP and Pj increases, lowering the 
mass-action ratio. With more ADP available for oxidative 
phosphorylation, the rate of respiration increases, caus- 
ing regeneration of ATP. This continues until the mass- 
action ratio returns to its normal high level, at which 
point respiration slows again. The rate of oxidation of 
cellular fuels is regulated with such sensitivity and pre- 
cision that the [ATP]/([ADP][PJ) ratio fluctuates only 
slightly in most tissues, even during extreme variations 
in energy demand. In short, ATP is formed only as fast as 
it is used in energy-requiring cellular activities. 

An Inhibitory Protein Prevents ATP Hydrolysis 
during Hypoxia 

We have already encountered ATP synthase as an ATP- 
driven proton pump (see Fig. 11-39), catalyzing the re- 
verse of ATP synthesis. When a cell is hypoxic (deprived 
of oxygen), as in a heart attack or stroke, electron trans- 
fer to oxygen slows, and so does the pumping of protons. 
The proton-motive force soon collapses. Under these 
conditions, the ATP synthase could operate in reverse, 
hydrolyzing ATP to pump protons outward and causing a 
disastrous drop in ATP levels. This is prevented by a small 




FIGURE 19-31 Structure of bovine FvATPase in a complex with its 
regulatory protein IF-,. (Derived from PDB ID 10HH) Two F A mole- 
cules are viewed here as in Figure 1 9-25c. The inhibitor IF-| (red) binds 
to the a/3 interface of the subunits in the diphosphate (ADP) conforma- 
tion (a-ADP and /3-ADP), freezing the two Ft complexes and thereby 
blocking ATP hydrolysis (and synthesis). (Parts of IFt that failed to re- 
solve in the crystals of Ft are shown in white outline as they occur in 
crystals of isolated .) This complex is stable only at the low cytoso- 
lic pH characteristic of cells that are producing ATP by glycolysis; 
when aerobic metabolism resumes, the cytosolic pH rises, the in- 
hibitor is destabilized, and ATP synthase becomes active. 

(84 amino acids) protein inhibitor, IF 1; which simultane- 
ously binds to two ATP synthase molecules, inhibiting 
their ATPase activity (Fig. 19-31). IF : is inhibitory only 
in its dimeric form, which is favored at pH lower than 6.5. 
In a cell starved for oxygen, the main source of ATP be- 
comes glycolysis, and the pyruvic or lactic acid thus 
formed lowers the pH in the cytosol and the mitochondr- 
ial matrix. This favors IF X dimerization, leading to inhibi- 
tion of the ATPase activity of ATP synthase and thereby 
preventing wasteful hydrolysis of ATP. When aerobic me- 
tabolism resumes, production of pyruvic acid slows, the 
pH of the cytosol rises, the IF 2 dimer is destabilized, and 
the inhibition of ATP synthase is lifted. 

Hypoxia Leads to ROS Production and Several Adaptive 
Responses 

In hypoxic cells there is an imbalance between the input 
of electrons from fuel oxidation in the mitochondrial 
matrix and transfer of electrons to molecular oxygen, 
leading to increased formation of reactive oxygen 
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FIGURE 19-32 Hypoxia-inducible factor (HIF-1) regulates gene ex- 
pression to reduce ROS formation. Under conditions of low oxygen 
(hypoxia), HIF-1 is synthesized in greater amounts and acts as a tran- 
scription factor, increasing the synthesis of glucose transporter, gly- 
colytic enzymes, pyruvate dehydrogenase kinase {PDH kinase), lactate 
dehydrogenase, a protease that degrades the cytochrome oxidase 

species. In addition to the glutathione peroxidase sys- 
tem (Fig. 19-18), cells have two other lines of defense 
against ROS (Fig. 19-32). One is regulation of pyruvate 
dehydrogenase (PDH), the enzyme that delivers acetyl- 
CoA to the citric acid cycle (Chapter 16). Under hypoxic 
conditions, PDH kinase phosphorylates mitochondrial 
PDH, inactivating it and slowing the delivery of FADH 2 
and NADH from the citric acid cycle to the respiratory 
chain. A second means of preventing ROS formation is 
the replacement of one subunit of Complex IV, known as 
COX4-1, with another subunit, COX4-2, that is better 
suited to hypoxic conditions. With COX4-1, the catalytic 
properties of Complex IV are optimal for respiration at 
normal oxygen concentrations; with COX4-2, Complex 
IV is optimized for operation under hypoxic conditions. 

The changes in PDH activity and the COX4-2 content 
of Complex IV are both mediated by HIF-1, the hypoxia- 
inducible factor. HIF-1 accumulates in hypoxic cells and, 
acting as a transcription factor, triggers increased synthe- 
sis of PDH kinase, COX4-2, and a protease that degrades 
COX4-1. Recall that HIF-1 also mediates the changes in 



subunit COX4-1, and cytochrome oxidase subunit COX4-2. These 
changes counter the formation of ROS by decreasing the supply of 
NADH and FADH 2 and making cytochrome oxidase of Complex IV 
more effective. Thick gray arrows signify reactions stimulated by HIF-1; 
thin, broken arrows show reactions slowed by HIF-1 . 



glucose transport and glycolytic enzymes that produce 
the Pasteur effect (see Box 14-1). 

BWhen these mechanisms for dealing with ROS are 
insufficient, due to genetic mutation affecting 
one of the protective proteins or under conditions of 
very high rates of ROS production, mitochondrial func- 
tion is compromised. Mitochondrial damage is thought 
to be involved in aging, heart failure, certain rare cases 
of diabetes (described below), and several maternally in- 
herited genetic diseases that affect the nervous system. ■ 

ATP-Producing Pathways Are Coordinately Regulated 

The major catabolic pathways have interlocking and 
concerted regulatory mechanisms that allow them to 
function together in an economical and self-regulating 
manner to produce ATP and biosynthetic precursors. 
The relative concentrations of ATP and ADP control 
not only the rates of electron transfer and oxidative 
phosphorylation but also the rates of the citric acid cy- 
cle, pyruvate oxidation, and glycolysis (Fig. 19-33). 
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FIGURE 19-33 Regulation of the ATP-producing pathways. This dia- 
gram shows the interlocking regulation of glycolysis, pyruvate oxida- 
tion, the citric acid cycle, and oxidative phosphorylation by the relative 
concentrations of ATP, ADP, and AMP, and by NADH. High [ATP] (or 
low [ADP] and [AMP]) produces low rates of glycolysis, pyruvate oxi- 
dation, acetate oxidation via the citric acid cycle, and oxidative phos- 
phorylation. All four pathways are accelerated when the use of ATP and 
the formation of ADP, AMP, and Pj increase. The interlocking of glycol- 
ysis and the citric acid cycle by citrate, which inhibits glycolysis, sup- 
plements the action of the adenine nucleotide system. In addition, 
increased levels of NADH and acetyl-CoA also inhibit the oxidation of 
pyruvate to acetyl-CoA, and a high [NADH]/[NAD + ] ratio inhibits the 
dehydrogenase reactions of the citric acid cycle (see Fig. 16-18). 



Whenever ATP consumption increases, the rate of 
electron transfer and oxidative phosphorylation in- 
creases. Simultaneously, the rate of pyruvate oxidation 
via the citric acid cycle increases, increasing the flow 
of electrons into the respiratory chain. These events 
can in turn evoke an increase in the rate of glycolysis, 
increasing the rate of pyruvate formation. When con- 
version of ADP to ATP lowers the ADP concentration, 
acceptor control slows electron transfer and thus ox- 
idative phosphorylation. Glycolysis and the citric acid 
cycle are also slowed, because ATP is an allosteric in- 
hibitor of the glycolytic enzyme phosphofructokinase-1 
(see Fig. 15-14) and of pyruvate dehydrogenase (see 
Fig. 16-18). 

Phosphofructokinase-1 is also inhibited by citrate, 
the first intermediate of the citric acid cycle. When the 
cycle is "idling," citrate accumulates within mitochon- 
dria, then is transported into the cytosol. When the con- 
centrations of both ATP and citrate rise, they produce a 
concerted allosteric inhibition of phosphofructokinase-1 
that is greater than the sum of their individual effects, 
slowing glycolysis. 
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Regulation of Oxidative 
Phosphorylation 



■ Oxidative phosphorylation is regulated by cellular 
energy demands. The intracellular [ADP] and the 
mass-action ratio [ATP]/([ADP][PJ) are measures of 
a cell's energy status. 

■ In hypoxic (oxygen-deprived) cells, a protein 
inhibitor blocks ATP hydrolysis by the reverse 
activity of ATP synthase, preventing a drastic drop 
in [ATP]. 

■ The adaptive responses to hypoxia, mediated by 
HIF-1, slow electron transfer into the respiratory 
chain and modify Complex IV to act more 
efficiently under low-oxygen conditions. 

■ ATP and ADP concentrations set the rate of 
electron transfer through the respiratory chain via 
a series of interlocking controls on respiration, 
glycolysis, and the citric acid cycle. 

1 9.4 Mitochondria in Thermogenesis, Steroid 
Synthesis, and Apoptosis 

Although ATP production is a central role for the mito- 
chondrion, this organelle has other functions that, in 
specific tissues or under specific circumstances, are also 
crucial. In adipose tissue, mitochondria generate heat to 
protect vital organs from low ambient temperature; in 
the adrenal glands and the gonads, mitochondria are 
the sites of steroid hormone synthesis; and in most or 
all tissues they are key participants in apoptosis 
(programmed cell death). 
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Uncoupled Mitochondria in Brown Adipose Tissue 
Produce Heat 

We noted above that respiration slows when the cell is 
adequately supplied with ATP. There is a remarkable 
and instructive exception to this general rule. Most new- 
born mammals, including humans, have a type of adi- 
pose tissue called brown adipose tissue (BAT; p. 917) 
in which fuel oxidation serves, not to produce ATP, but 
to generate heat to keep the newborn warm. This spe- 
cialized adipose tissue is brown because of the presence 
of large numbers of mitochondria and thus high concen- 
trations of cytochromes, with heme groups that are 
strong absorbers of visible light. 

The mitochondria of brown adipocytes are much 
like those of other mammalian cells, except in having a 
unique protein in their inner membrane. Thermogenin, 
also called the uncoupling protein (the product of the 
UCP1 gene), provides a path for protons to return to 
the matrix without passing through the F 0 Fx complex 
(Fig. 19-34). As a result of this short-circuiting of pro- 
tons, the energy of oxidation is not conserved by ATP 
formation but is dissipated as heat, which contributes to 
maintaining the body temperature (see Fig. 23-17). Hi- 
bernating animals also depend on the activity of uncou- 
pled BAT mitochondria to generate heat during their 
long dormancy (see Box 17-1). We will return to the 




FIGURE 19-34 Heat generation by uncoupled mitochondria. The un- 
coupling protein (thermogenin) in the mitochondria of brown adipose 
tissue, by providing an alternative route for protons to reenter the mi- 
tochondrial matrix, causes the energy conserved by proton pumping to 
be dissipated as heat. 



role of thermogenin when we discuss the regulation of 
body mass in Chapter 23 (pp. 931-932). 

Mitochondrial P-450 Oxygenases Catalyze Steroid 
Hydroxylations 

Mitochondria are the site of biosynthetic reactions that 
produce steroid hormones, including the sex hormones, 
glucocorticoids, mineralocorticoids, and vitamin D hor- 
mone. These compounds are synthesized from choles- 
terol or a related sterol in a series of hydroxylations 
catalyzed by enzymes of the cytochrome P-450 family, 
all of which have a critical heme group (its absorption at 
450 nm gives this family its name). In the hydroxylation 
reactions, one atom of molecular oxygen is incorporated 
into the substrate and the second is reduced to H 2 0: 

R — H + 0 2 + NADPH > R— OH + H 2 0 + NADP + 

There are dozens of P-450 enzymes, all situated in the 
inner mitochondrial membrane with their catalytic site 
exposed to the matrix. Steroidogenic cells are packed 
with mitochondria specialized for steroid synthesis; the 
mitochondria are generally larger than those in other 
tissues and have more extensive and highly convoluted 
inner membranes (Fig. 19-35). 

The path of electron flow in the mitochondrial P-450 
system is complex, involving a flavoprotein and an iron- 
sulfur protein that carry electrons from NADPH to the 
P-450 heme (Fig. 19-36). The detailed structure of a 
P-450 enzyme confers its substrate specificity, and its 
heme group, which interacts directly with 0 2 , anchors 
the catalytic activity shared by all P-450 enzymes. 

I Another large family of P-450 enzymes is found in 
BH the endoplasmic reticulum of hepatocytes. These 
enzymes catalyze reactions similar to the mitochondrial 
P-450 reactions, but their substrates include a wide va- 
riety of hydrophobic compounds, many of which are 
xenobiotics — compounds not found in nature but 
synthesized industrially The P-450 enzymes of the ER 
have very broad and overlapping substrate specificities. 




FIGURE 19-35 Mitochondria of adrenal gland, specialized for steroid 
synthesis. As seen in this electron micrograph of a thin section of adre- 
nal gland, the mitochondria are profuse and have extensive cristae, pro- 
viding a large surface for the P-450 enzymes of the inner membrane. 
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FIGURE 19-36 Path of electron flow in mitochondrial cytochrome P- 
450 reactions in adrenal gland. Two electrons are transferred from 
NADPH to the FAD-containing fiavoprotein adrenodoxin reductase, 
which passes the electrons, one at a time, to adrenodoxin, a small, 



soluble 2Fe-2S protein. Adrenodoxin passes single electrons to the cy- 
tochrome P-450 hydroxylase, which interacts directly with 0 2 and the 
substrate (R— H) to form the products, H 2 0 and R— OH. 



Hydroxylation of the hydrophobic compounds makes 
them more water soluble, and they can then be cleared 
by the kidneys and excreted in urine. Among the sub- 
strates for these P-450 oxygenases are many commonly 
used prescription drugs. Metabolism by P-450 enzymes 
limits the drugs' lifetime in the bloodstream and their 
therapeutic effects. Humans differ in their genetic com- 
plement of P-450 enzymes in the ER, and in the extent 
to which certain P-450 enzymes have been induced, 
such as by a history of ethanol ingestion. In principle, 
therefore, an individual's genetics and personal history 
could figure into determinations of therapeutic drug 
dose; in practice, this precise tailoring of dosage is not 
yet economically feasible, but it may become so. ■ 

Mitochondria Are Central to the Initiation of Apoptosis 

Apoptosis, also called programmed cell death, is a 

process in which individual cells die for the good of the 
organism (for example, in the course of normal embry- 
onic development), and the organism conserves the 
cells' molecular components (amino acids, nucleotides, 
and so forth) . Apoptosis may be triggered by an external 
signal, acting at a plasma membrane receptor, or by in- 
ternal events such as a DNA damage, viral infection, ox- 
idative stress from the accumulation of ROS, or another 
stress such as a heat shock. 

Mitochondria play a critical role in triggering apop- 
tosis. When a stressor gives the signal for cell death, one 
early consequence is an increase in the permeability of 
the outer mitochondrial membrane, allowing cy- 
tochrome c to escape from the intermembrane space 
into the cytosol (Fig. 19-37). The increased permeabil- 
ity is due to the opening of the permeability transi- 
tion pore complex (PTPC), a multisubunit protein in 
the outer membrane; its opening and closing are af- 
fected by several proteins that stimulate or suppress 
apoptosis. When released into the cytosol, cytochrome c 

FIGURE 19-37 Role of cytochrome c in apoptosis. Cytochrome c is a 
small, soluble, mitochondrial protein, located in the intermembrane 
space, that carries electrons between Complex III and Complex IV dur- 
ing respiration. In a completely separate role, as outlined here, it acts 
as a trigger for apoptosis by stimulating the activation of a family of 
proteases called caspases. 
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interacts with monomers of the protein Apaf-1 (apop- 
tosis protease activating factor-1), causing the for- 
mation of an apoptosome composed of seven Apaf-1 
and seven cytochrome c molecules. The apoptosome 
provides the platform on which the protease procaspase- 
9 is activated to caspase-9, a member of a family of 
highly specific proteases (the caspases) involved in 
apoptosis. They share a critical Cys residue at their ac- 
tive site, and all cleave proteins only on the carboxyl- 
terminal side of Asp residues, thus the name "caspases." 
Activated caspase-9 initiates a cascade of proteolytic 
activations, with one caspase activating a second, and it 
in turn activating a third, and so forth (see Fig. 12-51). 
(This role of cytochrome c in apoptosis is a clear case of 
"moonlighting," in that one protein plays two very differ- 
ent roles in the cell; see Box 16—1,) 
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Mitochondria in 
Thermogenesis, Steroid 
Synthesis, and Apoptosis 



In the brown adipose tissue of newborns, electron 
transfer is uncoupled from ATP synthesis and 
the energy of fuel oxidation is dissipated as 
metabolic heat. 

Hydroxylation reaction steps in the synthesis of 
steroid hormones in steroidogenic tissues (adrenal 
gland, gonads, liver, and kidney) take place in 
specialized mitochondria. 
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■ Mitochondrial cytochrome c, released into the 
cytosol, participates in activation of caspase-9, one 
of the proteases involved in apoptosis. 

19.5 Mitochondrial Genes: Their Origin and 
the Effects of Mutations 

Mitochondria contain their own genome, a circular, 
double-stranded DNA (mtDNA) molecule. Each of the 
hundreds or thousands of mitochondria in a typical cell 
has about five copies of this genome. The human mito- 
chondrial chromosome (Fig. 19-38) contains 37 
genes (16,569 bp), including 13 that encode subunits 
of proteins of the respiratory chain (Table 19-6); the 
remaining genes code for rRNA and tRNA molecules 
essential to the protein-synthesizing machinery of mi- 
tochondria. The great majority of mitochondrial pro- 
teins — about 900 different types — are encoded by 
nuclear genes, synthesized on cytoplasmic ribosomes, 
then imported into and assembled in the mitochondria 
(Chapter 27). 



FIGURE 19-38 Mitochondrial genes and mutations, (a) Map of 

human mitochondrial DNA, showing the genes that encode 
proteins of Complex I, the NADH dehydrogenase {ND1 to ND6); the 
cytochrome b of Complex III {Cyt b); the subunits of cytochrome oxi- 
dase (Complex IV) (CO/ to COM); and two subunits of ATP synthase 
{ATPase6 and ATPase8). The colors of the genes correspond to those of 
the complexes shown in Figure 1 9-7. Also included here are the genes 
for ribosomal RNAs (rRNA) and for some mitochondrion-specific trans- 
fer RNAs; tRNA specificity is indicated by the one-letter codes for 
amino acids. Arrows indicate the positions of mutations that cause 
Leber's hereditary optic neuropathy (LHON) and myoclonic epilepsy 
and ragged-red fiber disease (MERRF). Numbers in parentheses in- 
dicate the position of the altered nucleotides (nucleotide 1 is at 
the top of the circle and numbering proceeds counterclock- 
wise), (b) Electron micrograph of an abnormal mitochon- 
drion from the muscle of an individual with MERRF, 
showing the paracrystalline protein inclusions sometimes 
present in the mutant mitochondria. 
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Respiratory Proteins Encoded by Mitochondrial Genes in Humans 



Number Number of subpjfts encoded 
Complex of subunits by mitochondrial DNA 



I NADH dehydrogenase 


43 


7 


II Succinate dehydrogenase 


4 


0 


III Ubiquinone:cytochrome c oxidoreductase 


11 


1 


IV Cytochrome oxidase 


13 


3 


V ATP synthase 


8 


2 



Mitochondria Evolved from Endosymbiotic 
Bacteria 

The existence of mitochondrial DNA, ribosomes, and 
tRNAs supports the hypothesis of the endosymbiotic 
origin of mitochondria (see Fig. 1-36), which holds that 
the first organisms capable of aerobic metabolism, in- 
cluding respiration-linked ATP production, were bacte- 
ria. Primitive eukaryotes that lived anaerobically (by 
fermentation) acquired the ability to carry out oxidative 
phosphorylation when they established a symbiotic rela- 
tionship with bacteria living in their cytosol. After much 
evolution and the movement of many bacterial genes 
into the nucleus of the "host" eukaryote, the endosymbi- 
otic bacteria eventually became mitochondria. 

This hypothesis presumes that early free-living bac- 
teria had the enzymatic machinery for oxidative phos- 
phorylation and predicts that their modern bacterial 
descendants must have respiratory chains closely simi- 
lar to those of modern eukaryotes. They do. Aerobic 
bacteria carry out NAD-linked electron transfer from 
substrates to 0 2 , coupled to the phosphorylation of cy- 
tosolic ADP. The dehydrogenases are located in the bac- 
terial cytosol and the respiratory chain in the plasma 
membrane. The electron carriers translocate protons 
outward across the plasma membrane as electrons are 
transferred to 0 2 . Bacteria such as Escherichia coli 
have F 0 F X complexes in their plasma membranes; the F : 
portion protrudes into the cytosol and catalyzes ATP 
synthesis from ADP and Pi as protons flow back into the 
cell through the proton channel of F D . 

The respiration-linked extrusion of protons across 
the bacterial plasma membrane also provides the driving 
force for other processes. Certain bacterial transport 
systems bring about uptake of extracellular nutrients 
(lactose, for example) against a concentration gradient, 
in symport with protons (see Fig. 11-42). And the ro- 
tary motion of bacterial flagella is provided by "proton 
turbines," molecular rotary motors driven not by ATP 
but directly by the transmembrane electrochemical po- 
tential generated by respiration-linked proton pumping 
(Fig. 19-39). It seems likely that the chemiosmotic 
mechanism evolved early, before the emergence of eu- 
karyotes. 



Mutations in Mitochondrial DNA Accumulate throughout 
the Life of the Organism 

The respiratory chain is the major producer of reactive 
oxygen species in cells, so mitochondrial contents, in- 
cluding the mitochondrial genome, suffer the greatest 
exposure to, and damage by, ROS. Moreover, the mito- 
chondrial DNA replication system is less effective than 
the nuclear system at correcting mistakes made during 
replication and at repairing DNA damage. As a conse- 
quence of these two factors, defects in mtDNA accumu- 
late over time. One theory of aging is that this gradual 
accumulation of defects with increasing age is the pri- 
mary cause of many of the "symptoms" of aging, which 
include, for example, progressive weakening of skeletal 
and heart muscle. 

A unique feature of mitochondrial inheritance is 
the variation among individual cells, and between one 
individual organism and another, in the effects of a 
mtDNA mutation. A typical cell has hundreds or thou- 




FIGURE 19-39 Rotation of bacterial flagella by proton-motive force. 

The shaft and rings at the base of the flagellum make up a rotary mo- 
tor that has been called a "proton turbine." Protons ejected by electron 
transfer flow back into the cell through the turbine, causing rotation 
of the shaft of the flagellum. This motion differs fundamentally from the 
motion of muscle and of eukaryotic flagella and cilia, for which ATP 
hydrolysis is the energy source. 



740~j Oxidative Phosphorylation and Photophosphorylation 




FIGURE 19-40 Single cells contain many mitochondria. A typical ani- 
mal cell has hundreds or thousands of mitochondria, some fraction of 
which may contain genomes with mutations that affect mitochondrial 
function. This ovine (sheep) kidney epithelial cell was cultured in the 
laboratory, fixed, and then stained with fluorescent probes that show 
mitochondria as gold, microfilaments of actin red, and nuclei green 
when observed in the fluorescence microscope. 



and undergoes the many divisions of embryonic develop- 
ment, the resulting somatic cells differ in their propor- 
tion of mutant mitochondria (Fig. 19-41a). (Keep in 
mind that a developing embryo derives all of its mito- 
chondria from the egg, none from the sperm cell.) This 
heteroplasmy (in contrast to homoplasmy, in which 
every mitochondrial genome in every cell is the same) 
results in mutant phenotypes of varying degrees of 
severity. Cells (and tissues) containing mostly wild-type 
mitochondria have the wild-type phenotype; they are es- 
sentially normal. Other heteroplasmic cells will have in- 
termediate phenotypes, some almost normal, others 
(with a high proportion of mutant mitochondria) abnor- 
mal (Fig. 19-41b). If the abnormal phenotype is associ- 
ated with a disease (see below), individuals with the 
same mtDNA mutation may have disease symptoms of 
differing severity — depending on the number and dis- 
tribution of affected mitochondria. 



sands of mitochondria, each with its own genome copy 
(Fig. 19-40). Suppose that, in a female organism, 
damage to one mitochondrial genome occurs in a germ 
cell from which oocytes develop, such that the germ 
cell contains mainly mitochondria with wild-type genes 
but one mitochondrion with a mutant gene. During the 
course of oocyte maturation, as this germ cell and its 
descendants repeatedly divide, the defective mitochon- 
drion replicates and its progeny, all defective, are ran- 
domly distributed to daughter cells. Eventually, the 
mature egg cells contain different proportions of the 
defective mitochondria. When an egg cell is fertilized 



Some Mutations in Mitochondrial Genomes Cause Disease 

A growing number of human diseases have been 
attributed to mutations in mitochondrial genes 
that reduce the cell's capacity to produce ATP. Some tis- 
sues and cell types — neurons, myocytes of both skeletal 
and cardiac muscle, and /3 cells of the pancreas — are 
less able than others to tolerate lowered ATP production 
and are therefore more affected by mutations in mito- 
chondrial proteins. 

A group of genetic diseases known as the mito- 
chondrial encephalomyopathies affect primarily the 
brain and skeletal muscle. These diseases are invariably 
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FIGURE 19-41 Heteroplasmy in mitochondrial genomes, (a) When a 
mature egg cell is fertilized, all of the mitochondria in the resulting 
diploid cell (zygote) are maternal; none come from the sperm. If some 
fraction of the maternal mitochondria have a mutant gene, the random 
distribution of mitochondria during subsequent cell divisions yields 
some daughter cells with mostly mutant mitochondria, some with 
mostly wild-type mitochondria, and some in between; thus the daughter 
cells show varying degrees of heteroplasmy. (b) Different degrees of 
heteroplasmy produce different cellular phenotypes. This section of 
human muscle tissue is from an individual with defective cytochrome 
oxidase. The cells have been stained to make wild-type cells blue and 
cells with mutant cytochrome oxidase brown. As the micrograph 
shows, different cells in the same tissue are affected to different 
degrees by the mitochondrial mutation. 
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inherited from the mother, because, as noted above, a 
developing embryo derives all its mitochondria from the 
egg. The rare disease Leber's hereditary optic neu- 
ropathy (LHON) affects the central nervous system, 
including the optic nerves, causing bilateral loss of vi- 
sion in early adulthood. A single base change in the mi- 
tochondrial gene ND4 (Fig. 19-38a) changes an Arg 
residue to a His residue in a polypeptide of Complex I, 
and the result is mitochondria partially defective in elec- 
tron transfer from NADH to ubiquinone. Although these 
mitochondria can produce some ATP by electron trans- 
fer from succinate, they apparently cannot supply suffi- 
cient ATP to support the very active metabolism of 
neurons. One result is damage to the optic nerve, lead- 
ing to blindness. A single base change in the mitochon- 
drial gene for cytochrome 5, a component of Complex 
III, also produces LHON, demonstrating that the pathol- 
ogy results from a general reduction of mitochondrial 
function, not specifically from a defect in electron trans- 
fer through Complex I. 

A mutation (in ATP6) that affects the proton pore 
in ATP synthase leads to low rates of ATP synthesis 
while leaving the respiratory chain intact. The oxidative 
stress due to the continued supply of electrons from 
NADH increases the production of ROS, and the damage 
to mitochondria caused by ROS sets up a vicious cycle. 
Half of individuals with this mutant gene die within days 
or months of birth. 

Myoclonic epilepsy and ragged-red fiber dis- 
ease (MERRF) is caused by a mutation in the mito- 
chondrial gene that encodes a tRNA specific for lysine 
(tRNA Lys ). This disease, characterized by uncontrol- 
lable muscular jerking, apparently results from defec- 
tive production of several of the proteins that require 
mitochondrial tRNAs for their synthesis. Skeletal 
muscle fibers of individuals with MERRF have abnor- 
mally shaped mitochondria that sometimes contain 
paracrystalline structures (Fig. 19-38b). Other muta- 
tions in mitochondrial genes are believed to be re- 
sponsible for the progressive muscular weakness that 
characterizes mitochondrial myopathy and for en- 
largement and deterioration of the heart muscle in 
hypertrophic cardiomyopathy. According to one hy- 
pothesis on the progressive changes that accompany 
aging, the accumulation of mutations in mtDNA dur- 
ing a lifetime of exposure to DNA-damaging agents 
such as 'O2" results in mitochondria that cannot sup- 
ply sufficient ATP for normal cellular function. Mito- 
chondrial disease can also result from mutations in 
any of the 900 nuclear genes that encode mitochondr- 
ial proteins. ■ 

Diabetes Can Result from Defects in the Mitochondria of 
Pancreatic £5 Cells 

The mechanism that regulates the release of in- 
sulin from pancreatic /3 cells hinges on the ATP 
concentration in those cells. When blood glucose is high, 
/3 cells take up glucose and oxidize it by glycolysis and 



the citric acid cycle, raising [ATP] above a threshold 
level (Fig. 19-42). When [ATP] exceeds this threshold, 
an ATP-gated K + channel in the plasma membrane 
closes, depolarizing the membrane and triggering in- 
sulin release (see Fig. 23-28). Pancreatic /3 cells with 
defects in oxidative phosphorylation cannot increase 
[ATP] above this threshold, and the resulting failure of 
insulin release effectively produces diabetes. For exam- 
ple, defects in the gene for glucokinase, the hexokinase 
IV isozyme present in jS cells, lead to a rare form of dia- 
betes, MODY2 (see Box 15-3); low glucokinase activity 
prevents the generation of above-threshold [ATP], 
blocking insulin secretion. Mutations in the mitochon- 
drial tRNA Lys or tRNA Leu genes also compromise mito- 
chondrial ATP production, and type 2 diabetes mellitus 
is common among individuals with these defects (al- 
though these cases make up a very small fraction of all 
cases of diabetes). 

When nicotinamide nucleotide transhydrogenase, 
which is part of the mitochondrial defense against ROS 
(see Fig. 19-18), is genetically defective, the accumula- 
tion of ROS damages mitochondria, slowing ATP 
production and blocking insulin release by /3 cells 
(Fig. 19-42). Damage caused by ROS, including damage 
to mtDNA, may also underlie other human diseases; 
there is some evidence for its involvement in Alzheimer's, 
Parkinson's, and Huntington's diseases and in heart fail- 
ure, as well as in aging. ■ 
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FIGURE 19-42 Defective oxidative phosphorylation in pancreatic p 
cells blocks insulin secretion. Normally, when blood glucose rises, the 
production of ATP in (B cells increases. ATP, by blocking K + channels, 
depolarizes the plasma membrane and thus opens the voltage-gated 
Ca 2+ channels. The resulting influx of Ca 2+ triggers exocytosis of 
insulin-containing secretory vesicles, releasing insulin. When oxida- 
tive phosphorylation in p cells is defective, [ATP] is never sufficient to 
trigger this process, and insulin is not released. 
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SUMMARY 19.5 



Mitochondrial Genes: 
Their Origin and the 
Effects of Mutations 



■ A small proportion of human mitochondrial proteins, 
13 in all, are encoded by the mitochondrial 
genome and synthesized in mitochondria. About 
900 mitochondrial proteins are encoded in nuclear 
genes and imported into mitochondria after their 
synthesis. 

■ Mitochondria arose from aerobic bacteria that 
entered into an endosymbiotic relationship with 
ancestral eukaryotes. 

■ Mutations in the mitochondrial genome accumulate 
over the life of the organism. Mutations in the 
genes that encode components of the respiratory 
chain, ATP synthase, and the ROS-scavenging 
system, and even in tRNA genes, can cause a 
variety of human diseases, which often most 
severely affect muscle, heart, pancreatic cells, 
and brain. 

PHOTOSYNTHESIS: HARVESTING LIGHT ENERGY 

We now turn to another reaction sequence in which the 
flow of electrons is coupled to the synthesis of ATP: 
light-driven phosphorylation. The capture of solar en- 
ergy by photosynthetic organisms and its conversion to 
the chemical energy of reduced organic compounds is 
the ultimate source of nearly all biological energy. Pho- 
tosynthetic and heterotrophic organisms live in a bal- 
anced steady state in the biosphere (Fig. 19-43). 
Photosynthetic organisms trap solar energy and form 
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FIGURE 19-43 Solar energy as the ultimate source of all biological 
energy. Photosynthetic organisms use the energy of sunlight to manu- 
facture glucose and other organic products, which heterotrophic cells 
use as energy and carbon sources. 



ATP and NADPH, which they use as energy sources to 
make carbohydrates and other organic compounds from 
C0 2 and H 2 0; simultaneously, they release 0 2 into the 
atmosphere. Aerobic heterotrophs (humans, for exam- 
ple, as well as plants during dark periods) use the 0 2 so 
formed to degrade the energy-rich organic products of 
photosynthesis to C0 2 and H 2 0, generating ATP. The 
C0 2 returns to the atmosphere, to be used again by pho- 
tosynthetic organisms. Solar energy thus provides the 
driving force for the continuous cycling of C0 2 and 
0 2 through the biosphere and provides the reduced 
substrates — fuels, such as glucose — on which nonpho- 
tosynthetic organisms depend. 

Photosynthesis occurs in a variety of bacteria and in 
unicellular eukaryotes (algae) as well as in vascular 
plants. Although the process in these organisms differs 
in detail, the underlying mechanisms are remarkably 
similar, and much of our understanding of photosynthe- 
sis in vascular plants is derived from studies of simpler 
organisms. The overall equation for photosynthesis 
in vascular plants describes an oxidation-reduction 
reaction in which H 2 0 donates electrons (as hydrogen) 
for the reduction of C0 2 to carbohydrate (CH 2 0): 



CO, 



H 2 0 



light 



> 0 2 + (CH 2 0) 



19.6 General Features of 
Photophosphorylation 

Unlike NADH (the major electron donor in oxidative 
phosphorylation), H 2 0 is a poor donor of electrons; its 
standard reduction potential is 0.816 V, compared with 
-0.320 V for NADH. Photophosphorylation differs from 
oxidative phosphorylation in requiring the input of en- 
ergy in the form of light to create a good electron donor 
and a good electron acceptor. In photophosphorylation, 
electrons flow through a series of membrane-bound car- 
riers including cytochromes, quinones, and iron-sulfur 
proteins, while protons are pumped across a membrane 
to create an electrochemical potential. Electron transfer 
and proton pumping are catalyzed by membrane 
complexes homologous in structure and function to 
Complex III of mitochondria. The electrochemical po- 
tential they produce is the driving force for ATP synthe- 
sis from ADP and P i; catalyzed by a membrane-bound 
ATP synthase complex closely similar to that of oxidative 
phosphorylation. 

Photosynthesis in plants encompasses two processes: 
the light-dependent reactions, or light reactions, 
which occur only when plants are Hluminated, and the 
carbon-assimilation reactions (or carbon-fixation 
reactions), sometimes misleadingly called the dark reac- 
tions, which are driven by products of the light reactions 
(Fig. 19^4). In the light reactions, chlorophyll and other 
pigments of photosynthetic cells absorb light energy and 
conserve it as ATP and NADPH; simultaneously, 0 2 is 
evolved. In the carbon-assimilation reactions, ATP and 
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FIGURE 19-44 The light reactions of photosynthesis generate energy- 
rich NADPH and ATP at the expense of solar energy. NADPH and ATP 
are used in the carbon-assimilation reactions, which occur in light or 
darkness, to reduce C0 2 to form trioses and more complex com- 
pounds (such as glucose) derived from trioses. 



NADPH are used to reduce C0 2 to form triose phosphates, 
starch, and sucrose, and other products derived from 
them. In this chapter we are concerned only with the light- 
dependent reactions that lead to the synthesis of ATP and 
NADPH. The reduction of C0 2 is described in Chapter 20. 

Photosynthesis in Plants Takes Place in Chloroplasts 

In photosynthetic eukaryotic cells, both the light- 
dependent and the carbon-assimilation reactions take 
place in the chloroplasts (Fig. 19-45), intracellular or- 
ganelles that are variable in shape and generally a few 
micrometers in diameter. Like mitochondria, they are 
surrounded by two membranes, an outer membrane 
that is permeable to small molecules and ions, and an 



inner membrane that encloses the internal compart- 
ment. This compartment contains many flattened, mem- 
brane-surrounded vesicles or sacs, the thylakoids, 
usually arranged in stacks called grana (Fig. 19-45b). 
Embedded in the thylakoid membranes (commonly 
called lamellae) are the photosynthetic pigments and 
the enzyme complexes that carry out the light reactions 
and ATP synthesis. The stroma (the aqueous phase en- 
closed by the inner membrane) contains most of the en- 
zymes required for the carbon-assimilation reactions. 

Light Drives Electron Flow in Chloroplasts 

In 1937 Robert Hill found that when leaf extracts con- 
taining chloroplasts were illuminated, they (1) evolved 
0 2 and (2) reduced a nonbiological electron acceptor 
added to the medium, according to the Hill reaction: 



2H 2 0 + 2A 



light 



> 2AH 2 + 0 2 



where A is the artificial electron acceptor, or Hill 
reagent. One Hill reagent, the dye 2,6-dichlorophenolin- 
dophenol, is blue when oxidized (A) and colorless when 
reduced (AH 2 ), making the reaction easy to follow. 



OH 




OH 

Oxidized form 
(blue) 



OH 

Reduced form 
(colorless) 



Dichlorophenolindophenol 

When a leaf extract supplemented with the dye was 
illuminated, the blue dye became colorless and 0 2 was 
evolved. In the dark, neither 0 2 evolution nor dye reduc- 
tion took place. This was the first evidence that absorbed 
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light energy causes electrons to flow from H 2 0 to an elec- 
tron acceptor. Moreover, Hill found that C0 2 was neither 
required nor reduced to a stable form under these condi- 
tions; 0 2 evolution could be dissociated from C0 2 reduc- 
tion. Several years later Severo Ochoa showed that 
NADP + is the biological electron acceptor in chloro- 
plasts, according to the equation 

2H 2 0 + 2NADP+ 2NADPH + 2H + + 0 2 

To understand this photochemical process, we must 
first consider the more general topic of the effects of 
light absorption on molecular structure. 

SUMMARY 19.6 General Features of 

Photophosphorylation 

■ The light reactions of photosynthesis are those 
directly dependent on the absorption of light; the 
resulting photochemistry takes electrons from 
H 2 0 and drives them through a series of 
membrane-bound carriers, producing NADPH 
and ATP. 

■ The carbon-assimilation reactions of 
photosynthesis reduce C0 2 with electrons from 
NADPH and energy from ATP. 

19.7 Light Absorption 

Visible light is electromagnetic radiation of wavelengths 
400 to 700 nm, a small part of the electromagnetic spec- 
trum (Fig. 19-46), ranging from violet to red. The en- 
ergy of a single photon (a quantum of light) is greater at 
the violet end of the spectrum than at the red end; 
shorter wavelength (and higher frequency) corresponds 
to higher energy. The energy, E, in a single photon of vis- 
ible light is given by the Planck equation: 

E = hv = hc/X 

where h is Planck's constant (6.626 X 10" 34 J • s), v is 
the frequency of the light in cycles/s, c is the speed of 
light (3.00 X 10 8 m/s), and A is the wavelength in meters. 



The energy of a photon of visible light ranges from 
150 kJ/einstein for red light to —300 kJ/einstein for violet 
light. 



WORKED EXAMPLE 19-2 Energy of a Photon 

The light used by vascular plants for photosynthesis has 
a wavelength of about 700 nm. Calculate the energy in a 
"mole" of photons (an einstein) of light of this wave- 
length, and compare this with the energy needed to syn- 
thesize a mole of ATP 

Solution: The energy in a single photon is given by the 
Planck equation. At a wavelength of 700 X 1CT 9 m, the 
energy of a photon is 

E = hc/X 

= [(6.626 X 1(T 34 J • s)(3.00 X 10 8 m/s)]/(7.00 X l(T 7 m) 
= 2.84 X 10" 19 J 

An einstein of light is Avogadro's number (6.022 X 10 23 ) 
of photons, thus the energy of one einstein of photons at 
700 nm is given by 

(2.84 X 10" 19 J/photon)(6.022 X 10 23 photons/einstein) 

= 17.1 x 10 4 J/einstein 

= 171 kJ/einstein. 

So, a "mole" of photons of red light has about five times 
the energy needed to produce a mole of ATP from ADP 
and ^ (30.5kJ/mol). 



When a photon is absorbed, an electron in the ab- 
sorbing molecule (chromophore) is lifted to a higher en- 
ergy level. This is an all-or-nothing event; to be 
absorbed, the photon must contain a quantity of energy 
(a quantum) that exactly matches the energy of the 
electronic transition. A molecule that has absorbed a 
photon is in an excited state, which is generally unsta- 
ble. An electron lifted into a higher-energy orbital usu- 
ally returns rapidly to its lower-energy orbital; the 
excited molecule decays to the stable ground state, 
giving up the absorbed quantum as light or heat or using 
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FIGURE 19-46 Electromagnetic radiation. The 

spectrum of electromagnetic radiation, and the en- 
ergy of photons in the visible range. One einstein 
is 6.022 X 10 23 photons. 
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it to do chemical work. Light emission accompanying 
decay of excited molecules (called fluorescence) is al- 
ways at a longer wavelength (lower energy) than that of 
the absorbed light (see Box 12-3). An alternative mode 
of decay important in photosynthesis involves direct 
transfer of excitation energy from an excited molecule 
to a neighboring molecule. Just as the photon is a 
quantum of light energy, so the exciton is a quantum of 
energy passed from an excited molecule to another mol- 
ecule in a process called exciton transfer. 



Chlorophylls Absorb Light Energy for Photosynthesis 

The most important light-absorbing pigments in the 
thylakoid membranes are the chlorophylls, green 
pigments with polycyclic, planar structures resembling 
the protoporphyrin of hemoglobin (see Fig. 5-1), ex- 
cept that Mg 2+ , not Fe 2+ , occupies the central position 
(Fig. 19-47). The four inward-oriented nitrogen 
atoms of chlorophyll are coordinated with the Mg 2+ . All 
chlorophylls have a long phytol side chain, esterified to 
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FIGURE 19-47 Primary and secondary photopigments. (a) Chloro- Carotene (a carotenoid) and (d) lutein (a xanthophyll) are accessory 
phylls a and b and bacteriochlorophyll are the primary gatherers of pigments in plants. The areas shaded pink are the conjugated systems 
light energy, (b) Phycoerythrobilin and phycocyanobilin (phycobilins) (alternating single and double bonds) that largely account for the ab- 
are the antenna pigments in cyanobacteria and red algae, (c) /3- sorption of visible light. 
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FIGURE 19-48 Absorption of visible light by photopigments. Plants 
are green because their pigments absorb light from the red and blue re- 
gions of the spectrum, leaving primarily green light to be reflected or 
transmitted. Compare the absorption spectra of the pigments with the 
spectrum of sunlight reaching the earth's surface; the combination of 
chlorophylls (a and b) and accessory pigments enables plants to har- 
vest most of the energy available in sunlight. 



The relative amounts of chlorophylls and accessory pigments are 
characteristic of a particular plant species. Variation in the proportions 
of these pigments is responsible for the range of colors of photosyn- 
thetic organisms, from the deep blue-green of spruce needles, to the 
greener green of maple leaves, to the red, brown, or purple color of 
some species of multicellular algae and the leaves of some foliage 
plants favored by gardeners. 



a carboxyl-group substituent in ring IV, and chloro- 
phylls also have a fifth five-membered ring not present 
in heme. 

The heterocyclic five-ring system that surrounds 
the Mg 2+ has an extended polyene structure, with 
alternating single and double bonds. Such polyenes 
characteristically show strong absorption in the vis- 
ible region of the spectrum (Fig. 19-48); the 
chlorophylls have unusually high molar extinction 
coefficients (see Box 3-1) and are therefore partic- 
ularly well-suited for absorbing visible light during 
photosynthesis. 

Chloroplasts always contain both chlorophyll a and 
chlorophyll b (Fig. 19-4 7a). Although both are green, 
their absorption spectra are sufficiently different 
(Fig. 19-48) that they complement each other's range of 
light absorption in the visible region. Most plants con- 
tain about twice as much chlorophyll a as chlorophyll b. 
The pigments in algae and photosynthetic bacteria in- 
clude chlorophylls that differ only slightly from the plant 
pigments. 

Chlorophyll is always associated with specific bind- 
ing proteins, forming light-harvesting complexes 

(LHCs) in which chlorophyll molecules are fixed in 
relation to each other, to other protein complexes, and 
to the membrane. One light-harvesting complex (LHCII; 
Fig. 19-49) contains seven molecules of chlorophyll a, 
five of chlorophyll 6, and two of the accessory pigment 
lutein (see below). 

Cyanobacteria and red algae employ phycobilins 
such as phycoerythrobilin and phycocyanobilin (Fig. 
19-4 7b) as their light-harvesting pigments. These open- 



chain tetrapyrroles have the extended polyene system 
found in chlorophylls, but not their cyclic structure or 
central Mg 2+ . Phycobilins are covalently linked to spe- 
cific binding proteins, forming phycobiliproteins, 
which associate in highly ordered complexes called 




FIGURE 19-49 A light-harvesting complex, LHCII. (PDB ID 2BHW) 
The functional unit is an LHC trimer, with 36 chlorophyll and 6 lutein 
molecules. Shown here is a monomer, viewed in the plane of the mem- 
brane, with its three transmembrane a-helical segments, seven chloro- 
phyll a molecules (light green), five chlorophyll b molecules (dark 
green), and two molecules of the accessory pigment lutein (yellow), 
which form an internal cross-brace. 
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FIGURE 19-50 A phycobilisome. In these highly structured assemblies 
found in cyanobacteria and red algae, phycobilin pigments bound to 
specific proteins form complexes called phycoerythrin (PE), phyco- 
cyanin (PC), and allophycocyanin (AP), The energy of photons absorbed 
by PE or PC is conveyed through AP (a phycocyanobilin-binding 
protein) to chlorophyll a of the reaction center by exciton transfer, a 
process discussed in the text. 

phycobilisomes (Fig. 19-50) that constitute the primary 
light-harvesting structures in these microorganisms. 

Accessory Pigments Extend the Range of Light Absorption 

In addition to chlorophylls, thylakoid membranes con- 
tain secondary light-absorbing pigments, or accessory 
pigments, called carotenoids. Carotenoids may be 
yellow, red, or purple. The most important are /?- 
carotene, which is a red-orange isoprenoid, and the 
yellow carotenoid lutein (Fig. 19-47c, d). The 
carotenoid pigments absorb light at wavelengths not ab- 
sorbed by the chlorophylls (Fig. 19-48) and thus are 
supplementary light receptors. 

Experimental determination of the effectiveness of 
light of different colors in promoting photosynthesis 
yields an action spectrum (Fig. 19-51), often useful 
in identifying the pigment primarily responsible for a bi- 
ological effect of light. By capturing light in a region of 
the spectrum not used by other organisms, a photosyn- 
thetic organism can claim a unique ecological niche. For 
example, the phycobilins in red algae and cyanobacteria 
absorb light in the range 520 to 630 nm (Fig. 19-48), al- 
lowing them to occupy niches where light of lower or 
higher wavelength has been filtered out by the pigments 
of other organisms living in the water above them, or by 
the water itself. 

Chlorophyll Funnels the Absorbed Energy to Reaction 
Centers by Exciton Transfer 

The light-absorbing pigments of thylakoid or bacterial 
membranes are arranged in functional arrays called 
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FIGURE 19-51 Two ways to determine the action spectrum for pho- 
tosynthesis, (a) Results of a classic experiment performed by T. W. 
Englemann in 1882 to determine the wavelength of light that is most 
effective in supporting photosynthesis. Englemann placed cells of a 
filamentous photosynthetic alga on a microscope slide and illumi- 
nated them with light from a prism, so that one part of the filament re- 
ceived mainly blue light, another part yellow, another red. To 
determine which algal cells carried out photosynthesis most actively, 
Englemann also placed on the microscope slide bacteria known to 
migrate toward regions of high 0 2 concentration. After a period of il- 
lumination, the distribution of bacteria showed highest 0 2 levels (pro- 
duced by photosynthesis) in the regions illuminated with violet and 
red light. 

(b) Results of a similar experiment that used modern techniques 
(an oxygen electrode) for the measurement of 0 2 production. An ac- 
tion spectrum (as shown here) describes the relative rate of photosyn- 
thesis for illumination with a constant number of photons of different 
wavelengths. An action spectrum is useful because, by comparison 
with absorption spectra (such as those in Fig. 1 9-48), it suggests which 
pigments can channel energy into photosynthesis. 

photosystems. In spinach chloroplasts, for example, 
each photosystem contains about 200 chlorophyll and 
50 carotenoid molecules. All the pigment molecules in 
a photosystem can absorb photons, but only a few 
chlorophyll molecules associated with the photo- 
chemical reaction center are specialized to trans- 
duce light into chemical energy. The other pigment 
molecules in a photosystem are called light-harvest- 
ing or antenna molecules. They absorb light energy 
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FIGURE 19-52 Organization of photosystems in the thylakoid mem- 
brane. Photosystems are tightly packed in the thylakoid membrane, 
with several hundred antenna chlorophylls and accessory pigments sur- 
rounding a photoreaction center. Absorption of a photon by any of the 
antenna chlorophylls leads to excitation of the reaction center by exci- 
ton transfer (black arrow). Also embedded in the thylakoid membrane 
are the cytochrome b 6 f complex and ATP synthase (see Fig. 1 9-60), 



and transmit it rapidly and efficiently to the reaction 
center (Fig. 19-52). 

The chlorophyll molecules in light-harvesting com- 
plexes have light-absorption properties that are subtly 
different from those of free chlorophyll. When isolated 
chlorophyll molecules in vitro are excited by light, the 
absorbed energy is quickly released as fluorescence 
and heat, but when chlorophyll in intact leaves is excited 
by visible light (Fig. 19-53, step CD), very little fluores- 
cence is observed. Instead, the excited antenna chloro- 
phyll transfers energy directly to a neighboring 
chlorophyll molecule, which becomes excited as the first 



FIGURE 19-53 Exciton and electron transfer. This generalized scheme 
shows conversion of the energy of an absorbed photon into separation 
of charges at the reaction center. The steps are further described in the 
text. Note that step © may repeat between successive antenna mole- 
cules until the exciton reaches a reaction-center chlorophyll. The 
asterisk (*) represents the excited state of an antenna molecule. 
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molecule returns to its ground state (step (2)). This 
transfer of energy, exciton transfer, extends to a third, 
fourth, or subsequent neighbor, until one of a special pair 
of chlorophyll a molecules at the photochemical reaction 
center is excited (step ©). In this excited chlorophyll 
molecule, an electron is promoted to a higher-energy or- 
bital. This electron then passes to a nearby electron 
acceptor that is part of the electron-transfer chain, leav- 
ing the reaction-center chlorophyll with a missing elec- 
tron (an "electron hole," denoted by + in Fig. 19-53) 
(step®). The electron acceptor acquires a negative 
charge in this transaction. The electron lost by the 
reaction-center chlorophyll is replaced by an electron from 
a neighboring electron-donor molecule (step©), which 
thereby becomes positively charged. In this way, 
excitation by light causes electric charge separation 
and initiates an oxidation-reduction chain. 

SUMMARY 19.7 Light Absorption 

■ Photophosphorylation in the chloroplasts of green 
plants and in cyanobacteria involves electron flow 
through a series of membrane-bound carriers. 

■ In the light reactions of plants, absorption of a 
photon excites chlorophyll molecules and other 
(accessory) pigments, which funnel the energy into 
reaction centers in the thylakoid membranes. In the 
reaction centers, photoexcitation results in a 
charge separation that produces a strong electron 
donor (reducing agent) and a strong electron 
acceptor. 

19.8 The Central Photochemical Event: 
Light-Driven Electron Flow 

Light-driven electron transfer in plant chloroplasts dur- 
ing photosynthesis is accomplished by multienzyme sys- 
tems in the thylakoid membrane. Our current picture of 
photosynthetic mechanisms is a composite, drawn from 
studies of plant chloroplasts and a variety of bacteria 
and algae. Determination of the molecular structures of 
bacterial photosynthetic complexes (by x-ray crystal- 
lography) has given us a much improved understanding 
of the molecular events in photosynthesis in general. 

Bacteria Have One of Two Types of Single Photochemical 
Reaction Center 

One major insight from studies of photosynthetic bacte- 
ria came in 1952 when Louis Duysens found that illumi- 
nation of the photosynthetic membranes of the purple 
bacterium Rhodospirillum rubrum with a pulse of 
light of a specific wavelength (870 nm) caused a tempo- 
rary decrease in the absorption of light at that wave- 
length; a pigment was "bleached" by 870 nm light. Later 
studies by Bessel Kok and Horst Witt showed similar 



bleaching of plant chloroplast pigments by light of 680 
and 700 nm. Furthermore, addition of the (nonbiologi- 
cal) electron acceptor [Fe(CN) 6 ] 3 ^ (ferricyanide) caused 
bleaching at these wavelengths without illumination. 
These findings indicated that bleaching of the pigments 
was due to the loss of an electron from a photochemical 
reaction center. The pigments were named for the wave- 
length of maximum bleaching: P870, P680, and P700. 

Photosynthetic bacteria have relatively simple pho- 
totransduction machinery, with one of two general types 
of reaction center. One type (found in purple bacteria) 
passes electrons through pheophytin (chlorophyll lack- 
ing the central Mg 2+ ion) to a quinone. The other 
(in green sulfur bacteria) passes electrons through a 
quinone to an iron-sulfur center. Cyanobacteria and 
plants have two photosystems (PSI, PSII), one of each 
type, acting in tandem. Biochemical and biophysical 
studies have revealed many of the molecular details of 
reaction centers of bacteria, which therefore serve as 
prototypes for the more complex phototransduction sys- 
tems of plants. 

The Pheophytin-Quinone Reaction Center (Type 
II Reaction Center) The photosynthetic machinery 
in purple bacteria consists of three basic modules 
(Fig. 19-54a): a single reaction center (P870), a cy- 
tochrome be i electron-transfer complex similar to Com- 
plex III of the mitochondrial electron-transfer chain, and 
an ATP synthase, also similar to that of mitochondria. Il- 
lumination drives electrons through pheophytin and a 
quinone to the cytochrome bc x complex; after passing 
through the complex, electrons flow through cytochrome 
c 2 back to the reaction center, restoring its preillumina- 
tion state. This light-driven cyclic flow of electrons pro- 
vides the energy for proton pumping by the cytochrome 
bc x complex. Powered by the resulting proton gradient, 
ATP synthase produces ATP, exactly as in mitochondria. 

The three-dimensional structures of the reaction 
centers of purple bacteria (Rhodopseudomonas 
viridis and Rhodobacter sphaeroides) , deduced from 
x-ray crystallography, shed light on how phototrans- 
duction takes place in a pheophytin-quinone reaction 
center. The R. viridis reaction center (Fig. 19-55a) 
is a large protein complex containing four polypeptide 
sub units and 13 cofactors: two pairs of bacterial 
chlorophylls, a pair of pheophytins, two quinones, a 
nonheme iron, and four hemes in the associated c-type 
cytochrome. 

The extremely rapid sequence of electron transfers 
shown in Figure 19-55b has been deduced from physical 
studies of the bacterial pheophytin-quinone centers, using 
brief flashes of light to trigger phototransduction and a va- 
riety of spectroscopic techniques to follow the flow of 
electrons through several carriers. A pair of bacterio- 
chlorophylls — the "special pair," designated (Chl) 2 — is 
the site of the initial photochemistry in the bacterial reac- 
tion center. Energy from a photon absorbed by one of the 
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FIGURE 19-54 Functional modules of photosynthetic machinery in 
purple bacteria and green sulfur bacteria, (a) In purple bacteria, light 
energy drives electrons from the reaction-center P870 through pheo- 
phytin (Pheo), a quinone (Q), and the cytochrome be, complex, then 
through cytochrome c 2 and thus back to the reaction center. Electron 
flow through the cytochrome bc^ complex causes proton pumping, cre- 
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Green sulfur bacteria 
(Fe-S type) 
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atingan electrochemical potential that powers ATP synthesis, (b) Green 
sulfur bacteria have two routes for electrons driven by excitation of 
P840: a cyclic route through a quinone to the cytochrome be, complex 
and back to the reaction center via cytochrome c, and a noncyclic route 
from the reaction center through the iron-sulfur protein ferredoxin (Fd), 
then to NAD + in a reaction catalyzed by ferredoxin:NAD reductase. 



many antenna chlorophyll molecules surrounding the re- 
action center reaches (Chl) 2 by exciton transfer, When 
these two chlorophyll molecules — so close that their 
bonding orbitals overlap — absorb an exciton, the redox 
potential of (Chl) 2 is shifted, by an amount equivalent to 
the energy of the photon, converting the special pair to a 
very strong electron donor. The (Chl) 2 donates an elec- 
tron that passes through a neighboring chlorophyll 
monomer to pheophytin (Pheo). This produces two radi- 
cals, one positively charged (the special pair of chloro- 
phylls) and one negatively charged (the pheophytin): 

(Chl) 2 + 1 exciton > (Chi)? (excitation) 

(Chi)? + Pheo > "(Chl)J + 'Pheo" (charge separation) 

The pheophytin radical now passes its electron to a 
tightly bound molecule of quinone (Q A ), converting it to 
a semiquinone radical, which immediately donates its 
extra electron to a second, loosely bound quinone (Q B ). 
Two such electron transfers convert Q B to its fully re- 
duced form, Q B H 2 , which is free to diffuse in the mem- 
brane bilayer, away from the reaction center: 

2 *Pheo~ + 2H + + Q B ► 2 Pheo + Q B H 2 

(quinone reduction) 



The hydroquinone (Q B H 2 ), carrying in its chemical 
bonds some of the energy of the photons that originally 
excited P870, enters the pool of reduced quinone (QH 2 ) 
dissolved in the membrane and moves through the lipid 
phase of the bilayer to the cytochrome bc x complex. 

Like the homologous Complex III in mitochondria, 
the cytochrome bc 1 complex of purple bacteria carries 
electrons from a quinol donor (QH 2 ) to an electron ac- 
ceptor, using the energy of electron transfer to pump 
protons across the membrane, producing a proton-motive 
force. The path of electron flow through this complex is 
believed to be very similar to that through mitochon- 
drial Complex III, involving a Q cycle (Fig. 19-12) in 
which protons are consumed on one side of the mem- 
brane and released on the other. The ultimate electron 
acceptor in purple bacteria is the electron-depleted 
form of P870, "(Chl) 2 + (Fig. 19-54a). Electrons move 
from the cytochrome bci complex to P870 via a soluble 
otype cytochrome, cytochrome c 2 . The electron-trans- 
fer process completes the cycle, returning the reaction 
center to its unbleached state, ready to absorb another 
exciton from antenna chlorophyll. 

A remarkable feature of this system is that all the 
chemistry occurs in the solid state, with reacting species 
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FIGURE 19-55 Photoreaction center of the purple bacterium 
Rhodopseudomonas viridis. (PDB ID 1 PRC) (a) The system has four com- 
ponents: three subunits, H, M, and L (brown, blue, and gray, respectively), 
with a total of 1 1 transmembrane helical segments, and a fourth protein, 
cytochrome c (yellow), associated with the complex at the membrane sur- 
face. Subunits L and M are paired transmembrane proteins that together 
form a cylindrical structure with roughly bilateral symmetry about its long 
axis. Shown as space-filling models (and in (b) as ball-and-stick structures) 
are the prosthetic groups that participate in the photochemical events, 
Bound to the L and M chains are two pairs of bacteriochlorophyll mole- 
cules (green); one of the pairs — the "special pair," (Chl) 2 — is the site of the 
first photochemical changes after light absorption. Also incorporated in 
the system are a pair of pheophytin a (Pheo a) molecules (blue); two 
quinones, menaquinone (Q A ) and ubiquinone (Q B ) (orange and yellow), 
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also arranged with bilateral symmetry; and a single nonheme Fe (red) lo- 
cated approximately on the axis of symmetry between the quinones. 
Shown at the top of the figure are four heme groups (red) associated with 
the c-type cytochrome of the reaction center. The reaction center of an- 
other purple bacterium, Rhodobacter sphaeroides, is very similar, except 
that cytochrome c is not part of the crystalline complex. 

(b) Sequence of events following excitation of the special pair of 
bacteriochlorophylls (all components colored as in (a)), with the time 
scale of the electron transfers in parentheses. (T) The excited special 
pair passes an electron to pheophytin, © from which the electron 
moves rapidly to the tightly bound menaquinone, Q A . ©This quinone 
passes electrons much more slowly to the diffusible ubiquinone, Q B , 
through the nonheme Fe. Meanwhile, 0 the "electron hole" in the 
special pair is filled by an electron from a heme of cytochrome c. 



held close together in the right orientation for reaction. 
The result is a very fast and efficient series of reactions. 

The Fe-S Reaction Center (Type I Reaction Center) 

Photosynthesis in green sulfur bacteria involves the same 
three modules as in purple bacteria, but the process dif- 
fers in several respects and involves additional enzymatic 
reactions (Fig. 19-54b). Excitation causes an electron to 
move from the reaction center to the cytochrome be i 
complex via a quinone carrier. Electron transfer through 
this complex powers proton transport and creates the 
proton-motive force used for ATP synthesis, just as in pur- 
ple bacteria and in mitochondria. However, in contrast to 
the cyclic flow of electrons in purple bacteria, some elec- 
trons flow from the reaction center to an iron-sulfur pro- 
tein, ferredoxin, which then passes electrons via 
ferredoxin:NAD reductase to NAD + , producing NADH. 
The electrons taken from the reaction center to reduce 
NAD + are replaced by the oxidation of H 2 S to elemental S, 
then to SOl ~ , in the reaction that defines the green sulfur 



bacteria. This oxidation of H 2 S by bacteria is chemically 
analogous to the oxidation of H 2 0 by oxygenic plants. 

Kinetic and Thermodynamic Factors Prevent the 
Dissipation of Energy by Internal Conversion 

The complex construction of reaction centers is the prod- 
uct of evolutionary selection for efficiency in the photo- 
synthetic process. The excited state (Chi)* could in 
principle decay to its ground state by internal conversion, 
a very rapid process (10 picoseconds; 1 ps = 10" 12 s) in 
which the energy of the absorbed photon is converted to 
heat (molecular motion). Reaction centers are con- 
structed to prevent the inefficiency that would result from 
internal conversion. The proteins of the reaction center 
hold the bacteriochlorophylls, bacteriopheophytins, and 
quinones in a fixed orientation relative to each other, al- 
lowing the photochemical reactions to take place in a vir- 
tually solid state. This accounts for the high efficiency and 
rapidity of the reactions; nothing is left to chance collision 
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or random diffusion. Exciton transfer from antenna 
chlorophyll to the special pair of the reaction center is ac- 
complished in less than 100 ps with >90% efficiency. 
Within 3 ps of the excitation of P870, pheophytin has re- 
ceived an electron and become a negatively charged radi- 
cal; less than 200 ps later, the electron has reached the 
quinone Q B (Fig. 19-55b). The electron-transfer reactions 
not only are fast but are thermodynamically "downhill"; 
the excited special pair (Chi)* is a very good electron 
donor (E'° = - 1 V), and each successive electron transfer 
is to an acceptor of substantially less negative E'°. The 
standard free-energy change for the process is therefore 
negative and large; recall from Chapter 13 that AG'° = 
-nJAE ro ; here, A£"° is the difference between the stan- 
dard reduction potentials of the two half -reactions 

(1) (Chl)l — > -(Chl) 2 + +e~ E'° = -1.0 V 

(2) Q + 2H + + 2e" > QH 2 E'° = -0,045 V 

Thus 

&E ro = -0.045 V - (-1.0 V) - 0.95 V 



and 

AG' 0 = -2(96.5 kJ/V - mol)(0.95 V) = -180 kJ/mol 

The combination of fast kinetics and favorable thermody- 
namics makes the process virtually irreversible and highly 
efficient. The overall energy yield (the percentage of the 
photon's energy conserved in QH 2 ) is >30%, with the re- 
mainder of the energy dissipated as heat and entropy. 

In Plants, Two Reaction Centers Act in Tandem 

The photosynthetic apparatus of modern cyanobacte- 
ria, algae, and vascular plants is more complex than the 
one-center bacterial systems, and it seems to have 
evolved through the combination of two simpler bacte- 
rial photocenters. The thylakoid membranes of chloro- 
plasts have two different kinds of photosystems, each 
with its own type of photochemical reaction center and 
set of antenna molecules. The two systems have dis- 
tinct and complementary functions (Fig. 19-50). Pho- 
tosystem II (PSII) is a pheophytin-quinone type of 
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FIGURE 19-56 Integration of photosystems I and II in chloroplasts.This 

"Z scheme" shows the pathway of electron transfer from H 2 0 (lower left) 
to NADP + (far right) in noncyclic photosynthesis The position on the 
vertical scale of each electron carrier reflects its standard reduction po- 
tential. To raise the energy of electrons derived from H 2 0 to the energy 
level required to reduce NADP + to NADPH, each electron must be 
"lifted" twice (heavy arrows) by photons absorbed in PSII and PSI. One 
photon is required per electron in each photosystem. After excitation, 



the high-energy electrons flow "downhill" through the carrier chains 
shown. Protons move across the thylakoid membrane during the water- 
splitting reaction and during electron transfer through the cytochrome 
b 6 f complex, producing the proton gradient that is essential to ATP for- 
mation. An alternative path of electrons is cyclic electron transfer, in 
which electrons move from ferredoxin back to the cytochrome b 6 f com- 
plex, instead of reducing NADP + to NADPH. The cyclic pathway pro- 
duces more ATP and less NADPH than the noncyclic. 
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system (like the single photosystem of purple bacte- 
ria) containing roughly equal amounts of chlorophylls 
a and b. Excitation of its reaction-center P680 drives 
electrons through the cytochrome b&f complex with 
concomitant movement of protons across the thy- 
lakoid membrane. Photosystem I (PSI) is struc- 
turally and functionally related to the type I reaction 
center of green sulfur bacteria. It has a reaction center 
designated P700 and a high ratio of chlorophyll a to 
chlorophyll b. Excited P700 passes electrons to the 
Fe-S protein ferredoxin, then to NADP + , producing 
NADPH. The thylakoid membranes of a single spinach 
chloroplast have many hundreds of each kind of pho- 
tosystem. 

These two reaction centers in plants act in tandem 
to catalyze the light-driven movement of electrons from 
H 2 0 to NADP + (Fig. 19-56). Electrons are carried be- 
tween the two photosystems by the soluble protein 
plastocyanin, a one-electron carrier functionally simi- 
lar to cytochrome c of mitochondria. To replace the elec- 
trons that move from PSII through PSI to NADP + , 
cyanobacteria and plants oxidize H 2 0 (as green sulfur 
bacteria oxidize H 2 S), producing 0 2 (Fig. 19-56, bottom 
left). This process is called oxygenic photosynthesis 
to distinguish it from the anoxygenic photosynthesis of 
purple and green sulfur bacteria. All 0 2 -evolving photo- 
synthetic cells— those of plants, algae, and cyanobacte- 
ria — contain both PSI and PSII; organisms with only one 
photosystem do not evolve 0 2 . The diagram in Fig- 
ure 19-56, often called the Z scheme because of its 
overall form, outlines the pathway of electron flow be- 
tween the two photosystems and the energy relation- 
ships in the light reactions. The Z scheme thus describes 
the complete route by which electrons flow from H 2 0 to 
NADP + , according to the equation 

2H 2 0 + 2NADP + + 8 photons > 0 2 + 2NADPH + 2H + 

For every two photons absorbed (one by each photosys- 
tem) , one electron is transferred from H 2 0 to NADP + . To 
form one molecule of 0 2j which requires transfer of four 
electrons from two H 2 0 to two NADP + , a total of eight 
photons must be absorbed, four by each photosystem. 

The mechanistic details of the photochemical reac- 
tions in PSII and PSI are essentially similar to those of 
the two bacterial photosystems, with several important 
additions. In PSII, two very similar proteins, Dl and D2, 
form an almost symmetric dimer, to which all the 
electron-carrying cofactors are bound (Fig. 19-57). 
Excitation of P680 in PSII produces P680*, an excellent 
electron donor that, within picoseconds, transfers an 
electron to pheophytin, giving it a negative charge 
('Pheo~). With the loss of its electron, P680* is trans- 
formed into a radical cation, P680 + . 'Pheo~ very rapidly 
passes its extra electron to a protein-bound plasto- 
quinone, PQ A (or Q A ), which in turn passes its electron 
to another, more loosely bound plastoquinone, PQ B (or 
Q B ). When PQ B has acquired two electrons in two such 
transfers from PQ A and two protons from the solvent 




FIGURE 19-57 Photosystem II of the cyanobacterium Synechococcus 
elongates. The monomeric form of the complex shown here has two 
major transmembrane proteins, D1 and D2, each with its set of cofac- 
tors Although the two subunits are nearly symmetric, electron flow oc- 
curs through only one of the two branches of cofactors, that on the 
right (on D1 ). The arrows show the path of electron flow from the Mn 
ion cluster (Mn 4 ) of the water-splitting enzyme to the quinone PQ B . 
The photochemical events occur in the sequence indicated by the step 
numbers. Notice the close similarity between the positions of cofactors 
here and the positions in the bacterial photoreaction center shown in 
Figure 19-55. The role of theTyr residues is discussed later in the text. 

water, it is in its fully reduced quinol form, PQ B H 2 . The 
overall reaction initiated by light in PSII is 

4 P680 + 4H + +2 PQ B + 4 photons > 

4P680 + + 2PQ B H 2 (19-12) 

Eventually, the electrons in PQ B H 2 pass through the cy- 
tochrome b 6 f complex (Fig. 19-56). The electron ini- 
tially removed from P680 is replaced with an electron 
obtained from the oxidation of water, as described below. 
The binding site for plastoquinone is the point of action 
of many commercial herbicides that kill plants by block- 
ing electron transfer through the cytochrome b 6 f com- 
plex and preventing photosynthetic ATP production. 

The photochemical events that follow excitation of 
PSI at the reaction-center P700 are formally similar to 
those in PSII. The excited reaction-center P700* loses 
an electron to an acceptor, A 0 (believed to be a special 
form of chlorophyll, functionally homologous to the 
pheophytin of PSII), creating A 0 ~ andP700 + (Fig. 19-56, 
right side); again, excitation results in charge separation 
at the photochemical reaction center. P700 + is a strong 
oxidizing agent, which quickly acquires an electron from 
plastocyanin, a soluble Cu-containing electron-transfer 
protein. A 0 ~ is an exceptionally strong reducing agent 
that passes its electron through a chain of carriers that 
leads to NADP + , First, phylloquinone (AO accepts an 
electron and passes it to an iron-sulfur protein (through 
three Fe-S centers in PSI). From here, the electron 
moves to ferredoxin (Fd), another iron-sulfur protein 
loosely associated with the thylakoid membrane. 
Spinach ferredoxin (M r 10,700) contains a 2Fe-2S cen- 
ter (Fig. 19-5) that undergoes one-electron oxidation 
and reduction reactions. The fourth electron carrier in 
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the chain is the flavoprotein ferredoxin:NADP + oxi- 

doreductase, which transfers electrons from reduced 
ferredoxin (Fd recl ) to NADP + : 



2Fd red + 2H + + NADP' 



2Fd nx + NADPH + H 



This enzyme is homologous to the ferredoxin:NAD re- 
ductase of green sulfur bacteria (Fig. 19-54b). 



Antenna Chlorophylls Are Tightly Integrated with 

Electron Carriers 

The electron-carrying cofactors of PSI and the light- 
harvesting complexes are part of a supramolecular com- 
plex (Fig. 19-58a), the structure of which has been 
solved crystallographically. The protein consists of three 




FIGURE 19-58 The supramolecular complex of PSI and its associated 
antenna chlorophylls, (a) Schematic drawing of the essential proteins 
and cofactors in a single unit of PSI. A large number of antenna chloro- 
phylls surround the reaction center and convey to it (red arrows) the 
energy of absorbed photons. The result is excitation of the pair of 
chlorophyll molecules that constitute P700, greatly decreasing its re- 
duction potential; P700 then passes an electron through two nearby 
chlorophylls to phylloquinone (Q K ; also called A^. Reduced phyllo- 
quinone is reoxidized as it passes two electrons, one at a time (blue 
arrows), to an Fe-S protein (F x ) near the n side of the membrane. From 
F X/ electrons move through two more Fe-S centers (F A and F B ) to the 
Fe-S protein ferredoxin in the stroma. Ferredoxin then donates electrons 



to NADP + (not shown), reducing it to NADPH, one of the forms in 
which the energy of photons is trapped in chloroplasts. 

(b) The trimeric structure (derived from PDB ID 1JBO), viewed 
from the thylakoid lumen perpendicular to the membrane, showing 
all protein subunits (gray) and cofactors. (c) A monomer of PSI with 
all the proteins omitted, revealing the antenna and reaction-center 
chlorophylls (green with dark green Mg 2+ ions in the center), 
carotenoids (yellow), and Fe-S centers of the reaction center (space- 
filling red and orange structures). The proteins in the complex hold 
the components rigidly in orientations that maximize efficient exci- 
ton transfers between excited antenna molecules and the reaction 
center. 
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identical complexes, each composed of 11 different pro- 
teins (Fig. 19-58b). In this remarkable structure the many 
antenna chlorophyll and carotenoid molecules are pre- 
cisely arrayed around the reaction center (Fig. 19-58c). 
The reaction center's electron-carrying cofactors are 
therefore tightly integrated with antenna chlorophylls. 
This arrangement allows very rapid and efficient exciton 
transfer from antenna chlorophylls to the reaction 
center. In contrast to the single path of electrons in PSII, 
the electron flow initiated by absorption of a photon is be- 
lieved to occur through both branches of carriers in PSI. 



The Cytochrome b 6 f Complex Links Photosystems II and I 

Electrons temporarily stored in plastoquinol as a result of 
the excitation of P680 in PSII are carried to P700 of PSI 
via the cytochrome b 6 f complex and the soluble protein 
plastocyanin (Fig. 19-56, center). Like Complex III of mi- 
tochondria, the cytochrome b 6 f complex (Fig. 19-59) 
contains a 6-type cytochrome with two heme groups 
(designated b H and 5 L ), a Rieske iron-sulfur protein 
(M r 20,000), and cytochrome / (named for the Latin 
Jrons, "leaf). Electrons flow through the cytochrome b 6 f 





Subunit IV 2H + Stroma (n side) 



FIGURE 19-59 Electron and proton flow through the cy- 
tochrome b 6 f complex, (a) The crystal structure of the complex 
(PDB ID 1 UM3) reveals the positions of the cofactors involved 
in electron transfers. In addition to the hemes of cytochrome b 
(heme b H and b L ; also called heme b N and b P , respectively, be- 
cause of their proximity to the n and p sides of the bilayer) and 
cytochrome /'(heme f), there is a fourth (heme x) near heme b H ; 
there is also a /3-carotene of unknown function. Two sites bind 
plastoquinone: the PQH 2 site near the p side of the bilayer, and 
the PQ site near the n side. The Fe-S center of the Rieske protein 
lies just outside the bilayer on the p side, and the heme /"site is 
on a protein domain that extends well into the thylakoid lumen, 
(b) The complex is a homodimer arranged to create a cavern 
connecting the PQH 2 and PQ sites (compare this with the struc- 
ture of mitochondrial Complex 111 in Fig. 1 9-1 1 ). This cavern al- 
lows plastoquinone to move between the sites of its oxidation 
and reduction. 

(c) Plastoquinol (PQH 2 ) formed in PSII is oxidized by the 
cytochrome b e f complex in a series of steps like those of the 
Q cycle in the cytochrome bc^ complex (Complex III) of mito- 
chondria (see Fig. 19-12). One electron from PQH 2 passes to 
the Fe-S center of the Rieske protein, the other to heme b L of 
cytochrome b e . The net effect is passage of electrons from PQH 2 
to the soluble protein plastocyanin, which carries them to PSI. 



75 6j Oxidative Phosphorylation and Photophosphorylation 



complex from PQ B H 2 to cytochrome /, then to plasto- 
cyanin, and finally to P700, thereby reducing it. 

Like Complex III of mitochondria, cytochrome b Q f 
conveys electrons from a reduced quinone — a mobile, 
lipid-soluble carrier of two electrons (Q in mitochondria, 
PQ B in chloroplasts) — to a water-soluble protein that 
carries one electron (cytochrome c in mitochondria, 
plastocyanin in chloroplasts). As in mitochondria, the 
function of this complex involves a Q cycle (Fig. 19-12) 
in which electrons pass, one at a time, from PQ B H 2 to cy- 
tochrome b 6 . This cycle results in the pumping of pro- 
tons across the membrane; in chloroplasts, the direction 
of proton movement is from the stromal compartment to 
the thylakoid lumen, up to four protons moving for each 
pair of electrons. The result is production of a proton 
gradient across the thylakoid membrane as electrons 
pass from PSII to PSI. Because the volume of the flat- 
tened thylakoid lumen is small, the influx of a small 
number of protons has a relatively large effect on lume- 
nal pH. The measured difference in pH between the 
stroma (pH 8) and the thylakoid lumen (pH 5) repre- 
sents a 1,000-fold difference in proton concentration — a 
powerful driving force for ATP synthesis. 

Cyclic Electron Flow between PSI and the Cytochrome b 6 f 
Complex Increases the Production of ATP Relative to 
NADPH 

Electron flow from PSII through the cytochrome b 6 f 
complex, then through PSI to NADP + , is sometimes 
called noncyclic electron flow, to distinguish it from 
cyclic electron flow, which occurs to varying degrees 
depending primarily on the light conditions. The non- 
cyclic path produces a proton gradient, which is used to 
drive ATP synthesis, and NADPH, which is used in re- 
ductive biosynthetic processes. Cyclic electron flow in- 
volves only PSI, not PSII (Fig. 19-56). Electrons passing 
from P700 to ferredoxin do not continue to NADP + , but 
move back through the cytochrome b 6 /complex to plas- 
tocyanin. (This electron path parallels that in green 
sulfur bacteria, shown in Fig. 19-54b.) Plastocyanin 
then donates electrons to P700, which transfers them to 
ferredoxin. In this way, electrons are repeatedly recy- 
cled through the cytochrome b 6 f complex and the reac- 
tion center of PSI, each electron propelled around the 
cycle by the energy of one photon. Cyclic electron flow 
is not accompanied by net formation of NADPH or evo- 
lution of 0 2 . However, it is accompanied by proton 
pumping by the cytochrome b 6 f complex and by phos- 
phorylation of ADP to ATP, referred to as cyclic pho- 
tophosphorylation. The overall equation for cyclic 
electron flow and photophosphorylation is simply 

ADP + Pi ATP + H 2 0 

By regulating the partitioning of electrons between 
NADP + reduction and cyclic photophosphorylation, a 
plant adjusts the ratio of ATP to NADPH produced in the 
light-dependent reactions to match its needs for these 
products in the carbon-assimilation reactions and other 



biosynthetic processes. As we shall see in Chapter 20, 
the carbon-assimilation reactions require ATP and 
NADPH in the ratio 3:2. 

This regulation of electron-transfer pathways is part of 
a short-term adaptation to changes in light color (wave- 
length) and quantity (intensity), as further described below. 

State Transitions Change the Distribution of LHCII between 
the Two Photosystems 

The energy required to excite PSI (P700) is less (light of 
longer wavelength, lower energy) than that needed to ex- 
cite PSII (P680). If PSI and PSII were physically contigu- 
ous, excitons originating in the antenna system of PSII 
would migrate to the reaction center of PSI, leaving PSII 
chronically underexcited and interfering with the opera- 
tion of the two-center system. This imbalance in the sup- 
ply of excitons is prevented by separation of the two 
photosystems in the thylakoid membrane (Fig. 19-60). 
PSII is located almost exclusively in the tightly appressed 
membrane stacks of thylakoid grana; its associated light- 
harvesting complex (LHCII) mediates the tight associa- 
tion of adjacent membranes in the grana. PSI and the ATP 
synthase complex are located almost exclusively in the 
nonappressed thylakoid membranes (the stromal lamel- 
lae), where they have access to the contents of the 
stroma, including ADP and NADP + . The cytochrome b 6 f 
complex is present primarily in the grana. 

The association of LHCII with PSI and PSII depends 
on light intensity and wavelength, which can change in 
the short term, leading to state transitions in the 
chloroplast. In state 1, a critical Ser residue in LCHII is 
not phosphorylated, and LHCII associates with PSII. 
Under conditions of intense or blue light, which favor 
absorption by PSII, that photosystem reduces plasto- 
quinone to plastoquinol (PQH 2 ) faster than PSI can oxi- 
dize it. The resulting accumulation of PQH 2 activates a 
protein kinase that triggers the transition to state 2 by 
phosphorylating a Thr residue on LHCII (Fig. 19-61). 
Phosphorylation weakens the interaction of LHCII with 
PSII, and some LHCII dissociates and moves to the stro- 
mal lamellae; here it captures photons (excitons) for 
PSI, speeding the oxidation of PQH 2 and reversing the 
imbalance between electron flow in PSI and PSII. In less 
intense light (in the shade, with more red light), PSI ox- 
idizes PQH 2 faster than PSII can make it, and the result- 
ing increase in [PQ] triggers dephosphorylation of 
LHCII, reversing the effect of phosphorylation. 

The state transition in LCHII localization is mutually 
regulated with the transition from cyclic to noncyclic 
photophosphorylation, described above; the path of 
electrons is primarily noncyclic in state 1 and primarily 
cyclic in state 2. 

Water Is Split by the Oxygen-Evolving Complex 

The ultimate source of the electrons passed to NADPH in 
plant (oxygenic) photosynthesis is water. Having given up 
an electron to pheophytin, P680 + (of PSII) must acquire 
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FIGURE 19-60 Localization of PSI and PSN in thylakoid membranes. 

(a) The structures of the complexes and soluble proteins of the pho- 
tosynthetic apparatus of a vascular plant or alga, drawn to the same 
scale. The PDB ID for the structure of PSII is 2AXT; for PSI, 1QZV; for 
cytochrome b 6 f, 2E74; for plastocyanin, 1 AG6; for ferredoxin, 1A70; 
for ferredoxin:NADP + reductase, 1QG0. The ATP synthase structure 
shown is a composite of that from yeast mitochondria (PDB ID 
1Q01) and bovine mitochondria (PDB ID 1 BMF). (b) Light-harvest- 



ing complex LHCII and ATP synthase are located both in appressed 
regions of the thylakoid membrane (granal lamellae, in which several 
membranes are in contact) and in nonappressed regions (stromal 
lamellae), and have ready access to ADP and NADP + in the stroma. 
PSII is present almost exclusively in the appressed regions, and PSI 
almost exclusively in nonappressed regions, exposed to the stroma. 
LHCII is the "adhesive" that holds appressed lamellae together (see 
Fig. 19-61). 



FIGURE 19-61 Balancing of electron flow in PSI and PSII by state 
transition. A hydrophobic domain of LHCII in one thylakoid lamella 
inserts into the neighboring lamella and closely appresses the two 
membranes (state 1 ). Accumulation of plastoquinol (not shown) stimu- 
lates a protein kinase that phosphorylates a Thr residue in the hy- 
drophobic domain of LHCII, which reduces its affinity for the 
neighboring thylakoid membrane and converts appressed regions to 
nonappressed regions (state 2). A specific protein phosphatase reverses 
this regulatory phosphorylation when the [PQ]/[PQH 2 ] ratio increases. 
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an electron to return to its ground state in preparation for 
capture of another photon. In principle , the required elec- 
tron might come from any number of organic or inorganic 
compounds. Photosynthetic bacteria use a variety of elec- 
tron donors for this purpose — acetate, succinate, malate, 



or sulfide — depending on what is available in a particular 
ecological niche. About 3 billion years ago, evolution of 
primitive photosynthetic bacteria (the progenitors of the 
modern cyanobacteria) produced a photosystem capable 
of taking electrons from a donor that is always available — 
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FIGURE 19-62 Water-splitting activity of the oxygen-evolving complex. 

Shown here is the process that produces a four-electron oxidizing 
agent — a multinuclear center with several Mn ions — in the water-splitting 
complex of PS1I. The sequential absorption of four photons (excitons), 

water. Two water molecules are split, yielding four elec- 
trons, four protons, and molecular oxygen: 

2H 2 0 ► 4H + + 4e~ + 0 2 

A single photon of visible light does not have enough en- 
ergy to break the bonds in water; four photons are re- 
quired in this photolytic cleavage reaction. 

The four electrons abstracted from water do not 
pass directly to P680 + , which can accept only one elec- 
tron at a time. Instead, a remarkable molecular device, 
the oxygen-evolving complex (also called the water- 
splitting complex), passes four electrons one at a 
time to P680 + (Fig. 19-62). The immediate electron 
donor to P680 + is a Tyr residue (often designated Z or 
Tyr z ) in subunit Dl of the PSII reaction center. The Tyr 
residue loses both a proton and an electron, generating 
the electrically neutral Tyr free radical, 'Tyr: 

4P680 + + 4 Tyr ► 4P680 + 4 'Tyr (19-13) 

The Tyr radical regains its missing electron and proton by 
oxidizing a cluster of four manganese ions in the water- 
splitting complex. With each single-electron transfer, the Mn 
cluster becomes more oxidized; four single-electron trans- 
fers, each corresponding to the absorption of one photon, 
produce a charge of 4+ on the Mn complex (Fig. 19-62): 

4 *Tyr 4- [Mn complex] 0 ► 4 Tyr + [Mncomplex] 4+ (19-14) 

In this state, the Mn complex can take four electrons 
from a pair of water molecules, releasing 4 H + and 0 2 : 

[Mn complex] 4 + + 2H 2 0 > 

[Mn complex] 0 + 4H + + 0 2 (19-15) 

Because the four protons produced in this reaction are 
released into the thylakoid lumen, the oxygen-evolving 
complex acts as a proton pump, driven by electron 
transfer. The sum of Equations 19-12 through 19-15 is 



2H 2 0 + 2PQ B + 4 photons 



0 2 + 2PQ B H 2 (19-16) 



The oxygen-evolving complex is associated with a 
peripheral membrane protein (M r 33,000) on the lume- 
nal side of the thylakoid membrane that stabilizes the 
cluster of four Mn ions (in various oxidation states), 
one Ca 2+ ion, five 0 atoms, and a CI" ion, with precise 
geometry. The chemical changes that take place in this 
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each absorption causing the loss of one electron from the Mn center, pro- 
duces an oxidizing agent that can remove four electrons from two mole- 
cules of water, producing 0 2 . The electrons lost from the Mn center pass 
one at a time to an oxidized Tyr residue in a PSII protein, then to P680 + . 

cluster are not fully understood, but this chemistry is 
essential to life on Earth and of great interest both for 
its biological significance and as a challenge in bioinor- 
ganic chemistry. Manganese can exist in stable oxida- 
tion states from 2+ to 7 +, so a cluster of Mn ions can 
certainly donate or accept four electrons. Determina- 
tion of the structure of the polymetallic center has in- 
spired several reasonable and testable hypotheses. 
Stay tuned. 
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The Central Photochemical 
Event: Light-Driven 
Electron Flow 



Bacteria have a single reaction center; in purple 
bacteria, it is of the pheophytin-quinone type, and 
in green sulfur bacteria, the Fe-S type. 

Structural studies of the reaction center of a purple 
bacterium have provided information about 
light-driven electron flow from an excited special 
pair of chlorophyll molecules, through pheophytin, 
to quinones. Electrons then pass from quinones 
through the cytochrome bc x complex, and back to 
the photoreaction center. 

An alternative path, in green sulfur bacteria, sends 
electrons from reduced quinones to NAD + . 

Cyanobacteria and plants have two different 
photoreaction centers, arranged in tandem. 

Plant photosystem I passes electrons from its 
excited reaction center, P700, through a series of 
carriers to ferredoxin, which then reduces NADP + 
to NADPH. 

The reaction center of plant photosystem II, P680, 
passes electrons to plastoquinone, and the 
electrons lost from P680 are replaced by electrons 
from H 2 0 (electron donors other than H 2 0 are used 
in other organisms). 

Flow of electrons through the photosystems 
produces NADPH and ATP, in the ratio of about 
2:3. A second type of electron flow (cyclic flow) 
produces ATP only and allows variability in the 
proportions of NADPH and ATP formed. 
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■ The localization of PSI and PSII between the granal 
and stromal lamellae can change and is indirectly 
controlled by light intensity, optimizing the 
distribution of excitons between PSI and PSII 

for efficient energy capture. 

■ The light-driven splitting of H 2 0 is catalyzed by a 
Mn-containing protein complex; 0 2 is produced. 
The reduced plastoquinone carries electrons to the 
cytochrome b G f complex; from here they pass to 
plastocyanin, and then to P700 to replace those 
lost during its photoexcitation. 

■ Electron flow through the cytochrome b 6 f complex 
drives protons across the plasma membrane, 
creating a proton-motive force that provides the 
energy for ATP synthesis by an ATP synthase. 

1 9.9 ATP Synthesis by Photophosphorylation 

The combined activities of the two plant photosystems 
move electrons from water to NADP + , conserving some of 
the energy of absorbed light as NADPH (Fig. 19-56). Si- 
multaneously, protons are pumped across the thylakoid 
membrane and energy is conserved as an electrochemical 
potential. We turn now to the process by which this proton 
gradient drives the synthesis of ATP, the other energy- 
conserving product of the light-dependent reactions. 

In 1954 Daniel Arnon and his colleagues discovered 
that ATP is generated from ADP and P^ during photo- 
synthetic electron transfer in illuminated spinach 
chloroplasts. Support for these findings came from the 
work of Albert Frenkel, who detected light-dependent 
ATP production in pigment-containing membranous 




Light 



structures called chromatophores, 

derived from photosynthetic bacte- 
ria. Investigators concluded that 
some of the light energy captured by 
the photosynthetic systems of these 
organisms is transformed into the 
phosphate bond energy of ATP. This 
process is called photophosphory- 
lation, to distinguish it from oxida- 
tive phosphorylation in respiring 
mitochondria. 




Daniel Arnon, 
1910-1 994 



A Proton Gradient Couples Electron Flow and 
Phosphorylation 

Several properties of photosynthetic electron transfer 
and photophosphorylation in chloroplasts indicate that 
a proton gradient plays the same role as in mitochond- 
rial oxidative phosphorylation. (1) The reaction centers, 
electron carriers, and ATP-forming enzymes are located 
in a proton-impermeable membrane — the thylakoid 
membrane — which must be intact to support pho- 
tophosphorylation. (2) Photophosphorylation can be 
uncoupled from electron flow by reagents that promote 
the passage of protons through the thylakoid mem- 
brane. (3) Photophosphorylation can be blocked by ven- 
turicidin and similar agents that inhibit the formation of 
ATP from ADP and Pi by the mitochondrial ATP syn- 
thase (Table 19-4). (4) ATP synthesis is catalyzed by 
F 0 F! complexes, located on the outer surface of the thy- 
lakoid membranes, that are very similar in structure and 
function to the F 0 F! complexes of mitochondria. 

Electron-transferring molecules in the chain of carriers 
connecting PSII and PSI are oriented asymmetrically in the 
thylakoid membrane, so photoinduced electron flow results 
in the net movement of protons across the membrane, from 
the stromal side to the thylakoid lumen (Fig. 19-63). In 
1966 Andre Jagendorf showed that a pH gradient across the 
thylakoid membrane (alkaline outside) could furnish the 
driving force to generate ATP. His early observations pro- 
vided some of the most important experimental evidence in 
support of Mitchell's chemiosmotic hypothesis. 
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FIGURE 19-63 Proton and electron circuits in thy- 
lakoids. Electrons (blue arrows) move from H 2 0 
through PSII, through the intermediate chain of carri- 
ers, through PSI, and finally to NADP + . Protons (red 
arrows) are pumped into the thylakoid lumen by the 
flow of electrons through the carriers linking PSII and 
PSI, and reenter the stroma through proton channels 
formed by the F 0 (designated CF G ) of ATP synthase. 
The Ft subunit (CF-,) catalyzes synthesis of ATP. 
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Jagendorf incubated chloro 
plasts, in the dark, in a pH 4 
buffer; the buffer slowly pene- 
trated into the inner compart- 
ment of the thylakoids, lowering 
their internal pH. He added ADP 
::: ^ ^ and Pi to the dark suspension of 

wk ^tete*. chloroplasts and then suddenly 

■ raised the pH of the outer 
medium to 8, momentarily creat- 

Andre Jagendorf ■, TT v , 

b mg a large pH gradient across the 

membrane. As protons moved out of the thylakoids into the 
medium, ATP was generated from ADP and P t . Because the 
formation of ATP occurred in the dark (with no input of en- 
ergy from light), this experiment showed that a pH gradient 
across the membrane is a high-energy state that, as in mito- 
chondrial oxidative phosphorylation, can mediate the trans- 
duction of energy from electron transfer into the chemical 
energy of ATP 

The Approximate Stoichiometry of Photophosphorylation 
Has Been Established 

As electrons move from water to NADP + in plant chloro- 
plasts, about 12 H + move from the stroma to the thy- 
lakoid lumen per four electrons passed (that is, per 0 2 
formed). Four of these protons are moved by the oxygen- 
evolving complex, and up to eight by the cytochrome b 6 f 
complex. The measurable result is a 1,000-fold difference 
in proton concentration across the thylakoid membrane 
(ApH = 3). Recall that the energy stored in a proton gra- 
dient (the electrochemical potential) has two compo- 
nents: a proton concentration difference (ApH) and an 
electrical potential (At//) due to charge separation. In 
chloroplasts, ApH is the dominant component; counter- 
ion movement apparently dissipates most of the electrical 
potential. In illuminated chloroplasts, the energy stored 
in the proton gradient per mole of protons is 

AG = 2.3i?TApH + ZJAijj = -17 kJ/mol 

so the movement of 12 mol of protons across the thylakoid 
membrane represents conservation of about 200 kJ of 



energy — enough energy to drive the synthesis of several 
moles of ATP (AG'° = 30.5 kJ/mol). Experimental meas- 
urements yield values of about 3 ATP per 0 2 produced. 

At least eight photons must be absorbed to drive 
four electrons from H 2 0 to NADPH (one photon per 
electron at each reaction center). The energy in eight 
photons of visible light is more than enough for the syn- 
thesis of three molecules of ATP. 

ATP synthesis is not the only energy-conserving re- 
action of photosynthesis in plants; the NADPH formed 
in the final electron transfer is also energetically rich. 
The overall equation for noncyclic photophosphoryla- 
tion (a term explained below) is 

2H 2 0 + 8 photons + 2NADP + + —3 ADP + ~3P, > 

0 2 + -3 ATP + 2NADPH (19-17) 

The ATP Synthase of Chloroplasts Is Like 
That of Mitochondria 

The enzyme responsible for ATP synthesis in chloro- 
plasts is a large complex with two functional compo- 
nents, CF 0 and GFi (C denoting its location in 
chloroplasts). CF G is a transmembrane proton pore com- 
posed of several integral membrane proteins and is ho- 
mologous to mitochondrial F Q . GF 1 is a peripheral 
membrane protein complex very similar in subunit com- 
position, structure, and function to mitochondrial F x . 

Electron microscopy of sectioned chloroplasts 
shows ATP synthase complexes as knoblike projections 
on the outside (stromal, or n) surface of thylakoid mem- 
branes; these complexes correspond to the ATP syn- 
thase complexes seen to project on the inside (matrix, 
or n) surface of the inner mitochondrial membrane. 
Thus the relationship between the orientation of the 
ATP synthase and the direction of proton pumping is the 
same in chloroplasts and mitochondria. In both cases, 
the F x portion of ATP synthase is located on the more al- 
kaline (n) side of the membrane through which protons 
flow down their concentration gradient; the direction of 
proton flow relative to F x is the same in both cases: p to N 
(Fig. 19-64). 
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FIGURE 19-64 Comparison of the topology of pro- 
ton movement and ATP synthase orientation in the 
membranes of mitochondria, chloroplasts, and the 
bacterium E. coli. In each case, orientation of the 
proton gradient relative to ATP synthase activity is 
the same, 
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The mechanism of chloroplast ATP synthase is also 
believed to be essentially identical to that of its mito- 
chondrial analog; ADP and P 4 readily condense to form 
ATP on the enzyme surface, and the release of this 
enzyme-bound ATP requires a proton-motive force. 
Rotational catalysis sequentially engages each of the 
three subunits of the ATP synthase in ATP synthesis, 
ATP release, and ADP + P 4 binding (Figs 19-26, 19-27). 

SUMMARY 19.9 ATP Synthesis by 

Photophosphorylation 

■ In plants, both the water-splitting reaction and 
electron flow through the cytochrome b 6 f complex 
are accompanied by proton pumping across the 
thylakoid membrane. The proton-motive force thus 
created drives ATP synthesis by a CF 0 CF! complex 
similar to the mitochondrial F^ complex. 

■ The ATP synthase of chloroplasts (CF^F^ is very 
similar in both structure and catalytic mechanism 
to the ATP synthases of mitochondria and bacteria. 
Physical rotation driven by the proton gradient is 
accompanied by ATP synthesis at sites that cycle 
through three conformations, one with high affinity 
for ATP, one with high affinity for ADP 4- P b and 
one with low affinity for both nucleotides. 

19.10 The Evolution of Oxygenic 
Photosynthesis 

The appearance of oxygenic photosynthesis on Earth 
about 2.5 billion years ago was a crucial event in the 
evolution of the biosphere. Until then, the earth had 
been essentially devoid of molecular oxygen and lacked 
the ozone layer that protects living organisms from solar 
UV radiation. Oxygenic photosynthesis made available a 
nearly limitless supply of reducing agent to drive the pro- 
duction of organic compounds by reductive biosynthetic 
reactions. And mechanisms evolved that allowed organ- 
isms to use 0 2 as a terminal electron acceptor in highly 
energetic electron transfers from organic substrates, em- 
ploying the energy of oxidation to support their metabo- 
lism. The complex photosynthetic apparatus of a modern 
vascular plant is the culmination of a series of evolution- 
ary events, the most recent of which was the acquisition 
by eukaryotic cells of a cyanobacterial endosymbiont. 

Chloroplasts Evolved from Ancient Photosynthetic Bacteria 

Chloroplasts in modern organisms resemble mito- 
chondria in several properties, and are believed to 
have originated by the same mechanism that gave rise 
to mitochondria: endosymbiosis. Like mitochondria, 
chloroplasts contain their own DNA and protein-syn- 
thesizing machinery. Some of the polypeptides of 
chloroplast proteins are encoded by chloroplast genes 
and synthesized in the chloroplast; others are encoded 
by nuclear genes, synthesized outside the chloroplast, 



and imported (Chapter 27). When plant cells grow and 
divide, chloroplasts give rise to new chloroplasts by di- 
vision, during which their DNA is replicated and di- 
vided between daughter chloroplasts. The machinery 
and mechanism for light capture, electron flow, and 
ATP synthesis in modern cyanobacteria are similar in 
many respects to those in plant chloroplasts. These 
observations led to the now widely accepted hypothe- 
sis that the evolutionary progenitors of modern plant 
cells were primitive eukaryotes that engulfed photo- 
synthetic cyanobacteria and established stable en- 
dosymbiotic relationships with them (see Fig. 1-36). 

At least half of the photosynthetic activity on Earth 
now occurs in microorganisms — algae, other photosyn- 
thetic eukaryotes, and photosynthetic bacteria. 
Cyanobacteria have PSII and PSI in tandem, and the 
PSII has an associated water-splitting activity resem- 
bling that of plants. However, the other groups of photo- 
synthetic bacteria have single reaction centers and do 
not split H 2 0 or produce 0 2 . Many are obligate anaer- 
obes and cannot tolerate 0 2 ; they must use some com- 
pound other than H 2 0 as electron donor. Some 
photosynthetic bacteria use inorganic compounds as 
electron (and hydrogen) donors. For example, green 
sulfur bacteria use hydrogen sulfide: 

2H 2 S + C0 2 (CH 2 0) + H 2 0 + 2S 

These bacteria, instead of producing molecular 0 2 , form 
elemental sulfur as the oxidation product of H 2 S. (They 
further oxidize the S to SOl~.) Other photosynthetic 
bacteria use organic compounds such as lactate as elec- 
tron donors: 

2 Lactate + C0 2 (CH 2 0) + H 2 0 + 2 pyruvate 

The fundamental similarity of photosynthesis in plants 
and bacteria, despite the differences in the electron 
donors they employ, becomes more obvious when the 
equation of photosynthesis is written in the more gen- 
eral form 

2H 2 D + C0 2 (CH 2 0) + H 2 0 + 2D 

in which H 2 D is an electron (and hydrogen) donor and D 
is its oxidized form. H 2 D may be water, hydrogen sulfide, 
lactate, or some other organic compound, depending on 
the species. Most likely, the bacteria that first developed 
photosynthetic ability used H 2 S as their electron source. 

The ancient relatives of modern cyanobacteria 
probably arose by the combination of genetic material 
from two types of photosynthetic bacteria, with systems 
of the type seen in modern purple bacteria (with a PSII- 
like electron path) and green sulfur bacteria (with an 
electron path resembling that in PSI). The bacterium 
with two independent photosystems may have used one 
in one set of conditions, the other in different condi- 
tions. Over time, a mechanism to connect the two pho- 
tosystems for simultaneous use evolved, and the 
PSII-like system acquired the water-splitting capacity 
found in modern cyanobacteria. 
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Modern cyanobacteria can synthesize ATP by oxida- 
tive phosphorylation or by photophosphorylation, al- 
though they have neither mitochondria nor chloroplasts. 
The enzymatic machinery for both processes is in a highly 
convoluted plasma membrane (see Fig. 1-6). Three pro- 
tein components function in both processes, giving evi- 
dence that the processes have a common evolutionary 
origin (Fig. 19-65). First, the proton-pumping cy- 
tochrome b 6 f complex carries electrons from plasto- 
quinone to cytochrome c 6 in photosynthesis, and also 
carries electrons from ubiquinone to cytochrome c 6 in 
oxidative phosphorylation — the role played by cy- 
tochrome be i in mitochondria. Second, cytochrome c 6 , 
homologous to mitochondrial cytochrome c, carries elec- 
trons from Complex III to Complex IV in cyanobacteria; 
it can also carry electrons from the cytochrome fr 6 /com- 
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FIGURE 19-65 Dual roles of cytochrome b 6 f and cytochrome c 6 in 
cyanobacteria reflect evolutionary origins. Cyanobacteria use cy- 
tochrome b e f f cytochrome c 6/ and plastoquinone for both oxidative 
phosphorylation and photophosphorylation. (a) In photophosphoryla- 
tion, electrons flow (top to bottom) from water to NADP + . (b) In oxida- 
tive phosphorylation, electrons flow from NADH to 0 2 . Both 
processes are accompanied by proton movement across the mem- 
brane, accomplished by a Q cycle. 



plex to PSI — a role performed in plants by plastocyanin. 
We therefore see the functional homology between the 
cyanobacterial cytochrome b 6 f complex and the mito- 
chondrial cytochrome bc Y complex, and between 
cyanobacterial cytochrome c 6 and plant plastocyanin. 
The third conserved component is the ATP synthase, 
which functions in oxidative phosphorylation and pho- 
tophosphorylation in cyanobacteria, and in the mito- 
chondria and chloroplasts of photosynthetic eukaryotes. 
The structure and remarkable mechanism of this enzyme 
have been strongly conserved throughout evolution. 

In Halobacterium, a Single Protein Absorbs Light and 
Pumps Protons to Drive ATP Synthesis 

In some modern archaea, a quite different mechanism 
for converting the energy of light into an electrochemical 
gradient has evolved. The halophilic ("salt-loving") ar- 
chaean Halobacterium salinarum is descended from 
ancient evolutionary progenitors. This archaean (com- 
monly referred to as a halobacterium) lives only in brine 
ponds and salt lakes (Great Salt Lake and the Dead Sea, 
for example) , where the high salt concentration — which 
can exceed 4 m — results from water loss by evaporation; 
indeed, halobacteria cannot live in NaCl concentrations 
lower than 3 m. These organisms are aerobes and nor- 
mally use 0 2 to oxidize organic fuel molecules. However, 
the solubility of 0 2 is so low in brine ponds that some- 
times oxidative metabolism must be supplemented by 
sunlight as an alternative source of energy. 

The plasma membrane of H. salinarum contains 
patches of the light-absorbing pigment bacteri- 
orhodopsin, which contains retinal (the aldehyde deriv- 
ative of vitamin A; see Fig. 10-21) as a light-harvesting 
prosthetic group. When the cells are illuminated, all- 
£ra?zs-retinal bound to the bacteriorhodopsin absorbs 
a photon and undergoes photoisomerization to 13-cis- 
retinal, forcing a conformational change in the protein. 
The restoration of all-£rans-retinal is accompanied by 
the outward movement of protons through the plasma 
membrane. Bacteriorhodopsin, with only 247 amino acid 
residues, is the simplest light-driven proton pump 
known. The difference in the three-dimensional struc- 
ture of bacteriorhodopsin in the dark and after illumina- 
tion (Fig. 19»66a) suggests a pathway by which a 
concerted series of proton "hops" could effectively move 
a proton across the membrane. The chromophore retinal 
is bound through a Schiff-base linkage to the e-amino 
group of a Lys residue. In the dark, the nitrogen of this 
Schiff base is protonated; in the light, photoisomerization 
of retinal lowers the piT a of this group and it releases its 
proton to a nearby Asp residue, triggering a series of pro- 
ton hops that ultimately result in the release of a proton 
at the outer surface of the membrane (Fig. 19-66b). 

The electrochemical potential across the membrane 
drives protons back into the cell through a membrane 
ATP synthase complex very similar to that of mitochon- 
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FIGURE 19-66 Evolution of a second mechanism for light-driven proton 
pumping in a halophilic archaean. (a) Bacteriorhodopsin (M T 26,000) of 
Halobacterium halobium has seven membrane-spanning a helices (PDB 
ID 1C8R). The chromophore all-trans-retinal (purple) is covalently at- 
tached via a Schiff base to the s-amino group of a Lys residue deep in the 
membrane interior. Running through the protein are a series of Asp and 
Glu residues and a series of closely associated water molecules that to- 
gether provide the transmembrane path for protons (red arrows). Steps ® 
through (5) indicate proton movements, described below. 

(b) In the dark (left panel), the Schiff base is protonated. Illumina- 
tion (right panel) photoisomerizes the retinal, forcing subtle conforma- 
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tional changes in the protein that alter the distance between the Schiff 
base and its neighboring amino acid residues. Interaction with these 
neighbors lowers the pK a of the protonated Schiff base, and the base 
gives up its proton to a nearby carboxyl group on Asp 85 (step © in (a)). 
This initiates a series of concerted proton hops between water molecules 
(see Fig. 2-13) in the interior of the protein, which ends with (2) the 
release of a proton that was shared by Glu 194 and Glu 204 near the 
extracellular surface. (3) The Schiff base reacquires a proton from 
Asp 96 , which 0 takes up a proton from the cytosol. (5) Finally, Asp 85 
gives up its proton, leading to reprotonation of the Glu 204 -Glu 194 pair. 
The system is now ready for another round of proton pumping. 
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dria and chloroplasts. Thus, when 0 2 is limited, halobac- 
teria can use light to supplement the ATP synthesized 
by oxidative phosphorylation. Halobacteria do not evolve 
0 2 , nor do they carry out photoreduction of NADP + ; 
their phototransducing machinery is therefore much 
simpler than that of cyanobacteria or plants. Neverthe- 
less, its proton-pumping mechanism may prove to be 
prototypical for the many other, more complex, ion 
pumps. S Bacteriorhodopsin 
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The Evolution of 
Oxygenic Photosynthesis 



Modern cyanobacteria are derived from an ancient 
organism that acquired two photosystems, one of 
the type now found in purple bacteria, the other 
of the type found in green sulfur bacteria. 

Many photosynthetic microorganisms obtain 
electrons for photosynthesis not from water but 
from donors such as H 2 S. 

Cyanobacteria with the tandem photosystems and a 
water-splitting activity that released oxygen into 
the atmosphere appeared on Earth about 2.5 billion 
years ago. 

Chloroplasts, like mitochondria, evolved from 
bacteria living endosymbiotically in early 
eukaryotic cells. The ATP synthases of bacteria, 
cyanobacteria, mitochondria, and chloroplasts 
share a common evolutionary precursor and a 
common enzymatic mechanism. 

An entirely different mechanism for converting 
light energy to a proton gradient has evolved in the 
modern archaea, in which the light-harvesting 
pigment is retinal. 
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2. All Parts of Ubiquinone Have a Function In electron 
transfer, only the quinone portion of ubiquinone undergoes 
oxidation-reduction; the isoprenoid side chain remains un- 
changed. What is the function of this chain? 

3. Use of FAD Rather Than NAD + in Succinate Oxida- 
tion All the dehydrogenases of glycolysis and the citric acid 
cycle use NAD + (E'° for NAD+/NADH is -0.32 V) as electron 
acceptor except succinate dehydrogenase, which uses cova- 
lently bound FAD (E'° for FAD/FADH 2 in this enzyme is 0.050 V) . 
Suggest why FAD is a more appropriate electron acceptor than 
NAD + in the dehydrogenation of succinate, based on the E'° 
values of fumarate/succinate (E'° = 0.031), NAD + /NADH, and 
the succinate dehydrogenase FAD/FADH 2 . 

4. Degree of Reduction of Electron Carriers in the Res- 
piratory Chain The degree of reduction of each carrier in 
the respiratory chain is determined by conditions in the mito- 
chondrion. For example, when NADH and 0 2 are abundant, 
the steady-state degree of reduction of the carriers decreases 
as electrons pass from the substrate to 0 2 . When electron 
transfer is blocked, the carriers before the block become more 
reduced and those beyond the block become more oxidized 
(see Fig. 19-6). For each of the conditions below, predict the 



state of oxidation of ubiquinone and cytochromes 6, c ls c, and 
a + a 3 . 

(a) Abundant NADH and 0 2 , but cyanide added 

(b) Abundant NADH, but 0 2 exhausted 

(c) Abundant 0 2 , but NADH exhausted 

(d) Abundant NADH and 0 2 

5. Effect of Rotenone and Antimycin A on Electron 
Transfer Rotenone, a toxic natural product from plants, 
strongly inhibits NADH dehydrogenase of insect and fish mito- 
chondria. Antimycin A, a toxic antibiotic, strongly inhibits the 
oxidation of ubiquinol. 

(a) Explain why rotenone ingestion is lethal to some in- 
sect and fish species. 

(b) Explain why antimycin A is a poison. 

(c) Given that rotenone and antimycin A are equally effec- 
tive in blocking their respective sites in the electron-transfer 
chain, which would be a more potent poison? Explain. 

6. Uncouplers of Oxidative Phosphorylation In normal 
mitochondria the rate of electron transfer is tightly coupled to 
the demand for ATP. When the rate of use of ATP is relatively 
low, the rate of electron transfer is low; when demand for ATP 
increases, electron-transfer rate increases. Under these condi- 
tions of tight coupling, the number of ATP molecules produced 
per atom of oxygen consumed when NADH is the electron 
donor — the P/O ratio — is about 2.5. 

(a) Predict the effect of a relatively low and a relatively 
high concentration of uncoupling agent on the rate of electron 
transfer and the P/O ratio. 

(b) Ingestion of uncouplers causes profuse sweating and 
an increase in body temperature. Explain this phenomenon in 
molecular terms. What happens to the P/O ratio in the pres- 
ence of uncouplers? 

(c) The uncoupler 2,4-dinitrophenol was once prescribed 
as a weight-reducing drug. How could this agent, in principle, 
serve as a weight-reducing aid? Uncoupling agents are no 
longer prescribed, because some deaths occurred following 
their use. Why might the ingestion of uncouplers lead to death? 

7. Effects of Valinomycin on Oxidative Phosphorylation 

When the antibiotic valinomycin is added to actively respiring 
mitochondria, several things happen: the yield of ATP 
decreases, the rate of 0 2 consumption increases, heat is re- 
leased, and the pH gradient across the inner mitochondrial 
membrane increases. Does valinomycin act as an uncoupler or 
as an inhibitor of oxidative phosphorylation? Explain the ex- 
perimental observations in terms of the antibiotic's ability to 
transfer K + ions across the inner mitochondrial membrane. 

8. Mode of Action of Dicyclohexylcarbodiimide (DCCD) 

When DCCD is added to a suspension of tightly coupled, ac- 
tively respiring mitochondria, the rate of electron transfer 
(measured by 0 2 consumption) and the rate of ATP production 
dramatically decrease. If a solution of 2,4-dinitrophenol is now 
added to the preparation, 0 2 consumption returns to normal 
but ATP production remains inhibited. 

(a) What process in electron transfer or oxidative 
phosphorylation is affected by DCCD? 
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(b) Why does DCCD affect the 0 2 consumption of 
mitochondria? Explain the effect of 2,4-dinitrophenol on the 
inhibited mitochondrial preparation. 

(c) Which of the following inhibitors does DCCD most re- 
semble in its action: antimycin A, rotenone, or oligomycin? 

9. Compartmentalization of Citric Acid Cycle Compo- 
nents Isocitrate dehydrogenase is found only in the mito- 
chondrion, but malate dehydrogenase is found in both the 
cytosol and mitochondrion. What is the role of cytosolic malate 
dehydrogenase? 

10. The Malate-a-Ketoglutarate Transport System The 

transport system that conveys malate and a-ketoglutarate 
across the inner mitochondrial membrane (see Fig. 19-29) is 
inhibited by ?z-butylmalonate. Suppose n-butylmalonate is 
added to an aerobic suspension of kidney cells using glucose 
exclusively as fuel Predict the effect of this inhibitor on 
(a) glycolysis, (b) oxygen consumption, (c) lactate formation, 
and (d) ATP synthesis. 

11. Cellular ADP Concentration Controls ATP Forma- 
tion Although both ADP and P { are required for the synthesis 
of ATP, the rate of synthesis depends mainly on the concentra- 
tion of ADP, not Pi. Why? 

12. Time Scales of Regulatory Events in Mitochondria 

Compare the likely time scales for the adjustments in respira- 
tory rate caused by (a) increased [ADP] and (b) reduced p0 2 . 
What accounts for the difference? 

13. The Pasteur Effect When 0 2 is added to an anaerobic 
suspension of cells consuming glucose at a high rate, the rate 
of glucose consumption declines greatly as the 0 2 is used up, 
and accumulation of lactate ceases. This effect, first observed 
by Louis Pasteur in the 1860s, is characteristic of most cells 
capable of both aerobic and anaerobic glucose catabolism. 

(a) Why does the accumulation of lactate cease after 0 2 is 
added? 

(b) Why does the presence of 0 2 decrease the rate of glu- 
cose consumption? 

(c) How does the onset of 0 2 consumption slow down the 
rate of glucose consumption? Explain in terms of specific 
enzymes. 

14. Respiration-Deficient Yeast Mutants and Ethanol 
Production Respiration-deficient yeast mutants (p~; "pe- 
tites") can be produced from wild-type parents by treatment 
with mutagenic agents. The mutants lack cytochrome oxidase, 
a deficit that markedly affects their metabolic behavior. One 
striking effect is that fermentation is not suppressed by 0 2 — 
that is, the mutants lack the Pasteur effect (see Problem 13). 
Some companies are very interested in using these mutants to 
ferment wood chips to ethanol for energy use. Explain the ad- 
vantages of using these mutants rather than wild-type yeast 
for large-scale ethanol production. Why does the absence of 
cytochrome oxidase eliminate the Pasteur effect? 

15. Advantages of Supercomplexes for Electron Trans- 
fer There is growing evidence that mitochondrial Complexes 
I, II, III, and IV are part of a larger supercomplex. What might 



be the advantage of having all four complexes within a super- 
complex? 

16. How Many Protons in a Mitochondrion? Electron 
transfer translocates protons from the mitochondrial matrix to 
the external medium, establishing a pH gradient across the in- 
ner membrane (outside more acidic than inside). The ten- 
dency of protons to diffuse back into the matrix is the driving 
force for ATP synthesis by ATP synthase. During oxidative 
phosphorylation by a suspension of mitochondria in a medium 
of pH 7.4, the pH of the matrix has been measured as 7.7. 

(a) Calculate [H + ] in the external medium and in the ma- 
trix under these conditions. 

(b) What is the outside-to-inside ratio of [H + ]? Comment 
on the energy inherent in this concentration difference. (Hint: 
See Eqnll^, p. 396.) 

(c) Calculate the number of protons in a respiring liver mi- 
tochondrion, assuming its inner matrix compartment is a 
sphere of diameter 1.5 jam. 

(d) From these data, is the pH gradient alone sufficient to 
generate ATP? 

(e) If not, suggest how the necessary energy for synthesis 
of ATP arises. 

17. Rate of ATP Turnover in Rat Heart Muscle Rat heart 
muscle operating aerobically fills more than 90% of its ATP needs 
by oxidative phosphorylation. Each gram of tissue consumes 0 2 
at the rate of 10.0 jumol/min, with glucose as the fuel source. 

(a) Calculate the rate at which the heart muscle consumes 
glucose and produces ATP. 

(b) For a steady-state concentration of ATP of 5.0 /^mol/g 
of heart muscle tissue, calculate the time required (in seconds) 
to completely turn over the cellular pool of ATP. What does this 
result indicate about the need for tight regulation of ATP pro- 
duction? (Note: Concentrations are expressed as micromoles 
per gram of muscle tissue because the tissue is mostly water.) 

18. Rate of ATP Breakdown in Insect Flight Muscle 

ATP production in the flight muscle of the fly Lucilia sericata 
results almost exclusively from oxidative phosphorylation. 
During flight, 187 mL of 0 2 /hr • g of body weight is needed to 
maintain an ATP concentration of 7.0 /imol/g of flight muscle. 
Assuming that flight muscle makes up 20% of the weight of the 
fly, calculate the rate at which the flight-muscle ATP pool turns 
over. How long would the reservoir of ATP last in the absence 
of oxidative phosphorylation? Assume that reducing equiva- 
lents are transferred by the glycerol 3-phosphate shuttle and 
that 0 2 is at 25 °C and 101.3 kPa (1 atm). 

ER| 19. Mitochondrial Disease and Cancer Mutations 
(jH in the genes that encode certain mitochondrial pro- 
teins are associated with a high incidence of some types of 
cancer. How might defective mitochondria lead to cancer? 

20. Variable Severity of a Mitochondrial Disease 

Kfl Individuals with a disease caused by a specific defect in 
the mitochondrial genome may have symptoms ranging from 
mild to severe. Explain why. 

21. Transmembrane Movement of Reducing Equiva- 
lents Under aerobic conditions, extramitochondrial NADH 
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must be oxidized by the mitochondrial electron-transfer chain. 
Consider a preparation of rat hepatocytes containing mito- 
chondria and all the cytosolic enzymes. If [4- 3 H]NADH is intro- 
duced, radioactivity soon appears in the mitochondrial matrix. 
However, if [7- 14 C]NADH is introduced, no radioactivity ap- 
pears in the matrix. What do these observations reveal about 
the oxidation of extramitochondrial NADH by the electron- 
transfer chain? 

O O 




[4- 3 H] NADH [7- 14 C] NADH 

I 22. High Blood Alanine Level Associated with 
Kfl Defects in Oxidative Phosphorylation Most indi- 
viduals with genetic defects in oxidative phosphorylation are 
found to have relatively high concentrations of alanine in their 
blood. Explain this in biochemical terms. 

23. NAD Pools and Dehydrogenase Activities Although 
both pyruvate dehydrogenase and glyceraldehyde 3-phosphate 
dehydrogenase use NAD + as their electron acceptor, the two 
enzymes do not compete for the same cellular NAD pool. Why? 

I 24. Diabetes as a Consequence of Mitochondrial 
HH Defects Glucokinase is essential in the metabolism of 
glucose in pancreatic /3 cells Humans with two defective 
copies of the glucokinase gene exhibit a severe, neonatal dia- 
betes, whereas those with only one defective copy of the gene 
have a much milder form of the disease (mature onset dia- 
betes of the young, MODY2), Explain this difference in terms 
of the biology of the f3 cell 

B9 25. Effects of Mutations in Mitochondrial Com- 
MlM plex II Single nucleotide changes in the gene for suc- 
cinate dehydrogenase (Complex II) are associated with 
midgut carcinoid tumors. Suggest a mechanism to explain this 
observation. 

26. Photochemical Efficiency of Light at Different 
Wavelengths The rate of photosynthesis, measured by 0 2 
production, is higher when a green plant is illuminated with 
light of wavelength 680 nm than with light of 700 nm. However, 
Olumination by a combination of light of 680 nm and 700 nm 
gives a higher rate of photosynthesis than light of either wave- 
length alone. Explain. 

27. Balance Sheet for Photosynthesis In 1804 Theodore 
de Saussure observed that the total weight of oxygen and dry 
organic matter produced by plants is greater than the weight 
of carbon dioxide consumed during photosynthesis. Where 
does the extra weight come from? 

28. Role of H 2 S in Some Photosynthetic Bacteria Illumi- 
nated purple sulfur bacteria carry out photosynthesis in the 
presence of H 2 0 and 14 C0 2 , but only if H 2 S is added and 0 2 is 
absent. During the course of photosynthesis, measured by for- 



mation of [ CJcarbohydrate, H 2 S is converted to elemental 
sulfur, but no 0 2 is evolved. What is the role of the conversion 
of H 2 S to sulfur? Why is no 0 2 evolved? 

29. Boosting the Reducing Power of Photosystem I by 
Light Absorption When photosystem I absorbs red light at 
700 nm, the standard reduction potential of P700 changes 
from 0.40 V to about -1.2 V. What fraction of the absorbed 
light is trapped in the form of reducing power? 

30. Electron Flow through Photosystems I and II Predict 
how an inhibitor of electron passage through pheophytin would 
affect electron flow through (a) photosystem II and (b) photo- 
system I. Explain your reasoning. 

31. Limited ATP Synthesis in the Dark In a laboratory ex- 
periment, spinach chloroplasts are illuminated in the absence 
of ADP and P i} then the light is turned off and ADP and Pi are 
added. ATP is synthesized for a short time in the dark. Explain 
this finding. 

32. Mode of Action of the Herbicide DCMU When 
chloroplasts are treated with 3-(3,4-dichlorophenyl)-l,l- 
dimethylurea (DCMU, or diuron), a potent herbicide, 0 2 evo- 
lution and photophosphorylation cease. Oxygen evolution, but 
not photophosphorylation, can be restored by addition of an 
external electron acceptor, or Hill reagent. How does DCMU 
act as a weed killer? Suggest a location for the inhibitory action 
of this herbicide in the scheme shown in Figure 19-56. 
Explain. 

33. Effect of Venturicidin on Oxygen Evolution Venturi- 
cidin is a powerful inhibitor of the chloroplast ATP synthase, 
interacting with the CF 0 part of the enzyme and blocking 
proton passage through the CF 0 CF! complex. How would 
venturicidin affect oxygen evolution in a suspension of well- 
illuminated chloroplasts? Would your answer change if the 
experiment were done in the presence of an uncoupling 
reagent such as 2,4-dinitrophenol (DNP)? Explain. 

34. Bioenergetics of Photophosphorylation The steady- 
state concentrations of ATP, ADP, and P : in isolated spinach 
chloroplasts under full illumination at pH 7.0 are 120.0, 6.0, 
and 700.0 /am, respectively 

(a) What is the free-energy requirement for the synthesis 
of 1 mol of ATP under these conditions? 

(b) The energy for ATP synthesis is furnished by light- 
induced electron transfer in the chloroplasts. What is the 
minimum voltage drop necessary (during transfer of a pair 
of electrons) to synthesize ATP under these conditions? 
(You may need to refer to Eqn 13-7, p, 515.) 

35. Light Energy for a Redox Reaction Suppose you have 
isolated a new photosynthetic microorganism that oxidizes 
H 2 S and passes the electrons to NAD + . What wavelength of 
light would provide enough energy for H 2 S to reduce NAD + 
under standard conditions? Assume 100% efficiency in the 
photochemical event, and use E'° of -243 mV for H 2 S and 
-320 mV for NAD + . See Figure 19-46 for the energy equiva- 
lents of wavelengths of light. 



36. Equilibrium Constant for Water-Splitting Reactions 

The coenzyme NADP + is the terminal electron acceptor in 
chloroplasts, according to the reaction 

2H 2 0 + 2NADP+ > 2NADPH + 2H + + 0 2 

Use the information in Table 19-2 to calculate the equilibrium 
constant for this reaction at 25 °C. (The relationship between 
ifeq and AG'° is discussed on p, 492.) How can the chloroplast 
overcome this unfavorable equilibrium? 

37. Energetics of Phototransduction During photosyn- 
thesis, eight photons must be absorbed (four by each photo- 
system) for every 0 2 molecule produced: 

2H 2 0 + 2NADP + + 8 photons > 2NADPH + 2H + + 0 2 

Assuming that these photons have a wavelength of 700 nm 
(red) and that the light absorption and use of light energy are 
100% efficient, calculate the free-energy change for the 
process. 

38. Electron Transfer to a Hill Reagent Isolated spinach 
chloroplasts evolve 0 2 when illuminated in the presence of 
potassium ferricyanide (a Hill reagent), according to the 
equation 

2H 2 0 + 4Fe 3+ ► 0 2 + 4H + + 4Fe 2+ 

where Fe 3+ represents ferricyanide and Fe 2+ , ferrocyanide. Is 
NADPH produced in this process? Explain. 

39. How Often Does a Chlorophyll Molecule Absorb a 
Photon? The amount of chlorophyll a (M T 892) in a spinach leaf 
is about 20 /xg/cm 2 of leaf surface. In noonday sunlight (average 
energy reaching the leaf is 5.4 J/cm 2 • min), the leaf absorbs 
about 50% of the radiation. How often does a single chlorophyll 
molecule absorb a photon? Given that the average lifetime of an 
excited chlorophyll molecule in vivo is 1 ns, what fraction of the 
chlorophyll molecules are excited at any one time? 

40. Effect of Monochromatic Light on Electron Flow 

The extent to which an electron carrier is oxidized or reduced 
during photosynthetic electron transfer can sometimes be ob- 
served directly with a spectrophotometer. When chloroplasts 
are illuminated with 700 nm light, cytochrome/, plastocyanin, 
and plastoquinone are oxidized. When chloroplasts are illumi- 
nated with 680 nm light, however, these electron carriers are 
reduced. Explain. 

41. Function of Cyclic Photophosphorylation When the 
[NADPH]/[NADP + ] ratio in chloroplasts is high, photophos- 
phorylation is predominantly cyclic (see Fig. 19-56). Is 0 2 
evolved during cyclic photophosphorylation? Is NADPH pro- 
duced? Explain. What is the main function of cyclic pho- 
tophosphorylation? 

Data Analysis Problem 

42. Photophosphorylation: Discovery, Rejection, and 
Rediscovery In the 1930s and 1940s, researchers were begin- 
ning to make progress toward understanding the mechanism of 
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photosynthesis. At the time, the role of "energy-rich phosphate 
bonds" (today, "ATP") in glycolysis and cellular respiration was 
just becoming known. There were many theories about the 
mechanism of photosynthesis, especially about the role of light. 
This problem focuses on what was then called the "primary 
photochemical process" — that is, on what it is, exactly, that the 
energy from captured light produces in the photosynthetic cell. 
Interestingly, one important part of the modern model of pho- 
tosynthesis was proposed early on, only to be rejected, ignored 
for several years, then finally revived and accepted. 

In 1944, Emerson, Stauffer, and Umbreit proposed that 
"the function of light energy in photosynthesis is the formation 
of 'energy-rich' phosphate bonds" (p. 107). In their model 
(hereafter, the "Emerson model"), the free energy necessary 
to drive both C0 2 fixation and reduction came from these 
"energy-rich phosphate bonds" (i.e., ATP), produced as a 
result of light absorption by a chlorophyll-containing protein. 

This model was explicitly rejected by Rabinowitch 
(1945). After summarizing Emerson and coauthors' findings, 
Rabinowitch stated: "Until more positive evidence is provided, 
we are inclined to consider as more convincing a general argu- 
ment against this hypothesis, which can be derived from energy 
considerations. Photosynthesis is eminently a problem of energy 
accumulation. What good can be served, then, by converting 
light quanta (even those of red light, which amount to about 43 
kcal per Einstein) into 'phosphate quanta' of only 10 kcal per 
mole? This appears to be a start in the wrong direction — toward 
dissipation rather than toward accumulation of energy" (Vol. I, 
p. 228). This argument, along with other evidence, led to the 
abandonment of the Emerson model until the 1950s, when it was 
found to be correct — albeit in a modified form. 

For each piece of information from Emerson and coau- 
thors' article presented in (a) through (d) below, answer the 
following three questions: 

1 . How does this information support the Emerson model, 
in which light energy is used directly by chlorophyll to 
make ATP, and the ATP then provides the energy to 
drive C0 2 fixation and reduction? 

2 . How would Rabinowitch explain this information, based on 
his model (and most other models of the day), in which light 
energy is used directly by chlorophyll to make reducing 
compounds? Rabinowitch wrote: "Theoretically, there is no 
reason why all electronic energy contained in molecules ex- 
cited by the absorption of light should not be available for oxi- 
dation-reduction" (Vol. I, p. 152). In this model, the reducing 
compounds are then used to fix and reduce C0 2 , and the 
energy for these reactions comes from the large amounts of 
free energy released by the reduction reactions. 

3. How is this information explained by our modern under- 
standing of photosynthesis? 

(a) Chlorophyll contains a Mg 2+ ion, which is known to be 
an essential cofactor for many enzymes that catalyze phospho- 
rylation and dephosphorylation reactions. 

(b) A crude "chlorophyll protein" isolated from photosyn- 
thetic cells showed phosphorylating activity. 



\j72_\ Oxidative Phosphorylation and Photophosphorylation 



(c) The phosphorylating activity of the "chlorophyll pro- 
tein" was inhibited by light. 

(d) The levels of several different phosphorylated com- 
pounds in photosynthetic cells changed dramatically in re- 
sponse to light exposure. (Emerson and coworkers were not 
able to identify the specific compounds involved.) 

As it turned out, the Emerson and Rabinowitch models 
were both partly correct and partly incorrect. 

(e) Explain how the two models relate to our current 
model of photosynthesis. 

In his rejection of the Emerson model, Rabinowitch 
went on to say: "The difficulty of the phosphate storage 
theory appears most clearly when one considers the fact 
that, in weak light, eight or ten quanta of light are sufficient 
to reduce one molecule of carbon dioxide. If each quantum 
should produce one molecule of high-energy phosphate, 



the accumulated energy would be only 80-100 kcal per 
Einstein — while photosynthesis requires at least 112 kcal 
per mole, and probably more, because of losses in irre- 
versible partial reactions" (Vol. 1, p. 228). 

(f) How does Rabinowitch's value of 8 to 10 photons per 
molecule of C0 2 reduced compare with the value accepted to- 
day? You need to consult Chapter 20 for some of the informa- 
tion required here. 

(g) How would you rebut Rabinowitch's argument, based 
on our current knowledge about photosynthesis? 
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... the discovery of the long-lived isotope of carbon, carbon-14, by 
Samuel Ruben and Martin Kamen in 1940 provided the ideal tool for 
the tracing of the route along which carbon dioxide travels on its way 
to carbohydrate. 

— Me!vin Calvin, Nobel Address, 1961 
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We have now reached a turning point in our study 
of cellular metabolism. Thus far in Part II we 
have described how the major metabolic fuels — 
carbohydrates, fatty acids, and amino acids — are de- 
graded through converging catabolic pathways to enter 
the citric acid cycle and yield their electrons to the res- 
piratory chain, and how this exergonic flow of electrons 
to oxygen is coupled to the endergonic synthesis of ATP. 
We now turn to anabolic pathways, which use chemical 
energy in the form of ATP and NADH or NADPH to syn- 
thesize cellular components from simple precursor mol- 
ecules. Anabolic pathways are generally reductive 
rather than oxidative. Catabolism and anabolism pro- 
ceed simultaneously in a dynamic steady state, so the 
energy-yielding degradation of cellular components is 
counterbalanced by biosynthetic processes, which cre- 
ate and maintain the intricate orderliness of living cells. 

Plants must be especially versatile in their handling 
of carbohydrates, for several reasons. First, plants are au- 
totrophs, able to convert inorganic carbon (as C0 2 ) into 
organic compounds. Second, biosynthesis occurs prima- 
rily in plastids, membrane-bounded organelles unique to 
photosynthetic organisms, and the movement of interme- 
diates between cellular compartments is an important as- 
pect of metabolism. Third, plants are not motile: they 
cannot move to find better supplies of water, sunlight, or 



nutrients. They must have sufficient metabolic flexibility 
to allow them to adapt to changing conditions in the place 
where they are rooted. Finally, plants have thick cell walls 
made of carbohydrate polymers, which must be assem- 
bled outside the plasma membrane and which constitute 
a significant proportion of the cell's carbohydrate. 

The chapter begins with a description of the process 
by which C0 2 is assimilated into trioses and hexoses, 
then considers photorespiration, an important side re- 
action during C0 2 fixation, and the ways in which cer- 
tain plants avoid this side reaction. We then look at how 
the biosynthesis of sucrose (for sugar transport) and 
starch (for energy storage) is accomplished by mech- 
anisms analogous to those employed by animal cells to 
make glycogen. The next topic is the synthesis of the 
cellulose of plant cell walls and the peptidoglycan of 
bacterial cell walls, illustrating the problems of energy- 
dependent biosynthesis outside the plasma membrane. 
Finally, we discuss how the various pathways that share 
pools of common intermediates are segregated within 
organelles yet integrated with one another. 

20.1 Photosynthetic Carbohydrate Synthesis 

The synthesis of carbohydrates in animal cells always 
employs precursors having at least three carbons, all of 
which are less oxidized than the carbon in C0 2 . Plants 
and photosynthetic microorganisms, by contrast, can 
synthesize carbohydrates from C0 2 and water, reducing 
C0 2 at the expense of the energy and reducing power fur- 
nished by the ATP and NADPH that are generated by the 
light-dependent reactions of photosynthesis (Fig. 20-1). 
Plants (and other autotrophs) can use C0 2 as the sole 
source of the carbon atoms required for the biosynthesis 
of cellulose and starch, lipids and proteins, and the 
many other organic components of plant cells. By con- 
trast, heterotrophs cannot bring about the net reduction 
of C0 2 to achieve a net synthesis of glucose. 

[773] 
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FIGURE 20-1 Assimilation of C0 2 into biomass in plants. The light-driven 
synthesis of ATP and NADPH, described in Chapter 19, provides energy and 
reducing power for the fixation of C0 2 into trioses, from which all the carbon- 
containing compounds of the plant cell are synthesized. The processes shown 
with red arrows are the focus of this chapter. 



Green plants contain in their chloroplasts unique en- 
zymatic machinery that catalyzes the conversion of C0 2 
to simple (reduced) organic compounds, a process 
called C0 2 assimilation. This process has also been 
called C0 2 fixation or carbon fixation, but we reserve 
these terms for the specific reaction in which C0 2 is in- 
corporated (fixed) into a three-carbon organic com- 
pound, the triose phosphate 3-phosphoglycerate. This 
simple product of photosynthesis is the precursor of 
more complex biomolecules, including sugars, polysac- 
charides, and the metabolites derived from them, all of 
which are synthesized by meta- 
JP bolic pathways similar to those 
of animal tissues. Carbon diox- 
ide is assimilated via a cyclic 
pathway, its key intermediates 
constantly regenerated. The 
pathway was elucidated in the 
early 1950s by Melvin Calvin, 
Andrew Benson, and James A. 
Bassham, and is often called the 
Calvin cycle or, more descrip- 
tively, the photosynthetic car- 
bon reduction cycle. 
Carbohydrate metabolism is more complex in plant 
cells than in animal cells or in nonphotosynthetic mi- 
croorganisms. In addition to the universal pathways of 
glycolysis and gluconeogenesis, plants have the unique 
reaction sequences for reduction of C0 2 to triose phos- 
phates and the associated reductive pentose phosphate 
pathway — all of which must be coordinately regulated 
to ensure proper allocation of carbon to energy produc- 
tion and synthesis of starch and sucrose. Key enzymes 
are regulated, as we shall see, by (1) reduction of disul- 
fide bonds by electrons flowing from photosystem I and 
(2) changes in pH and Mg 2+ concentration that result 
from illumination. When we look at other aspects of 
plant carbohydrate metabolism, we also find enzymes 
that are modulated by (3) conventional allosteric regu- 
lation by one or more metabolic intermediates and 
(4) covalent modification (phosphorylation). 




Melvin Calvin, 1911-1997 



Plastids Are Organelles Unique to Plant Cells and Algae 

Most of the biosynthetic activities in plants (including 
C0 2 assimilation) occur in plastids, a family of self- 
reproducing organelles bounded by a double membrane 
and containing a small genome that encodes some of 
their proteins. Most proteins destined for plastids are 
encoded in nuclear genes, which are transcribed and 
translated like other nuclear genes; then the proteins 
are imported into plastids. Plastids reproduce by binary 
fission, replicating their genome (a single circular DNA 
molecule) and using their own enzymes and ribosomes 
to synthesize the proteins encoded by that genome. 
Chloroplasts (see Fig. 19-45) are the sites of C0 2 
assimilation. The enzymes for this process are contained 
in the stroma, the soluble phase bounded by the inner 
chloroplast membrane. Amyloplasts are colorless plas- 
tids (that is, they lack chlorophyll and other pigments 
found in chloroplasts). They have no internal mem- 
branes analogous to the photosynthetic membranes 
(thylakoids) of chloroplasts, and in plant tissues rich 
in starch these plastids are packed with starch gran- 
ules (Fig. 20-2). Chloroplasts can be converted to 




FIGURE 20-2 Amyloplasts filled with starch (dark granules) are 
stained with iodine in this section of Ranunculus root cells. Starch 
granules in various tissues range from 1 to 100 ,am in diameter. 
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proplastids by the loss of their internal membranes 
and chlorophyll, and proplastids are interconvertible 
with amyloplasts (Fig. 20-3). In turn, both amyloplasts 
and proplastids can develop into chloroplasts. The rela- 
tive proportions of the plastid types depend on the type 
of plant tissue and on the intensity of illumination. Cells 
of green leaves are rich in chloroplasts, whereas amylo- 
plasts dominate in nonphotosynthetic tissues that store 
starch in large quantities, such as potato tubers. 

The inner membranes of all types of plastids are im- 
permeable to polar and charged molecules. Traffic 
across these membranes is mediated by sets of specific 
transporters. 

Carbon Dioxide Assimilation Occurs in Three Stages 

The first stage in the assimilation of C0 2 into biomole- 
cules (Fig. 20-4) is the carbon-fixation reaction: 

condensation of C0 2 with a five-carbon acceptor, 
ribulose 1 ,5-bisphosphate , to form two molecules 
of 3-phosphoglycerate. In the second stage, the 
3-phosphoglycerate is reduced to triose phosphates. 
Overall, three molecules of C0 2 are fixed to three mole- 
cules of ribulose 1 ,5-bisphosphate to form six mole- 
cules of glyceraldehyde 3-phosphate (18 carbons) in 
equilibrium with dihydroxyacetone phosphate. In the 
third stage, five of the six molecules of triose phosphate 
(15 carbons) are used to regenerate three molecules of 
ribulose 1 ,5-bisphosphate (15 carbons), the starting 
material. The sixth molecule of triose phosphate, the 
net product of photosynthesis, can be used to make 
hexoses for fuel and building materials, sucrose for 
transport to nonphotosynthetic tissues, or starch for 
storage. Thus the overall process is cyclical, with the 
continuous conversion of C0 2 to triose and hexose 
phosphates. Fructose 6-phosphate is a key intermedi- 
ate in stage 3 of C0 2 assimilation; it stands at a branch 
point, leading either to regeneration of ribu- 
lose 1 ,5-bisphosphate or to synthesis of 
starch. The pathway from hexose phosphate 
to pentose bisphosphate involves many of the 
same reactions used in animal cells for the 
conversion of pentose phosphates to hexose 
phosphates during the nonoxidative phase of 
the pentose phosphate pathway (see Fig. 
14-22). In the photosynthetic assimilation of 
C0 2 , essentially the same set of reactions op- 
erates in the other direction, converting hex- 
ose phosphates to pentose phosphates. This 




FIGURE 20-3 Plastids: their origins and interconversions. All types of 
plastids are bounded by a double membrane, and some (notably the 
mature chloroplast) have extensive internal membranes. The internal 
membranes can be lost (when a mature chloroplast becomes a pro- 
plastid) and resynthesized (as a proplastid gives rise to a pregranal 
plastid and then a mature chloroplast). Proplastids in nonphotosyn- 
thetic tissues (such as root) give rise to amyloplasts, which contain 
large quantities of starch. All plant cells have plastids, and these or- 
ganelles are the site of other important processes, including the synthe- 
sis of essential amino acids, thiamine, pyridoxal phosphate, flavins, 
and vitamins A, C, E, and K. 

reductive pentose phosphate cycle uses the same 
enzymes as the oxidative pathway, and several more en- 
zymes that make the reductive cycle irreversible. All 13 
enzymes of the pathway are in the chloroplast stroma. 
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FIGURE 20-4 The three stages of C0 2 assimilation in pho- 
tosynthetic organisms. Stoichiometries of three key inter- 
mediates (numbers in parentheses) reveal the fate of carbon 
atoms entering and leaving the cycle. As shown here, three 
C0 2 are fixed for the net synthesis of one molecule of glyc- 
eraldehyde 3-phosphate. This cycle is the photosynthetic 
carbon reduction cycle, or the Calvin cycle. 
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Stage 1: Fixation of CO a into 3-Phosphoglycerate 

An important clue to the nature of the C0 2 -assimilation 
mechanisms in photosynthetic organisms came in the 
late 1940s. Calvin and his associates illuminated a sus- 
pension of green algae in the presence of radioactive 
carbon dioxide ( 14 C0 2 ) for just a few seconds, then 
quickly killed the cells, extracted their contents, and 
with the help of chromatographic methods searched for 
the metabolites in which the labeled carbon first ap- 
peared. The first compound that became labeled was 
3-phosphoglycerate, with the 14 C predominantly 
located in the carboxyl carbon atom. These experiments 
strongly suggested that 3-phosphoglycerate is an early 
intermediate in photosynthesis. The many plants in 
which this three-carbon compound is the first interme- 
diate are called C 3 plants, in contrast to the C 4 plants 
described below. 

The enzyme that catalyzes incorporation of C0 2 
into an organic form is ribulose 1,5-bisphosphate 
carboxylase/oxygenase, a name mercifully shortened 
to rubisco. As a carboxylase, rubisco catalyzes the 
covalent attachment of C0 2 to the five-carbon sugar 
ribulose 1,5-bisphosphate and cleavage of the unstable 
six-carbon intermediate to form two molecules of 
3-phosphoglycerate, one of which bears the carbon 



introduced as C0 2 in its carboxyl group (Fig. 20-4). The 
enzyme's oxygenase activity is discussed in Section 20.2. 

There are two distinct forms of rubisco. Form I is 
found in vascular plants, algae, and cyanobacteria; form II 
is confined to certain photosynthetic bacteria. Plant ru- 
bisco, the crucial enzyme in the production of biomass 
from C0 2 , has a complex form I structure (Fig. 20-5a), 
with eight identical large subunits (M T 53,000; encoded 
in the chloroplast genome, or plastome), each containing 
a catalytic site, and eight identical small subunits (M T 
14,000; encoded in the nuclear genome) of uncertain 
function. The form II rubisco of photosynthetic bacteria 
is simpler in structure, having two subunits that in many 
respects resemble the large subunits of the plant enzyme 
(Fig. 20-5b). This similarity is consistent with the en- 
dosymbiont hypothesis for the origin of chloroplasts (p. 
33). The plant enzyme has an exceptionally low turnover 
number; only three molecules of C0 2 are fixed per sec- 
ond per molecule of rubisco at 25 °C. To achieve high 
rates of C0 2 fixation, plants therefore need large 
amounts of this enzyme. In fact, rubisco makes up almost 
50% of soluble protein in chloroplasts and is probably 
one of the most abundant enzymes in the biosphere. 

Central to the proposed mechanism for plant ru- 
bisco is a carbamoylated Lys side chain with a bound 




Top view 



FIGURE 20-5 Structure of ribulose 1,5-bisphosphate carboxy- 
lase (rubisco). (a) Top and side view of a ribbon model of form I 
rubisco from spinach (PDB ID 8RUC). The enzyme has eight large 
subunits (blue) and eight small ones (gray), tightly packed into a 
structure of M T 550,000. Rubisco is present at a concentration of 
about 250 mg/mL in the chloroplast stroma, corresponding to an 
extraordinarily high concentration of active sites (~4 mM). Amino 
acid residues of the active site are shown in yellow, Mg 2+ in green, 
(b) Ribbon model of form II rubisco from the bacterium Rhodospir- 
iilum rubrum (PDB ID 9RUB). The subunits are in gray and blue. A 
Lys residue at the active site that is carboxyiated to a carbamate in 
the active enzyme is shown in red. The substrate, ribulose 1,5-bis- 
phosphate, is yellow; Mg 2+ is green. 
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FIGURE 20-6 Central role of Mg 2+ in the catalytic mechanism of ru- 
bisco. (Derived from PDB ID 1 RXO) Mg 2+ is coordinated in a roughly 
octahedral complex with six oxygen atoms: one oxygen in the carba- 
mate on Lys 201 ; two in the carboxyl groups of Glu 204 and Asp 203 ; two at 
C-2 and C-3 of the substrate, ribulose 1,5-bisphosphate; and one in the 
other substrate, C0 2 . A water molecule occupies the C0 2 -binding site 
in this crystal structure. In this figure a C0 2 molecule is modeled in its 
place. (Residue numbers refer to the spinach enzyme.) 

Mg 2+ ion. The Mg 2+ ion brings together and orients 
the reactants at the active site (Fig. 20-6) and polar- 
izes the C0 2 , opening it to nucleophilic attack by the 
five-carbon enediolate reaction intermediate formed 
on the enzyme (Fig. 20-7). The resulting six-carbon 
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MECHANISM FIGURE 20-7 First stage of C0 2 assimilation: rubisco's 
carboxylase activity. The C0 2 -fixation reaction is catalyzed by ribu- 
lose 1,5-bisphosphate carboxylase/oxygenase (rubisco). The overall 
reaction accomplishes the combination of one C0 2 and one ribulose 



1,5-bisphosphate to form two molecules of 3-phosphoglycerate, one 
of which contains the carbon atom from C0 2 (red). Additional proton 
transfers (not shown), involving Lys 201 , Lys 175 , and His 294 , occur in 
several of these steps. |§ Rubisco Mechanism; Rubisco Tutorial 
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intermediate breaks down to yield two molecules of 
3-phosphoglycerate. 

As the catalyst for the first step of photosynthetic C0 2 
assimilation, rubisco is a prime target for regulation. The 
enzyme is inactive until carbamoylated on the e amino 
group of Lys 201 (Fig. 20-8). Ribulose 1,5-bisphosphate 
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FIGURE 20-8 Role of rubisco activase in the carbamoylation of Lys 201 
of rubisco. When the substrate ribulose 1,5-bisphosphate is bound to 
the active site, Lys 201 is not accessible. Rubisco activase couples ATP hy- 
drolysis to expulsion of the bound sugar bisphosphate, exposing Lys 201 ; 
this Lys residue can now be carbamoylated with C0 2 in a reaction that 
is apparently not enzyme-mediated. Mg 2+ is attracted to and binds to 
the negatively charged carbamoyl-Lys, and the enzyme is thus activated. 



inhibits carbamoylation by binding tightly to the active site 
and locking the enzyme in the "closed" conformation, in 
which Lys 201 is inaccessible. Rubisco activase over- 
comes the inhibition by promoting ATP-dependent re- 
lease of the ribulose 1,5-bisphosphate, exposing the Lys 
amino group to nonenzymatic carbamoylation by C0 2 ; this 
is followed by Mg 2+ binding, which activates the rubisco. 
Rubisco activase in some species is activated by light 
through a redox mechanism (see Fig. 20-19). 

Another regulatory mechanism involves the "noc- 
turnal inhibitor" 2-carboxyarabinitol 1 -phosphate, a nat- 
urally occurring transition-state analog (see Box 6-3) 
with a structure similar to that of the /3-keto acid inter- 
mediate of the rubisco reaction (Fig. 20-7; see also 
Fig. 20-20). This compound, synthesized in the dark in 
some plants, is a potent inhibitor of carbamoylated 
rubisco. It is either broken down when light returns or is 
expelled by rubisco activase, activating the rubisco. 
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Stage 2: Conversion of 3-Phosphoglycerate to 
Glyceraldehyde 3-Phosphate The 3-phosphoglyc- 
erate formed in stage 1 is converted to glyceraldehyde 
3-phosphate in two steps that are essentially the rever- 
sal of the corresponding steps in glycolysis, with one 
exception: the nucleotide cofactor for the reduction of 
1,3-bisphosphoglycerate is NADPH rather than NADH 
(Fig. 20-9). The chloroplast stroma contains all the 
glycolytic enzymes except phosphoglycerate mutase. 
The stromal and cytosolic enzymes are isozymes; both 
sets of enzymes catalyze the same reactions, but they 
are the products of different genes. 

In the first step of stage 2, the stromal 3-phospho- 
glycerate kinase catalyzes the transfer of a phosphoryl 
group from ATP to 3-phosphoglycerate, yielding 1,3- 
bisphosphoglycerate. Next, NADPH donates electrons in 
a reduction catalyzed by the chloroplast-specific isozyme 
of glyceraldehyde 3-phosphate dehydrogenase, pro- 
ducing glyceraldehyde 3-phosphate and Pi. Triose phos- 
phate isomerase then interconverts glyceraldehyde 
3-phosphate and dihydroxyacetone phosphate. Most of 
the triose phosphate thus produced is used to regenerate 
ribulose 1,5-bisphosphate; the rest is either converted to 
starch in the chloroplast and stored for later use or imme- 
diately exported to the cytosol and converted to sucrose 
for transport to growing regions of the plant. In develop- 
ing leaves, a significant portion of the triose phosphate 
may be degraded by glycolysis to provide energy. 

Stage 3: Regeneration of Ribulose 1,5-Bisphos- 
phate from TViose Phosphates The first reaction in 
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FIGURE 20-9 Second stage of C0 2 assimilation. 3-Phosphoglycerate 
is converted to glyceraldehyde 3-phosphate {red arrows). Also shown 
are the alternative fates of the fixed carbon of glyceraldehyde 3- 
phosphate (blue arrows). Most of the glyceraldehyde 3-phosphate is 
recycled to ribulose 1,5-bisphosphate as shown in Figure 20-10. A 
small fraction of the "extra" glyceraldehyde 3-phosphate may be used 
immediately as a source of energy, but most is converted to sucrose for 
transport or is stored in the chloroplast as starch. In the latter case, 



glyceraldehyde 3-phosphate condenses with dihydroxyacetone phos- 
phate in the stroma to form fructose 1,6-bisphosphate, a precursor of 
starch. In other situations the glyceraldehyde 3-phosphate is converted 
to dihydroxyacetone phosphate, which leaves the chloroplast via a 
specific transporter (see Fig. 20-15) and, in the cytosol, can be de- 
graded glycolytically to provide energy or used to form fructose 
6-phosphate and hence sucrose. 



the assimilation of C0 2 into triose phosphates consumes 
ribulose 1,5-bisphosphate and, for continuous flow of 
C0 2 into carbohydrate, ribulose 1,5-bisphosphate must 
be constantly regenerated. This is accomplished in a 
series of reactions (Fig. 20-10) that, together with 
stages 1 and 2, constitute the cyclic pathway shown in 
Figure 20-4. The product of the first assimilation reac- 
tion (3-phosphoglycerate) thus undergoes transforma- 
tions that regenerate ribulose 1,5-bisphosphate. The 
intermediates in this pathway include three-, four-, five-, 
six-, and seven-carbon sugars. In the following discus- 
sion, all step numbers refer to Figure 20-10. 

Steps CD and (4) are catalyzed by the same enzyme, 
aldolase. It first catalyzes the reversible condensation 



of glyceraldehyde 3-phosphate with dihydroxyacetone 
phosphate, yielding fructose 1,6-bisphosphate (step ®); 
this is cleaved to fructose 6-phosphate and Pi by fruc- 
tose 1,6-bisphosphatase (FBPase-1) in step (2). The re- 
action is strongly exergonic and essentially irreversible. 
Step (3) is catalyzed by transketolase, which contains 
thiamine pyrophosphate (TPP) as its prosthetic group 
(see Fig. 14-14a) and requires Mg 2+ . Transketolase cat- 
alyzes the reversible transfer of a 2-carbon ketol group 
(CH 2 OH— CO— ) from a ketose phosphate donor, fruc- 
tose 6-phosphate, to an aldose phosphate acceptor, 
glyceraldehyde 3-phosphate (Fig. 20-1 la, b), forming 
the pentose xylulose 5-phosphate and the tetrose ery- 
throse 4-phosphate. In step (4), aldolase acts again, 
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FIGURE 20-10 Third stage of C0 2 assimilation. This schematic dia- 
gram shows the interconversions of triose phosphates and pentose 
phosphates. Black dots represent the number of carbons in each com- 
pound. The starting materials are glyceraldehyde 3-phosphate and di- 
hydroxyacetone phosphate. Reactions catalyzed by aldolase (® and 
@) and transketolase (© and ©) produce pentose phosphates that are 
converted to ribulose 1,5-bisphosphate — ribose 5-phosphate by ribose 



5-phosphate isomerase (©) and xylulose 5-phosphate by ribulose 
5-phosphate epimerase ((g)). In step (9), ribulose 5-phosphate is phos- 
phorylated, regenerating ribulose 1,5-bisphosphate. The steps with 
blue arrows are exergonic and make the whole process irreversible: 
steps (2) fructose 1 ,6-bisphosphatase, (5) sedoheptulose bisphos- 
phatase, and (9) ribulose 5-phosphate kinase. 



combining erythrose 4-phosphate with dihydroxyace- 
tone phosphate to form the seven-carbon sedoheptu- 
lose 1,7-bisphosphate. An enzyme unique to plastids, 
sedoheptulose 1,7-bisphosphatase, converts the bisphos- 
phate to sedoheptulose 7-phosphate (step®); this is 
the second irreversible reaction in the pathway. Trans- 
ketolase now acts again, converting sedoheptulose 
7-phosphate and glyceraldehyde 3-phosphate to two 
pentose phosphates in step© (Fig. 20-llc). Figure 
20-12 shows how a two-carbon fragment is temporarily 
carried on the transketolase cofactor TPP and con- 



densed with the three carbons of glyceraldehyde 
3-phosphate in step (6). 

The pentose phosphates formed in the transketo- 
lase reactions — ribose 5-phosphate and xylulose 5- 
phosphate — are converted to ribulose 5-phosphate 
(steps ® and (§)) , which in the final step ((g)) of the cy- 
cle is phosphorylated to ribulose 1,5-bisphosphate by 
ribulose 5-phosphate kinase (Fig. 20-13). This is the 
third very exergonic reaction of the pathway, as the 
phosphate anhydride bond in ATP is swapped for a 
phosphate ester in ribulose 1,5-bisphosphate. 
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FIGURE 20-11 Transketolase-catalyzed reac- 
tions of the Calvin cycle, (a) General reaction 
catalyzed by transketolase: the transfer of a 
two-carbon group, carried temporarily on en- 
zyme-bound TPP, from a ketose donor to an al- 
dose acceptor, (b) Conversion of a hexose and 
a triose to a four-carbon and a five-carbon 
sugar (step (3) of Fig. 20-1 0). (c) Conversion of 
seven-carbon and three-carbon sugars to two 
pentoses (step (6) of Fig. 20-1 0). 
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FIGURE 20-12 TPP as a cofactor for transketolase. 

Transketolase transfers a two-carbon group from 
sedoheptulose 7-phosphate to glyceraldehyde 
3-phosphate, producing two pentose phosphates 
(step (6) in Fig. 20-10). Thiamine pyrophosphate 
serves as a temporary carrier of the two-carbon 
unit and as an electron sink (see Fig. 14-14) to 
facilitate the reactions, 




Ribose 5-phosphate 

o, H 

v 

H— C— OH 
H— C— OH 
H— C— OH 
CH 2 0^© 



Sedoheptulose 
7-phosphate 
(ketose donor) 



CH 2 OH 
i 

HO— C— H 
H— C— OH 
H— C— OH 
H — C — OH 
CH 2 0— © 



OH OH OH H OH 

I I I 

-C—C—C i 

I I 



< HI 

"ft 

CH 3 R 



782 J Carbohydrate Biosynthesis in Plants and Bacteria 



°v H 

I 

H-C-OH 

I 

H-C-OH 
H— i— < 



I 



-OH 

CH 2 0-© 
Ribose 5-phosphate 



CH.OH 

C=0 

HO-i-H 

H-C-OH 

CH 2 0-(P) 
Xylulose 5-phosphate 



FIGURE 20-13 Regeneration of ribulose 1,5-bisphosphate. The starting material for the Calvin 
cycle, ribulose 1,5-bisphosphate, is regenerated from two pentose phosphates produced in the 
cycle. This pathway involves the action of an isomerase and an epimerase, then phosphorylation 
by a kinase, with ATP as phosphate group donor (steps (7), ®, and (9) of Fig. 20-1 0). 
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Synthesis of Each Triose Phosphate from C0 2 
Requires Six NADPH and Nine ATP 

The net result of three turns of the Calvin cycle is the 
conversion of three molecules of C0 2 and one molecule 
of phosphate to a molecule of triose phosphate. The sto- 
ichiometry of the overall path from C0 2 to triose phos- 
phate, with regeneration of ribulose 1,5-bisphosphate, is 
shown in Figure 20-14. Three molecules of ribulose 
1,5-bisphosphate (a total of 15 carbons) condense 
with three C0 2 (3 carbons) to form six molecules of 



3-phosphoglycerate (18 carbons). These six molecules 
of 3-phosphoglycerate are reduced to six molecules of 
glyceraldehyde 3-phosphate (which is in equilibrium 
with dihydroxyacetone phosphate), with the expendi- 
ture of six ATP (in the synthesis of 1,3-bisphosphoglyc- 
erate) and six NADPH (in the reduction of 1,3-bisphos- 
phoglycerate to glyceraldehyde 3-phosphate). The 
isozyme of glyceraldehyde 3-phosphate dehydro- 
genase present in chloroplasts can use NADP as its 
electron carrier and normally functions in the direction 
of 1,3-bisphosphoglycerate reduction. The cytosolic 



FIGURE 20-14 Stoichiometry of C0 2 assimila- 
tion in the Calvin cycle. For every three C0 2 
molecules fixed, one molecule of triose phos- 
phate (glyceraldehyde 3-phosphate) is produced 
and nine ATP and six NADPH are consumed. 
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isozyme uses NAD, as does the glycolytic enzyme of 
animals and other eukaryotes, and in the dark this 
isozyme acts in glycolysis to oxidize glyceraldehyde 
3-phosphate. Both glyceraldehyde 3-phosphate dehy- 
drogenase isozymes, like all enzymes, catalyze the reac- 
tion in both directions. 

One molecule of glyceraldehyde 3-phosphate is the 
net product of the carbon assimilation pathway. The 
other five triose phosphate molecules (15 carbons) are 
rearranged in steps 0 to (9) of Figure 20-10 to form 
three molecules of ribulose 1,5-bisphosphate (15 car- 
bons). The last step in this conversion requires one ATP 
per ribulose 1,5-bisphosphate, or a total of three ATP. 
Thus, in summary, for every molecule of triose phos- 
phate produced by photosynthetic C0 2 assimilation, six 
NADPH and nine ATP are required. 

NADPH and ATP are produced in the light-dependent 
reactions of photosynthesis in about the same ratio 
(2:3) as they are consumed in the Calvin cycle. Nine 
ATP molecules are converted to ADP and phosphate in 
the generation of a molecule of triose phosphate; eight 
of the phosphates are released as Pi and combined with 
eight ADP to regenerate ATP. The ninth phosphate is in- 
corporated into the triose phosphate itself. To convert 
the ninth ADP to ATP, a molecule of Pi must be imported 
from the cytosol, as we shall see. 

In the dark, the production of ATP and NADPH by 
photophosphorylation, and the incorporation of C0 2 
into triose phosphate (by the so-called dark reactions), 
cease. The "dark reactions" of photosynthesis were so 
named to distinguish them from the primary light- 
driven reactions of electron transfer to NADP + and syn- 
thesis of ATP, described in Chapter 19. They do not, in 
fact, occur at significant rates in the dark and are thus 
more appropriately called the carbon-assimilation 
reactions. Later in this section we describe the regula- 
tory mechanisms that turn on carbon assimilation in the 
light and turn it off in the dark. 



The chloroplast stroma contains all the enzymes 
necessary to convert the triose phosphates produced 
by C0 2 assimilation (glyceraldehyde 3-phosphate and 
dihydroxyacetone phosphate) to starch, which is 
temporarily stored in the chloroplast as insoluble 
granules. Aldolase condenses the trioses to fructose 
1,6-bisphosphate; fructose 1,6-bisphosphatase produces 
fructose 6-phosphate; phosphohexose isomerase yields 
glucose 6-phosphate; and phosphoglucomutase pro- 
duces glucose 1 -phosphate, the starting material for 
starch synthesis (see Section 20.3). 

All the reactions of the Calvin cycle except those 
catalyzed by rubisco, sedoheptulose 1,7-bisphosphatase, 
and ribulose 5-phosphate kinase also take place in ani- 
mal tissues. Lacking these three enzymes, animals 
cannot carry out net conversion of C0 2 to glucose. 

A Transport System Exports Triose Phosphates 
from the Chloroplast and Imports Phosphate 

The inner chloroplast membrane is impermeable to 
most phosphorylated compounds, including fructose 
6-phosphate, glucose 6-phosphate, and fructose 1,6- 
bisphosphate. It does, however, have a specific anti- 
porter that catalyzes the one-for-one exchange of Pi 
with a triose phosphate, either dihydroxyacetone phos- 
phate or 3-phosphoglycerate (Fig. 20-15; see also 
Fig. 20-9). This antiporter simultaneously moves Pi into 
the chloroplast, where it is used in photophosphoryla- 
tion, and moves triose phosphate into the cytosol, where 
it can be used to synthesize sucrose, the form in which 
the fixed carbon is transported to distant plant tissues. 

Sucrose synthesis in the cytosol and starch synthe- 
sis in the chloroplast are the major pathways by which 
the excess triose phosphate from photosynthesis is 
"harvested." Sucrose synthesis (described below) re- 
leases four Pi molecules from the four triose phosphates 
required to make sucrose. For every molecule of triose 




Chloroplast 
inner membrane 



Stroma 



9ATP 

1 

9ATP 



9 ADP + 9Pi 





9 ADP + 8Pi + Pi 4 



Dihydroxy- 
acetone 
phosphate 




Cytosol 



^ « — 

Dihydroxy- 
^ acetone 
phosphate 



Sucrose 



FIGURE 20-15 The P r triose phosphate antiport system of the inner 
chloroplast membrane. This transporter facilitates the exchange of cy- 
tosolic Pj for stromal dihydroxyacetone phosphate. The products of 
photosynthetic carbon assimilation are thus moved into the cytosol, 



where they serve as a starting point for sucrose biosynthesis, and Pj re- 
quired for photophosphorylation is moved into the stroma. This same 
antiporter can transport 3-phosphoglycerate and acts in the shuttle for 
exporting ATP and reducing equivalents (see Fig. 20-1 6). 
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phosphate removed from the chloroplast, one P 4 is 
transported into the chloroplast, providing the ninth ^ 
mentioned above, to be used in regenerating ATP. If this 
exchange were blocked, triose phosphate synthesis 
would quickly deplete the available P x in the chloroplast, 
slowing ATP synthesis and suppressing assimilation of 
C0 2 into starch. 

The P-triose phosphate antiport system serves one 
additional function. ATP and reducing power are 
needed in the cytosol for a variety of synthetic and 
energy-requiring reactions. These requirements are met 
to an as yet undetermined degree by mitochondria, but 
a second potential source of energy is the ATP and 
NADPH generated in the chloroplast stroma during the 
light reactions. However, neither ATP nor NADPH can 
cross the chloroplast membrane. The P-triose phos- 
phate antiport system has the indirect effect of moving 
ATP equivalents and reducing equivalents from the 
chloroplast to the cytosol (Fig. 20-16). Dihydroxyace- 
tone phosphate formed in the stroma is transported to 
the cytosol, where it is converted by glycolytic enzymes 
to 3-phosphoglycerate, generating ATP and NADH. 



3-Phosphoglycerate reenters the chloroplast, complet- 
ing the cycle. 

Four Enzymes of the Calvin Cycle Are Indirectly 
Activated by Light 

The reductive assimilation of C0 2 requires a lot of ATP 
and NADPH, and their stromal concentrations increase 
when chloroplasts are illuminated (Fig. 20-17). The 
light-induced transport of protons across the thylakoid 
membrane (Chapter 19) also increases the stromal pH 
from about 7 to about 8, and it is accompanied by a flow 
of Mg 2+ from the thylakoid compartment into the 
stroma, raising the [Mg 2+ ] from 1 to 3 irm to 3 to 6 mM. 
Several stromal enzymes have evolved to take advan- 
tage of these light-induced conditions, which signal the 
availability of ATP and NADPH: the enzymes are more 
active in an alkaline environment and at high [Mg 2+ ]. 
For example, activation of rubisco by formation of the 
carbamoyllysine is faster at alkaline pH, and high stro- 



mal [Mg + ] favors formation of the enzyme's active Mg : 
complex. Fructose 1,6-bisphosphatase requires Mg : 
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FIGURE 20-16 Role of the Pi-triose phosphate antiporter in the trans- 
port of ATP and reducing equivalents. Dihydroxyacetone phosphate 
leaves the chloroplast and is converted to glyceraldehyde 3-phosphate 
in the cytosol. The cytosolic glyceraldehyde 3-phosphate dehydroge- 
nase and phosphoglycerate kinase reactions then produce NADH, 



ATP, and 3-phosphoglycerate. The latter reenters the chloroplast and is 
reduced to dihydroxyacetone phosphate, completing a cycle that ef- 
fectively moves ATP and reducing equivalents (NAD(P)H) from chloro- 
plast to cytosol. 
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FIGURE 20-17 Source of ATP and NADPH. ATP and NADPH pro- 
duced by the light reactions are essential substrates for the reduction of 
C0 2 . The photosynthetic reactions that produce ATP and NADPH are 
accompanied by movement of protons (red) from the stroma into the 
thyiakoid, creating alkaline conditions in the stroma. Magnesium ions 
pass from the thylakoid into the stroma, increasing the stromal [Mg 2+ ]. 

and is very dependent on pH (Fig. 20-18); its activity 
increases more than 100-fold when pH and [Mg 2+ ] rise 
during chloroplast illumination. 

Four Calvin cycle enzymes are subject to a special 
type of regulation by light. Ribulose 5-phosphate kinase, 
fructose 1,6-bisphosphatase, sedoheptulose 1,7-bisphos- 
phatase, and glyceraldehyde 3-phosphate dehydroge- 
nase are activated by light-driven reduction of disulfide 




FIGURE 20-19 Light activation of several enzymes of the Calvin cycle. 

The light activation is mediated by thioredoxin, a small, disulfide- 
containing protein. In the light, thioredoxin is reduced by electrons 
moving from photosystem I through ferredoxin (Fd) (blue arrows), then 
thioredoxin reduces critical disulfide bonds in each of the enzymes 
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FIGURE 20-18 Activation of chloroplast fructose 1,6-bisphosphatase. 

Reduced fructose 1,6-bisphosphatase (FBPase-1) is activated by light 
and by the combination of high pH and high [Mg 2+ ] in the stroma, 
both of which are produced by illumination. 

bonds between two Cys residues critical to their cat- 
alytic activities. When these Cys residues are disulfide- 
bonded (oxidized), the enzymes are inactive; this is the 
normal situation in the dark. With illumination, elec- 
trons flow from photosystem I to ferredoxin (see 
Fig. 19-56), which passes electrons to a small, soluble, 
disulfide-containing protein called thioredoxin 
(Fig. 20-19), in a reaction catalyzed by ferredoxin- 
thioredoxin reductase. Reduced thioredoxin donates 
electrons for the reduction of the disulfide bonds of the 
light-activated enzymes, and these reductive cleavage 
reactions are accompanied by conformational changes 
that increase enzyme activities. At nightfall, the Cys 
residues in the four enzymes are reoxidized to their 
disulfide forms, the enzymes are inactivated, and ATP is 
not expended in C0 2 assimilation. Instead, starch syn- 
thesized and stored during the daytime is degraded to 
fuel glycolysis at night. 

Glucose 6-phosphate dehydrogenase, the first en- 
zyme in the oxidative pentose phosphate pathway, is 
also regulated by this light-driven reduction mechanism, 




sedoheptulose 1 ,7-bisphosphatase, fructose 1,6-bisphosphatase, ribu- 
lose 5-phosphate kinase, and glyceraldehyde 3-phosphate dehydroge- 
nase, activating these enzymes. In the dark, the — SH groups undergo 
reoxidation to disulfides, inactivating the enzymes. 
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but in the opposite sense. During the day, when photo- 
synthesis produces plenty of NADPH, this enzyme is not 
needed for NADPH production. Reduction of a critical 
disulfide bond by electrons from ferredoxin inactivates 
the enzyme. 

SUMMARY 20.1 Photosynthetic 

Carbohydrate Synthesis 

■ Photosynthesis in vascular plants takes place in 
chloroplasts. In the C0 2 -assimilating reactions (the 
Calvin cycle), ATP and NADPH are used to reduce 
C0 2 to those phosphates. These reactions occur in 
three stages: the fixation reaction itself, catalyzed 
by rubisco; reduction of the resulting 3-phospho- 
glycerate to glyceraldehyde 3-phosphate; and 
regeneration of ribulose 1,5-bisphosphate from 
triose phosphates. 

■ Rubisco condenses C0 2 with ribulose 1,5-bisphos- 
phate, forming an unstable hexose bisphosphate 
that splits into two molecules of 3-phosphoglycer- 
ate. Rubisco is activated by covalent modification 
(carbamoylation of Lys 201 ) catalyzed by rubisco 
activase and is inhibited by a natural transition- 
state analog, whose concentration rises in the dark 
and falls during daylight. 

■ Stromal isozymes of the glycolytic enzymes cat- 
alyze reduction of 3-phosphoglycerate to glycer- 
aldehyde 3-phosphate; each molecule reduced 
requires one ATP and one NADPH. 

■ Stromal enzymes, including transketolase and 
aldolase, rearrange the carbon skeletons of triose 
phosphates, generating intermediates of three, 
four, five, six, and seven carbons and eventually 
yielding pentose phosphates. The pentose phos- 
phates are converted to ribulose 5-phosphate, then 
phosphorylated to ribulose 1,5-bisphosphate to 
complete the Calvin cycle. 

■ The cost of fixing three C0 2 into one triose 
phosphate is nine ATP and six NADPH, which are 
provided by the light-dependent reactions of 
photosynthesis. 

■ An antiporter in the inner chloroplast membrane 
exchanges P A in the cytosol for 3-phosphoglycerate 
or dihydroxyacetone phosphate produced by 

C0 2 assimilation in the stroma. Oxidation of 
dihydroxyacetone phosphate in the cytosol 
generates ATP and NADH, thus moving ATP and 
reducing equivalents from the chloroplast to the 
cytosol. 

■ Four enzymes of the Calvin cycle are activated 
indirectly by light and are inactive in the dark, so 
that hexose synthesis does not compete with 
glycolysis — which is required to provide energy in 
the dark. 



20.2 Photorespiration and the C 4 
and CAM Pathways 

As we have seen, photosynthetic cells produce 0 2 (by 
the splitting of H 2 0) during the light-driven reactions 
(Chapter 19) and use C0 2 during the light-independent 
processes (described above), so the net gaseous change 
during photosynthesis is the uptake of C0 2 and release 
of0 2 : 

C0 2 + H 2 0 ► 0 2 + (CH 2 0) 

In the dark, plants also carry out mitochondrial respi- 
ration, the oxidation of substrates to C0 2 and the con- 
version of 0 2 to H 2 0. And there is another process in 
plants that, like mitochondrial respiration, consumes 0 2 
and produces C0 2 and, like photosynthesis, is driven by 
light. This process, photorespiration, is a costly side 
reaction of photosynthesis, a result of the lack of speci- 
ficity of the enzyme rubisco. In this section we describe 
this side reaction and the strategies plants use to mini- 
mize its metabolic consequences. 

Photorespiration Results from Rubisco's 
Oxygenase Activity 

Rubisco is not absolutely specific for C0 2 as a substrate. 
Molecular oxygen (0 2 ) competes with C0 2 at the active 
site, and about once in every three or four turnovers, 
rubisco catalyzes the condensation of 0 2 with ribulose 
1,5-bisphosphate to form 3-phosphoglycerate and 
2-phosphoglycolate (Fig. 20-20), a metabolically 
useless product. This is the oxygenase activity referred 
to in the full name of the enzyme: ribulose 1,5-bisphos- 
phate carboxylase/oxygenase. The reaction with 0 2 
results in no fixation of carbon and seems to be a net 
liability to the cell; salvaging the carbons from 2- 
phosphoglycolate (by the pathway outlined below) con- 
sumes significant amounts of cellular energy and 
releases some previously fixed C0 2 . 

Given that the reaction with oxygen is deleterious to 
the organism, why did the evolution of rubisco produce 
an active site unable to discriminate well between C0 2 
and 0 2 ? Perhaps much of this evolution occurred before 
the time, about 2.5 billion years ago, when production of 
0 2 by photosynthetic organisms started to raise the oxy- 
gen content of the atmosphere. Before that time, there 
was no selective pressure for rubisco to discriminate be- 
tween C0 2 and 0 2 . The K m for C0 2 is about 9 /zm, and 
that for 0 2 is about 350 ^m. The modern atmosphere 
contains about 20% 0 2 and only 0.04% C0 2 , so an aque- 
ous solution in equilibrium with air at room temperature 
contains about 250 jjlm 0 2 and 11 C0 2 — concentra- 
tions that allow significant 0 2 "fixation" by rubisco and 
thus a significant waste of energy. The temperature de- 
pendence of the solubilities of 0 2 and C0 2 is such that at 
higher temperatures, the ratio of 0 2 to C0 2 in solution 
increases. In addition, the affinity of rubisco for C0 2 
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FIGURE 20-20 Oxygenase activity of rubisco. Rubisco can incorpo- 
rate 0 2 rather than C0 2 into ribuiose 1,5-bisphosphate. The unstable 
intermediate thus formed splits into 2-phosphoglycolate (recycled as 
described in Fig. 20-21) and 3-phosphoglycerate, which can reenter 
the Calvin cycle. 



decreases with increasing temperature, exacerbating its 
tendency to catalyze the wasteful oxygenase reaction. 
And as C0 2 is consumed in the assimilation reactions, 
the ratio of 0 2 to C0 2 in the air spaces of a leaf in- 
creases, further favoring the oxygenase reaction. 

The Salvage of Phosphoglycolate Is Costly 

The glycolate pathway converts two molecules of 
2-phosphoglycolate to a molecule of serine (three car- 
bons) and a molecule of C0 2 (Fig. 20-21). In the 
chloroplast, a phosphatase converts 2-phosphoglycolate 
to glycolate, which is exported to the peroxisome. There, 
glycolate is oxidized by molecular oxygen, and the re- 
sulting aldehyde (glyoxylate) undergoes transamination 
to glycine. The hydrogen peroxide formed as a side prod- 
uct of glycolate oxidation is rendered harmless by perox- 
idases in the peroxisome. Glycine passes from the 
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FIGURE 20-21 Glycolate pathway. This pathway, which salvages 
2-phosphoglycolate (shaded pink) by its conversion to serine and 
eventually 3-phosphoglycerate, involves three cellular compartments. 
Glycolate formed by dephosphorylation of 2-phosphoglycolate in 
chloroplasts is oxidized to glyoxylate in peroxisomes and then 
transaminated to glycine. In mitochondria, two glycine molecules con- 
dense to form serine and the C0 2 released during photorespiration 
(shaded green). This reaction is catalyzed by glycine decarboxylase, an 
enzyme present at very high levels in the mitochondria of C 3 plants 
(see text). The serine is converted to hydroxypyruvate and then to glyc- 
erate in peroxisomes; glycerate reenters the chloroplasts to be phos- 
phorylated, rejoining the Calvin cycle. Oxygen (shaded blue) is 
consumed at two steps during photorespiration. 

peroxisome to the mitochondrial matrix, where it under- 
goes oxidative decarboxylation by the glycine decar- 
boxylase complex, an enzyme similar in structure and 
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mechanism to two mitochondrial complexes we have al- 
ready encountered: the pyruvate dehydrogenase com- 
plex and the a-ketoglutarate dehydrogenase complex 
(Chapter 16). The glycine decarboxylase complex 
oxidizes glycine to C0 2 and NH 3 , with the concomitant 
reduction of NAD + to NADH and transfer of the remain- 
ing carbon from glycine to the cofactor tetrahydrofolate 
(Fig. 20-22). The one-carbon unit carried on tetrahy- 
drofolate is then transferred to a second glycine by ser- 
ine hydroxymethyltransferase, producing serine. The 
net reaction catalyzed by the glycine decarboxylase com- 
plex and serine hydroxymethyltransferase is 

2 Glycine + NAD + + H 2 0 ► 

serine + C0 2 + NH 3 + NADH + H + 

The serine is converted to hydroxypyruvate, to glycer- 
ate, and finally to 3-phosphoglycerate, which is used to 



regenerate ribulose 1,5-bisphosphate, completing the 
long, expensive cycle (Fig. 20-21). 

In bright sunlight, the flux through the glycolate sal- 
vage pathway can be very high, producing about five 
times more C0 2 than is typically produced by all the ox- 
idations of the citric acid cycle. To generate this large 
flux, mitochondria contain prodigious amounts of the 
glycine decarboxylase complex: the four proteins of the 
complex make up half of all the protein in the mitochon- 
drial matrix in the leaves of pea and spinach plants! In 
nonphotosynthetic parts of a plant, such as potato tu- 
bers, mitochondria have very low concentrations of the 
glycine decarboxylase complex. 

The combined activity of the rubisco oxygenase and 
the glycolate salvage pathway consumes 0 2 and pro- 
duces C0 2 — hence the name photorespiration. This 
pathway is perhaps better called the oxidative photo- 
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FIGURE 20-22 The glycine decarboxylase system. Glycine decarboxyl- 
ase in plant mitochondria is a complex of four types of subunits, with 
the stoichiometry P 4 H 27 T 9 L 2 . Protein H has a covalently attached 
lipoic acid residue that can undergo reversible oxidation. Step© is 
formation of a Schiff base between pyridoxal phosphate (PLP) and 
glycine, catalyzed by protein P (named for its bound PLP). In step©, 
protein P catalyzes oxidative decarboxylation of glycine, releasing 
C0 2 ; the remaining methylamine group is attached to one of the— SH 
groups of reduced lipoic acid. © Protein T (which uses tetrahydrofo- 
late (H 4 folate) as cofactor) now releases NH 3 from the methylamine 



H 4 folate 



iV 5 , iV 10 -methylene H 4 folate 

moiety and transfers the remaining one-carbon fragment to tetrahydro- 
folate, producing N 5 ,N 10 -methy!enetetrahydrofoIate. @ Protein L oxi- 
dizes the two — SH groups of lipoic acid to a disulfide, passing 
electrons through FAD to NAD + ©, thus completing the cycle. The 
N 5 ,N 10 -methylenetetrahydrofolate formed in this process is used by 
serine hydroxymethyltransferase to convert a molecule of glycine to 
serine, regenerating the tetrahydrofolate that is essential for the reac- 
tion catalyzed by protein T. The L subunit of glycine decarboxylase is 
identical to the dihydrolipoyl dehydrogenase (E 3 ) of pyruvate dehydro- 
genase and a-ketoglutarate dehydrogenase (see Fig. 16-6). 
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synthetic carbon cycle or C 2 cycle, names that do not 
invite comparison with respiration in mitochondria. Un- 
like mitochondrial respiration, "photorespiration" does 
not conserve energy and may actually inhibit net bio- 
mass formation as much as 50%. This inefficiency has led 
to evolutionary adaptations in the carbon-assimilation 
processes, particularly in plants that have evolved in 
warm climates. 

In C 4 Plants, C0 2 Fixation and Rubisco Activity Are 
Spatially Separated 

In many plants that grow in the tropics (and in temperate- 
zone crop plants native to the tropics, such as maize, 
sugarcane, and sorghum) a mechanism has evolved to 
circumvent the problem of wasteful photorespiration. 
The step in which C0 2 is fixed into a three-carbon prod- 
uct, 3-phosphoglycerate, is preceded by several steps, 
one of which is temporary fixation of C0 2 into a four- 
carbon compound. Plants that use this process are 
referred to as C 4 plants, and the assimilation process as 
C 4 metabolism or the C 4 pathway. Plants that use the 
carbon-assimilation method we have described thus far, 
in which the first step is reaction of C0 2 with ribulose 
1,5-bisphosphate to form 3-phosphoglycerate, are called 
C 3 plants. 

The C 4 plants, which typically grow at high light in- 
tensity and high temperatures, have several important 
characteristics: high photosynthetic rates, high growth 
rates, low photorespiration rates, low rates of water loss, 
and a specialized leaf structure. Photosynthesis in the 
leaves of C 4 plants involves two cell types: mesophyll 
and bundle-sheath cells (Fig. 20-23a). There are three 
variants of C 4 metabolism, worked out in the 1960s by 
Marshall Hatch and Rodger Slack (Fig. 20-23b). 

In plants of tropical origin, the first intermediate 
into which 14 C0 2 is fixed is oxaloacetate, a four-carbon 
compound. This reaction, which occurs in the cytosol of 
leaf mesophyll cells, is catalyzed by phosphoenolpyru- 
vate carboxylase, for which the substrate is HCO3 , 
not C0 2 . The oxaloacetate thus formed is either reduced 
to malate at the expense of NADPH (as shown in Fig. 
20-23b) or converted to aspartate by transamination: 

Oxaloacetate + a-amino acid > L-aspartate + a-keto acid 

The malate or aspartate formed in the mesophyll cells 
then passes into neighboring bundle-sheath cells through 
plasmodesmata, protein-lined channels that connect 
two plant cells and provide a path for movement of 



FIGURE 20-23 Carbon assimilation in C 4 plants. The C 4 pathway, in- 
volving mesophyll cells and bundle-sheath cells, predominates in 
plants of tropical origin, (a) Electron micrograph showing chloroplasts 
of adjacent mesophyll and bundle-sheath cells. The bundle-sheath cell 
contains starch granules. Plasmodesmata connecting the two cells are 
visible, (b) The C 4 pathway of C0 2 assimilation, which occurs through 
a four-carbon intermediate. 
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metabolites and even small proteins between cells. In 
the bundle-sheath cells, malate is oxidized and decar- 
boxylated to yield pyruvate and C0 2 by the action of 
malic enzyme, reducing NADP + . In plants that use 
aspartate as the C0 2 carrier, aspartate arriving in 
bundle-sheath cells is transaminated to form oxaloac- 
etate and reduced to malate, then the C0 2 is released by 
malic enzyme or PEP carboxykinase. As labeling exper- 
iments show, the free C0 2 released in the bundle-sheath 
cells is the same C0 2 molecule originally fixed into ox- 
aloacetate in the mesophyll cells. This C0 2 is now fixed 
again, this time by rubisco, in exactly the same reaction 
that occurs in C 3 plants: incorporation of C0 2 into C-l of 
3-phosphogly cerate . 

The pyruvate formed by decarboxylation of malate 
in bundle-sheath cells is transferred back to the meso- 
phyll cells, where it is converted to PEP by an unusual 
enzymatic reaction catalyzed by pyruvate phosphate 
dikinase (Fig. 20-23b). This enzyme is called a diki- 
nase because two different molecules are simultane- 
ously phosphorylated by one molecule of ATP: pyruvate 
to PEP, and phosphate to pyrophosphate. The py- 
rophosphate is subsequently hydrolyzed to phosphate, 
so two high-energy phosphate groups of ATP are used in 
regenerating PEP. The PEP is now ready to receive an- 
other molecule of C0 2 in the mesophyll cell. 

The PEP carboxylase of mesophyll cells has a high 
affinity for HCO3 (which is favored relative to C0 2 in 
aqueous solution and can fix C0 2 more efficiently than 
can rubisco). Unlike rubisco, it does not use 0 2 as an al- 
ternative substrate, so there is no competition between 
C0 2 and 0 2 . The PEP carboxylase reaction, then, serves 
to fix and concentrate C0 2 in the form of malate. Re- 
lease of C0 2 from malate in the bundle-sheath cells 
yields a sufficiently high local concentration of C0 2 for 
rubisco to function near its maximal rate, and for sup- 
pression of the enzyme's oxygenase activity. 

Once C0 2 is fixed into 3-phosphoglycerate in the 
bundle-sheath cells, the other reactions of the Calvin 
cycle take place exactly as described earlier. Thus in 
C 4 plants, mesophyll cells carry out C0 2 assimilation by 
the C 4 pathway and bundle-sheath cells synthesize 
starch and sucrose by the C 3 pathway 

Three enzymes of the C 4 pathway are regulated by 
light, becoming more active in daylight. Malate dehydro- 
genase is activated by the thioredoxin-dependent re- 
duction mechanism shown in Figure 20-19; PEP 
carboxylase is activated by phosphorylation of a Ser 
residue; and pyruvate phosphate dikinase is activated 
by dephosphorylation. In the latter two cases, the de- 
tails of how light effects phosphorylation or dephospho- 
rylation are not known. 

The pathway of C0 2 assimilation has a greater en- 
ergy cost in C 4 plants than in C 3 plants. For each mole- 
cule of C0 2 assimilated in the C 4 pathway, a molecule of 
PEP must be regenerated at the expense of two high- 
energy phosphate groups of ATP Thus C 4 plants need 
five ATP molecules to assimilate one molecule of C0 2 , 



whereas C 3 plants need only three (nine per triose phos- 
phate). As the temperature increases (and the affinity of 
rubisco for C0 2 decreases, as noted above), a point is 
reached (at about 28 to 30 °C) at which the gain in effi- 
ciency from the elimination of photorespiration more 
than compensates for this energetic cost. C 4 plants 
(crabgrass, for example) outgrow most C 3 plants during 
the summer, as any experienced gardener can attest. 

In CAM Plants, C0 2 Capture and Rubisco Action Are 
Temporally Separated 

Succulent plants such as cactus and pineapple, which 
are native to very hot, very dry environments, have an- 
other variation on photosynthetic C0 2 fixation, which 
reduces loss of water vapor through the pores (stomata) 
by which C0 2 and 0 2 must enter leaf tissue. Instead of 
separating the initial trapping of C0 2 and its fixation by 
rubisco across space (as do the C 4 plants), they separate 
these two events over time. At night, when the air is 
cooler and moister, the stomata open to allow entry of 
C0 2 , which is then fixed into oxaloacetate by PEP car- 
boxylase. The oxaloacetate is reduced to malate and 
stored in the vacuoles, to protect cytosolic and plastid 
enzymes from the low pH produced by malic acid disso- 
ciation. During the day the stomata close, preventing 
the water loss that would result from high daytime tem- 
peratures, and the C0 2 trapped overnight in malate is 
released as C0 2 by the NADP-linked malic enzyme. This 
C0 2 is now assimilated by the action of rubisco and the 
Calvin cycle enzymes. Because this method of C0 2 fixa- 
tion was first discovered in stonecrops, perennial flower- 
ing plants of the family Crassulaceae, it is called 
crassulacean acid metabolism, and the plants are called 
CAM plants. 

SUMMARY 20.2 Photorespiration 

and the C 4 and CAM 
Pathways 

■ When rubisco uses 0 2 rather than C0 2 as substrate, 
the 2-phosphoglycolate so formed is disposed of in 
an oxygen-dependent pathway. The result is 
increased consumption of 0 2 — photorespiration or, 
more accurately, the oxidative photosynthetic car- 
bon cycle or C 2 cycle. The 2-phosphoglycolate is 
converted to glyoxylate, to glycine, and then to 
serine in a pathway that involves enzymes in the 
chloroplast stroma, the peroxisome, and the 
mitochondrion. 

■ In C 4 plants, the carbon-assimilation pathway 
minimizes photorespiration: C0 2 is first fixed in 
mesophyll cells into a four-carbon compound, 
which passes into bundle-sheath cells and releases 
C0 2 in high concentrations. The released C0 2 is 
fixed by rubisco, and the remaining reactions of the 
Calvin cycle occur as in C 3 plants. 
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■ In CAM plants, C0 2 is fixed into malate in the dark 
and stored in vacuoles until daylight, when the 
stomata are closed (minimizing water loss) and 
malate serves as a source of C0 2 for rubisco. 

20.3 Biosynthesis of Starch and Sucrose 

During active photosynthesis in bright light, a plant leaf 
produces more carbohydrate (as triose phosphates) 
than it needs for generating energy or synthesizing pre- 
cursors. The excess is converted to sucrose and trans- 
ported to other parts of the plant, to be used as fuel or 
stored. In most plants, starch is the main storage form, 
but in a few plants, such as sugar beet and sugarcane, 
sucrose is the primary storage form. The synthesis of su- 
crose and starch occurs in different cellular compart- 
ments (cytosol and plastids, respectively), and these 
processes are coordinated by a variety of regulatory 
mechanisms that respond to changes in light level and 
photosynthetic rate. 

ADP-Glucose is the Substrate for Starch Synthesis in Plant 
Plastids and for Glycogen Synthesis in Bacteria 

Starch, like glycogen, is a high molecular weight poly- 
mer of D-glucose in (cd->4) linkage. It is synthesized in 
chloroplasts for temporary storage as one of the stable 
end products of photosynthesis, and for long-term stor- 
age it is synthesized in the amyioplasts of the nonphoto- 
synthetic parts of plants — seeds, roots, and tubers 
(underground stems). 

The mechanism of glucose activation in starch syn- 
thesis is similar to that in glycogen synthesis. An acti- 
vated nucleotide sugar, in this case ADP-glucose, is 
formed by condensation of glucose 1 -phosphate with 
ATP in a reaction made essentially irreversible by the 
presence in plastids of inorganic pyrophosphatase 
(p. 508). Starch synthase then transfers glucose 
residues from ADP-glucose to preexisting starch mole- 
cules. Although it has generally been assumed that 
glucose is added to the nonreducing end of starch, as in 
glycogen synthesis (see Fig. 15-30), evidence now sug- 
gests that starch synthase has two equivalent active sites 
that alternate in inserting a glucosyl residue onto the re- 
ducing end of the growing chain. This end remains cova- 
lently attached to the enzyme, first at one active site, 
then at the other (Fig. 20-24). Attachment to one ac- 
tive site effectively activates the reducing end of the 
growing chain for nucleophilic displacement of the en- 
zyme by the attacking C-4 hydroxyl of a glucosyl moiety 



FIGURE 20-24 A model for starch synthesis. According to this model/ 
starch synthesis proceeds by a two-site insertion mechanism, with 
ADP-glucose as the initial glucosyl donor. The two identical active sites 
on starch synthase alternate in displacing the growing chain from each 
other, and new glucosyl units are inserted at the reducing end of the 
growing chain. 
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attack on ADP-glucose, displacing ADP and forming 
a covalent attachment to C-l of the glucose unit. 



P- 

A TV 



<3)^AD^ 



OH 



1 



The bond holding glucose residue 1 to X a 
undergoes nucleophilic attack by the — OH 
at C-4 of glucose residue 2 on X b , forming an 
(al^4)-disaccharide of residues 2 and 1. This 
remains attached through glucose 2 to X b . X a , 
now free, displaces ADP from another ADP- 
glucose and becomes attached to glucose 3. 



OH 




{4T iADF 



The hydroxyl at C-4 of glucose 3 displaces X b 
from the disaccharide, forming a trisaccharide 
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Many repetitions of this sequence extend the 
oligosaccharide, adding glucose residues at its 
reducing end, with X a and X b alternately 
carrying the growing starch chain. When the chain 
reaches an appropriate length, it is separated 
from starch synthase. 
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bound to the other active site, forming the (al-»4) link- 
age characteristic of starch. 

The amylose of starch is unbranched, but amy- 
lopectin has numerous (al^6)-linked branches (see 
Fig. 7-14). Chloroplasts contain a branching enzyme, 
similar to glycogen-branching enzyme (see Fig. 15-31), 
that introduces the (al-»6) branches of amylopectin. 
Taking into account the hydrolysis by inorganic py- 
rophosphatase of the PPi produced during ADP-glucose 
synthesis, the overall reaction for starch formation from 
glucose 1 -phosphate is 

Starchy + glucose 1-phosphate + ATP > 

starchy + 1 + ADP + 2Pi 

AG'° = -50kJ/mol 

Starch synthesis is regulated at the level of ADP-glucose 
formation, as discussed below. 

Many types of bacteria store carbohydrate in the 
form of glycogen (essentially highly branched starch), 
which they synthesize in a reaction analogous to that 
catalyzed by glycogen synthase in animals. Bacteria, like 
plant plastids, use ADP-glucose as the activated form of 
glucose, whereas animal cells use UDP-glucose. Again, 
the similarity between plastid and bacterial metabolism 
is consistent with the endosymbiont hypothesis for the 
origin of organelles (p. 33). 

UDP-Glucose Is the Substrate for Sucrose Synthesis 
intheCytosol of Leaf Cells 

Most of the triose phosphate generated by C0 2 fixation 
in plants is converted to sucrose (Fig. 20-25) or starch. 
In the course of evolution, sucrose may have been se- 
lected as the transport form of carbon because of its un- 
usual linkage between the anomeric C-l of glucose and 
the anomeric C-2 of fructose. This bond is not hy- 
drolyzed by amylases or other common carbohydrate- 
cleaving enzymes, and the unavailability of the anomeric 
carbons prevents sucrose from reacting nonenzymati- 
cally (as does glucose) with amino acids and proteins. 

Sucrose is synthesized in the cytosol, beginning with 
dihydroxyacetone phosphate and glyceraldehyde 3-phos- 
phate exported from the chloroplast. After condensation 
of two triose phosphates to form fructose 1,6-bisphos- 
phate (catalyzed by aldolase), hydrolysis by fructose 1,6- 
bisphosphatase yields fructose 6-phosphate. Sucrose 
6-phosphate synthase then catalyzes the reaction of 
fructose 6-phosphate with UDP-glucose to form su- 
crose 6-phosphate (Fig. 20-25). Finally, sucrose 6- 
phosphate phosphatase removes the phosphate group, 
making sucrose available for export to other tissues. The 
reaction catalyzed by sucrose 6-phosphate synthase is a 
low-energy process (AG'° = -5.7 kJ/mol), but the hy- 
drolysis of sucrose 6-phosphate to sucrose is sufficiently 
exergonic (AG'° = -16.5 kJ/mol) to make the overall 
synthesis of sucrose essentially irreversible. Sucrose syn- 
thesis is regulated and closely coordinated with starch 
synthesis, as we shall see. 
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FIGURE 20-25 Sucrose synthesis. Sucrose is synthesized from UDP- 
glucose and fructose 6-phosphate, which are synthesized from triose 
phosphates in the plant cell cytosol by pathways shown in Figures 
15-29 and 20-9. The sucrose 6-phosphate synthase of most plant 
species is allosterically regulated by glucose 6-phosphate and Pj. 



One remarkable difference between the cells of 
plants and animals is the absence in the plant cell cy- 
tosol of the enzyme inorganic pyrophosphatase, which 
catalyzes the reaction 



PPi + H 2 0 



2R 



-19.2kJ/mol 



For many biosynthetic reactions that liberate PPi, py- 
rophosphatase activity makes the process more favor- 
able energetically, tending to make these reactions 
irreversible. In plants, this enzyme is present in plastids 
but absent from the cytosol. As a result, the cytosol of 
leaf cells contains a substantial concentration of PPi — 
enough (-0.3 ium) to make reactions such as that 
catalyzed by UDP-glucose pyrophosphorylase (see 
Fig. 15-29) readily reversible. Recall from Chapter 14 
(p. 533) that the cytosolic isozyme of phosphofructoki- 
nase in plants uses PPi, not ATP, as the phosphoryl donor. 

Conversion of Triose Phosphates to Sucrose 
and Starch Is Tightly Regulated 

Triose phosphates produced by the Calvin cycle in bright 
sunlight, as we have noted, may be stored temporarily 
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in the chloroplast as starch, or converted to sucrose and 
exported to nonphotosynthetic parts of the plant, or 
both. The balance between the two processes is tightly 
regulated, and both must be coordinated with the rate of 
carbon fixation. Five-sixths of the triose phosphate 
formed in the Calvin cycle must be recycled to ribulose 
1,5-bisphosphate (Fig. 20-14); if more than one-sixth of 
the triose phosphate is drawn out of the cycle to make 
sucrose and starch, the cycle will slow or stop. However, 
insufficient conversion of triose phosphate to starch or 
sucrose would tie up phosphate, leaving a chloroplast de- 
ficient in P i? which is also essential for operation of the 
Calvin cycle. 

The flow of triose phosphates into sucrose is regu- 
lated by the activity of fructose 1,6-bisphosphatase 
(FBPase-1) and the enzyme that effectively reverses its 
action, PP r dependent phosphofructokinase (PP-PFK-1; 
p. 533). These enzymes are therefore critical points 
for determining the fate of triose phosphates produced 
by photosynthesis. Both enzymes are regulated by 
fructose 2,6-bisphosphate (F26BP), which inhibits 
FBPase-1 and stimulates PP-PFK-1. In vascular plants, 
the concentration of F26BP varies inversely with the 
rate of photosynthesis (Fig. 20-26). Phosphofructoki- 
nase-2, responsible for F26BP synthesis, is inhibited by 
dihydroxyacetone phosphate or 3-phosphoglycerate 
and stimulated by fructose 6-phosphate and Pi. During 
active photosynthesis, dihydroxyacetone phosphate is 
produced and Pi is consumed, resulting in inhibition of 
PFK-2 and lowered concentrations of F26BP. This 
favors greater flux of triose phosphate into fructose 
6-phosphate formation and sucrose synthesis. With this 
regulatory system, sucrose synthesis occurs when the 
level of triose phosphate produced by the Calvin cycle 
exceeds that needed to maintain the operation of the 
cycle. 

Sucrose synthesis is also regulated at the level of 
sucrose 6-phosphate synthase, which is allosterically 
activated by glucose 6-phosphate and inhibited by P t . 
This enzyme is further regulated by phosphorylation 
and dephosphorylation; a protein kinase phosphory- 
lates the enzyme on a specific Ser residue, making it 
less active, and a phosphatase reverses this inactiva- 
tion by removing the phosphate (Fig. 20-27). Inhibi- 
tion of the kinase by glucose 6-phosphate, and of the 
phosphatase by P i; enhances the effects of these two 
compounds on sucrose synthesis. When hexose phos- 
phates are abundant, sucrose 6-phosphate synthase 
is activated by glucose 6-phosphate; when P^ is ele- 
vated (as when photosynthesis is slow), sucrose syn- 
thesis is slowed. During active photosynthesis, triose 
phosphates are converted to fructose 6-phosphate, 
which is rapidly equilibrated with glucose 6-phosphate 
by phosphohexose isomerase. Because the equilib- 
rium lies far toward glucose 6-phosphate, as soon as 
fructose 6-phosphate accumulates, the level of 
glucose 6-phosphate rises and sucrose synthesis is 
stimulated. 
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FIGURE 20-26 Fructose 2,6-bisphosphate as regulator of sucrose 
synthesis. The concentration of the allosteric regulator fructose 2,6- 
bisphosphate in plant cells is regulated by the products of photosyn- 
thetic carbon assimilation and by Pj. Dihydroxyacetone phosphate and 
3-phosphoglycerate produced by C0 2 assimilation inhibit phospho- 
fructokinase-2 (PFK-2), the enzyme that synthesizes the regulator; P| 
stimulates PFK-2. The concentration of the regulator is therefore 
inversely proportional to the rate of photosynthesis. In the dark, 
the concentration of fructose 2,6-bisphosphate increases and stimu- 
lates the glycolytic enzyme PP r dependent phosphofructokinase-1 
(PP-PFK-1), while inhibiting the gluconeogenic enzyme fructose 
1,6-bisphosphatase (FBPase-1). When photosynthesis is active (in the 
light), the concentration of the regulator drops and the synthesis of 
fructose 6-phosphate and sucrose is favored. 



The key regulatory enzyme in starch synthesis is 
ADP-glucose pyrophosphorylase (Fig. 20-28); it is 

activated by 3-phosphoglycerate (which accumulates 
during active photosynthesis) and inhibited by 
(which accumulates when light-driven condensation of 
ADP and Pi slows). When sucrose synthesis slows, 
3-phosphoglycerate formed by C0 2 fixation accumu- 
lates, activating this enzyme and stimulating the synthe- 
sis of starch. 
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FIGURE 20-27 Regulation of sucrose phosphate synthase by phospho- 
rylation. A protein kinase (SPS kinase) specific for sucrose phosphate 
synthase (SPS) phosphorylates a Ser residue in SPS, inactivating it; a 
specific phosphatase (SPS phosphatase) reverses this inhibition. The ki- 
nase is inhibited allostericaliy by glucose 6-phosphate, which also 
activates SPS allostericaliy. The phosphatase is inhibited by ? u which 
also inhibits SPS directly. Thus when the concentration of glucose 6- 
phosphate is high as a result of active photosynthesis, SPS is activated 
and produces sucrose phosphate. A high Pj concentration, which 
occurs when photosynthetic conversion of ADP to ATP is slow, inhibits 
sucrose phosphate synthesis. 
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FIGURE 20-28 Regulation of ADP-glucose phosphorylase by 3- 
phosphoglycerate and P it This enzyme, which produces the precursor 
for starch synthesis, is rate-limiting in starch production. The enzyme is 
stimulated allostericaliy by 3-phosphoglycerate (3-PGA) and inhibited 
by P^ in effect, the ratio [3-PGA]/[P;], which rises with increasing rates 
of photosynthesis, controls starch synthesis at this step. 

SUMMARY 20.3 Biosynthesis of Starch 
and Sucrose 

■ Starch synthase in chloroplasts and amyloplasts 
catalyzes the addition of single glucose residues, 
donated by ADP-glucose, to the reducing end of 
a starch molecule by a two-step insertion 
mechanism. Branches in amylopectin are 
introduced by a second enzyme. 



a Sucrose is synthesized in the cytosol in two steps 
from UDP-glucose and fructose 1-phosphate. 

m The partitioning of triose phosphates between 
sucrose synthesis and starch synthesis is regulated 
by fructose 2,6-bisphosphate (F26BP), an allosteric 
effector of the enzymes that determine the level of 
fructose 6-phosphate. F26BP concentration varies 
inversely with the rate of photosynthesis, and 
F26BP inhibits the synthesis of fructose 6-phosphate, 
the precursor to sucrose. 

20.4 Synthesis of Cell Wall Polysaccharides: 
Plant Cellulose and Bacterial Peptidoglycan 

Cellulose is a major constituent of plant cell walls, provid- 
ing strength and rigidity and preventing the swelling of 
the cell and rupture of the plasma membrane that might 
result when osmotic conditions favor water entry into the 
cell. Each year, worldwide, plants synthesize more than 
10 11 metric tons of cellulose, making this simple polymer 
one of the most abundant compounds in the biosphere. 
The structure of cellulose is simple: linear polymers of 
thousands of (j81 -»4) -linked D-glucose units, assembled 
into bundles of about 36 chains, which aggregate side by 
side to form a microfibril (Fig. 20-29). 
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FIGURE 20-29 Cellulose structure. The plant cell wall is made up in 
part of cellulose molecules arranged side by side to form paracrys- 
talline arrays — cellulose microfibrils. Many microfibrils combine to 
form a cellulose fiber, seen in the scanning electron microscope as a 
structure 5 to 12 nm in diameter, laid down on the cell surface in sev- 
eral layers distinguishable by the different orientations of their fibers. 



20.4 Synthesis of Cell Wall Polysaccharides: Plant Cellulose and Bacterial Peptidoglycan 795 



The biosynthesis of cellulose is less well understood 
than that of glycogen or starch. As a major component of 
the plant cell wall, cellulose must be synthesized from 
intracellular precursors but deposited and assembled 
outside the plasma membrane. The enzymatic machin- 
ery for initiation, elongation, and export of cellulose 
chains is more complicated than that needed to synthe- 
size starch or glycogen (which are not exported). Bacte- 
ria face a similar set of problems when they synthesize 
the complex polysaccharides that make up their cell 
walls, and they may employ some of the same mecha- 
nisms to solve these problems. 

Cellulose Is Synthesized by Supramolecular Structures in 
the Plasma Membrane 

The complex enzymatic machinery that assembles cel- 
lulose chains spans the plasma membrane, with one part 
positioned to bind the substrate, UDP-glucose, in the cy- 
tosol and another part extending to the outside, respon- 
sible for elongating and crystallizing cellulose molecules 
in the extracellular space. Freeze-fracture electron mi- 
croscopy shows these terminal complexes, also called 
rosettes, to be composed of six large particles arranged 
in a regular hexagon with a diameter of about 30 nm 
(Fig. 20-30). Several proteins, including the catalytic 
subunit of cellulose synthase, make up the terminal 
complex. Much of the recent progress in understanding 
cellulose synthesis stems from genetic and molecular 
genetic studies of the plant Arabidopsis thaliana, 
which is especially amenable to genetic dissection and 
whose genome has been sequenced. The gene family 
that encodes this cellulose-synthesizing activity has 



been cloned and found to encode proteins with eight 
transmembrane segments and a central domain on the 
cytosolic side of the plasma membrane that includes se- 
quences expected in a glycosyltransferase (Fig. 20-30). 

In one working model of cellulose synthesis, cellu- 
lose chains are initiated by the formation of a lipid- 
linked intermediate unlike anything involved in starch 
or glycogen synthesis. As shown in step (T) of Figure 
20-30, glucose is transferred from UDP-glucose to a 
membrane lipid, probably the plant sterol sitosterol, on 
the inner face of the plasma membrane. Here, intracellu- 
lar cellulose synthase adds several more glucose 
residues to the first one, in (/31^4) linkage, forming a 
short oligosaccharide chain attached to the sitosterol 
(sitosterol dextrin). Next, the whole sitosterol dextrin 
flips across to the outer face of the plasma membrane, 
where it now associates with another form of cellulose 
synthase. 

(2) The UDP-glucose used for cellulose synthesis is 
generated from sucrose produced during photosynthe- 
sis, by the reaction catalyzed by sucrose synthase 
(named for the reverse reaction): 

Sucrose + UDP > UDP-glucose + fructose 

Cellulose synthase spans the plasma membrane and 
uses cytosolic UDP-glucose as the precursor for extra- 
cellular cellulose synthesis. A membrane-bound form of 
sucrose synthase forms a complex with cellulose syn- 
thase, feeding UDP-glucose from sucrose directly into 
cell wall synthesis. 

(3) A second form of cellulose synthase extends the 
polymer to 500 to 15,000 glucose units, extruding it onto 
the outer surface of the cell. The action of the enzyme is 
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FIGURE 20-30 One proposed model for synthesis of cellulose in a vascu- 
lar plant. This schematic is derived from a combination of genetic and 
biochemical studies of Arabidopsis thaliana and of vascular plants. 
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processive: one enzyme molecule adds many glucose 
units before releasing the growing cellulose chain. The 
direction of chain growth (whether addition occurs at 
the reducing end or at the nonreducing end) has not 
been established. 

Each of the six globules of the rosette consists of 
multiple protein subunits that together synthesize six 
cellulose chains. (4) The large enzyme complex that cat- 
alyzes this process actually moves along the plasma 
membrane, following the course of microtubules in the 
cortex, the cytoplasmic layer just under the membrane. 
Because these microtubules lie perpendicular to the 
axis of the plant's growth, the cellulose microfibrils are 
laid down across the axis of growth. The motion of the 
cellulose synthase complexes is believed to be driven by 
energy released in the polymerization reaction, not by a 
molecular motor such as kinesin. 

The finished cellulose is in the form of crystalline 
microfibrils (Fig. 20-29), each consisting of 36 separate 
cellulose chains lying side by side, all with the same 
(parallel) orientation of nonreducing and reducing ends. 
It seems likely that the 36 separate polymers synthe- 
sized at one rosette arrive together on the outer surface 
of the cell, already aligned and ©ready to crystallize 
as a microfibril of the cell wall. When the 36 polymers 
reach some critical length, their synthesis is terminated 
by an unknown mechanism; crystallization into a mi- 
crofibril follows. 

In the activated precursor of cellulose (UDP- 
glucose), the glucose is a-linked to the nucleotide, but in 
the product (cellulose), glucose residues are Q31->4)- 
linked, so there is an inversion of configuration at the 
anomeric carbon (C-l) as the glycosidic bond forms. 
Glycosyltransferases that invert configuration are gen- 
erally assumed to use a single-displacement mechanism, 
with nucleophilic attack by the acceptor species at the 
anomeric carbon of the donor sugar (UDP-glucose). 

Certain bacteria (Acetobacter, Agrobacteria, Rhi- 
zobia, and Sarcina) and many simple eukaryotes also 
carry out cellulose synthesis, apparently by a mechanism 
similar to that in plants. If the bacteria use a membrane 
lipid to initiate new chains, it cannot be a sterol — 
bacteria do not contain sterols. 

Lipid-Linked Oligosaccharides Are Precursors 
for Bacterial Cell Wall Synthesis 

Like plants, many bacteria have thick, rigid extracellular 
walls that protect them from osmotic lysis. The peptido- 
glycan that gives bacterial envelopes their strength 
and rigidity is an alternating linear copolymer of N- 
acetylglucosamine (GlcNAc) and AA-acetylmuramic acid 
(Mur2Ac), linked by 031^4) glycosidic bonds and cross- 
linked by short peptides attached to the Mur2Ac (Fig. 
20-31). During assembly of the polysaccharide back- 
bone of this complex macromolecule, both GlcNAc and 
Mur2Ac are activated by attachment of a uridine 
nucleotide at their anomeric carbons. First, GlcNAc 




Pentaglycine 
cross-link 



FIGURE 20-31 Peptidoglycan structure. This is the peptidoglycan of 
the cell wall of Staphylococcus aureus, a gram-positive bacterium. 
Peptides (strings of colored spheres) covalently link N-acetylmuramic 
acid residues in neighboring polysaccharide chains. Note the mixture 
of l and d amino acids in the peptides. Gram-positive bacteria such as 
5. aureus have a pentaglycine chain in the cross-link. Gram-negative 
bacteria, such as E. coli, lack the pentaglycine; instead, the terminal 
D-Ala residue of one tetrapeptide is attached directly to a neighboring 
tetrapeptide through either L-Lys or a lysine-like amino acid, di- 
aminopimelic acid. The peptide bond of glutamate is unusual here; it 
involves the carboxyl group of the glutamate side chain. 

1 -phosphate condenses with UTP to form UDP-GlcNAc 
(Fig. 20-32, step®), which reacts with phospho- 
enolpyruvate to form UDP-Mur2Ac (step (2)); five amino 
acids are then added (step (3)). The Mur2Ac-penta- 
peptide moiety is transferred from the uridine nucleotide 
to the membrane lipid dolichol, a long-chain isoprenoid 
alcohol (see Fig. 10-22f) (step®), and a GlcNAc residue 
is donated by UDP-GlcNAc (step (§)). In many bacteria, 
five glycines are added in peptide linkage to the amino 
group of the Lys residue of the pentapeptide (step (6)). 
Finally, this disaccharide decapeptide is added to the 
nonreducing end of an existing peptidoglycan molecule 
(step ©). A transpeptidation reaction cross-links adja- 
cent polysaccharide chains (step (§)), contributing to 
a huge, strong, macromolecular wall around the bacter- 
ial cell. Many of the most effective antibiotics in use 
today act by inhibiting reactions in the synthesis of the 
peptidoglycan. 
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FIGURE 20-32 Synthesis of bacterial peptidoglycan. In 

the early steps of this pathway (© through©), N-acetyl- 
glucosamine (GlcNAc) and N-acetylmuramic acid 
(Mur2Ac) are activated by attachment of a uridine nu- 
cleotide (UDP) to their anomeric carbons and, in the case 
of Mur2Ac, of a long-chain isoprenoid alcohol (dolichol) 
through a phosphodiester bond. These activating groups 
participate in the formation of glycosidic linkages; they 
serve as excellent leaving groups. After steps © and ©, 
assembly of a disaccharide with a peptide side chain (10 
amino acid residues), © this precursor is transferred to the 
nonreducing end of an existing peptidoglycan chain, 
which serves as a primer for the polymerization reaction. 
Finally, ® in a transpeptidation reaction between the pep- 
tide side chains on two different peptidoglycan molecules, 
a Gly residue at the end of one chain displaces a terminal 
D-Ala in the other chain, forming a cross-link. This 
transpeptidation reaction is inhibited by the penicillins, 
which kill bacteria by weakening their cell walls 
(see Fig. 6-27). 
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Many other oligosaccharides and polysaccharides 
are synthesized by similar routes in which sugars are ac- 
tivated for subsequent reactions by attachment to nu- 
cleotides. In the glycosylation of proteins, for example 
(see Fig. 27-39), the precursors of the carbohydrate 
moieties include sugar nucleotides and lipid-linked 
oligosaccharides. 

SUMMARY 20.4 Synthesis of Cell Wall 
Polysaccharides: Plant 
Cellulose and Bacterial 
Peptidoglycan 

■ Cellulose synthesis takes place in terminal 
complexes (rosettes) in the plasma membrane. 
Each cellulose chain begins as a sitosterol dextrin 
formed inside the cell. It then flips to the outside, 
where the oligosaccharide portion is transferred to 
cellulose synthase in the rosette and is then 



extended. Each rosette produces 36 separate 
cellulose chains simultaneously and in parallel. The 
chains crystallize into one of the microfibrils that 
form the cell wall. 

■ Synthesis of the bacterial cell wall peptidoglycan 
also involves lipid-linked oligosaccharides formed 
inside the cell and flipped to the outside for 
assembly. 

20.5 Integration of Carbohydrate 
Metabolism in the Plant Cell 

Carbohydrate metabolism in a typical plant cell is more 
complex in several ways than that in a typical animal cell. 
The plant cell carries out the same processes that gener- 
ate energy in animal cells (glycolysis, citric acid cycle, and 
oxidative phosphorylation); it can generate hexoses from 
three- or four-carbon compounds by gluconeogenesis; it 



L 
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can oxidize hexose phosphates to pentose phosphates 
with the generation of NADPH (the oxidative pentose 
phosphate pathway); and it can produce a polymer of 
(al->4)-linked glucose (starch) and degrade it to gener- 
ate hexoses. But besides these carbohydrate transforma- 
tions that it shares with animal cells, the photosynthetic 
plant cell can fix C0 2 into organic compounds (the rubisco 
reaction); use the products of fixation to generate trioses, 
hexoses, and pentoses (the Calvin cycle); and convert 
acetyl-CoA generated from fatty acid breakdown to four- 
carbon compounds (the glyoxylate cycle) and the four- 
carbon compounds to hexoses (gluconeogenesis). These 
processes, unique to the plant cell, are segregated in sev- 
eral compartments not found in animal cells: the glyoxy- 
late cycle in glyoxysomes, the Calvin cycle in chloroplasts, 
starch synthesis in amyloplasts, and organic acid storage 
in vacuoles. The integration of events among these various 
compartments requires specific transporters in the mem- 
branes of each organelle, to move products from one or- 
ganelle to another or into the cytosol. 

Gluconeogenesis Converts Fats and Proteins 
to Glucose in Germinating Seeds 

Many plants store lipids and proteins in their seeds, to 
be used as sources of energy and as biosynthetic precur- 
sors during germination, before photosynthetic mecha- 
nisms have developed. Active gluconeogenesis in 
germinating seeds provides glucose for the synthesis of 
sucrose, polysaccharides, and many metabolites derived 
from hexoses. In plant seedlings, sucrose provides much 
of the chemical energy needed for initial growth. 

We noted earlier (Chapter 14) that animal cells can 
carry out gluconeogenesis from three- and four-carbon 
precursors, but not from the two acetyl carbons of acetyl- 
CoA. Because the pyruvate dehydrogenase reaction is ef- 
fectively irreversible (pp. 616-617), animal cells have no 
way to convert acetyl-CoA to pyruvate or oxaloacetate. 
Unlike animals, plants and some microorganisms can 
convert acetyl-CoA derived from fatty acid oxidation to 
glucose (Fig. 20-33). Some of the enzymes essential to 
this conversion are sequestered in glyoxysomes, where 
glyoxysome-specific isozymes of oxidation break down 
fatty acids to acetyl-CoA (see Fig. 16-22). The physical 
separation of the glyoxylate cycle and /3-oxidation en- 
zymes from the mitochondrial citric acid cycle enzymes 
prevents further oxidation of acetyl-CoA to C0 2 . Instead, 
the acetyl-CoA is converted to succinate in the glyoxylate 
cycle (see Fig. 16-20). The succinate passes into the mi- 
tochondrial matrix, where it is converted by citric acid cy- 
cle enzymes to oxaloacetate, which moves into the 
cytosol. Cytosolic oxaloacetate is converted by gluconeo- 
genesis to fructose 6-phosphate, the precursor of su- 
crose. Thus the integration of reaction sequences in three 
subcellular compartments is required for the production 
of fructose 6-phosphate or sucrose from stored lipids. Be- 
cause only three of the four carbons in each molecule of 
oxaloacetate are converted to hexose in the cytosol, 
about 75% of the carbon in the fatty acids stored as 



seed lipids is converted to carbohydrate by the combined 
pathways of Figure 20-33. The other 25% is lost as C0 2 in 
the conversion of oxaloacetate to phosphoenolpyruvate. 
Hydrolysis of storage triacylglycerols also produces 
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FIGURE 20-33 Conversion of stored fatty acids to sucrose in germi- 
nating seeds. This pathway begins in glyoxysomes. Succinate is pro- 
duced and exported to mitochondria, where it is converted to 
oxaloacetate by enzymes of the citric acid cycle. Oxaloacetate enters 
the cytosol and serves as the starting material for gluconeogenesis and 
for the synthesis of sucrose, the transport form of carbon in plants. 
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neogenic pathway at the triose phosphate isomerase reaction. 



glycerol 3-phosphate, which can enter the gluconeogenic 
pathway after its oxidation to dihydroxyacetone phos- 
phate (Fig. 20-34). 

Glucogenic amino acids (see Table 14-4) derived 
from the breakdown of stored seed proteins also yield 
precursors for gluconeogenesis, following transamina- 
tion and oxidation to succinyl-CoA, pyruvate, oxaloac- 
etate, fumarate, and a-ketoglutarate (Chapter 18) — all 
good starting materials for gluconeogenesis. 

Pools of Common Intermediates Link Pathways 
in Different Organelles 

Although we have described metabolic transformations 
in plant cells in terms of individual pathways, these 
pathways interconnect so completely that we should in- 
stead consider pools of metabolic intermediates shared 
among these pathways and connected by readily re- 
versible reactions (Fig. 20-35). One such metabolite 
pool includes the hexose phosphates glucose 1 -phos- 
phate, glucose 6-phosphate, and fructose 6-phosphate; 
a second includes the 5-phosphates of the pentoses 
ribose, ribulose, and xylulose; a third includes the triose 
phosphates dihydroxyacetone phosphate and glycer- 
aldehyde 3-phosphate. Metabolite fluxes through these 
pools change in magnitude and direction in response to 
changes in the circumstances of the plant, and they vary 
with tissue type. Transporters in the membranes of each 
organelle move specific compounds in and out, and the 
regulation of these transporters presumably influences 
the degree to which the pools mix. 

During daylight hours, triose phosphates produced in 
leaf tissue by the Calvin cycle move out of the chloroplast 
and into the cytosolic hexose phosphate pool, where they 
are converted to sucrose for transport to nonphotosyn- 
thetic tissues, In these tissues, sucrose is converted to 
starch for storage or is used as an energy source via 
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FIGURE 20-35 Pools of hexose phosphates, pentose phosphates, and 
triose phosphates. The compounds in each pool are readily intercon- 
vertible by reactions that have small standard free-energy changes. 
When one component of the pool is temporarily depleted, a new equi- 



librium is quickly established to replenish it. Movement of the sugar 
phosphates between intracellular compartments is limited; specific 
transporters must be present in an organelle membrane. 
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glycolysis. In growing plants, hexose phosphates are also 
withdrawn from the pool for the synthesis of cell walls. At 
night, starch is metabolized by glycolysis to provide en- 
ergy, essentially as in non-photosynthetic organisms, and 
NADPH and ribose 5-phosphate are obtained through the 
oxidative pentose phosphate pathway 



Further Reading 



SUMMARY 20.5 



Integration of 
Carbohydrate 
Metabolism in the 
Plant Cell 



Plants can synthesize sugars from acetyl-CoA, 
the product of fatty acid breakdown, by the com- 
bined actions of the glyoxylate cycle and gluconeo- 
genesis. 

The individual pathways of carbohydrate 
metabolism in plants overlap extensively; they 
share pools of common intermediates, including 
hexose phosphates, pentose phosphates, and triose 
phosphates. Transporters in the membranes of 
chloroplasts, mitochondria, and amyloplasts 
mediate the movement of sugar phosphates 
between organelles. The direction of metabolite 
flow through the pools changes from day to night. 
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Problems 



1. Segregation of Metabolism in Organelles What are 
the advantages to the plant cell of having different organelles 
to carry out different reaction sequences that share 
intermediates? 



2. Phases of Photosynthesis When a suspension of green 
algae is illuminated in the absence of C0 2 and then incu- 
bated with 14 C0 2 in the dark, 14 C0 2 is converted to [ 14 C]glu- 
cose for a brief time What is the significance of this 
observation with regard to the C0 2 -assimilation process, and 
how is it related to the light reactions of photosynthesis? 



Why does the conversion of 14 C0 2 
a brief time? 



to [ C]glucose stop after 



3. Identification of Key Intermediates in CO a Assimi- 
lation Calvin and his colleagues used the unicellular green 
alga Chlorella to study the carbon-assimilation reactions of 
photosynthesis. They incubated 14 C0 2 with illuminated sus- 
pensions of algae and followed the time course of appearance 
of 14 C in two compounds, X and Y, under two sets of condi- 
tions. Suggest the identities of X and Y, based on your under- 
standing of the Calvin cycle. 

(a) Illuminated Chlorella were grown with unlabeled 
C0 2 , then the light was turned off and 14 C0 2 was added (ver- 
tical dashed line in the graph below). Under these conditions, 
X was the first compound to become labeled with 14 C; Y was 
unlabeled. 



PS 




Time 



(b) Illuminated Chlorella cells were grown with 14 C0 2 . 
Illumination was continued until all the 14 C0 2 had disappeared 
(vertical dashed line in the graph below) . Under these condi- 
tions, X became labeled quickly but lost its radioactivity with 
time, whereas Y became more radioactive with time. 




4. Regulation of the Calvin Cycle Iodoacetate reacts irre- 
versibly with the free — SH groups of Cys residues in proteins. 



O 



NAD+S 



Time 



CH 2 — C x 

NAD F SH n nafi + s CT 

0" 

Iodoacetate Inactive enzyme 

Predict which Calvin cycle enzyme (s) would be inhibited by 
iodoacetate, and explain why. 

5. Thioredoxin in Regulation of Calvin Cycle Enzymes 

Motohashi and colleagues* used thioredoxin as a hook to fish out 
from plant extracts the proteins that are activated by thiore- 
doxin. To do this, they prepared a mutant thioredoxin in which 
one of the reactive Cys residues was replaced with a Ser. Explain 
why this modification was necessary for their experiments. 

6. Comparison of the Reductive and Oxidative Pentose 
Phosphate Pathways The reductive pentose phosphate 
pathway generates a number of intermediates identical to those 
of the oxidative pentose phosphate pathway (Chapter 14). 
What role does each pathway play in cells where it is active? 

7. Photorespiration and Mitochondrial Respiration Com- 
pare the oxidative photosynthetic carbon cycle (C 2 cycle), also 
called photorespiration, with the mitochondrial respiration 
that drives ATP synthesis. Why are both processes referred to 
as respiration? Where in the cell do they occur, and under what 
circumstances? What is the path of electron flow in each? 

8. Rubisco and the Composition of the Atmosphere 

N. E. Tolbert has argued that the dual specificity of rubisco 
for C0 2 and 0 2 is not simply a leftover from evolution in a low- 
oxygen environment. He suggests that the relative activities of 
the carboxylase and oxygenase activities of rubisco actually 
have set, and now maintain, the ratio of C0 2 to 0 2 in the 
earth's atmosphere. Discuss the pros and cons of this hypoth- 
esis, in molecular terms and in global terms. How does the ex- 
istence of C 4 organisms bear on the hypothesis? 

9. Role of Sedoheptulose 1 ,7-Bisphosphatase What ef- 
fect on the cell and the organism might result from a defect in 
sedoheptulose 1,7-bisphosphatase in (a) a human hepatocyte 
and (b) the leaf cell of a green plant? 

10. Pathway of C0 2 Assimilation in Maize If a maize 
(corn) plant is illuminated in the presence of 14 C0 2 , after 
about 1 second, more than 90% of all the radioactivity incorpo- 
rated in the leaves is found at C-4 of malate, aspartate, and ox- 
aloacetate. Only after 60 seconds does 14 C appear at C-l of 
3-phosphogly cerate. Explain. 

*Motohashi, K„ Kondoh, A., Stumpp, M.T., & Hisabori, T. (2001) Com- 
prehensive survey of proteins targeted by chloroplast thioredoxin. Proc. 
Natl Acad Sci USA 98, 11,224-11,229. 

'Tolbert, N.E. (1994) The role of photosynthesis and photorespiration 
in regulating atmospheric C0 2 and 0 2 . In Regulation of Atmospheric 
CO s and 0 2 by Photosynthetic Carbon Metabolism (Tolbert, N.E. & 
Preiss, J., eds), pp. 8-33, Oxford University Press, New York. 



11. Identifying CAM Plants Given some 14 C0 2 and all the 
tools typically present in a biochemistry research lab, how 
would you design a simple experiment to determine whether a 
plant was a typical C 4 plant or a CAM plant? 

12. Chemistry of Malic Enzyme: Variation on a Theme 

Malic enzyme, found in the bundle-sheath cells of C 4 plants, 
carries out a reaction that has a counterpart in the citric acid 
cycle. What is the analogous reaction? Explain your choice. 

13. The Cost of Storing Glucose as Starch Write the 
sequence of steps and the net reaction required to calculate 
the cost, in ATP molecules, of converting a molecule of cytoso- 
lic glucose 6-phosphate to starch and back to glucose 6- 
phosphate. What fraction of the maximum number of ATP 
molecules available from complete catabolism of glucose 
6-phosphate to C0 2 and H 2 0 does this cost represent? 

14. Inorganic Pyrophosphatase The enzyme inorganic py- 
rophosphatase contributes to making many biosynthetic reac- 
tions that generate inorganic pyrophosphate essentially 
irreversible in cells. By keeping the concentration of PP L very 
low, the enzyme "pulls" these reactions in the direction of PPi 
formation. The synthesis of ADP-glucose in chloroplasts is one 
reaction that is pulled in the forward direction by this mecha- 
nism. However, the synthesis of UDP-glucose in the plant 
cytosol, which produces PP i3 is readily reversible in vivo. How 
do you reconcile these two facts? 

15. Regulation of Starch and Sucrose Synthesis Sucrose 
synthesis occurs in the cytosol and starch synthesis in the 
chloroplast stroma, yet the two processes are intricately bal- 
anced. What factors shift the reactions in favor of (a) starch 
synthesis and (b) sucrose synthesis? 

16. Regulation of Sucrose Synthesis In the regulation of 
sucrose synthesis from the those phosphates produced during 
photosynthesis, 3-phosphoglycerate and P t play critical roles 
(see Fig. 20-26). Explain why the concentrations of these two 
regulators reflect the rate of photosynthesis. 

17. Sucrose and Dental Caries The most prevalent infec- 
tion in humans worldwide is dental caries, which stems from 
the colonization and destruction of tooth enamel by a variety 
of acidifying microorganisms. These organisms synthesize and 
live within a water-insoluble network of dextrans, called dental 
plaque, composed of (a 1— »6) -linked polymers of glucose with 
many (cd— >3) branch points. Polymerization of dextran re- 
quires dietary sucrose, and the reaction is catalyzed by a bac- 
terial enzyme, dextran-sucrose glucosyltransferase. 

(a) Write the overall reaction for dextran polymerization. 

(b) In addition to providing a substrate for the formation 
of dental plaque, how does dietary sucrose also provide oral 
bacteria with an abundant source of metabolic energy? 

18. Differences between C 3 and C 4 Plants The plant 
genus Atriplex includes some C 3 and some C 4 species. From 
the data in the following plots (species 1, black curve; species 
2, red curve), identify which is a C 3 plant and which is a C 4 
plant. Justify your answer in molecular terms that account for 
the data in all three plots. 




Light intensity 




Leaf temperature 




[C0 2 1 in intracellular space 



19. C 4 Pathway in a Single Cell In typical C 4 plants, the 
initial capture of C0 2 occurs in one cell type, and the Calvin 
cycle reactions occur in another (see Fig. 20-23), Voznesen- 
skaya and colleagues* have described a plant, Bienertia 
cycloptera — which grows in salty depressions of semidesert 
in Central Asia — that shows the biochemical properties of a C 4 
plant but unlike typical C 4 plants does not segregate the reac- 
tions of C0 2 fixation into two cell types. PEP carboxylase and 
rubisco are present in the same cell. However, the cells have 
two types of chloroplasts, which are localized differently. One 
type, relatively poor in grana (thylakoids) , is confined to the 
periphery; the more typical chloroplasts are clustered in the 
center of the cell, separated from the peripheral chloroplasts 

*Voznesenskaya, E.V., Fraceschi, V.R., Kiirats, 0., Artyusheva, E.G., 
Freitag, H., & Edwards, G E (2002) Proof of C 4 photosynthesis with- 
out Kranz anatomy in Bienertia cycloptera (Chenopodiaceae) Plant 
J. 31, 649-662. 
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by large vacuoles. Thin cytosolic bridges pass through the vac- 
uoles, connecting the peripheral and central cytosol. A micro- 
graph of a B cycloptera cell, with arrows pointing to peripheral 
chloroplasts, is shown below. 




In this plant, where would you expect to find (a) PEP car- 
boxylase, (b) rubisco, and (c) starch granules? Explain your 
answers with a model for C0 2 fixation in these C 4 cells. 

Data Analysis Problem 

20. Rubisco of Bacterial Endosymbionts of Hydrother- 
mal Vent Animals Undersea hydrothermal vents support re- 
markable ecosystems. At these extreme depths there is no 
light to support photosynthesis, yet thriving vent communities 
are found. Much of their primary productivity occurs through 
chemosynthesis carried out by bacterial symbionts that live in 
specialized organs (trophosomes) of certain vent animals. 

Chemosynthesis in these bacteria involves a process that 
is virtually identical to photosynthesis. Carbon dioxide is fixed 
by rubisco and reduced to glucose, and the necessary ATP and 
NADPH are produced by electron-transfer processes similar to 
those of the light- dependent reactions of photosynthesis. The 
key difference is that in chemosynthesis, the energy driving 
electron transfer comes from a highly exergonic chemical 
reaction rather than from light. Different chemosynthetic 
bacteria use different reactions for this purpose. The bacteria 
found in hydrothermal vent animals typically use the oxidation 
of H 2 S (abundant in the vent water) by 0 2 , producing elemen- 
tal sulfur. These bacteria also use the conversion of H 2 S to sul- 
fur as a source of electrons for chemosynthetic C0 2 reduction. 

(a) What is the overall reaction for chemosynthesis in 
these bacteria? You do not need to write a balanced equation; 
just give the starting materials and products. 

(b) Ultimately, these endosymbiotic bacteria obtain their 
energy from sunlight. Explain how this occurs. 

Robinson and colleagues (2003) explored the properties 
of rubisco from the bacterial endosymbiont of the giant tube 
worm Riftia pachyptila. Rubisco, from any source, catalyzes 
the reaction of either C0 2 (Fig. 20-7) or 0 2 (Fig. 20-20) with 
ribulose 1,5-bisphosphate. In general, rubisco reacts more 



readily with C0 2 than 0 2 . The degree of selectivity (ft) can be 
expressed as 

^carboxylation ^ [CO2] 
^oxygenation 

[0 2 ] 

where Vis the reaction velocity. 

Robinson and coworkers measured the ft value for the ru- 
bisco of the bacterial endosymbionts. They purified rubisco 
from tube-worm trophosomes, reacted it with mixtures of dif- 
ferent ratios of 0 2 and C0 2 in the presence of [l- 3 H]ribulose 
1,5-bisphosphate, and measured the ratio of [ 3 H]phosphoglyc- 
erate to [ 3 H]phosphoglycolate. 

(c) The measured ratio of [ 3 H]phosphoglycerate to 
[ 3 H]phosphoglycolate is equal to the ratio ^carboxyiation/ 
Oxygenation- Explain why. 

(d) Why would [5- 3 H]ribulose 1,5-bisphosphate not be a 
suitable substrate for this assay? 

The ft for the endosymbiont rubisco had a value of 8.6 ± 0.9. 

(e) The atmospheric (molar) concentration of 0 2 is 20% and 
that of C0 2 is about 380 parts per million. If the endosymbiont 
were to carry out chemosynthesis under these atmospheric con- 
ditions, What WOUld be the Value Of ^carboxylation/^oxygenation? 

(f) Based on your answer to (e), would you expect ft for 
the rubisco of a terrestrial plant to be higher than, the same as, 
or lower than 8.6? Explain your reasoning. 

Two stable isotopes of carbon are commonly found in the 
environment: the more abundant 12 C and the rare 13 C. All ru- 
bisco enzymes catalyze the fixation of 12 C0 2 faster than that of 
13 C0 2 . As a result, the carbon in glucose is slightly enriched in 
12 C compared with the isotopic composition of C0 2 in the en- 
vironment. Several factors are involved in this "preferential" 
use of 12 C0 2 , but one factor is the fundamental physics of 
gases. The temperature of a gas is related to the kinetic energy 
of its molecules. Kinetic energy is given by 1 Amv z , where m is 
molecular mass and v is velocity. Thus, at the same tempera- 
ture (same kinetic energy), the molecules of a lighter gas will 
be moving faster than those of a heavier gas. 

(g) How could this contribute to rubisco's "preference" 
for 12 C0 2 over 13 C0 2 ? 

Some of the first convincing evidence that the tube-worm 
hosts were obtaining their fixed carbon from the endosym- 
bionts was that the 13 C/ 12 C ratio in the animals was much closer 
to that of the bacteria than that of nonvent marine animals. 

(h) Why is this more convincing evidence for a symbiotic 
relationship than earlier studies that simply showed the pres- 
ence of rubisco in the bacteria found in trophosomes? 

Reference 

Robinson, J.J., Scott, K.M., Swanson, S.T., O'Leary, M.H., 
Horken, K., Tabita, F.R., & Cavanaugh, CM. (2003) Kinetic 
isotope effect and characterization of form II RubisCO from the 
chemoautotrophic endosymbionts of the hydrothermal vent 
tubeworm Riftia pachyptila. Limnol. Oceanogr. 48, 48-54. 



How the division of "spoils" came about I do not recall— it may have 
been by drawing lots. At any rate, David Shemin "drew" amino acid 
metabolism, which led to his classical work on heme biosynthesis. 
David Rittenburg was to continue his interest in protein synthesis and 
turnover, and lipids were to be my territory. 

— Konrad Block, on how his career turned to problems of lipid 
metabolism after the death of his mentor, Rudolf Schoenheimer; 

article in Annual Review of Biochemistry, 1987 
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Lipids play a variety of cellular roles, some only 
recently recognized. They are the principal form of 
stored energy in most organisms and major 
constituents of cellular membranes. Specialized lipids 
serve as pigments (retinal, carotene), cofactors (vitamin 
K), detergents (bile salts), transporters (dolichols), 
hormones (vitamin D derivatives, sex hormones), extra- 
cellular and intracellular messengers (eicosanoids, 
phosphatidylinositol derivatives), and anchors for mem- 
brane proteins (covalently attached fatty acids, prenyl 
groups, and phosphatidylinositol). The ability to synthe- 
size a variety of lipids is essential to all organisms. This 
chapter describes the biosynthetic pathways for some of 
the most common cellular lipids, illustrating the strate- 
gies employed in assembling these water-insoluble prod- 
ucts from water-soluble precursors such as acetate. Like 
other biosynthetic pathways, these reaction sequences 
are endergonic and reductive. They use ATP as a source 
of metabolic energy and a reduced electron carrier 
(usually NADPH) as a reductant. 

We first describe the biosynthesis of fatty acids, the 
primary components of both triacylglycerols and phos- 
pholipids, then examine the assembly of fatty acids into 
triacylglycerols and the simpler membrane phospho- 
lipids. Finally, we consider the synthesis of cholesterol, 
a component of some membranes and the precursor of 
steroids such as the bile acids, sex hormones, and 
adrenocortical hormones. 



21.1 Biosynthesis of Fatty Acids 
and Eicosanoids 

After the discovery that fatty acid oxidation takes 
place by the oxidative removal of successive two-car- 
bon (acetyl-CoA) units (see Fig. 17-8), biochemists 
thought the biosynthesis of fatty acids might proceed 
by a simple reversal of the same enzymatic steps. How- 
ever, as they were to find out, fatty acid biosynthesis 
and breakdown occur by different pathways, are cat- 
alyzed by different sets of enzymes, and take place in 
different parts of the cell. Moreover, biosynthesis re- 
quires the participation of a three-carbon intermedi- 
ate, malonyl-CoA, that is not involved in fatty acid 
breakdown. 

X # 
C-CH 2 -C x 

"O S-CoA 

Malonyl-CoA 

We focus first on the pathway of fatty acid synthesis, 
then turn our attention to regulation of the pathway and 
to the biosynthesis of longer-chain fatty acids, unsatu- 
rated fatty acids, and their eicosanoid derivatives. 

Malonyl-CoA Is Formed from Acetyl-CoA 
and Bicarbonate 

The formation of malonyl-CoA from acetyl-CoA is an 
irreversible process, catalyzed by acetyl-CoA car- 
boxylase. The bacterial enzyme has three separate 
polypeptide subunits (Fig. 21-1); in animal cells, all 
three activities are part of a single multifunctional 
polypeptide. Plant cells contain both types of acetyl- 
CoA carboxylase. In all cases, the enzyme contains a 
biotin prosthetic group covalently bound in amide 
linkage to the s-amino group of a Lys residue in one 

[805] 
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of the three polypeptides or domains of the enzyme 
molecule. The two-step reaction catalyzed by this en- 
zyme is very similar to other biotin-dependent car- 
boxylation reactions, such as those catalyzed by 
pyruvate carboxylase (see Fig. 16-16) and propionyl- 
CoA carboxylase (see Fig. 17-11). A carboxyl group, 
derived from bicarbonate (HCCQ, is first transferred 
to biotin in an ATP-dependent reaction. The biotinyl 



FIGURE 21-1 The acetyl-CoA carboxylase reaction. Acetyl-CoA car- 
boxylase has three functional regions: biotin carrier protein (gray); 
biotin carboxylase, which activates C0 2 by attaching it to a nitrogen 
in the biotin ring in an ATP-dependent reaction (see Fig. 16-16); and 
transcarboxylase, which transfers activated C0 2 (shaded green) from 
biotin to acetyl-CoA, producing malonyl-CoA. The long, flexible 
biotin arm carries the activated C0 2 from the biotin carboxylase region 
to the transcarboxylase active site. The active enzyme in each step is 
shaded blue. 



group serves as a temporary carrier of C0 2 , transfer- 
ring it to acetyl-CoA in the second step to yield 
malonyl-CoA. 



Fatty Acid Synthesis Proceeds in a Repeating 
Reaction Sequence 

In all organisms, the long carbon chains of fatty acids are 
assembled in a repeating four-step sequence (Fig. 21-2), 
catalyzed by a system collectively referred to as fatty 
acid synthase. A saturated acyl group produced by 
each four-step series of reactions becomes the substrate 
for subsequent condensation with an activated malonyl 
group. With each passage through the cycle, the fatty 
acyl chain is extended by two carbons. 

Both the electron-carrying cofactor and the acti- 
vating groups in the reductive anabolic sequence dif- 
fer from those in the oxidative catabolic process. 
Recall that in /3 oxidation, NAD + and FAD serve as 
electron acceptors and the activating group is the thiol 
( — SH) group of coenzyme A (see Fig. 17-8). By con- 
trast, the reducing agent in the synthetic sequence is 
NADPH and the activating groups are two different 
enzyme-bound — SH groups, as described in the fol- 
lowing section. 

There are two major variants of fatty acid synthase: 
fatty acid synthase I (FAS I), found in vertebrates and 
fungi, and fatty acid synthase II (FAS II), found in 
plants and bacteria. The FAS I found in vertebrates 
consists of a single multifunctional polypeptide chain 
(M r 240,000). The mammalian FAS I is the prototype. 
Seven active sites for different reactions lie in separate 
domains (Fig. 21-3a). The mammalian polypeptide 
functions as a homodimer (M r 480,000). The subunits 
appear to function independently. When all the active 
sites in one subunit are inactivated by mutation, fatty 
acid synthesis is only modestly reduced. A somewhat 
different FAS I is found in yeast and other fungi, and is 
made up of two multifunctional polypeptides that form 
a complex with an architecture distinct from the verte- 
brate systems (Fig. 21-3b). Three of the seven required 
active sites are found on the a subunit and four on the 
jS subunit. 
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FIGURE 21-2 Addition of two carbons to a growing fatty acyl chain: a 
four-step sequence. Each tnalonyl group and acetyl (or longer acyl) group 
is activated by a thioester that links it to fatty acid synthase, a multienzyme 
system described later in the text. (T) Condensation of an activated acyl 
group (an acetyl group from acetyl-CoA is the first acyl group) and two car- 
bons derived from malonyl-CoA, with elimination of C0 2 from the mal- 
onyl group, extends the acyl chain by two carbons. The mechanism of the 
first step of this reaction is given to illustrate the role of decarboxylation in 
facilitating condensation. The £-keto product of this condensation is then 
reduced in three more steps nearly identical to the reactions of /3 oxidation, 
but in the reverse sequence: @ the jS-keto group is reduced to an alcohol, 
(5) elimination of H 2 0 creates a double bond, and (?) the double bond is 
reduced to form the corresponding saturated fatty acyl group. 
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FIGURE 21-3 The structure of fatty acid synthase type I systems. The 

low-resolution structures of (a) the mammalian (porcine; derived from 
PDB ID 2CF2) and (b) fungal enzyme systems (derived from PDB IDs 
2UV9, 2UVA, 2UVB, and 2UVC) are shown, (a) All of the active sites in 
the mammalian system are located in different domains within a single 
large polypeptide chain. The different enzymatic activites are: /3-ketoacyl- 
ACP synthase (KS), malonyl/acetyl-CoA-ACP transferase (MAT), /3- 
hydroxyacyl-ACP dehydratase (DH), enoyl-ACP reductase (ER), and 
£-ketoacyl-ACP reductase (KR), ACP is the acyl carrier protein. The linear 
arrangement of the domains in the polypeptide is shown in the lower 
panel. The seventh domain (TE) is a thioesterase that releases the palmi- 
tate product from ACP when the synthesis is completed. The ACP and TE 
domains are disordered in the crystal and are therefore not shown in the 
structure, (b) In the structure of the FAS I from the fungus Thermomyces 
lanuginosus, the same active sites are divided between two multifunc- 
tional polypeptide chains that function together. Six copies of each 
polypeptide are found in the heterododecameric complex. A wheel of six 
a subunits, which include ACP as well as the KS and KR active sites, is 
found at the center of the complex. In the wheel three subunits are found 
on one face, three on the other. On either side of the wheel are domes 
formed by trimers of the j0 subunits (containing the ER and DH active 
sites, as well as two domains with active sites analogous to MAT in the 
mammalian enzyme). The domains of one of each type of subunit are col- 
ored according to the active site colors of the mammalian enzyme in (a). 
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FIGURE 21-4 The overall process of palmitate synthesis. The fatty acyl 
chain grows by two-carbon units donated by activated malonate, with 
loss of C0 2 at each step. The initial acetyl group is shaded yellow, C-1 
and C-2 of malonate are shaded pink, and the carbon released as C0 2 
is shaded green. After each two-carbon addition, reductions convert 
the growing chain to a saturated fatty acid of four, then six, then eight 
carbons, and so on. The final product is palmitate (1 6:0). 
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With FAS I systems, fatty acid synthesis leads to a 
single product, and no intermediates are released. 
When the chain length reaches 16 carbons, that prod- 
uct (palmitate, 16:0; see Table 10-1) leaves the cycle. 
Carbons C-1 6 and C-1 5 of the palmitate are derived from 
the methyl and carboxyl carbon atoms, respectively, 
of an acetyl-CoA used directly to prime the system 
at the outset (Fig. 21-4); the rest of the carbon 
atoms in the chain are derived from acetyl-CoA via 
malonyl-CoA. 

FAS II, in plants and bacteria, is a dissociated sys- 
tem; each step in the synthesis is catalyzed by a sepa- 
rate and freely diffusible enzyme. Intermediates are also 
diffusible and may be diverted into other pathways 
(such as lipoic acid synthesis). Unlike FAS I, FAS II gen- 
erates a variety of products, including saturated fatty 
acids of several lengths, as well as unsaturated, 
branched, and hydroxy fatty acids. An FAS II system is 
also found in vertebrate mitochondria. The discussion to 
follow will focus on the mammalian FAS I. 

The Mammalian Fatty Acid Synthase 
Has Multiple Active Sites 

The multiple domains of mammalian FAS I function as 
distinct but linked enzymes. The active site for each en- 
zyme is found in a separate domain within the larger 
polypeptide. Throughout the process of fatty acid syn- 
thesis, the intermediates remain covalently attached as 
thioesters to one of two thiol groups. One point of 



attachment is the — SH group of a Cys residue in one of 
the synthase domains (/3-ketoacyl-ACP synthase; KS); 
the other is the — SH group of acyl carrier protein, a sep- 
arate domain of the same polypeptide. Hydrolysis of 
thioesters is highly exergonic, and the energy released 
helps to make two different steps ((D and © in 
Fig. 21-6) in fatty acid synthesis (condensation) ther- 
modynamically favorable. 

Acyl carrier protein (ACP) is the shuttle that 
holds the system together. The Escherichia coli ACP 
is a small protein (M r 8,860) containing the prosthetic 
group 4 -phosphopantetheine (Fig. 21-5; compare 
this with the panthothenic acid and /3-mercaptoeth- 
ylamine moiety of coenzyme A in Fig. 8-38). The 
4'-phosphopantetheine prosthetic group of E. coli 
ACP is believed to serve as a flexible arm, tethering 
the growing fatty acyl chain to the surface of the fatty 
acid synthase complex while carrying the reaction in- 
termediates from one enzyme active site to the next. 
The ACP of mammals has a similar function and the 
same prosthetic group; as we have seen, however, it is 
embedded as a domain in a much larger multifunc- 
tional polypeptide. 

Fatty Acid Synthase Receives the Acetyl 
and Malonyl Groups 

Before the condensation reactions that build up the 
fatty acid chain can begin, the two thiol groups on the 
enzyme complex must be charged with the correct acyl 
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FIGURE 21-5 Acyl carrier protein (ACP). The prosthetic group is 4'- 
phosphopantetheine, which is covalently attached to the hydroxyl 
group of a Ser residue in ACP. Phosphopantetheine contains the B vita- 
min pantothenic acid, also found in the coenzyme A molecule. Its — SH 
group is the site of entry of malonyl groups during fatty acid synthesis. 



groups (Fig. 21-6, top). First, the acetyl group of 
acetyl-CoA is transferred to ACP in a reaction catalyzed 
by the malonyl /acetyl-CoA-ACP transferase (MAT 
in Fig. 21-6) domain of the multifunctional polypeptide. 
The acetyl group is then transferred to the Cys — SH 
group of the /3-ketoacyl-ACP synthase (KS). The sec- 
ond reaction, transfer of the malonyl group from mal- 
onyl-CoA to the — SH group of ACP, is also catalyzed by 
malonyl/acetyl-CoA-ACP transferase. In the charged 
synthase complex, the acetyl and malonyl groups are ac- 
tivated for the chain-lengthening process. The first four 
steps of this process are now considered in some detail; 
all step numbers refer to Figure 21-6. 

Step (J) Condensation The first reaction in the for- 
mation of a fatty acid chain is a formal Claisen conden- 
sation involving the activated acetyl and malonyl groups 
to form acetoacetyl-ACP, an acetoacetyl group bound 
to ACP through the phosphopantetheine — SH group; 
simultaneously, a molecule of C0 2 is produced. In this 
reaction, catalyzed by /3-ketoacyl-ACP synthase, the 
acetyl group is transferred from the Cys — SH group of 
the enzyme to the malonyl group on the — SH of ACP, 
becoming the methyl-terminal two-carbon unit of the 
new acetoacetyl group. 



The carbon atom of the C0 2 formed in this reaction 
is the same carbon originally introduced into malonyl- 
CoA from HCO3 by the acetyl-CoA carboxylase reaction 
(Fig. 21-1). Thus C0 2 is only transiently in covalent 
linkage during fatty acid biosynthesis; it is removed as 
each two-carbon unit is added. 

Why do cells go to the trouble of adding C0 2 to make 
a malonyl group from an acetyl group, only to lose the 
C0 2 during the formation of acetoacetate? Recall that in 
the /3 oxidation of fatty acids (see Fig. 17-8), cleavage 
of the bond between two acyl groups (cleavage of an 
acetyl unit from the acyl chain) is highly exergonic, so 
the simple condensation of two acyl groups (two acetyl- 
CoA molecules, for example) is highly endergonic. The 
use of activated malonyl groups rather than acetyl 
groups is what makes the condensation reactions ther- 
modynamically favorable. The methylene carbon (C-2) 
of the malonyl group, sandwiched between carbonyl and 
carboxyl carbons, is chemically situated to act as a good 
nucleophile. In the condensation step (step ®), decar- 
boxylation of the malonyl group facilitates the nucle- 
ophilic attack of the methylene carbon on the thioester 
linking the acetyl group to 0-ketoacyl-ACP synthase, dis- 
placing the enzyme's — SH group. Coupling the conden- 
sation to the decarboxylation of the malonyl group 
renders the overall process highly exergonic. A similar 
carboxylation-decarboxylation sequence facilitates the 
formation of phosphoenolpyruvate from pyruvate in glu- 
coneogenesis (see Fig. 14-17). 

By using activated malonyl groups in the synthesis 
of fatty acids and activated acetate in their degradation, 
the cell makes both processes energetically favorable, 
although one is effectively the reversal of the other. The 
extra energy required to make fatty acid synthesis fa- 
vorable is provided by the ATP used to synthesize mal- 
onyl-CoA from acetyl-CoA and HCO3 (Fig. 21-1). 

Step (2) Reduction of the Carbonyl Group The ace- 
toacetyl-ACP formed in the condensation step now under- 
goes reduction of the carbonyl group at C-3 to form 
D-/3-hydroxybutyryl-ACP This reaction is catalyzed by 
/S-ketoacyl-ACP reductase (KR) and the electron donor 
is NADPH. Notice that the D-/3-hydroxybutyryl group does 
not have the same stereoisomeric form as the L-/3-hydrox- 
yacyl intermediate in fatty acid oxidation (see Fig. 17-8). 

Step (D Dehydration The elements of water are now 
removed from C-2 and C-3 of D-/3-hydroxybutyryl-ACP 
to yield a double bond in the product, trans-A 2 - 
butenoyl-ACP. The enzyme that catalyzes this dehy- 
dration is /J-hydroxyacyl-ACP dehydratase (DH). 

Step ® Reduction of the Double Bond Finally, the 
double bond of £rcms-A 2 -butenoyl-ACP is reduced 
(saturated) to form butyryl-ACP by the action of 
enoyl-ACP reductase (ER); again, NADPH is the elec- 
tron donor. 
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FIGURE 21-6 Sequence of events during synthesis of a fatty acid. The 

mammalian FAS I complex is shown schematically, with catalytic do- 
mains colored as in Figure 21-3, Each domain of the larger polypep- 
tide represents one of the six enzymatic activities of the complex, 
arranged in a large, tight "S" shape. The acyl carrier protein (ACP) is not 
resolved in the crystal structure shown in Figure 21-3, but is attached 
to the KS domain. The phosphopantetheine arm of ACP ends in an — SH. 
After the first panel, the enzyme shown in color is the one that will act 
in the next step. As in Figure 21-4, the initial acetyl group is shaded yel- 
low, C-1 and C-2 of malonate are shaded pink, and the carbon released 
as C0 2 is shaded green. Steps © to @ are described in the text. 
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The Fatty Acid Synthase Reactions Are 
Repeated to Form Palmitate 

Production of the four-carbon, saturated fatty acyl-ACP 
marks completion of one pass through the fatty acid syn- 
thase complex. The butyryl group is now transferred 
from the phosphopantetheine — SH group of ACP to the 
Cys — SH group of /3-ketoacyl-ACP synthase, which ini- 
tially bore the acetyl group (Fig. 21-6). To start the next 
cycle of four reactions that lengthens the chain by two 
more carbons, another malonyl group is linked to the 
now unoccupied phosphopantetheine — SH group of 
ACP (Fig. 21-7). Condensation occurs as the butyryl 
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FIGURE 21-7 Beginning of the second round of the fatty acid syn- 
thesis cycle. The butyryl group is on the Cys — SH group. The incom- 
ing malonyl group is first attached to the phosphopantetheine — SH 
group. Then, in the condensation step, the entire butyryl group on the 
Cys — SH is exchanged for the carboxyl group of the malonyl residue, 
which is lost as C0 2 (green). This step is analogous to step (J) in Figure 
21—6 The product, a six-carbon j6-ketoacyl group, now contains four 
carbons derived from malonyl-CoA and two derived from the acetyl- 
CoA that started the reaction. The /3-ketoacyl group then undergoes 
steps (5) through (7), as in Figure 21-6. 



group, acting like the acetyl group in the first cycle, is 
linked to two carbons of the malonyl-ACP group with 
concurrent loss of C0 2 . The product of this condensation 
is a six-carbon acyl group, covalently bound to the phos- 
phopantetheine — SH group. Its /3-keto group is reduced 
in the next three steps of the synthase cycle to yield the 
saturated acyl group, exactly as in the first round of re- 
actions — in this case forming the six-carbon product. 

Seven cycles of condensation and reduction pro- 
duce the 16-carbon saturated palmitoyl group, still 
bound to ACP. For reasons not well understood, chain 
elongation by the synthase complex generally stops at 
this point and free palmitate is released from the ACP by 
a hydrolytic activity (thioesterase; TE) in the multifunc- 
tional protein. 

We can consider the overall reaction for the synthe- 
sis of palmitate from acetyl-CoA in two parts. First, the 
formation of seven malonyl-CoA molecules: 

7 Acetyl-CoA + 7C0 2 + 7ATP ^ 

7 malonyl-CoA + 7ADP + IV, (21-1) 

then seven cycles of condensation and reduction: 

Acetyl-CoA + 7 malonyl-CoA + 14NADPH + 14H + -> 

palmitate + 7C0 2 + 8 CoA + 14NADP + + 6H 2 0 (21-2) 

Note that only six net water molecules are produced, 
because one is used to hydrolyze the thioester linking 
the palmitate product to the enzyme. The overall 
process (the sum of Eqns 21-1 and 21-2) is 

8 Acetyl-CoA + 7 ATP + 14NADPH + 14H + ^ 
palmitate + 8 CoA + 7ADP + 7P t + 14NADP + + 6H 2 0 (21-3) 

The biosynthesis of fatty acids such as palmitate thus re- 
quires acetyl-CoA and the input of chemical energy in 
two forms: the group transfer potential of ATP and the 
reducing power of NADPH. The ATP is required to 
attach C0 2 to acetyl-CoA to make malonyl-CoA; the 
NADPH is required to reduce the double bonds. 

In nonphotosynthetic eukaryotes there is an addi- 
tional cost to fatty acid synthesis, because acetyl-CoA is 
generated in the mitochondria and must be transported 
to the cytosol. As we will see, this extra step consumes 
two ATPs per molecule of acetyl-CoA transported, in- 
creasing the energetic cost of fatty acid synthesis to 
three ATPs per two-carbon unit. 

Fatty Acid Synthesis Occurs in the Cytosol of Many 
Organisms but in the Chioroplasts of Plants 

In most higher eukaryotes, the fatty acid synthase com- 
plex is found exclusively in the cytosol (Fig. 21-8), as 
are the biosynthetic enzymes for nucleotides, amino 
acids, and glucose. This location segregates synthetic 
processes from degradative reactions, many of which 
take place in the mitochondrial matrix. There is a corre- 
sponding segregation of the electron-carrying cofactors 
used in anabolism (generally a reductive process) and 
those used in catabolism (generally oxidative). 
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FIGURE 21-8 Subcellular localization of lipid metabolism. Yeast and 
vertebrate cells differ from higher plant cells in the compartmentation 
of lipid metabolism. Fatty acid synthesis takes place in the compart- 
ment in which NADPH is available for reductive synthesis (i.e., where 
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the [NADPH]/[NADP + ] ratio is high); this is the cytosol in animals and 
yeast, and the chloroplast in plants. Processes in red type are covered 
in this chapter. 



Usually, NADPH is the electron carrier for ana- 
bolic reactions, and NAD + serves in catabolic reac- 
tions. In hepatocytes, the [NADPH]/[NADP + ] ratio is 
very high (about 75) in the cytosol, furnishing a 
strongly reducing environment for the reductive syn- 
thesis of fatty acids and other biomolecules. The cy- 
tosolic [NADH]/[NAD + ] ratio is much smaller (only 
about 8 X 10" 4 ), so the NAD + -dependent oxidative 
catabolism of glucose can take place in the same com- 
partment, and at the same time, as fatty acid synthe- 
sis. The [NADH]/[NAD + ] ratio in the mitochondrion is 
much higher than in the cytosol, because of the flow of 
electrons to NAD + from the oxidation of fatty acids, 
amino acids, pyruvate, and acetyl-CoA. This high mi- 
tochondrial [NADH]/[NAD + ] ratio favors the reduction 
of oxygen via the respiratory chain. 

In hepatocytes and adipocytes, cytosolic NADPH is 
largely generated by the pentose phosphate pathway 
(see Fig. 14-21) and by malic enzyme (Fig. 21-9a). 
The NADP-linked malic enzyme that operates in the car- 
bon-assimilation pathway of C 4 plants (see Fig. 20-23) 
is unrelated in function. The pyruvate produced in the 
reaction shown in Figure 21-9a reenters the mitochon- 
drion. In hepatocytes and in the mammary gland of lac- 
tating animals, the NADPH required for fatty acid 
biosynthesis is supplied primarily by the pentose phos- 
phate pathway (Fig. 21 -9b). 
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FIGURE 21-9 Production of NADPH. Two routes to NADPH, cat- 
alyzed by (a) malic enzyme and (b) the pentose phosphate pathway. 

In the photosynthetic cells of plants, fatty acid syn- 
thesis occurs not in the cytosol but in the chloroplast 
stroma (Fig. 21-8). This makes sense, given that 
NADPH is produced in chloroplasts by the light reac- 
tions of photosynthesis: 



H 2 0 + NADP + 



light 



-> i0 2 + NADPH 



H" 
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Acetate Is Shuttled out of Mitochondria as Citrate 

In nonphotosynthetic eukaryotes, nearly all the acetyl- 
CoA used in fatty acid synthesis is formed in mitochon- 
dria from pyruvate oxidation and from the catabolism of 
the carbon skeletons of amino acids, Acetyl-CoA arising 
from the oxidation of fatty acids is not a significant 
source of acetyl-CoA for fatty acid biosynthesis in ani- 
mals, because the two pathways are reciprocally regu- 
lated, as described below. 

The mitochondrial inner membrane is imperme- 
able to acetyl-CoA, so an indirect shuttle transfers 
acetyl group equivalents across the inner membrane 
(Fig. 21-10). Intramitochondrial acetyl-CoA first re- 
acts with oxaloacetate to form citrate, in the citric 
acid cycle reaction catalyzed by citrate synthase 
(see Fig. 16-7). Citrate then passes through the in- 
ner membrane on the citrate transporter. In the cy- 



tosol, citrate cleavage by citrate lyase regenerates 
acetyl-CoA and oxaloacetate in an ATP-dependent re- 
action. Oxaloacetate cannot return to the mitochon- 
drial matrix directly, as there is no oxaloacetate 
transporter. Instead, cytosolic malate dehydrogenase 
reduces the oxaloacetate to malate, which can return 
to the mitochondrial matrix on the malate-a-ketoglu- 
tarate transporter in exchange for citrate. In the ma- 
trix, malate is reoxidized to oxaloacetate to complete 
the shuttle. However, most of the malate produced in 
the cytosol is used to generate cytosolic NADPH 
through the activity of malic enzyme (Fig. 21-9a). The 
pyruvate produced is transported to the mitochondria 
by the pyruvate transporter (Fig. 21-10), and con- 
verted back into oxaloacetate by pyruvate carboxylase 
in the matrix. The resulting cycle results in the con- 
sumption of two ATPs (by citrate lyase and pyruvate 
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FIGURE 21-10 Shuttle for transfer of acetyl groups from mitochon- 
dria to the cytosol. The mitochondrial outer membrane is freely per- 
meable to all these compounds. Pyruvate derived from amino acid 
catabolism in the mitochondrial matrix, or from glucose by glycolysis 
in the cytosol, is converted to acetyl-CoA in the matrix. Acetyl groups 
pass out of the mitochondrion as citrate; in the cytosol they are 



delivered as acetyl-CoA for fatty acid synthesis. Oxaloacetate is re- 
duced to malate, which can return to the mitochondrial matrix and is 
converted to oxaloacetate. The major fate for cytosolic malate is oxi- 
dation by malic enzyme to generate cytosolic NADPH; the pyruvate 
produced returns to the mitochondrial matrix. 
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carboxylase) for every molecule of acetyl-CoA deliv- 
ered to fatty acid synthesis. After citrate cleavage to 
generate acetyl-CoA, conversion of the four remaining 
carbons to pyruvate and C0 2 via malic enzyme gener- 
ates about half the NADPH required for fatty acid syn- 
thesis. The pentose phosphate pathway contributes 
the rest of the needed NADPH. 

Fatty Acid Biosynthesis Is Tightly Regulated 

When a cell or organism has more than enough meta- 
bolic fuel to meet its energy needs, the excess is gener- 
ally converted to fatty acids and stored as lipids such as 
triacylglycerols. The reaction catalyzed by acetyl-CoA 
carboxylase is the rate-limiting step in the biosynthesis 
of fatty acids, and this enzyme is an important site of 
regulation. In vertebrates, palmitoyl-CoA ; the principal 
product of fatty acid synthesis, is a feedback inhibitor 
of the enzyme; citrate is an allosteric activator (Fig. 
21-lla), increasing V miiX . Citrate plays a central role in 
diverting cellular metabolism from the consumption 
(oxidation) of metabolic fuel to the storage of fuel as 
fatty acids. When the concentrations of mitochondrial 
acetyl-CoA and ATP increase, citrate is transported out 
of mitochondria; it then becomes both the precursor of 
cytosolic acetyl-CoA and an allosteric signal for the ac- 
tivation of acetyl-CoA carboxylase. At the same time, 
citrate inhibits the activity of phosphofructokinase-1 
(see Fig. 15-14), reducing the flow of carbon through 
glycolysis. 

Acetyl-CoA carboxylase is also regulated by cova- 
lent modification. Phosphorylation, triggered by the 
hormones glucagon and epinephrine, inactivates the 
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FIGURE 21-11 Regulation of fatty acid synthesis, (a) In the cells of 
vertebrates, both allosteric regulation and hormone-dependent cova- 
lent modification influence the flow of precursors into malonyl-CoA, 
In plants, acetyl-CoA carboxylase is activated by the changes in 
[Mg 2+ ] and pH that accompany illumination (not shown here), (b) Fil- 
aments of acetyl-CoA carboxylase {the active, dephosphorylated form) 
as seen with the electron microscope. 



enzyme and reduces its sensitivity to activation by cit- 
rate, thereby slowing fatty acid synthesis. In its active 
(dephosphorylated) form, acetyl-CoA carboxylase poly- 
merizes into long filaments (Fig. 21-1 lb); phosphoryla- 
tion is accompanied by dissociation into monomeric 
subunits and loss of activity. 

The acetyl-CoA carboxylase of plants and bacteria 
is not regulated by citrate or by a phosphorylation- 
dephosphorylation cycle. The plant enzyme is activated 
by an increase in stromal pH and [Mg 2+ ], which occurs 
on illumination of the plant (see Fig. 20-17). Bacteria do 
not use triacylglycerols as energy stores. InE. coli, the 
primary role of fatty acid synthesis is to provide precur- 
sors for membrane lipids; the regulation of this process 
is complex, involving guanine nucleotides (such as 
ppGpp) that coordinate cell growth with membrane for- 
mation (see Figs 8-39, 28-24). 

In addition to the moment-by-moment regulation of 
enzymatic activity, these pathways are regulated at the 
level of gene expression. For example, when animals in- 
gest an excess of certain polyunsaturated fatty acids, 
the expression of genes encoding a wide range of li- 
pogenic enzymes in the liver is suppressed. The detailed 
mechanism by which these genes are regulated is not 
yet clear. 

If fatty acid synthesis and /3 oxidation were to 
proceed simultaneously, the two processes would 
constitute a futile cycle, wasting energy. We noted 
earlier (see Fig. 17-12) that /3 oxidation is blocked by 
malonyl-CoA, which inhibits carnitine acyltrans- 
ferase I. Thus during fatty acid synthesis, the pro- 
duction of the first intermediate, malonyl-CoA, shuts 
down jS oxidation at the level of a transport system in 
the mitochondrial inner membrane. This control 
mechanism illustrates another advantage of segre- 
gating synthetic and degradative pathways in differ- 
ent cellular compartments. 

Long-Chain Saturated Fatty Acids Are 
Synthesized from Palmitate 

Palmitate, the principal product of the fatty acid syn- 
thase system in animal cells, is the precursor of other 
long-chain fatty acids (Fig. 21-12). It may be length- 
ened to form stearate (18:0) or even longer saturated 
fatty acids by further additions of acetyl groups, through 
the action of fatty acid elongation systems present in 
the smooth endoplasmic reticulum and in mitochondria. 
The more active elongation system of the ER extends 
the 16-carbon chain of palmitoyl-CoA by two carbons, 
forming stearoyl-CoA. Although different enzyme sys- 
tems are involved, and coenzyme A rather than ACP is 
the acyl carrier in the reaction, the mechanism of elon- 
gation in the ER is otherwise identical to that in palmi- 
tate synthesis: donation of two carbons by malonyl-CoA, 
followed by reduction, dehydration, and reduction to 
the saturated 18-carbon product, stearoyl-CoA. 
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FIGURE 21-12 Routes of synthesis of other fatty acids. Palmitate is 
the precursor of stearate and longer-chain saturated fatty acids, as well 
as the monounsaturated acids palmitoleate and oleate. Mammals can- 
not convert oleate to linoleate or a-linolenate {shaded pink), which 
are therefore required in the diet as essential fatty acids. Conversion of 
linoleate to other polyunsaturated fatty acids and eicosanoids is out- 
lined. Unsaturated fatty acids are symbolized by indicating the num- 
ber of carbons and the number and position of the double bonds, as 
in Table 10-1. 



Desaturation of Fatty Acids Requires a 
Mixed-Function Oxidase 

Palmitate and stearate serve as precursors of the two 
most common monounsaturated fatty acids of animal 
tissues: palmitoleate, 16:1 (A 9 ), and oleate, 18:1 (A 9 ); 
both of these fatty acids have a single cis double bond 
between C-9 and C-10 (see Table 10-1). The double 
bond is introduced into the fatty acid chain by an ox- 
idative reaction catalyzed by fatty acyl-CoA desat- 
urase (Fig. 21-13), a mixed-function oxidase 
(Box 21-1). Two different substrates, the fatty acid 
and NADH or NADPH, simultaneously undergo two- 
electron oxidations. The path of electron flow includes 
a cytochrome (cytochrome b 5 ) and a flavoprotein (cy- 
tochrome b b reductase), both of which, like fatty 
acyl-CoA desaturase, are in the smooth ER. Bacteria 
have two cytochrome b 5 reductases, one NADH- 
dependent and the other NADPH-dependent; which of 
these is the main electron donor in vivo is unclear. In 
plants, oleate is produced by a stearoyl-ACP desat- 
urase in the chloroplast stroma that uses reduced 
ferredoxin as the electron donor. 

Mammalian hepatocytes can readily introduce dou- 
ble bonds at the A 9 position of fatty acids but cannot in- 
troduce additional double bonds between C-10 and the 
methyl-terminal end. Thus mammals cannot synthesize 
linoleate, 18:2(A 9 ' 12 ), or a-linolenate, 18:3(A 9 ' 12 ' 15 ). 
Plants, however, can synthesize both; the desaturases 



that introduce double bonds at the A and A 1& positions 
are located in the ER and the chloroplast. The ER en- 
zymes act not on free fatty acids but on a phospholipid, 
phosphatidylcholine, that contains at least one oleate 
linked to the glycerol (Fig. 21-14). Both plants and 
bacteria must synthesize polyunsaturated fatty acids to 
ensure membrane fluidity at reduced temperatures. 

Because they are necessary precursors for the syn- 
thesis of other products, linoleate and a-linolenate are 
essential fatty acids for mammals; they must be ob- 
tained from dietary plant material. Once ingested, 
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FIGURE 21-13 Electron transfer in the desaturation of fatty acids in 
vertebrates. Blue arrows show the path of electrons as two sub- 
strates — a fatty acyl-CoA and NADPH — undergo oxidation by molec- 
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ular oxygen. These reactions take place on the lumenal face of the 
smooth ER. A similar pathway, but with different electron carriers, oc- 
curs in plants. 
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BOX 21-1 Mixed-Function Oxidases, Oxygenases, and Cytochrome P450 



In this chapter we encounter several enzymes that carry- 
out oxidation-reduction reactions in which molecular 
oxygen is a participant. The reaction that introduces a 
double bond into a fatty acyl chain (see Fig. 21-13) is 
one such reaction. 

The nomenclature for enzymes that catalyze reac- 
tions of this general type is often confusing to students, as 
is the mechanism of the reactions. Oxidase is the general 
name for enzymes that catalyze oxidations in which mo- 
lecular oxygen is the electron acceptor but oxygen atoms 
do not appear in the oxidized product (however, there is 
an exception to this "rule," as we shall see!). The enzyme 
that creates a double bond in fatty acyl-CoA during the 
oxidation of fatty acids in peroxisomes (see Fig. 17-13) is 
an oxidase of this type; a second example is the cy- 
tochrome oxidase of the mitochondrial electron-transfer 
chain (see Fig. 19-14). In the first case, the transfer of 
two electrons to H 2 0 produces hydrogen peroxide, H 2 0 2 ; 
in the second, two electrons reduce \0 2 to H 2 0. Many, 
but not all, oxidases are flavoproteins. 

Oxygenases catalyze oxidative reactions in which 
oxygen atoms are directly incorporated into the sub- 
strate molecule, forming a new hydroxyl or carboxyl 
group, for example. Dioxygenases catalyze reactions 
in which both oxygen atoms of 0 2 are incorporated into 
the organic substrate molecule. An example of a dioxy- 
genase is tryptophan 2,3-dioxygenase, which catalyzes 
the opening of the five-membered ring of tryptophan in 
the catabolism of this amino acid. When this reaction 
takes place in the presence of 18 0 2j the isotopic oxygen 
atoms are found in the two carbonyl groups of the prod- 
uct (shown in red). 
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Monooxygenases, more abundant and more com- 
plex in their action, catalyze reactions in which only one 
of the two oxygen atoms of 0 2 is incorporated into the 
organic substrate, the other being reduced to H 2 0. 
Monooxygenases require two substrates to serve as 



reductants of the two oxygen atoms of 0 2 . The main 
substrate accepts one of the two oxygen atoms, and a 
cosubstrate furnishes hydrogen atoms to reduce the 
other oxygen atom to H 2 0. The general reaction equa- 
tion for monooxygenases is 

AH + BH 2 + O — O^A — OH + B + H 2 0 

where AH is the main substrate and BH 2 the cosubstrate. 
Because most monooxygenases catalyze reactions in 
which the main substrate becomes hydroxylated, they 
are also called hydroxylases. They are also sometimes 
called mixed-function oxidases or mixed-function 
oxygenases, to indicate that they oxidize two differ- 
ent substrates simultaneously. (Note here the use of 
"oxidase" — a deviation from the general meaning of this 
term noted above.) 

There are different classes of monooxygenases, de- 
pending on the nature of the cosubstrate. Some use re- 
duced flavin nucleotides (FMNH 2 or FADH 2 ), others use 
NADH or NADPH, and still others use a-ketoglutarate 
as the cosubstrate. The enzyme that hydroxylates the 
phenyl ring of phenylalanine to form tyrosine is a monooxy- 
genase for which tetrahydrobiopterin serves as cosubstrate 
(see Fig. 18-23). This is the enzyme that is defective in the 
human genetic disease phenylketonuria. 

The most numerous and most complex monooxy- 
genation reactions are those employing a type of heme 
protein called cytochrome P-450. This cytochrome is 
usually present in the smooth ER rather than the mito- 
chondria. Like mitochondrial cytochrome oxidase, cy- 
tochrome P-450 can react with 0 2 and bind carbon 
monoxide, but it can be differentiated from cytochrome 
oxidase because the carbon monoxide complex of its re- 
duced form absorbs light strongly at 450 nm — thus the 
name P-450. 

Cytochrome P-450 catalyzes hydroxylation reac- 
tions in which an organic substrate, RH, is hydroxylated 
to R — OH, incorporating one oxygen atom of 0 2 ; the 
other oxygen atom is reduced to H 2 0 by reducing equiv- 
alents that are furnished by NADH or NADPH but are 
usually passed to cytochrome P-450 by an iron-sulfur 
protein. Figure 1 shows a simplified outline of the action 
of cytochrome P-450, which has intermediate steps not 
yet fully understood. 
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Cytochrome P-450 is actually a family of similar pro- 
teins; several hundred members of this protein family are 
known, each with a different substrate specificity. In the 
adrenal cortex, for example, a specific cytochrome 
P-450 participates in the hydroxylation of steroids to 
yield the adrenocortical hormones (see Fig. 21-46). 
Cytochrome P-450 is also important in the hydroxyla- 
tion of many different drugs, such as barbiturates and 
other xenobiotics (substances foreign to the organism), 
particularly if they are hydrophobic and relatively insol- 
uble. The environmental carcinogen benzo[a]pyrene 
(found in cigarette smoke) undergoes cytochrome 



P-450-dependent hydroxylation during detoxification. 
Hydroxylation of xenobiotics makes them more soluble 
in water and allows their excretion in the urine. Unfor- 
tunately, hydroxylation of some compounds converts 
them to toxic substances, subverting the detoxification 
system. 

Reactions described in this chapter that are catalyzed 
by mixed-function oxidases are those involved in fatty 
acyl-CoA desaturation (Fig. 21-13), leukotriene synthe- 
sis (Fig. 21-16), plasmalogen synthesis (Fig. 21-30), 
conversion of squalene to cholesterol (Fig. 21-37), and 
steroid hormone synthesis (Fig. 21-46). 
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FIGURE 21-14 Action of plant desaturases. Desaturases in plants 
oxidize phosphatidylcholine-bound oleate to polyunsaturated fatty 
acids. Some of the products are released from the phosphatidylcholine 
by hydrolysis. 



linoleate may be converted to certain other polyunsatu- 
rated acids, particularly y-linolenate, eicosatrienoate, 
and arachidonate (eicosatetraenoate), all of which can 
be made only from linoleate (Fig. 21-12). Arachidonate, 



20:4(A 



5,8,11,14- 



), is an essential precursor of regulatory 



lipids, the eicosanoids. The 20-carbon fatty acids are 
synthesized from linoleate (and a-linolenate) by fatty 
acid elongation reactions analogous to those described 
on page 814. 

Eicosanoids Are Formed from 20-Carbon 
Polyunsaturated Fatty Acids 

Eicosanoids are a family of very potent biological sig- 
naling molecules that act as short-range messengers, 
affecting tissues near the cells that produce them. In 
response to hormonal or other stimuli, phospholipase 
A 2 , present in most types of mammalian cells, attacks 
membrane phospholipids, releasing arachidonate from 
the middle carbon of glycerol. Enzymes of the smooth 
ER then convert arachidonate to prostaglandins, 
beginning with the formation of prostaglandin H 2 
(PGH 2 ), the immediate precursor of many other 
prostaglandins and of thromboxanes (Fig. 21-15a). 
The two reactions that lead to PGH 2 are catalyzed by a 
bifunctional enzyme, cyclooxygenase (COX), also 
called prostaglandin H 2 synthase. In the first of two 
steps, the cyclooxygenase activity introduces molecu- 
lar oxygen to convert arachidonate to PGG 2 . The sec- 
ond step, catalyzed by the peroxidase activity of COX, 
converts PGG 2 to PGH 2 . 

Mammals have two isozymes of prostaglandin H 2 
synthase, COX-1 and COX-2. These have different func- 
tions but closely similar amino acid sequences (60% to 
65% sequence identity) and similar reaction mecha- 
nisms at both of their catalytic centers. COX-1 is re- 
sponsible for the synthesis of the prostaglandins that 
regulate the secretion of gastric mucin, and COX-2 for 
the prostaglandins that mediate inflammation, pain, 
and fever. 
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FIGURE 21-15 The "cyclic" pathway from arachidonate to 
prostaglandins and thromboxanes, (a) After arachidonate is re- 
leased from phospholipids by the action of phospholipase A 2 , the cy- 
clooxygenase and peroxidase activities of COX (also called prostaglandin 
H 2 synthase) catalyze the production of PGH 2 , the precursor of other 
prostaglandins and thromboxanes, (b) Aspirin inhibits the first reaction 
by acetylating an essential Ser residue on the enzyme, lbuprofen and 
naproxen inhibit the same step, probably by mimicking the structure of 
the substrate or an intermediate in the reaction, (c) COX-2-specific 
cyclooxygenase inhibitors used as pain relievers (see text). 

Pain can be relieved by inhibiting COX-2. The first 
drug widely marketed for this purpose was aspirin 
(acetylsalicylate; Fig. 21-15b). The name aspirin (from a 
for acetyl and spir for Spirsaiire, the German word for 
the salicylates prepared from the plant Spiraea 
ulmaria) appeared in 1899 when the drug was intro- 
duced by the Bayer company Aspirin irreversibly inacti- 
vates the cyclooxygenase activity of both COX isozymes, 
by acetylating a Ser residue and blocking each enzyme's 
active site. The synthesis of prostaglandins and throm- 
boxanes is thereby inhibited. lbuprofen, another widely 
used nonsteroidal antiinflammatory drug (NSAID; Fig. 
2 1-1 5b), inhibits the same pair of enzymes. However, the 
inhibition of COX-1 can result in undesired side effects 
including stomach irritation and more serious condi- 
tions. In the 1990s, following discovery of the crystal 
structures of COX-1 and COX-2, NSAID compounds that 
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had a greater specificity for COX-2 were developed as 
advanced therapies for severe pain. Three of these drugs 
were approved for use worldwide: rofecoxib (Vioxx), 
valdecoxib (Bextra), and celecoxib (Celebrex) (Fig. 
21-15c). Launched in the late 1990s, the new com- 
pounds were initially a success for the pharmaceutical 
firms that produced them. However, enthusiasm turned 
to concern as field reports and clinical studies connected 
the drugs with an increased risk of heart attack and 
stroke. The reasons for the problems are still not clear, 
but some researchers speculated that the COX-2 in- 
hibitors were altering the fine balance maintained be- 
tween the hormone prostacyclin, which dilates blood 
vessels, prevents blood clotting, and is reduced by COX- 
2 inhibitors, and the thromboxanes, produced on the 
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pathway involving COX-1, that help to form blood clots. 
Vioxx was withdrawn from the market in 2004, and Bex- 
tra was withdrawn soon after. As of early 2007, Celebrex 
is still on the market but used with increased caution. 

Thromboxane synthase, present in blood platelets 
(thrombocytes), converts PGH 2 to thromboxane A 2 , from 
which other thromboxanes are derived (Fig. 21-15a). 
Thromboxanes induce constriction of blood vessels 
and platelet aggregation, early steps in blood clotting. 
Low doses of aspirin, taken regularly, reduce the prob- 
ability of heart attacks and strokes by reducing throm- 
boxane production. ■ 

Thromboxanes, like prostaglandins, contain a ring of 
five or six atoms; the pathway from arachidonate to these 
two classes of compounds is sometimes called the 
"cyclic" pathway, to distinguish it from the "linear" path- 
way that leads from arachidonate to the leukotrienes, 
which are linear compounds (Fig. 21-16). Leukotriene 
synthesis begins with the action of several lipoxygenases 
that catalyze the incorporation of molecular oxygen into 
arachidonate. These enzymes, found in leukocytes and in 
heart, brain, lung, and spleen, are mixed-function oxi- 
dases that use cytochrome P-450 (Box 21-1). The vari- 
ous leukotrienes differ in the position of the peroxide 
group introduced by the lipoxygenases. This linear path- 
way from arachidonate, unlike the cyclic pathway, is not 
inhibited by aspirin or other NSAIDs. 

Plants also derive important signaling molecules from 
fatty acids. As in animals, a key step in the initiation of 
signaling involves activation of a specific phospholipase. 
In plants, the fatty acid substrate that is released is a- 
linolenate. A lipoxygenase then catalyzes the first step in 
a pathway that converts linolenate to jasmonate, a sub- 
stance known to have signaling roles in insect defense, re- 
sistance to fungal pathogens, and pollen maturation. 
Jasmonate (see Fig. 12-32) also affects seed germination, 
root growth, and fruit and seed development. 



SUMMARY 21.1 Biosynthesis of Fatty Acids 
and Eicosanoids 

■ Long-chain saturated fatty acids are synthesized 
from acetyl-CoA by a cytosolic system of six 
enzymatic activities plus acyl carrier protein 
(ACP). There are two types of fatty acid synthase. 
FAS I, found in vertebrates and fungi, consists of 
multifunctional polypeptides. FAS II is a dissociated 
system found in bacteria and plants. Both contain 
two types of — SH groups (one furnished by 

the phosphopantetheine of ACP, the other by 
a Cys residue of /3-ketoacyl-ACP synthase) 
that function as carriers of the fatty acyl 
intermediates. 

■ Malonyl-ACP, formed from acetyl-CoA (shuttled out 
of mitochondria) and C0 2 , condenses with an acetyl 
bound to the Cys — SH to yield acetoacetyl-ACP, 
with release of C0 2 . This is followed by reduction to 
the D-jS-hydroxy derivative, dehydration to the 
£ra?zs-A 2 -unsaturated acyl- ACP, and reduction to 
butyryl-ACP. NADPH is the electron donor for both 
reductions. Fatty acid synthesis is regulated at the 
level of malonyl-CoA formation. 

■ Six more molecules of malonyl-ACP react 
successively at the carboxyl end of the growing 
fatty acid chain to form palmitoyl-ACP — the end 
product of the fatty acid synthase reaction. Free 
palmitate is released by hydrolysis. 

■ Palmitate may be elongated to the 18-carbon 
stearate. Palmitate and stearate can be desaturated 
to yield palmitoleate and oleate, respectively, by 
the action of mixed-function oxidases. 

■ Mammals cannot make linoleate and must obtain it 
from plant sources; they convert exogenous 
linoleate to arachidonate, the parent compound of 
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eicosanoids (prostaglandins, thromboxanes, and 
leukotrienes), a family of very potent signaling 
molecules. The synthesis of prostaglandins and 
thromboxanes is inhibited by NSAIDs that act on 
the cyclooxygenase activity of prostaglandin H 2 
synthase. 

21.2 Biosynthesis of Triacylglycerols 

Most of the fatty acids synthesized or ingested by an or- 
ganism have one of two fates: incorporation into triacyl- 
glycerols for the storage of metabolic energy or 
incorporation into the phospholipid components of 
membranes. The partitioning between these alternative 
fates depends on the organism's current needs. During 
rapid growth, synthesis of new membranes requires the 
production of membrane phospholipids; when an organ- 
ism has a plentiful food supply but is not actively grow- 
ing, it shunts most of its fatty acids into storage fats. 
Both pathways begin at the same point: the formation of 
fatty acyl esters of glycerol. In this section we examine 
the route to triacylglycerols and its regulation, and the 
production of glycerol 3-phosphate in the process of 
glyceroneogenesis. 

Triacylglycerols and Glycerophospholipids Are 
Synthesized from the Same Precursors 

Animals can synthesize and store large quantities of tri- 
acylglycerols, to be used later as fuel (see Box 17-1). 
Humans can store only a few hundred grams of glycogen 
in liver and muscle, barely enough to supply the body's 
energy needs for 12 hours. In contrast, the total amount 
of stored triacylglycerol in a 70-kg man of average build 
is about 15 kg, enough to support basal energy needs for 
as long as 12 weeks (see Table 23-5). Triacylglycerols 
have the highest energy content of all stored nutri- 
ents — more than 38 kJ/g, Whenever carbohydrate is in- 
gested in excess of the organism's capacity to store 
glycogen, the excess is converted to triacylglycerols and 
stored in adipose tissue. Plants also manufacture tria- 
cylglycerols as an energy-rich fuel, mainly stored in 
fruits, nuts, and seeds. 

In animal tissues, triacylglycerols and glycerophos- 
pholipids such as phosphatidylethanolamine share two 
precursors (fatty acyl-CoA and L-glycerol 3-phosphate) 
and several biosynthetic steps. The vast majority of the 
glycerol 3-phosphate is derived from the glycolytic in- 
termediate dihydroxyacetone phosphate (DHAP) by 
the action of the cytosolic NAD-linked glycerol 3-phos- 
phate dehydrogenase; in liver and kidney, a small 
amount of glycerol 3-phosphate is also formed from 
glycerol by the action of glycerol kinase (Fig. 21-17). 
The other precursors of triacylglycerols are fatty 
acyl-CoAs, formed from fatty acids by acyl-CoA syn- 
thetases, the same enzymes responsible for the activa- 
tion of fatty acids for /3 oxidation (see Fig. 17-5). 
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FIGURE 21-17 Biosynthesis of phosphatidic acid. A fatty acyl group is 
activated by formation of the fatty acyl-CoA, then transferred to ester 
linkage with L-glycerol 3-phosphate, formed in either of the two ways 
shown. Phosphatidic acid is shown here with the correct stereochem- 
istry at C-2 of the glycerol molecule. To conserve space in subsequent 
figures (and in Fig. 21-14), both fatty acyl groups of glycerophospho- 
lipids, and all three acyl groups of triacylglycerols, are shown project- 
ing to the right. 
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FIGURE 21-18 Phosphatidic acid in lipid biosynthesis. Phosphatidic 
acid is the precursor of both triacylglycerols and glycerophospho- 
lipids. The mechanisms for head-group attachment in phospholipid 
synthesis are described later in this section. 

The first stage in the biosynthesis of triacylglycerols 
is the acylation of the two free hydroxyl groups of l- 
glycerol 3-phosphate by two molecules of fatty acyl-CoA 
to yield diacylglycerol 3-phosphate, more commonly 
called phosphatidic acid or phosphatidate (Fig. 21-17). 
Phosphatidic acid is present in only trace amounts in 
cells but is a central intermediate in lipid biosynthesis; 
it can be converted either to a triacylglycerol or to a 
glycerophospholipid. In the pathway to triacylglycerols, 
phosphatidic acid is hydrolyzed by phosphatidic acid 
phosphatase to form a 1,2-diacylglycerol (Fig. 21-18). 
Diacylglycerols are then converted to triacylglycerols by 
transesterification with a third fatty acyl-CoA. 

Triacylglycerol Biosynthesis in Animals 
Is Regulated by Hormones 

In humans, the amount of body fat stays relatively con- 
stant over long periods, although there may be minor 
short-term changes as caloric intake fluctuates. Carbo- 
hydrate, fat, or protein consumed in excess of energy 



needs is stored in the form of triacylglycerols that can be 
drawn upon for energy, enabling the body to withstand 
periods of fasting. 

Biosynthesis and degradation of triacylglycerols 
are regulated such that the favored path depends 
on the metabolic resources and requirements of the mo- 
ment. The rate of triacylglycerol biosynthesis is pro- 
foundly altered by the action of several hormones. 
Insulin, for example, promotes the conversion of carbo- 
hydrate to triacylglycerols (Fig. 21-19). People with 
severe diabetes mellitus, due to failure of insulin secre- 
tion or action, not only are unable to use glucose prop- 
erly but also fail to synthesize fatty acids from 
carbohydrates or amino acids. If the diabetes is un- 
treated, these individuals have increased rates of fat 
oxidation and ketone body formation (Chapter 17) and 
therefore lose weight. ■ 

An additional factor in the balance between biosyn- 
thesis and degradation of triacylglycerols is that approx- 
imately 75% of all fatty acids released by lipolysis are 
reesterified to form triacylglycerols rather than used for 
fuel. This ratio persists even under starvation condi- 
tions, when energy metabolism is shunted from the use 
of carbohydrate to the oxidation of fatty acids. Some of 
this fatty acid recycling takes place in adipose tissue, 
with the reesterification occurring before release into 
the bloodstream; some takes place via a systemic cycle 
in which free fatty acids are transported to liver, recy- 
cled to triacylglycerol, exported again to the blood 
(transport of lipids in the blood is discussed in Section 
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FIGURE 21-19 Regulation of triacylglycerol synthesis by in- 
sulin. Insulin stimulates conversion of dietary carbohydrates 
and proteins to fat. Individuals with diabetes mellitus lack insulin; in 
uncontrolled disease, this results in diminished fatty acid synthesis, 
and the acetyl-CoA arising from catabolism of carbohydrates and pro- 
teins is shunted instead to ketone body production. People in severe 
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FIGURE 21-20 The triacylglycerol cycle. In mammals, triacylglycerol 
molecules are broken down and resynthesized in a triacylglycerol cycle 
during starvation. Some of the fatty acids released by lipolysis of triacyl- 
glycerol in adipose tissue pass into the bloodstream, and the remainder 
are used for resynthesis of triacylglycerol. Some of the fatty acids re- 
leased into the blood are used for energy (in muscle, for example), and 
some are taken up by the liver and used in triacylglycerol synthesis. The 
triacylglycerol formed in the liver is transported in the blood back to adi- 
pose tissue, where the fatty acid is released by extracellular lipoprotein 
lipase, taken up by adipocytes, and reesterified into triacylglycerol. 

21.4), and taken up again by adipose tissue after release 
from triacylglycerol by extracellular lipoprotein lipase 
(Fig. 21-20; see also Fig. 17-1). Flux through this 
triacylglycerol cycle between adipose tissue and liver 
may be quite low when other fuels are available and the 
release of fatty acids from adipose tissue is limited, but 
as noted above, the proportion of released fatty acids 
that are reesterified remains roughly constant at 75% 
under all metabolic conditions. The level of free fatty 
acids in the blood thus reflects both the rate of release of 
fatty acids and the balance between the synthesis and 
breakdown of triacylglycerols in adipose tissue and liver. 

When the mobilization of fatty acids is required to 
meet energy needs, release from adipose tissue is stimu- 
lated by the hormones glucagon and epinephrine (see 
Figs 17-3, 17-12). Simultaneously, these hormonal sig- 
nals decrease the rate of glycolysis and increase the rate 
of gluconeogenesis in the liver (providing glucose for the 
brain, as further elaborated in Chapter 23). The released 
fatty acid is taken up by a number of tissues, including 
muscle, where it is oxidized to provide energy. Much of 
the fatty acid taken up by liver is not oxidized but is re- 
cycled to triacylglycerol and returned to adipose tissue. 

The function of the apparently futile triacylglycerol 
cycle (futile cycles are discussed in Chapter 15) is not 
well understood. However, as we learn more about how 
the triacylglycerol cycle is sustained via metabolism in 
two separate organs and is coordinately regulated, some 
possibilities emerge. For example, the excess capacity 
in the triacylglycerol cycle (the fatty acid that is eventu- 
ally reconverted to triacylglycerol rather than oxidized 



as fuel) could represent an energy reserve in the blood- 
stream during fasting, one that would be more rapidly 
mobilized in a "fight or flight" emergency than would 
stored triacylglycerol. 

The constant recycling of triacylglycerols in adipose 
tissue even during starvation raises a second question: 
what is the source of the glycerol 3-phosphate required for 
this process? As noted above, glycolysis is suppressed in 
these conditions by the action of glucagon and epineph- 
rine, so little DHAP is available, and glycerol released dur- 
ing lipolysis cannot be converted directly to glycerol 
3-phosphate in adipose tissue, because these cells lack 
glycerol kinase (Fig. 21-17). So, how is sufficient glycerol 
3-phosphate produced? The answer lies in a pathway dis- 
covered more than three decades ago and given little at- 
tention until recently, a pathway intimately linked to the 
triacylglycerol cycle and, in a larger sense, to the balance 
between fatty acid and carbohydrate metabolism. 

Adipose Tissue Generates Glycerol 3-phosphate 
byGlyceroneogenesis 

Glyceroneogenesis is a shortened version of gluco- 
neogenesis, from pyruvate to DHAP (see Fig. 14-16), 
followed by conversion of the DHAP to glycerol 3-phos- 
phate by cytosolic NAD-linked glycerol 3-phosphate de- 
hydrogenase (Fig. 21-21). Glycerol 3-phosphate is 
subsequently used in triacylglycerol synthesis. Glycero- 
neogenesis was discovered in the 1960s by Lea Reshef, 
Richard Hanson, and John Ballard, and simultaneously 
by Eleazar Shafrir and his coworkers, who were in- 
trigued by the presence of two gluconeogenic enzymes, 
pyruvate carboxylase and phosphoenolpyruvate (PEP) 
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FIGURE 21-21 Glyceroneo- 
genesis. The pathway is essen- 
tially an abbreviated version 
of gluconeogenesis, from 
pyruvate to dihydroxyacetone 
phosphate (DHAP), followed 
by conversion of DHAP to 
glycerol 3-phosphate, which 
is used for the synthesis of tri- 
acylglycerol. 



21.2 Biosynthesis of Triacylglycerols 823 



carboxykinase, in adipose tissue, where glucose is not 
synthesized. After a long period of inattention, interest 
in this pathway has been renewed by the demonstration 
of a link between glyceroneogenesis and late-onset 
(type 2) diabetes, as we shall see. 

Glyceroneogenesis has multiple roles. In adipose 
tissue, glyceroneogenesis coupled with reesterification 
of free fatty acids controls the rate of fatty acid release 
to the blood. In brown adipose tissue, the same pathway 
may control the rate at which free fatty acids are deliv- 
ered to mitochondria for use in thermogenesis (see Fig. 
19-34). And in fasting humans, glyceroneogenesis in the 
liver alone supports the synthesis of enough glycerol 3- 
phosphate to account for up to 65% of fatty acids 
reesterified to triacylglycerol. 

Flux through the triacylglycerol cycle between liver 
and adipose tissue is controlled to a large degree by 
the activity of PEP carboxykinase, which limits the rate 
of both gluconeogenesis and glyceroneogenesis. Gluco- 
corticoid hormones such as Cortisol (a biological steroid 
derived from cholesterol; see Fig. 21-45) and dexa- 
methasone (a synthetic glucocorticoid) regulate the 
levels of PEP carboxykinase reciprocally in the liver 
and adipose tissue. Acting through the glucocorticoid 
receptor, these steroid hormones increase the expression 
of the gene encoding PEP carboxykinase in the liver, 
thus increasing gluconeogenesis and glyceroneogenesis 
(Fig. 21-22). 
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Stimulation of glyceroneogenesis leads to an in- 
crease in the synthesis of triacylglycerol molecules in 
the liver and their release into the blood. At the same 
time, glucocorticoids suppress the expression of the 
gene encoding PEP carboxykinase in adipose tissue. 
This results in a decrease in glyceroneogenesis in adi- 
pose tissue; recycling of fatty acids declines as a result, 
and more free fatty acids are released into the blood. 
Thus glyceroneogenesis is regulated reciprocally in the 
liver and adipose tissue, affecting lipid metabolism in 
opposite ways: a lower rate of glyceroneogenesis in adi- 
pose tissue leads to more fatty acid release (rather than 
recycling), whereas a higher rate in the liver leads to 
more synthesis and export of triacylglycerols. The net 
result is an increase in flux through the triacylglycerol 
cycle. When the glucocorticoids are no longer present, 
flux through the cycle declines as the expression of PEP 
carboxykinase increases in adipose tissue and decreases 
in the liver. 
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FIGURE 21-22 Regulation of glyceroneogenesis. (a) Gluco- 
corticoid hormones stimulate glyceroneogenesis and gluco- 
neogenesis in the liver, while suppressing glyceroneogenesis in the 
adipose tissue (by reciprocal regulation of the gene expressing PEP 
carboxykinase (PEPCK) in the two tissues); this increases the flux 
through the triacylglycerol cycle. The glycerol freed by the breakdown 
of triacylglycerol in adipose tissue is released to the blood and trans- 
ported to the liver, where it is primarily converted to glucose, although 
some is converted to glycerol 3-phosphate by glycerol kinase, (b) A 
class of drugs called thiazolidinediones are now used to treat type 2 di- 
abetes. In this disease, high levels of free fatty acids in the blood inter- 
fere with glucose utilization in muscle and promote insulin resistance. 
Thiazolidinediones activate a nuclear receptor called peroxisome 
proliferator-activated receptor y (PPAR7), which induces the activity of 
PEP carboxykinase. Therapeutically, thiazolidinediones increase the 
rate of glyceroneogenesis, thus increasing the resynthesis of triacyl- 
glycerol in adipose tissue and reducing the amount of free fatty acid in 
the blood. 
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Thiazolidinediones Treat Type 2 Diabetes 
by Increasing Glyceroneogenesis 

B3 The recent attention to glyceroneogenesis has 
■ufl arisen in part from the connection between this 
pathway and diabetes. High levels of free fatty acids in 
the blood interfere with glucose utilization in muscle 
and promote the insulin resistance that leads to 
type 2 diabetes. A new class of drugs called thiazol- 
idinediones reduce the levels of fatty acids circulat- 
ing in the blood and increase sensitivity to insulin. 
Thiazolidinediones promote the induction in adipose 
tissue of PEP carboxykinase (Fig. 21-22), leading to 
increased synthesis of the precursors of glyceroneoge- 
nesis. The therapeutic effect of thiazolidinediones is 
thus due, at least in part, to the increase in glycero- 
neogenesis, which in turn increases the resynthesis of 
triacylglycerol in adipose tissue and reduces the re- 
lease of free fatty acid from adipose tissue into the 
blood. The benefits of one such drug, rosiglitazone 
(Avandia), are countered in part by an increased risk 
of heart attack, for reasons not yet clear. Assessment 
of this drug is continuing. ■ 
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SUMMARY 21.2 Biosynthesis of 
Triacylglycerols 

■ Triacylglycerols are formed by reaction of two 
molecules of fatty acyl-CoA with glycerol 
3-phosphate to form phosphatidic acid; this 
product is dephosphorylated to a diacylglycerol, 
then acylated by a third molecule of fatty 
acyl-CoA to yield a triacylglycerol. 

■ The synthesis and degradation of triacylglycerol 
are hormonally regulated. 

■ Mobilization and recycling of triacylglycerol 
molecules results in a triacylglycerol cycle. 
Triacylglycerols are resynthesized from free fatty 
acids and glycerol 3-phosphate even during 
starvation. The dihydroxyacetone phosphate 
precursor of glycerol 3-phosphate is derived from 
pyruvate via glyceroneogenesis. 



21.3 Biosynthesis of 
Membrane Phospholipids 

In Chapter 10 we introduced two major classes of mem- 
brane phospholipids: glycerophospholipids and sphin- 
golipids. Many different phospholipid species can be 
constructed by combining various fatty acids and polar 
head groups with the glycerol or sphingosine backbone 
(see Figs 10-9, 10-13). All the biosynthetic pathways 
follow a few basic patterns. In general, the assembly of 
phospholipids from simple precursors requires (1) syn- 
thesis of the backbone molecule (glycerol or sphingo- 
sine); (2) attachment of fatty acid(s) to the backbone 
through an ester or amide linkage; (3) addition of a 
hydrophilic head group to the backbone through a phos- 
phodiester linkage; and, in some cases, (4) alteration or 
exchange of the head group to yield the final phospho- 
lipid product. 

In eukaryotic cells, phospholipid synthesis occurs 
primarily on the surfaces of the smooth endoplasmic 
reticulum and the mitochondrial inner membrane. Some 
newly formed phospholipids remain at the site of syn- 
thesis, but most are destined for other cellular locations. 
The process by which water-insoluble phospholipids 
move from the site of synthesis to the point of their 
eventual function is not fully understood, but we con- 
clude this section by discussing some mechanisms that 
have emerged in recent years. 

Cells Have Two Strategies for Attaching 
Phospholipid Head Groups 

The first steps of glycerophospholipid synthesis are 
shared with the pathway to triacylglycerols (Fig. 21-17): 
two fatty acyl groups are esterified to C-l and C-2 of 
L-glycerol 3-phosphate to form phosphatidic acid. Com- 
monly but not invariably, the fatty acid at C-l is saturated 
and that at C-2 is unsaturated. A second route to phos- 
phatidic acid is the phosphorylation of a diacylglycerol 
by a specific kinase. 

The polar head group of glycerophospholipids is at- 
tached through a phosphodiester bond, in which each of 
two alcohol hydroxyls (one on the polar head group and 
one on C-3 of glycerol) forms an ester with phosphoric 
acid (Fig. 21-23). In the biosynthetic process, one of the 
hydroxyls is first activated by attachment of a nucleotide, 
cytidine diphosphate (CDP). Cytidine monophosphate 
(CMP) is then displaced in a nucleophilic attack by the 
other hydroxyl (Fig. 21-24). The CDP is attached either 
to the diacylglycerol, forming the activated phosphatidic 
acid CDP-diacylglycerol (strategy 1), or to the hy- 
droxyl of the head group (strategy 2). Eukaryotic cells 
employ both strategies, whereas bacteria use only the 
first. The central importance of cytidine nucleotides in 
lipid biosynthesis was discovered by Eugene P. Kennedy 
in the early 1960s. 
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is attached to a diacylglycerol by a phosphodiester bond, formed when 
phosphoric acid condenses with two alcohols, eliminating two mole- 
cules of H 2 0. 



Phospholipid Synthesis in £ coli Employs CDP- 
Diacylglycerol 

The first strategy for head-group attachment is illustrated 
by the synthesis of phosphatidylserine, phosphatidyl- 
ethanolamine, and phosphatidylglycerol in coli The di- 
acylglycerol is activated by condensation of phosphatide 
acid with cytidine triphosphate (CTP) to form CDP-dia- 
cylglycerol, with the elimination of pyrophosphate (Fig. 
21-25). Displacement of CMP through nucleophilic at- 
tack by the hydroxyl group of serine or by the C-l hy- 
droxyl of glycerol 3-phosphate yields phosphatidyl- 
serine or phosphatidylglycerol 3-phosphate, respec- 
tively The latter is processed further by cleavage of the 
phosphate monoester (with release of Pj) to yield phos- 
phatidylglycerol. 

Phosphatidylserine and phosphatidylglycerol can 
serve as precursors of other membrane lipids in bacteria 
(Fig. 21-25). Decarboxylation of the serine moiety in 
phosphatidylserine, catalyzed by phosphatidylserine 
decarboxylase, yields phosphatidylethanolamine. In 
E. coli, condensation of two molecules of phosphati- 
dylglycerol, with elimination of one glycerol, yields 
cardiolipin, in which two diacylglycerols are joined 
through a common head group. 
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FIGURE 21 -24 Two general strategies for forming the phosphodiester bond of phospholipids. In both cases, CDP 
supplies the phosphate group of the phosphodiester bond. 
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FIGURE 21-25 Origin of the polar head groups of 
phospholipids in f. co/i. Initially, a head group (either 
serine or glycerol 3-phosphate) is attached via a CDP- 
diacylglycerol intermediate (strategy 1 in Fig. 21-24). For 
phospholipids other than phosphatidylserine, the head 
group is further modified, as shown here. In the enzyme 
names, PG represents phosphatidylglycerol; PS, phos- 
phatidylserine. 
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FIGURE 21-26 Synthesis of cardiolipin and phos- 
phatidylinositol in eukaryotes. These glycerophospho 
lipids are synthesized using strategy 1 in Figure 21-24. 
Phosphatidylglycerol is synthesized as in bacteria (see 
Fig. 21-25). PI represents phosphatidylinositol. 
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Eukaryotes Synthesize Anionic Phospholipids 
from CDP-Diacylglycerol 

In eukaryotes, phosphatidylglycerol, cardiolipin, and the 
phosphatidylinositols (all anionic phospholipids; see Fig. 
10-9) are synthesized by the same strategy used for 
phospholipid synthesis in bacteria. Phosphatidylglycerol 
is made exactly as in bacteria. Cardiolipin synthesis in eu- 
karyotes differs slightly: phosphatidylglycerol condenses 
with CDP-diacylglycerol (Fig. 21-26), not another mol- 
ecule of phosphatidylglycerol as in E. coli (Fig. 21-25). 

Phosphatidylinositol is synthesized by condensation 
of CDP-diacylglycerol with inositol (Fig. 21-26). Specific 
phosphatidylinositol kinases then convert phos- 
phatidylinositol to its phosphorylated derivatives. Phos- 
phatidylinositol and its phosphorylated products in the 
plasma membrane play a central role in signal transduc- 
tion in eukaryotes (see Figs 12-10, 12-16). 

Eukaryotic Pathways to Phosphatidylserine, 
Phosphatidylethanolamine, and Phosphatidylcholine 
Are Interrelated 

Yeast, like bacteria, can produce phosphatidylserine by 
condensation of CDP-diacylglycerol and serine, and can 



synthesize phosphatidylethanolamine from phosphatidyl- 
serine in the reaction catalyzed by phosphatidylserine 
decarboxylase (Fig. 21-27). Phosphatidylethanolamine 
may be converted to phosphatidylcholine (lecithin) 
by the addition of three methyl groups to its amino 
group; 5-adenosylmethionine is the methyl group 
donor (see Fig. 18-18) for all three methylation reac- 
tions. These paths are the major sources of phos- 
phatidylethanolamine and phosphatidylcholine in all 
eukaryotic cells. 

In mammals, phosphatidylserine is not synthe- 
sized from CDP-diacylglycerol; instead, it is derived 
from phosphatidylethanolamine or phosphatidyl- 
choline via one of two head-group exchange reac- 
tions carried out in the endoplasmic reticulum (Fig. 
2l-2Sa). Synthesis of phosphatidylethanolamine 
and phosphatidylcholine in mammals occurs by 
strategy 2 of Figure 21-24: phosphorylation and ac- 
tivation of the head group ; followed by condensation 
with diacylglycerol. For example, choline is reused 
("salvaged") by being phosphorylated then con- 
verted to CDP-choline by condensation with CTP. 
A diacylglycerol displaces CMP from CDP-choline, 
producing phosphatidylcholine (Fig. 21-28b). An 
analogous salvage pathway converts ethanolamine 
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FIGURE 21-27 The major path from phos- 
phatidylserine to phosphatidylethanolamine and 
phosphatidylcholine in all eukaryotes. AdoMet is 
5-adenosylmethionine; adoHcy, 5-adenosylhomo- 
cysteine. 

FIGURE 21-28 Pathways for phosphatidylserine and phosphatidyl- 
choline synthesis in mammals, (a) Phosphatidylserine is synthesized 
by Ca 2+ -dependent head-group exchange reactions promoted by 
phosphatidylserine synthase 1 (PSS1) or phosphatidylserine synthase 
2 (PSS2). PSS1 can use either phosphatidylethanolamine or phos- 
phatidylcholine as a substrate. The pathways used by bacteria and 
yeast correspond to those shown in Figure 21-27. (b) The same 
strategy shown here for phosphatidylcholine synthesis (strategy 2 in 
Fig. 21-24) is also used for salvaging ethanolamine in phos- 
phatidylethanolamine synthesis. 
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obtained in the diet to phosphatidylethanolamine. In 
the liver, phosphatidylcholine is also produced by 
methylation of phosphatidylethanolamine (with S- 
adenosylmethionine, as described above), but in all 
other tissues phosphatidylcholine is produced only 
by condensation of diacylglycerol and CDP-choline. 
The pathways to phosphatidylcholine and phos- 
phatidylethanolamine in various organisms are sum- 
marized in Figure 21-29. 

Although the role of lipid composition in membrane 
function is not entirely understood, changes in composi- 
tion can produce dramatic effects. Researchers have 
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FIGURE 21 -29 Summary of the pathways for synthesis of major phos- 
pholipids. The pathways vary in different classes of organisms. In this 
diagram, pathways used by mammals are highlighted in yellow; those 
used by bacteria and yeast are highlighted in pink. Orange highlighting 
shows where the paths overlap. In mammals, phosphatidylethanola- 
mine and phosphatidylcholine are synthesized by a pathway employ- 
ing diacylglycerol and the CDP-derivative of the appropriate head 
group. Conversion of phosphatidylethanolamine to phosphatidyl- 
choline in mammals takes place only in the liver. The pathways for 
phosphatidylserine synthesis for various classes of organisms are de- 
tailed in Figures 21-27 and 21-28. 



isolated fruit flies with mutations in the gene that en- 
codes ethanolamine kinase (analogous to choline ki- 
nase; Fig. 21 -28b). Lack of this enzyme eliminates one 
pathway for phosphatidylethanolamine synthesis, 
thereby reducing the amount of this lipid in cellular 
membranes. Flies with this mutation — those with the 
genotype easily shocked — exhibit transient paralysis 
following electrical stimulation or mechanical shock that 
would not affect wild-type flies. 

Plasmalogen Synthesis Requires Formation of 
an Ether-Linked Fatty Alcohol 

The biosynthetic pathway to ether lipids, including 
plasmalogens and the platelet-activating factor 

(see Fig. 10-10), involves the displacement of an es- 
terified fatty acyl group by a long-chain alcohol to form 
the ether linkage (Fig. 21-30). Head-group attach- 
ment follows, by mechanisms essentially like those 
used in formation of the common ester-linked phos- 
pholipids. Finally, the characteristic double bond of 
plasmalogens (shaded blue in Fig. 21-30) is introduced 
by the action of a mixed-function oxidase similar to 
that responsible for desaturation of fatty acids (Fig. 
21-13). The peroxisome is the primary site of plas- 
malogen synthesis. 

Sphingolipid and Glycerophospholipid Synthesis 
Share Precursors and Some Mechanisms 

The biosynthesis of sphingolipids takes place in four 
stages: (1) synthesis of the 18-carbon amine sphinga- 
nine from palmitoyl-CoA and serine; (2) attachment 
of a fatty acid in amide linkage to yield iV-acylsphin- 
ganine; (3) desaturation of the sphinganine moiety to 
form iV-acylsphingosine (ceramide); and (4) attach- 
ment of a head group to produce a sphingolipid such 
as a cerebroside or sphingomyelin (Fig. 21-31). 
The first few steps of this pathway occur in the endo- 
plasmic reticulum, while the attachment of head 
groups in stage 4 occurs in the Golgi complex. The 
pathway shares several features with the pathways 
leading to glycerophospholipids: NADPH provides re- 
ducing power, and fatty acids enter as their activated 
CoA derivatives. In cerebroside formation, sugars en- 
ter as their activated nucleotide derivatives. Head- 
group attachment in sphingolipid synthesis has several 
novel aspects. Phosphatidylcholine, rather than CDP- 
choline, serves as the donor of phosphocholine in the 
synthesis of sphingomyelin. 

In glycolipids — the cerebrosides and gangliosides 
(see Fig. 10-13) — the head-group sugar is attached di- 
rectly to the C-l hydroxyl of sphingosine in glycosidic 
linkage rather than through a phosphodiester bond. 
The sugar donor is a UDP-sugar (UDP-glucose or UDP- 
galactose). 



830 J Lipid Biosynthesis 



O 



Ri-COCT 



R 1 -^ 

X S-CoA 

Fatty acyl-CoA 

+ 

CH 2 OH 
I 

c=o o 

I II 

CH 2 — O— P-O" 
I 

O" 

Dihydroxyacetone 
phosphate 



CH 2 — O— C— R l 



0 



~? fatty ;jl vI 
group 



CoA-SH 



> CH 2 — O— P— CT 
O" 

1 - Acyldihydroxy acetone 
3-phosphate 



R 2 — CH 2 — CH 2 — OH 
Saturated fatty alcohol 



0 



R 2 ~ CH 2 -C 



S CoA 



2NADP - CoA-SH 



2H + 

FIGURE 21-30 Synthesis of ether lipids and plasmalogens. The 

newly formed ether linkage is shaded pink. The intermediate 1- 
alkyl-2-acylglyceroI 3-phosphate is the ether analog of phospha- 
tide acid. Mechanisms for attaching head groups to ether lipids are 
essentially the same as for their ester-linked analogs. The character- 
istic double bond of plasmalogens {shaded blue) is introduced in a 
final step by a mixed-function oxidase system similar to that shown 
in Figure21-13. 



Polar Lipids Are Targeted to Specific 
Cellular Membranes 

After synthesis on the smooth ER, the polar lipids, in- 
cluding the glycerophospholipids, sphingolipids, and 
glycolipids, are inserted into specific cellular mem- 
branes in specific proportions, by mechanisms not yet 
understood. Membrane lipids are insoluble in water, so 
they cannot simply diffuse from their point of synthesis 
(the ER) to their point of insertion. Instead, they are 
transported from the ER to the Golgi complex, where 
additional synthesis can take place. They are then deliv- 
ered in membrane vesicles that bud from the Golgi com- 
plex then move to and fuse with the target membrane 
(see Fig. 11-22). These pathways are not completely 
understood, but progress is being made. A 68 kDa 
protein called CERT (for ceramide transport) trans- 
ports ceramide from the ER to the Golgi complex. 
Cytosolic proteins also bind phospholipids and sterols 
and transport them between cellular membranes. These 
mechanisms contribute to the establishment of the 
characteristic lipid compositions of organelle mem- 
branes (see Fig. 11-2). 
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FIGURE 21-31 Biosynthesis of sphingolipids. Condensation of 
palmitoyl-CoA and serine (forming 0-ketosphinganine) followed by 
reduction with NADPH yields sphinganine, which is then acylated to 
N-acylsphinganine (a ceramide). In animals, a double bond (shaded 
pink) is created by a mixed-function oxidase before the final addition 
of a head group: phosphatidylcholine, to form sphingomyelin, or 
glucose, to form a cerebroside. 



Diacylglycerols are the principal precursors of 
glycerophospholipids. 

In bacteria, phosphatidylserine is formed by the 
condensation of serine with CDP-diacylglycerol; 
decarboxylation of phosphatidylserine produces 
phosphatidylethanolamine. Phosphatidylglycerol is 
formed by condensation of CDP-diacylglycerol with 
glycerol 3-phosphate, followed by removal of the 
phosphate in monoester linkage. 

Yeast pathways for the synthesis of 
phosphatidylserine, phosphatidylethanolamine, 
and phosphatidylglycerol are similar to those in 
bacteria; phosphatidylcholine is formed by 
methylation of phosphatidylethanolamine. 

Mammalian cells have some pathways similar to 
those in bacteria, but somewhat different routes 
for synthesizing phosphatidylcholine and 
phosphatidylethanolamine. The head-group alcohol 
(choline or ethanolamine) is activated as the CDP 
derivative, then condensed with diacylglycerol. 
Phosphatidylserine is derived only from 
phosphatidylethanolamine . 

Synthesis of plasmalogens involves formation of 
their characteristic double bond by a mixed-function 
oxidase. The head groups of sphingolipids are 
attached by unique mechanisms. 

Phospholipids travel to their intracellular 
destinations via transport vesicles or specific 
proteins. 
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21.4 Biosynthesis of Cholesterol, 
Steroids, and Isoprenoids 

Cholesterol is doubtless the most publicized lipid, notori- 
ous because of the strong correlation between high levels 
of cholesterol in the blood and the incidence of human 
cardiovascular diseases. Less well advertised is choles- 
terol's crucial role as a component of cellular membranes 
and as a precursor of steroid hormones and bile acids. 
Cholesterol is an essential molecule in many animals, in- 
cluding humans, but is not required in the mammalian 
diet — all cells can synthesize it from simple precursors. 

The structure of this 27-carbon compound suggests 
a complex biosynthetic pathway, but all of its carbon 
atoms are provided by a single precursor — acetate. The 
isoprene units that are the essential intermediates in 
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the pathway from acetate to cholesterol are also precur- 
sors to many other natural lipids, and the mechanisms 
by which isoprene units are polymerized are similar in 
all these pathways. 

CH 3 

CH2 = C — CH=CH'2 
Isoprene 

We begin with an account of the main steps in the 
biosynthesis of cholesterol from acetate, then discuss 
the transport of cholesterol in the blood, its uptake by 
cells, the normal regulation of cholesterol synthesis, and 
its regulation in those with defects in cholesterol uptake 
or transport. We next consider other cellular compo- 
nents derived from cholesterol, such as bile acids and 
steroid hormones. Finally, an outline of the biosynthetic 
pathways to some of the many compounds derived from 
isoprene units, which share early steps with the path- 
way to cholesterol, illustrates the extraordinary versatil- 
ity of isoprenoid condensations in biosynthesis. 

Cholesterol Is Made from Acetyi-CoA 
in Four Stages 

Cholesterol, like long-chain fatty acids, is made from 
acetyl-CoA, but the assembly plan is quite different. In 
early experiments, animals were fed acetate labeled 
with 14 C in either the methyl carbon or the carboxyl car- 
bon. The pattern of labeling in the cholesterol isolated 
from the two groups of animals (Fig. 21-32) provided 
the blueprint for working out the enzymatic steps in 
cholesterol biosynthesis. 

Synthesis takes place in four stages, as shown in 
Figure 21-33: ® condensation of three acetate units 
to form a six-carbon intermediate, mevalonate; (2) con- 
version of mevalonate to activated isoprene units; (3) 
polymerization of six 5-carbon isoprene units to form 
the 30-carbon linear squalene; and ® cyclization of 
squalene to form the four rings of the steroid nucleus, 
with a further series of changes (oxidations, removal or 
migration of methyl groups) to produce cholesterol. 
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FIGURE 21-32 Origin of the carbon atoms of cholesterol. This can be 
deduced from tracer experiments with acetate labeled in the methyl 
carbon (black) or the carboxyl carbon (red). The individual rings in the 
fused-ring system are designated A through D. 
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FIGURE 21-33 Summary of cholesterol biosynthesis. The four stages 
are discussed in the text. Isoprene units in squalene are set off by red 
dashed lines. 



Stage (T) Synthesis of Mevalonate from Acetate 

The first stage in cholesterol biosynthesis leads to the 
intermediate mevalonate (Fig. 21-34). Two mole- 
cules of acetyl-CoA condense to form acetoacetyl-CoA, 
which condenses with a third molecule of acetyl-CoA 
to yield the six-carbon compound /3-hydroxy-/3- 
methylglutaryl-CoA (HMG-CoA). These first two 
reactions are catalyzed by thiolase and HMG-CoA 
synthase, respectively. The cytosolic HMG-CoA syn- 
thase in this pathway is distinct from the mitochondrial 
isozyme that catalyzes HMG-CoA synthesis in ketone 
body formation (see Fig. 17-18). 

The third reaction is the committed and rate-limiting 
step: reduction of HMG-CoA to mevalonate, for which 
each of two molecules of NADPH donates two electrons. 
HMG-CoA reductase, an integral membrane protein of 
the smooth ER, is the major point of regulation on the 
pathway to cholesterol, as we shall see. 
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FIGURE 21-34 Formation of mevalonate from acetyl-CoA. The origin 
of C-1 and C-2 of mevalonate from acetyl-CoA is shown in pink. 



Stage (D Conversion of Mevalonate to TWo Acti- 
vated Isoprenes In the next stage of cholesterol syn- 
thesis, three phosphate groups are transferred from 
three ATP molecules to mevalonate (Fig. 21-35). The 
phosphate attached to the C-3 hydroxyl group of meval- 
onate in the intermediate 3-phospho-5-pyrophospho- 
mevalonate is a good leaving group; in the next step, 
both this phosphate and the nearby carboxyl group 
leave, producing a double bond in the five-carbon prod- 
uct, A 3 -isopentenyl pyrophosphate. This is the first 
of the two activated isoprenes central to cholesterol for- 
mation. Isomerization of A 3 -isopentenyl pyrophosphate 
yields the second activated isoprene, dimethylallyl py- 
rophosphate. Synthesis of isopentenyl pyrophosphate 
in the cytoplasm of plant cells follows the pathway de- 
scribed here. However, plant chloroplasts and many 
bacteria use a mevalonate-independent pathway. This 
alternative pathway does not occur in animals, so it is an 
attractive target for the development of new antibiotics. 



CH 3 

00C-CH 2 -C-CH 2 -CH 2 -OH 

12 3 I 4 5 

OH 

Mevalonate 
ADP 



mevalonate 
5-phosphotransferase 



CH 3 0 

OOC— CH 2 — C— CH 2 — CH 2 — O— P— O" 
I I 
OH O" 

5-Phosphomevalonate 



p hos phut nuvalonm m 

ADP 

GH 3 O O 

I II II 
OOC— CH 2 -C-CH 2 — CH 2 -0— P-0— P-0" 

I I I 

OH O" O" 



pyrupbtjflpJich 
rlouarboxylfiHe 



CH, 



5-Py roph 0 sphomev a 1 ona te 
ATP 

ADP 



O 



0 
II 



OOC— CH a — C-CH 2 -CH 2 —0— P-0— P-0 
O O" O" 



3-Phospho-5- 
pyrophosphomevalonate 




Activated 
isoprenes 



CHo 



pyrophoepho* 

IIN-V th.ll.tf.l' 

dtif [irboxyJiiistf 

CH 3 

I 

-C — CH 2 — CH 2 

isopentenyl J 
pyrophosphate 
(IPP) isomerase 



^> C0 2 , Vi 
0 



o 



-0— P— O— P— 0 

I I 

0" O" 
A 3 -Isopentenyl pyrophosphate 



CH, 



CH 3 
-C 



0 O 

CH— CH 2 — O— P— O— P— O" 

1 I 

0" cr 

Dimethylallyl pyrophosphate 

FIGURE 21-35 Conversion of mevalonate to activated isoprene units. 

Six of these activated units combine to form squalene (see Fig. 21-36). 
The leaving groups of 3-phospho-5-pyrophosphomevalonate are 
shaded pink. The bracketed intermediate is hypothetical. 



Stage (3) Condensation of Six Activated Isoprene 
Units to Form Squalene Isopentenyl pyrophos- 
phate and dimethylallyl pyrophosphate now undergo a 
head-to-tail condensation, in which one pyrophosphate 
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group is displaced and a 10-carbon chain, geranyl py- 
rophosphate, is formed (Fig. 21-36). (The "head" is the end 
to which pyrophosphate is joined.) Geranyl pyrophosphate 
undergoes another head-to-tail condensation with isopen- 
tenyl pyrophosphate, yielding the 15-carbon intermediate 
farnesyl pyrophosphate. Finally, two molecules of farne- 
syl pyrophosphate join head to head, with the elimination of 
both pyrophosphate groups, to form squalene. 

The common names of these intermediates derive 
from the sources from which they were first isolated. 
Geraniol, a component of rose oil, has the aroma of gera- 
niums, and farnesol is an aromatic compound found in 
the flowers of the Farnese acacia tree. Many natural 
scents of plant origin are synthesized from isoprene 
units. Squalene, first isolated from the liver of sharks 
(genus Squalus), has 30 carbons, 24 in the main chain 
and 6 in the form of methyl group branches. 



Stage (4) Conversion of Squalene to the Four-Ring 
Steroid Nucleus When the squalene molecule is rep- 
resented as in Figure 21-37, the relationship of its linear 
structure to the cyclic structure of the sterols becomes 
apparent. All sterols have the four fused rings that form 
the steroid nucleus, and all are alcohols, with a hydroxyl 
group at C-3 — thus the name "sterol." The action of 
squalene monooxygenase adds one oxygen atom from 
0 2 to the end of the squalene chain, forming an epoxide. 
This enzyme is another mixed-function oxidase (Box 
21-1); NADPH reduces the other oxygen atom of 0 2 to 
H 2 0. The double bonds of the product, squalene 2,3- 
epoxide, are positioned so that a remarkable concerted 
reaction can convert the linear squalene epoxide to a 
cyclic structure. In animal cells, this cyclization results in 
the formation of lanosterol, which contains the four 
rings characteristic of the steroid nucleus. Lanosterol is 
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FIGURE 21-36 Formation of squalene. This 30-carbon structure arises 
through successive condensations of activated isoprene (five-carbon) 
Squalene units. 
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finally converted to cholesterol in a series of about 20 re- 
actions that include the migration of some methyl groups 
and the removal of others. Elucidation of this extraordi- 



nary biosynthetic pathway, one of the most complex 
known, was accomplished by Konrad Bloch, Feodor Ly- 
nen, John Cornforth, and George Popjak in the late 1950s. 
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FIGURE 21-37 Ring closure converts linear 
squalene to the condensed steroid nucleus. 

The first step in this sequence is catalyzed by a 
mixed-function oxidase (a monooxygenase), 
for which the cosubstrate is NADPH. The 
product is an epoxide, which in the next step is 
cyclized to the steroid nucleus. The final prod- 
uct of these reactions in animal cells is choles- 
terol; in other organisms, slightly different 
sterols are produced, as shown. 
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Cholesterol is the sterol characteristic of animal cells; 
plants, fungi, and protists make other, closely related 
sterols instead. They use the same synthetic pathway as 
far as squalene 2,3-epoxide, at which point the pathways 
diverge slightly yielding other sterols, such as stigmas- 
terol in many plants and ergosterol in fungi (Fig. 21-37). 

■■ WORKED EXAMPLE 21-1 Energetic Cost of 

Squalene Synthesis 

What is the energetic cost of the synthesis of squalene 
from acetyl-CoA, in number of ATPs per molecule of 
squalene synthesized? 

Solution: In the pathway from acetyl-CoA to squalene, 
ATP is consumed only in the steps that convert meval- 
onate to the activated isoprene precursors of squalene. 
Three ATP molecules are used to create each of the six 
activated isoprenes required to construct squalene, for a 
total cost of 18 ATP molecules. 



Cholesterol Has Several Fates 

Much of the cholesterol synthesis in vertebrates takes 
place in the liver. A small fraction of the cholesterol 
made there is incorporated into the membranes of hepa- 
tocytes, but most of it is exported in one of three forms: 
biliary cholesterol, bile acids, or cholesteryl esters. Bile 
acids and their salts are relatively hydrophilic choles- 
terol derivatives that are synthesized in the liver and aid 
in lipid digestion (see Fig. 17-1). Cholesteryl esters 
are formed in the liver through the action of acyl- 
CoA-cholesterol acyl transferase (ACAT). This 
enzyme catalyzes the transfer of a fatty acid from coen- 
zyme A to the hydroxyl group of cholesterol (Fig. 
21-38), converting the cholesterol to a more hydropho- 
bic form. Cholesteryl esters are transported in secreted 
lipoprotein particles to other tissues that use choles- 
terol, or they are stored in the liver. 

All growing animal tissues need cholesterol for 
membrane synthesis, and some organs (adrenal gland 
and gonads, for example) use cholesterol as a precursor 
for steroid hormone production (discussed below). Cho- 
lesterol is also a precursor of vitamin D (see Fig. 10-20). 
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FIGURE 21-38 Synthesis of cholesteryl esters. Esterification converts 
cholesterol to an even more hydrophobic form for storage and transport. 



Cholesterol and Other Lipids Are Carried on 
Plasma Lipoproteins 

Cholesterol and cholesteryl esters, like triacylglycerols 
and phospholipids, are essentially insoluble in water, 
yet must be moved from the tissue of origin to the tissues 
in which they will be stored or consumed. They are carried 
in the blood plasma as plasma lipoproteins, macromole- 
cular complexes of specific carrier proteins, apolipopro- 
teins, with various combinations of phospholipids, 
cholesterol, cholesteryl esters, and triacylglycerols. 

Apolipoproteins ("apo" designates the protein in its 
lipid-free form) combine with lipids to form several 
classes of lipoprotein particles, spherical complexes 
with hydrophobic lipids in the core and hydrophilic 
amino acid side chains at the surface (Fig. 21-39a). 
Different combinations of lipids and proteins produce 
particles of different densities, ranging from chylomi- 
crons to high-density lipoproteins. These particles can 
be separated by ultracentrifugation (Table 21-1) and 
visualized by electron microscopy (Fig. 21-39b). 



TABLE 21-1 



Major Classes of Human Plasma Lipoproteins; Some Propertie 

Composition (wt %) 



Lipoprotein Density (g/taL) Protein Phospholipids Free cholesterol Cholesteryl esters Triacylglycerols 



Chylomicrons 

VLDL 

LDL 

HDL 



<1.006 
0.95-1.006 
1.006-1.063 
1.063-1.210 



2 

10 

23 
55 



9 

18 
20 
24 



3 
12 
37 
15 



85 
50 
10 
4 



Source: Modified from Kritchevsky, D. (1986) Atherosclerosis and nutrition. Nutr. Int. 2, 290-297. 
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FIGURE 21-39 Lipoproteins, (a) Structure of a low-density lipoprotein 
(LDL). Apolipoprotein B-100 (apoB-100) is one of the largest single 
polypeptide chains known, with 4,636 amino acid residues (M r 
513,000). One particle of LDL contains a core with about 1,500 mol- 
ecules of cholesteryl esters, surrounded by a shell composed of about 
500 more molecules of cholesterol, 800 molecules of phospholipids, 
and one molecule of apoB-100. (b) Four classes of lipoproteins, visu- 
alized in the electron microscope after negative staining. Clockwise 
from top left: chylomicrons, 50 to 200 nm in diameter; VLDL, 28 to 70 
nm; HDL, 8 to 1 1 nm; and LDL, 20 to 25 nm. For properties of lipopro- 
teins, see Table 21-1 . 



Each class of lipoprotein has a specific function, 
determined by its point of synthesis, lipid composition, 
and apolipoprotein content. At least ten different 
apolipoproteins are found in the lipoproteins of human 
plasma (Table 21-2), distinguishable by their size, their 
reactions with specific antibodies, and their characteris- 
tic distribution in the lipoprotein classes. These protein 
components act as signals, targeting lipoproteins to 
specific tissues or activating enzymes that act on the 
lipoproteins. 



f TABLE 21-2 


Apolipoproteins of the Human Plasma Lipoproteins 


i 


Apolipoprotein 




Molecular weight 


Lipoprotein association 


Function (if known) 


ApoA-I 




28,331 


HDL 


Activates LCAT; interacts with 
ABC transporter 


ApoA-II 




17,380 


HDL 


Inhibits LCAT 


ApoA-IV 




44,000 


Chylomicrons, HDL 


Activates LCAT; cholesterol 
transport/clearance 


ApoB-48 




240,000 


Chylomicrons 


Cholesterol transport/clearance 


ApoB-lOO 




513,000 


VLDL, LDL 


Binds to LDL receptor 


ApoC-I 




7,000 


VLDL, HDL 




ApoC-H 




8,837 


Chylomicrons, VLDL, HDL 


Activates lipoprotein lipase 


ApoC-III 




8,751 


Chylomicrons, VLDL, HDL 


Inhibits lipoprotein lipase 


ApoD 




32,500 


HDL 




ApoE 




34,145 


Chylomicrons, VLDL, HDL 


Triggers clearance of VLDL and 
chylomicron remnants 



Source: Modified from Vance, D.E. & Vance, J.E. (eds) (1985) Biochemistry of Lipids and Membranes. The Benjamin/Cummings Publishing 
Company, Menlo Park, CA. 
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Chylomicrons, discussed in Chapter 17 in connec- 
tion with the movement of dietary triacylglycerols from 
the intestine to other tissues, are the largest of the 
lipoproteins and the least dense, containing a high pro- 
portion of triacylglycerols (see Fig. 17-2). Chylomi- 
crons are synthesized in the ER of epithelial cells that 
line the small intestine, then move through the lym- 
phatic system and enter the bloodstream via the left 
subclavian vein. The apolipoproteins of chylomicrons in- 
clude apoB-48 (unique to this class of lipoproteins), 
apoE, and apoC-II (Table 21-2). ApoC-II activates 
lipoprotein lipase in the capillaries of adipose, heart, 
skeletal muscle, and lactating mammary tissues, allow- 
ing the release of free fatty acids to these tissues. Chy- 
lomicrons thus carry dietary fatty acids to tissues where 
they will be consumed or stored as fuel (Fig. 21-40). 
The remnants of chylomicrons (depleted of most of 
their triacylglycerols but still containing cholesterol, 



apoE, and apoB-48) move through the bloodstream to 
the liver. Receptors in the liver bind to the apoE in the 
chylomicron remnants and mediate their uptake by en- 
docytosis. In the liver, the remnants release their cho- 
lesterol and are degraded in lysosomes. 

When the diet contains more fatty acids than are 
needed immediately as fuel, they are converted to tria- 
cylglycerols in the liver and packaged with specific 
apolipoproteins into very-low-density lipoprotein 
(VLDL). Excess carbohydrate in the diet can also be 
converted to triacylglycerols in the liver and exported as 
VLDLs (Fig. 21-40a). In addition to triacylglycerols, 
VLDLs contain some cholesterol and cholesteryl esters, 
as well as apoB-100, apoC-I, apoC-II, apoC-III, and apoE 
(Table 21-2). These lipoproteins are transported in the 
blood from the liver to muscle and adipose tissue, where 
activation of lipoprotein lipase by apoC-II causes the re- 
lease of free fatty acids from the VLDL triacylglycerols. 
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FIGURE 21-40 Lipoproteins and lipid transport, (a) Lipids are trans- 
ported in the bloodstream as lipoproteins, which exist as several vari- 
ants that have different functions, different protein and lipid 
compositions (see Tables 21-1, 21-2), and thus different densities. Di- 
etary lipids are packaged into chylomicrons; much of their triacylglyc- 
erol content is released by lipoprotein lipase to adipose and muscle 
tissues during transport through capillaries. Chylomicron remnants 
(containing largely protein and cholesterol) are taken up by the liver. 
Endogenous lipids and cholesterol from the liver are delivered to adi- 
pose and muscle tissue by VLDL. Extraction of lipid from VLDL (along 




HDL precursors 
(from liver and 
intestine) 



Blood plasma Blood plasma 
after fast after meal 

(b) 

with loss of some apolipoproteins) gradually converts some of it to 
LDL, which delivers cholesterol to extrahepatic tissues or returns to 
the liver. The liver takes up LDL, VLDL remnants (called intermediate 
density lipoprotein, or IDL), and chylomicron remnants by receptor- 
mediated endocytosis. Excess cholesterol in extrahepatic tissues is 
transported back to the liver as HDL. In the liver, some cholesterol is 
converted to bile salts, (b) Blood plasma samples collected after a fast 
(left) and after a high-fat meal (right). Chylomicrons produced after a 
fatty meal give the plasma a milky appearance. 
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ApoE Alleles Predict Incidence of Alzheimer's Disease 



IOX 21 



In the human population there are three common vari- 
ants, or alleles, of the gene encoding apolipoprotein E. 
The most common, accounting for about 78% of human 
apoE alleles, is APOE3; alleles APOE4 and APOE2 ac- 
count for 15% and 7%, respectively. The APOE4 allele is 
particularly common in humans with Alzheimer's dis- 
ease, and the link is highly predictive. Individuals who 
inherit APOE4 have an increased risk of late-onset 
Alzheimer's disease. Those who are homozygous for 
APOE4 have a 16-fold increased risk of developing the 
disease; for those who do, the mean age of onset is just 
under 70 years. For people who inherit two copies of 
APOE3, by contrast, the mean age of onset of Alzheimer's 
disease exceeds 90 years. 



The molecular basis for the association between 
apoE-4 and Alzheimer's disease is not yet known. It is 
also not clear how apoE-4 might affect the growth of the 
amyloid fibers that appear to be the primary causative 
agents of Alzheimer's (see Fig. 4-31). Speculation has 
focused on a possible role for apoE in stabilizing the cy- 
toskeletal structure of neurons. The apoE-2 and apoE-3 
proteins bind to a number of proteins associated with 
neuronal microtubules, whereas apoE-4 does not. This 
may accelerate the death of neurons. Whatever the 
mechanism proves to be, these observations promise to 
expand our understanding of the biological functions of 
apolipoproteins. 



Adipocytes take up these fatty acids, reconvert them to 
triacylglycerols, and store the products in intracellular 
lipid droplets; myocytes, in contrast, primarily oxidize 
the fatty acids to supply energy. Most VLDL remnants 
are removed from the circulation by hepatocytes. The 
uptake, like that for chylomicrons, is receptor-mediated 
and depends on the presence of apoE in the VLDL rem- 
nants. Box 21-2 describes a link between apoE and 
Alzheimer's disease. 

The loss of triacylglycerol converts some VLDL to 
VLDL remnants (also called intermediate density 
lipoprotein, IDL); further removal of triacylglycerol 
from VLDL produces low-density lipoprotein (LDL) 
(Table 21-1). Very rich in cholesterol and cholesteryl 
esters and containing apoB- 1 00 as their maj or 
apolipoprotein, LDLs carry cholesterol to extrahepatic 
tissues that have specific plasma membrane receptors 
that recognize apoB-100. These receptors mediate the 
uptake of cholesterol and cholesteryl esters in a process 
described below. 

The fourth major lipoprotein type, high-density 
lipoprotein (HDL), originates in the liver and small 
intestine as small, protein-rich particles that contain 
relatively little cholesterol and no cholesteryl esters 
(Fig. 21-40). HDLs contain apoA-I, apoC-I, apoC-II, 
and other apolipoproteins (Table 21-2), as well as 
the enzyme lecithin-cholesterol acyl transferase 
(LCAT), which catalyzes the formation of cholesteryl 
esters from lecithin (phosphatidylcholine) and choles- 
terol (Fig. 21-41). LCAT on the surface of nascent 
(newly forming) HDL particles converts the cholesterol 
and phosphatidylcholine of chylomicron and VLDL 
remnants to cholesteryl esters, which begin to form a 
core, transforming the disk-shaped nascent HDL to a 
mature, spherical HDL particle. This cholesterol-rich 
lipoprotein then returns to the liver, where the choles- 
terol is unloaded; some of this cholesterol is converted 
to bile salts. 
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FIGURE 21-41 Reaction catalyzed by lecithin-cholesterol acyl trans- 
ferase (LCAT). This enzyme is present on the surface of HDL and is 
stimulated by the HDL component apoA-I. Cholesteryl esters accumu- 
late within nascent HDLs, converting them to mature HDLs. 
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HDL may be taken up in the liver by receptor- 
mediated endocytosis, but at least some of the choles- 
terol in HDL is delivered to other tissues by a novel 
mechanism. HDL can bind to plasma membrane recep- 
tor proteins called SR-BI in hepatic and steroidogenic 
tissues such as the adrenal gland. These receptors me- 
diate not endocytosis but a partial and selective transfer 
of cholesterol and other lipids in HDL into the cell. De- 
pleted HDL then dissociates to recirculate in the blood- 
stream and extract more lipids from chylomicron and 
VLDL remnants. Depleted HDL can also pick up choles- 
terol stored in extrahepatic tissues and carry it to the 
liver, in reverse cholesterol transport pathways 
(Fig. 21-40). In one reverse transport path, interaction 
of nascent HDL with SR-BI receptors in cholesterol-rich 
cells triggers passive movement of cholesterol from the 
cell surface into HDL, which then carries it back to the 
liver. In a second pathway, apoA-I in depleted HDL in- 
teracts with an active transporter, the ABC1 protein, in 
a cholesterol-rich cell. The apoA-I (and presumably the 
HDL) is taken up by endocytosis, then resecreted with a 
load of cholesterol, which it transports to the liver. 

The ABC1 protein is a member of a large family of 
multidrug transporters, sometimes called ABC trans- 
porters because they all have 74TP-6inding cassettes; 



they also have two transmembrane domains with six 
transmembrane helices (Chapter 11). These proteins 
actively transport a variety of ions, amino acids, vita- 
mins, steroid hormones, and bile salts across plasma 
membranes. The CFTR protein that is defective in cys- 
tic fibrosis (see Box 11-3) is another member of this 
ABC family of multidrug transporters. 

Cholesteryl Esters Enter Cells by 
Receptor-Mediated Endocytosis 

Each LDL particle in the bloodstream contains apoB-100, 
which is recognized by specific surface receptor proteins, 
LDL receptors, on cells that need to take up cholesterol. 
The binding of LDL to an LDL receptor initiates endocy- 
tosis, which conveys the LDL and its receptor into the cell 
within an endosome (Fig. 21-42). The endosome even- 
tually fuses with a lysosome, which contains enzymes that 
hydrolyze the cholesteryl esters, releasing cholesterol 
and fatty acid into the cytosol. The apoB-100 of LDL is 
also degraded to amino acids that are released to the cy- 
tosol, but the LDL receptor escapes degradation and is 
returned to the cell surface, to function again in LDL 
uptake. ApoB-100 is also present in VLDL, but its recep- 
tor-binding domain is not available for binding to the 
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FIGURE 21-42 Uptake of cholesterol by receptor-mediated endocytosis. 
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LDL receptor; conversion of VLDL to LDL exposes the re- 
ceptor-binding domain of apoB-100. This pathway for the 
transport of cholesterol in blood and its receptor-medi- 
ated endocytosis by target tissues was elucidated by 
Michael Brown and Joseph Goldstein. 




Michael Brown and Joseph Goldstein 



Cholesterol that enters cells by this path may be in- 
corporated into membranes or reesterified by ACAT (Fig. 
21-38) for storage within cytosolic lipid droplets. Accu- 
mulation of excess intracellular cholesterol is prevented 
by reducing the rate of cholesterol synthesis when suffi- 
cient cholesterol is available from LDL in the blood. 

The LDL receptor also binds to apoE and plays a 
significant role in the hepatic uptake of chylomicrons 
and VLDL remnants. However, if LDL receptors are un- 
available (as, for example, in a mouse strain that lacks 
the gene for the LDL receptor), VLDL remnants and 
chylomicrons are still taken up by the liver even though 
LDL is not. This indicates the presence of a back-up 
system for receptor-mediated endocytosis of VLDL 
remnants and chylomicrons. One back-up receptor is 
lipoprotein receptor-related protein (LRP), which binds 
to apoE as well as to a number of other ligands. 



Cholesterol Biosynthesis Is Regulated at Several Levels 

Cholesterol synthesis is a complex and energy-expensive 
process, so it is clearly advantageous to an organism to 
regulate the biosynthesis of cholesterol to complement 
dietary intake. In mammals, cholesterol production is reg- 
ulated by intracellular cholesterol concentration and by 
the hormones glucagon and insulin. The rate-limiting step 
in the pathway to cholesterol (and a major site of regula- 
tion) is the conversion of HMG-CoA to mevalonate (Fig. 
21-34), the reaction catalyzed by HMG-CoA reductase. 

Regulation in response to cholesterol levels is medi- 
ated by an elegant system of transcriptional regulation 
of the gene encoding HMG-CoA reductase. This gene, 
along with more than 20 other genes encoding enzymes 
that mediate the uptake and synthesis of cholesterol 
and unsaturated fatty acids, is controlled by a small fam- 
ily of proteins called sterol regulatory element-binding 
proteins (SREBPs). When newly synthesized, these 
proteins are embedded in the ER. Only the soluble 
amino-terminal domain of an SREBP functions as a tran- 
scriptional activator, using mechanisms discussed in 
Chapter 28. However, this domain has no access to the 
nucleus and cannot participate in gene activation while 
it remains part of the SREBP molecule. To activate tran- 
scription of the HMG-CoA reductase gene and other 
genes, the transcriptionally active domain is separated 
from the rest of the SREBP by proteolytic cleavage. 
When cholesterol levels are high, SREBPs are inactive, 
secured to the ER in a complex with another protein 
called SREBP cleavage-activating protein (SCAP) (Fig. 
21-43). It is SCAP that binds cholesterol and a number 
of other sterols, thus acting as a sterol sensor. When 
sterol levels are high, the SCAP-SREBP complex proba- 
bly interacts with another protein that retains the entire 
complex in the ER. When the level of sterols in the cell 
declines, a conformational change in SCAP causes release 
of the SCAP-SREBP complex from the ER-retention ac- 
tivity, and the complex migrates within vesicles to the 
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FIGURE 21-43 SREBP activation. Sterol regulatory element-binding 
proteins (SREBPs, shown in green) are embedded in the ER when first 
synthesized, in a complex with the protein SREBP cleavage-activating 
protein (SCAP, red). (N and C represent the amino and carboxyl termini 
of the proteins.) When bound to SCAP, SREBPs are inactive. When 



sterol levels decline, the complex migrates to the Golgi complex, and 
SREBP is cleaved by two different proteases in succession. The liber- 
ated amino-terminal domain of SREBP migrates to the nucleus, where 
it activates transcription of sterol-regulated genes. 
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Golgi complex. In the Golgi complex, SREBP is cleaved 
twice by two different proteases, the second cleavage 
releasing the amino-terminal domain into the cytosol. 
This domain travels to the nucleus and activates tran- 
scription of its target genes. The amino-terminal domain 
of SREBP has a short half -life and is rapidly degraded by 
proteasomes (see Fig. 27-48). When sterol levels in- 
crease sufficiently, the proteolytic release of SREBP 
amino-terminal domains is again blocked, and protea- 
some degradation of the existing active domains results 
in a rapid shut-down of the gene targets. 

Several other mechanisms also regulate cholesterol 
synthesis (Fig. 21-44). Hormonal control is mediated 
by covalent modification of HMG-CoA reductase itself. 
The enzyme exists in phosphorylated (inactive) and de- 
phosphorylated (active) forms. Glucagon stimulates 
phosphorylation (inactivation) , and insulin promotes 
dephosphorylation, activating the enzyme and favoring 
cholesterol synthesis. High intracellular concentrations 
of cholesterol activate ACAT, which increases esterifi- 
cation of cholesterol for storage. Finally, a high cellular 
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FIGURE 21-44 Regulation of cholesterol formation balances synthesis 
with dietary uptake. Glucagon promotes phosphorylation (inactiva- 
tion) of HMG-CoA reductase; insulin promotes dephosphorylation (ac- 
tivation). X represents unidentified metabolites of cholesterol that 
stimulate proteolysis of HMG-CoA reductase. 
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Coronary heart disease is the leading cause of death in 
developed countries. The coronary arteries that bring 
blood to the heart become narrowed due to the forma- 
tion of fatty deposits called atherosclerotic plaques 
(containing cholesterol, fibrous proteins, calcium de- 
posits, blood platelets, and cell debris). Developing the 
link between artery occlusion (atherosclerosis) and 
blood cholesterol levels was a project of the 20 th cen- 
tury, triggering a dispute that was resolved only with the 
development of effective cholesterol-lowering drugs. 

In 1913, N.N. Anitschkov, an experimental patholo- 
gist in St. Petersburg, Russia, published a study showing 
that rabbits fed a diet rich in cholesterol developed le- 
sions very similar to the atherosclerotic plaques seen in 
aging humans. Anitschkov continued his work over the 
next few decades, publishing it in prominent western 
journals. Nevertheless, the work was not accepted as a 
model for human atherosclerosis, due to a prevailing 
view that the disease was a simple consequence of aging 
and could not be prevented. The link between serum 
cholesterol and atherosclerosis (the lipid hypothesis) 
was gradually strengthened, until some researchers in 
the 1960s openly suggested that therapeutic interven- 
tion might be helpful. However, controversy persisted 
until the results of a large study of cholesterol lowering, 
sponsored by the United States National Institutes of 
Health, was published in 1984: the Coronary Primary 
Prevention Trial. This study conclusively showed a sta- 
tistically significant decrease in heart attacks and 
strokes as a result of decreasing cholesterol level. The 
study made use of a bile acid-binding resin, cholestyra- 
mine, to control cholesterol. The results triggered a 



ipid Hypothesis and the Development of Statins 



search for more effective therapeutic interventions. 
Some controversy persisted until the development of 
the statins in the late 1980s and 1990s. 

Dr. Akira Endo, working at the Sankyo company in 
Tokyo, discovered the first statin and reported the work in 
1976. Endo had been interested in cholesterol metabolism 
for some time, and speculated in 1971 that the fungi being 
screened at that time for new antibiotics might also contain 
an inhibitor of cholesterol synthesis. Over a period of sev- 
eral years, he screened more than 6,000 fungus cultures 
until a positive result was found. The compound that re- 
sulted was named compactin (Fig. 1). The compound 
eventually proved effective in reducing cholesterol levels 
in dogs and monkeys, and the work came to the attention 
of Michael Brown and Joseph Goldstein at the University of 
Texas-Southwestern medical school. Brown and Goldstein 
began to work with Endo, and confirmed his results. Some 
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FIGURE 1 Statins as inhibitors of HMG-CoA reductase. A comparison 
of the structures of mevalonate and four pharmaceutical compounds 
(statins) that inhibit HMG-CoA reductase. 
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cholesterol level diminishes transcription of the gene 
that encodes the LDL receptor, reducing production of 
the receptor and thus the uptake of cholesterol from 
the blood. 

H Unregulated cholesterol production can lead to 
serious human disease. When the sum of choles- 
terol synthesized and cholesterol obtained in the diet 
exceeds the amount required for the synthesis of mem- 
branes, bile salts, and steroids, pathological accumula- 
tions of cholesterol in blood vessels (atherosclerotic 
plaques) can develop, resulting in obstruction of blood 
vessels (atherosclerosis). Heart failure due to oc- 
cluded coronary arteries is a leading cause of death in 
industrialized societies. Atherosclerosis is linked to high 
levels of cholesterol in the blood, and particularly to high 
levels of LDL-bound cholesterol; there is a negative cor- 
relation between HDL levels and arterial disease. 

In familial hypercholesterolemia, a human genetic dis- 
order, blood levels of cholesterol are extremely high and 
severe atherosclerosis develops in childhood. These indi- 
viduals have a defective LDL receptor and lack receptor- 



mediated uptake of cholesterol carried by LDL. Conse- 
quently, cholesterol is not cleared from the blood; it accu- 
mulates and contributes to the formation of 
atherosclerotic plaques. Endogenous cholesterol synthesis 
continues despite the excessive cholesterol in the blood, 
because extracellular cholesterol cannot enter the cell to 
regulate intracellular synthesis (Fig. 21-44). A class of 
drugs called statins, some isolated from natural sources 
and some synthesized by the pharmaceutical industry, are 
used to treat patients with familial hypercholesterolemia 
and other conditions involving elevated serum cholesterol. 
The statins resemble mevalonate (Box 21-3) and are com- 
petitive inhibitors of HMG-CoA reductase. 

In familial HDL deficiency, HDL levels are very low; 
they are almost undetectable in Tangier disease. Both 
genetic disorders are the result of mutations in the 
ABC1 protein. Cholesterol-depleted HDL cannot take 
up cholesterol from cells that lack ABC1 protein, and 
cholesterol-poor HDL is rapidly removed from the blood 
and destroyed. Both familial HDL deficiency and Tang- 
ier disease are very rare (worldwide, fewer than 100 
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dramatic results in the first limited clinical trials convinced 
several pharmaceutical firms to join the hunt for statins. A 
team at Merck led by Alfred Alberts and P. Roy Vagelos be- 
gan screening fungal cultures and found a positive result 
after screening just 18 cultures. The new statin was even- 
tually called lovastatin (Fig. 1). A rumor that compactin, at 
very high doses, was carcinogenic in dogs almost sidelined 
the race to develop statins in 1980, but the benefits to pa- 
tients with familial hypercholesterolemia were already evi- 
dent. After much consultation with experts around the 
world and the U.S. Food and Drug Administration, Merck 
proceeded carefully to develop lovastatin. Extensive test- 
ing over the next two decades revealed no carcinogenic ef- 
fects from lovastatin or from the newer generations of 
statins that have appeared since. 

Statins inhibit HMG-CoA reductase in part by mimick- 
ing the structure of mevalonate (Fig. 1), and thus inhibit 



cholesterol synthesis. Lovastatin treat- 
ment lowers serum cholesterol by as 
much as 30% in individuals with hyper- 
cholesterolemia resulting from one de- 
fective copy of the gene for the LDL 
receptor. When combined with an edible 
resin that binds bile acids and prevents 
their reabsorption from the intestine, 
the drug is even more effective. 

Statins are now the most widely 
used drugs for lowering serum choles- 
terol levels. Side effects are always a 
concern with drugs, but statins repre- 
sent a case where many of the side ef- 
fects are positive. These drugs can 
improve blood flow, enhance the stability of atheroscle- 
rotic plaques (so they don't rupture and obstruct blood 
flow), reduce platelet aggregation, and reduce vascular 
inflammation. Some of these effects occur before choles- 
terol levels drop in patients taking statins for the first 
time, and may be related to a secondary inhibition of iso- 
prenoid synthesis by statins. Not all the effects of statins 
are positive. A few patients, usually among those taking 
statins in combination with other cholesterol-lowering 
drugs, experience muscle pain or weakness that can be- 
come severe and even debilitating. A fairly long list of 
other side effects has been documented in patients; most 
are rare. However, for the vast majority of patients, the 
statin-mediated decrease in the risks associated with 
coronary heart disease can be dramatic. As with all med- 
ications, statins should be used only in consultation with 
a physician. 
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families with Tangier disease are known), but the exis- 
tence of these diseases establishes a role for ABC1 pro- 
tein in the regulation of plasma HDL levels. Because low 
plasma HDL levels correlate with a high incidence of 
coronary heart disease, the ABC1 protein may prove a 
useful target for drugs to control HDL levels. ■ 

Steroid Hormones Are Formed by Side-Chain Cleavage 
and Oxidation of Cholesterol 

Humans derive all their steroid hormones from choles- 
terol (Fig. 21-45). Two classes of steroid hormones are 
synthesized in the cortex of the adrenal gland: miner- 
alocorticoids, which control the reabsorption of inor- 
ganic ions (Na + , CI", and HCO3) by the kidney and 
glucocorticoids, which help regulate gluconeogenesis 
and reduce the inflammatory response. Sex hormones 
are produced in male and female gonads and the pla- 
centa. They include progesterone, which regulates the 
female reproductive cycle, and androgens (such as 
testosterone) and estrogens (such as estradiol), which 
influence the development of secondary sexual charac- 
teristics in males and females, respectively. Steroid hor- 
mones are effective at very low concentrations and are 
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FIGURE 21-45 Some steroid hormones derived from cholesterol. The 

structures of some of these compounds are shown in Figure 10-19 

FIGURE 21-46 Side-chain cleavage in the synthesis of steroid hor- 
mones. Cytochrome P-450 acts as electron carrier in this mixed-function 
oxidase system that oxidizes adjacent carbons. The process also requires 
the electron-transferring proteins adrenodoxin and adrenodoxin reduc- 
tase This system for cleaving side chains is found in mitochondria of the 
adrenal cortex, where active steroid production occurs. Pregnenolone is 
the precursor of all other steroid hormones (see Fig. 21-45). 



therefore synthesized in relatively small quantities. In 
comparison with the bile salts, their production con- 
sumes relatively little cholesterol. 




Progesterone 

Synthesis of steroid hormones requires removal of 
some or all of the carbons in the "side chain" on C-17 of 
the D ring of cholesterol Side-chain removal takes place 
in the mitochondria of steroidogenic tissues. Removal 
involves the hydroxylation of two adjacent carbons in 
the side chain (C-20 and C-22) followed by cleavage of 
the bond between them (Fig. 21-46). Formation of the 
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various hormones also involves the introduction of oxy- 
gen atoms. All the hydroxylation and oxygenation reac- 
tions in steroid biosynthesis are catalyzed by mixed- 
function oxidases (Box 21-1) that use NADPH, 0 2 , and 
mitochondrial cytochrome P-450. 

Intermediates in Cholesterol Biosynthesis 
Have Many Alternative Fates 

In addition to its role as an intermediate in cholesterol 
biosynthesis, isopentenyl pyrophosphate is the activated 
precursor of a huge array of biomolecules with diverse 
biological roles (Fig. 21-47). They include vitamins A, 
E, and K; plant pigments such as carotene and the phytol 
chain of chlorophyll; natural rubber; many essential oils 
(such as the fragrant principles of lemon oil, eucalyptus, 
and musk); insect juvenile hormone, which controls 
metamorphosis; dolichols, which serve as lipid-soluble 
carriers in complex polysaccharide synthesis; and 
ubiquinone and plastoquinone, electron carriers in mito- 
chondria and chloroplasts. Collectively, these molecules 
are called isoprenoids. More than 20,000 different iso- 
prenoid molecules have been discovered in nature, and 
hundreds of new ones are reported each year. 

Prenylation (covalent attachment of an isoprenoid; 
see Fig. 27-35) is a common mechanism by which pro- 
teins are anchored to the inner surface of cellular mem- 
branes in mammals (see Fig. 11-14). In some of these 
proteins the attached lipid is the 15-carbon farnesyl 
group; others have the 20-carbon geranylgeranyl group. 
Different enzymes attach the two types of lipids. It is 
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FIGURE 21-47 Overview of isoprenoid biosynthesis. The structures of 
most of the end products shown here are given in Chapter 1 0. 



possible that prenylation reactions target proteins to 
different membranes, depending on which lipid is at- 
tached. Protein prenylation is another important role for 
the isoprene derivatives of the pathway to cholesterol. 
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Biosynthesis of Cholesterol, 
Steroids, and Isoprenoids 



Cholesterol is formed from acetyl-CoA in a complex 
series of reactions, through the intermediates 
)3-hydroxy-j8-methylglutaryl-CoA, mevalonate, and 
two activated isoprenes, dimethylallyl pyrophosphate 
and isopentenyl pyrophosphate. Condensation of 
isoprene units produces the noncyclic squalene, 
which is cyclized to yield the steroid ring system and 
side chain. 

Cholesterol synthesis is under hormonal control 
and is also inhibited by elevated concentrations of 
intracellular cholesterol, which acts through 
covalent modification and transcriptional regulation 
mechanisms. 

Cholesterol and cholesteryl esters are carried in 
the blood as plasma lipoproteins. VLDL carries 
cholesterol, cholesteryl esters, and triacylglycerols 
from the liver to other tissues, where the 
triacylglycerols are degraded by lipoprotein lipase, 
converting VLDL to LDL. The LDL, rich in 
cholesterol and its esters, is taken up by receptor- 
mediated endocytosis, in which the apolipoprotein 
B-100 of LDL is recognized by receptors in the 
plasma membrane. HDL removes cholesterol from 
the blood, carrying it to the liver. Dietary conditions 
or genetic defects in cholesterol metabolism may 
lead to atherosclerosis and heart disease. 

The steroid hormones (glucocorticoids, 
mineralocorticoids, and sex hormones) are 
produced from cholesterol by alteration of the side 
chain and introduction of oxygen atoms into the 
steroid ring system. In addition to cholesterol, a wide 
variety of isoprenoid compounds are derived from 
mevalonate through condensations of isopentenyl 
pyrophosphate and dimethylallyl pyrophosphate. 

Prenylation of certain proteins targets them for 
association with cellular membranes and is 
essential for their biological activity 
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Terms in bold are defined in the glossary, 

malonyl-CoA 805 mixed-function 
acetyl-CoA carboxylase 805 oxidases 815 

fatty acid synthase 806 essential fatty acids 815 

acyl carrier protein mixed-function 

(ACP) 808 oxygenases 816 

fatty acyl-CoA cytochrome P-450 816 

desaturase 815 prostaglandins 8 1 7 
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1. Pathway of Carbon in Fatty Acid Synthesis Using 
your knowledge of fatty acid biosynthesis, provide an explana- 
tion for the following experimental observations: 

(a) Addition of uniformly labeled [ 14 C]acetyl-CoA to a sol- 
uble liver fraction yields palmitate uniformly labeled with 14 C. 

(b) However, addition of a trace of uniformly labeled 
[ 14 C]acetyl-CoA in the presence of an excess of unlabeled mal- 
onyl-CoA to a soluble liver fraction yields palmitate labeled 
with 14 C only in C-15 and C-16. 

2. Synthesis of Fatty Acids from Glucose After a person 
has ingested large amounts of sucrose, the glucose and fruc- 
tose that exceed caloric requirements are transformed to fatty 
acids for triacylglycerol synthesis. This fatty acid synthesis 
consumes acetyl-CoA, ATP, and NADPH. How are these sub- 
stances produced from glucose? 

3. Net Equation of Fatty Acid Synthesis Write the net 
equation for the biosynthesis of palmitate in rat liver, starting 
from mitochondrial acetyl-CoA and cytosolic NADPH, ATP, 
and C0 2 . 



4. Pathway of Hydrogen in Fatty Acid Synthesis Con- 
sider a preparation that contains all the enzymes and cofactors 
necessary for fatty acid biosynthesis from added acetyl-CoA 
and malonyl-CoA. 

(a) If [2- 2 H]acetyl-CoA (labeled with deuterium, the heavy 
isotope of hydrogen) 



7 / 

2 H— C— C 



'S-CoA 



and an excess of unlabeled malonyl-CoA are added as sub- 
strates, how many deuterium atoms are incorporated into 
every molecule of palmitate? What are their locations? Explain, 
(b) If unlabeled acetyl-CoA and [2- 2 H]malonyl-CoA 



2 H 

-ooc-c-c 

2 H 



O 



S-CoA 



are added as substrates, how many deuterium atoms are in- 
corporated into every molecule of palmitate? What are their 
locations? Explain. 

5. Energetics of jS-Ketoacyl-ACP Synthase In the con- 
densation reaction catalyzed by /3-ketoacyl-ACP synthase (see 
Fig. 21-6), a four-carbon unit is synthesized by the combina- 
tion of a two-carbon unit and a three-carbon unit, with the re- 
lease of C0 2 . What is the thermodynamic advantage of this 
process over one that simply combines two two-carbon units? 

6. Modulation of Acetyl-CoA Carboxylase Acetyl-CoA 
carboxylase is the principal regulation point in the biosynthe- 
sis of fatty acids. Some of the properties of the enzyme are 
described below. 

(a) Addition of citrate or isocitrate raises the 7 max of the 
enzyme as much as 10-fold. 

(b) The enzyme exists in two interconvertible forms that 
differ markedly in their activities: 

Protomer (inactive) filamentous polymer (active) 
Citrate and isocitrate bind preferentially to the filamentous 
form, and palmitoyl-CoA binds preferentially to the protomer, 
Explain how these properties are consistent with the reg- 
ulatory role of acetyl-CoA carboxylase in the biosynthesis of 
fatty acids. 

7. Shuttling of Acetyl Groups across the Mitochondrial 
Inner Membrane The acetyl group of acetyl-CoA, produced 
by the oxidative decarboxylation of pyruvate in the mitochon- 
drion, is transferred to the cytosol by the acetyl group shuttle 
outlined in Figure 21-10. 

(a) Write the overall equation for the transfer of one acetyl 
group from the mitochondrion to the cytosol. 

(b) What is the cost of this process in ATPs per acetyl 
group? 

(c) In Chapter 17 we encountered an acyl group shuttle in 
the transfer of fatty acyl-CoA from the cytosol to the mito- 
chondrion in preparation for j8 oxidation (see Fig. 17-6). One 
result of that shuttle was separation of the mitochondrial and 
cytosolic pools of CoA. Does the acetyl group shuttle also 
accomplish this? Explain. 
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8. Oxygen Requirement for Desaturases The biosynthe- 
sis of palmitoleate (see Fig, 21-12) a common unsaturated 
fatty acid with a cis double bond in the A 9 position, uses palmi- 
tate as a precursor. Can this be carried out under strictly 
anaerobic conditions? Explain. 

9. Energy Cost of Iriacylglycerol Synthesis Use a net 

equation for the biosynthesis of tripalmitoylglycerol (tri- 
palmitin) from glycerol and palmitate to show how many ATPs 
are required per molecule of tripalmitin formed 

10. Turnover of THacylglycerols in Adipose Tissue When 
[ 14 C]glucose is added to the balanced diet of adult rats, there is 
no increase in the total amount of stored triacylglycerols, but 
the triacylglycerols become labeled with 14 C, Explain. 

11. Energy Cost of Phosphatidylcholine Synthesis Write 
the sequence of steps and the net reaction for the biosynthesis 
of phosphatidylcholine by the salvage pathway from oleate, 
palmitate, dihydroxyacetone phosphate, and choline. Starting 
from these precursors, what is the cost (in number of ATPs) of 
the synthesis of phosphatidylcholine by the salvage pathway? 

12. Salvage Pathway for Synthesis of Phosphatidyl- 
choline A young rat maintained on a diet deficient in methio- 
nine fails to thrive unless choline is included in the diet. Explain. 

13. Synthesis of Isopentenyl Pyrophosphate If 2 [ 14 C]acetyl- 
CoA is added to a rat liver homogenate that is synthesizing 
cholesterol, where will the 14 C label appear in A 3 -isopentenyl py- 
rophosphate, the activated form of an isoprene unit? 

14. Activated Donors in Lipid Synthesis In the biosynthe- 
sis of complex lipids, components are assembled by transfer of 
the appropriate group from an activated donor. For example, 
the activated donor of acetyl groups is acetyl-CoA. For each of 
the following groups, give the form of the activated donor: 
(a) phosphate; (b) D-glucosyl; (c) phosphoethanolamine; (d) D- 
galactosyl; (e) fatty acyl; (f) methyl; (g) the two-carbon group 
in fatty acid biosynthesis; (h) A 3 -isopentenyl. 

15. Importance of Fats in the Diet When young rats are 
placed on a totally fat-free diet, they grow poorly, develop a 
scaly dermatitis, lose hair, and soon die— symptoms that can 
be prevented if linoleate or plant material is included in the 
diet. What makes linoleate an essential fatty acid? Why can 
plant material be substituted? 

16. Regulation of Cholesterol Biosynthesis Cholesterol 
in humans can be obtained from the diet or synthesized de 
novo. An adult human on a low-cholesterol diet typically syn- 
thesizes 600 mg of cholesterol per day in the liver. If the 
amount of cholesterol in the diet is large, de novo synthesis of 
cholesterol is drastically reduced. How is this regulation 
brought about? 

17. Lowering Serum Cholesterol Levels with 
. Statins Patients treated with a statin drug generally 
exhibit a dramatic lowering of serum cholesterol. However, the 
amount of the enzyme HMG-CoA reductase present in cells can 
increase substantially. Suggest an explanation for this effect. 

18. Roles of Thiol Esters in Cholesterol Biosynthesis 

Draw a mechanism for each of the three reactions shown in 



Figure 21-34, detailing the pathway for the synthesis of meval- 
onate from acetyl-CoA. 

19. Potential Side Effects of Treatment with 
Statins Although clinical trials have not yet been car- 
ried out to document benefits or side effects, some physicians 
have suggested that patients being treated with statins also 
take a supplement of coenzyme Q. Suggest a rationale for this 
recommendation. 
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20. Engineering E. coli to Produce Large Quantities of 
an Isoprenoid There is a huge variety of naturally occurring 
isoprenoids, some of which are medically or commercially im- 
portant and produced industrially. The production methods 
include in vitro enzymatic synthesis, which is an expensive 
and low-yield process. In 1999, Wang, Oh, and Liao reported 
their experiments to engineer the easily grown bacterium E. 
coli to produce large amounts of astaxanthin, a commercially 
important isoprenoid 

Astaxanthin is a red-orange carotenoid pigment (an an- 
tioxidant) produced by marine algae. Marine animals such as 
shrimp, lobster, and some fish that feed on the algae get their 
orange color from the ingested astaxanthin. Astaxanthin is 
composed of eight isoprene units; its molecular formula is 
C40H52O4: 




Astaxanthin 

(a) Circle the eight isoprene units in the astaxanthin mol- 
ecule. Hint: Use the projecting methyl groups as a guide. 

Astaxanthin is synthesized by the pathway shown on the 
facing page, starting with A 3 -isopentenyl pyrophosphate (IPP). 
Steps CD and © are shown in Figure 21-36, and the reaction 
catalyzed by IPP isomerase is shown in Figure 21-35. 

(b) In step @ of the pathway, two molecules of geranyl- 
geranyl pyrophosphate are linked to form phytoene. Is this a 
head-to-head or a head-to-tail joining? (See Figure 21-36 for 
details.) 

(c) Briefly describe the chemical transformation in step (5) . 

(d) The synthesis of cholesterol (Fig. 21-37) includes a 
cyclization (ring closure) that involves a net oxidation by 0 2 . 
Does the cyclization in step © of the astaxanthin synthetic 
pathway require a net oxidation of the substrate (lycopene)? 
Explain your reasoning. 

E. coli does not make large quantities of many iso- 
prenoids, and does not synthesize astaxanthin. It is known 
to synthesize small amounts of IPP, DMAPP, geranyl py- 
rophosphate, farnesyl pyrophosphate, and geranylgeranyl 
pyrophosphate. Wang and colleagues cloned several of the 
E coli genes that encode enzymes needed for astaxanthin 
synthesis in plasmids that allow their overexpression. These 
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A 3 -Isopentenyl pyrophosphate (IPP) 

^ IPP isomerase 



G 



Y Dimethylallyl pyrophosphate 
(DMAPP) 



Geranyl pyrophosphate (C 10 ) + PPi 
(5V IPP 




Farnesyl pyrophosphate (C 15 ) + PPi 
(3) - IPP 



Geranylgeranyl pyrophosphate (C 2 q) + PPi 



Geranylgeranyl pyrophosphate 




^-Carotene (C 40 ) 

© 



Astaxanthin (C 4 n) 



genes included idi, which encodes IPP isomerase, and ispA, 
which encodes a prenyl transferase that catalyzes steps ® 
and®. 

To engineer an E coli capable of the complete astax- 
anthin pathway, Wang and colleagues cloned several genes 
from other bacteria into plasmids that would allow their 
overexpression inE. coli. These genes included crtE from 
Erwinia uredovora, which encodes an enzyme that cat- 
alyzes step ®; and crtB, crtl, crtY, crtZ, and crtW from 
Agrobacterium aurantiacum, which encode enzymes for 
steps (4), ©, ©, ®, and®, respectively. 



The investigators also cloned the gene gps from Ar- 
chaeoglobusfulgidus, overexpressed this gene mE. coli, and 
extracted the gene product. When this extract was reacted 
with [ 14 C]IPP and DMAPP, or geranyl pyrophosphate, or farne- 
syl pyrophosphate, only 14 C-labeled geranylgeranyl pyrophos- 
phate was produced in all cases. 

(e) Based on these data, which step(s) in the pathway 
are catalyzed by the enzyme encoded by gpsl Explain your 
reasoning. 

Wang and coworkers then constructed several E. coli strains 
overexpressing different genes and measured the orange color of 
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the colonies (wild-type E. coli colonies are off-white) and the 
amount of astaxanthin produced. Their results are shown below. 





Gene(s) 


Orange 


Astaxanthin yield 


Strain 


overexpressed 


color 


Og/g dry weight) 


1 


rrtRTZVW 




ND 


2 






ND 


3 


crtBIZYW, idi 


- 


ND 


4 


crtBIZYW, idi, ispA 




ND 


5 


crtBIZYW, crtE 


+ 


32.8 


6 


crtBIZYW, crtE, ispA 


+ 


35.3 


7 


crtBIZYW, crtE, idi 


+ + 


234.1 


8 


crtBIZYW, crtE, idi, ispA 


+ + + 


390.3 


9 


crtBIZYW, gps 


+ 


35.6 


10 


crtBIZYW, gps, idi 


+ + + 


1,418.8 



Note: ND, not determined 



(f) Comparing the results for strains 1 through 4 with 
those for strains 5 through 8, what can you conclude about the 
expression level of an enzyme capable of catalyzing step © of 
the astaxanthin synthetic pathway in wild-type E. coli? Explain 
your reasoning. 

(g) Based on the data above, which enzyme is rate-limiting 
in this pathway, IPP isomerase or the enzyme encoded by idi? 
Explain your reasoning. 

(h) Would you expect a strain overexpressing crtBIZYW, 
gps, and crtE to produce low (+), medium (++), or high 
( + + + ) levels of astaxanthin, as measured by its orange color? 
Explain your reasoning. 

Reference 

Wang, C.-W., Oh, M.-K., & Liao, J.C. (1999) Engineered iso- 
prenoid pathway enhances astaxanthin production in Escherichia 
coli Biotechnol. Bioeng 62,235-241. 



Time passes rapidly when you are having fun. The thrill of seeing 
people get well who might otherwise have died of disease . . . cannot 
be described in words. The Nobel Prize was only the icing on the cake. 

— Gertrude Elion, quoted in an article in Science, 2002 
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itrogen ranks behind only carbon, hydrogen, and 
oxygen in its contribution to the mass of living sys- 
tems. Most of this nitrogen is bound up in amino 
acids and nucleotides. In this chapter we address all as- 
pects of the metabolism of these nitrogen-containing 
compounds except amino acid catabolism, which is cov- 
ered in Chapter 18. 

Discussing the biosynthetic pathways for amino 
acids and nucleotides together is a sound approach, not 
only because both classes of molecules contain nitrogen 
(which arises from common biological sources) but be- 
cause the two sets of pathways are extensively inter- 
twined, with several key intermediates in common. 
Certain amino acids or parts of amino acids are incorpo- 
rated into the structure of purines and pyrimidines, and 
in one case part of a purine ring is incorporated into an 
amino acid (histidine). The two sets of pathways also 
share much common chemistry, in particular a prepon- 
derance of reactions involving the transfer of nitrogen or 
one-carbon groups. 

The pathways described here can be intimidating to 
the beginning biochemistry student. Their complexity 
arises not so much from the chemistry itself, which in 
many cases is well understood, but from the sheer num- 
ber of steps and variety of intermediates. These path- 
ways are best approached by maintaining a focus on 
metabolic principles we have already discussed, on key 



intermediates and precursors, and on common classes 
of reactions. Even a cursory look at the chemistry can be 
rewarding, for some of the most unusual chemical trans- 
formations in biological systems occur in these path- 
ways; for instance, we find prominent examples of the 
rare biological use of the metals molybdenum, selenium, 
and vanadium. The effort also offers a practical divi- 
dend, especially for students of human or veterinary 
medicine. Many genetic diseases of humans and animals 
have been traced to an absence of one or more enzymes 
of amino acid and nucleotide metabolism, and many 
pharmaceuticals in common use to combat infectious 
diseases are inhibitors of enzymes in these pathways — 
as are a number of the most important agents in cancer 
chemotherapy. 

Regulation is crucial in the biosynthesis of the 
nitrogen-containing compounds. Because each amino 
acid and each nucleotide is required in relatively small 
amounts, the metabolic flow through most of these path- 
ways is not nearly as great as the biosynthetic flow lead- 
ing to carbohydrate or fat in animal tissues. Because the 
different amino acids and nucleotides must be made in 
the correct ratios and at the right time for protein and 
nucleic acid synthesis, their biosynthetic pathways must 
be accurately regulated and coordinated with each 
other. And because amino acids and nucleotides are 
charged molecules, their levels must be regulated to 
maintain electrochemical balance in the cell. As dis- 
cussed in earlier chapters, pathways can be controlled by 
changes in either the activity or the amounts of specific 
enzymes. The pathways we encounter in this chapter 
provide some of the best-understood examples of the 
regulation of enzyme activity. Control of the amounts of 
different enzymes in a cell (that is, of their synthesis and 
degradation) is a topic covered in Chapter 28. 



851 



J 852 ! Biosynthesis of Amino Acids, Nucleotides, and Related Molecules 



22.1 Overview of Nitrogen Metabolism 

The biosynthetic pathways leading to amino acids and 
nucleotides share a requirement for nitrogen. Because 
soluble, biologically useful nitrogen compounds are gen- 
erally scarce in natural environments, most organisms 
maintain strict economy in their use of ammonia, amino 
acids, and nucleotides. Indeed, as we shall see, free 
amino acids, purines, and pyrimidines formed during 
metabolic turnover of proteins and nucleic acids are 
often salvaged and reused. We first examine the path- 
ways by which nitrogen from the environment is intro- 
duced into biological systems. 

The Nitrogen Cycle Maintains a Pool 
of Biologically Available Nitrogen 

The most important source of nitrogen is air, which is 
four-fifths molecular nitrogen (N 2 ). However, relatively 
few species can convert atmospheric nitrogen into 
forms useful to living organisms. In the biosphere, the 
metabolic processes of different species function inter- 
dependent^ to salvage and reuse biologically available 
nitrogen in a vast nitrogen cycle (Fig. 22-1). The 
first step in the cycle is fixation (reduction) of atmo- 
spheric nitrogen by nitrogen-fixing bacteria to yield am- 
monia (NH 3 or NH^"). Although ammonia can be used 
by most living organisms, soil bacteria that derive 
their energy by oxidizing ammonia to nitrite (NO2 ) 
and ultimately nitrate (NO3) are so abundant and 
active that nearly all ammonia reaching the soil is oxi- 
dized to nitrate. This process is known as nitrifica- 
tion. Plants and many bacteria can take up and 
readily reduce nitrate and nitrite through the action of 
nitrate and nitrite reductases. The ammonia so formed 
is incorporated into amino acids by plants. Animals 
then use plants as a source of amino acids, both 
nonessential and essential, to build their proteins. 
When organisms die, microbial degradation of their 
proteins returns ammonia to the soil, where nitrifying 
bacteria again convert it to nitrite and nitrate. A bal- 
ance is maintained between fixed nitrogen and atmo- 
spheric nitrogen by bacteria that convert nitrate to N 2 



under anaerobic conditions, a process called denitri- 
fication (Fig. 22-1). These soil bacteria use NO3 
rather than 0 2 as the ultimate electron acceptor in a 
series of reactions that (like oxidative phosphoryla- 
tion) generates a transmembrane proton gradient, 
which is used to synthesize ATP. 

The nitrogen cycle is short-circuited by a recently 
discovered group of bacteria that promote anaerobic 
ammonia oxidation, or anammox (Fig. 22-1), a process 
that converts ammonia and nitrite to N 2 . As much as 
50% to 70% of the NH 3 -to-N 2 conversion in the bio- 
sphere may occur through this pathway, undetected until 
the 1980s. The obligate anaerobes that promote anam- 
mox are fascinating in their own right and are providing 
some useful solutions to waste-treatment problems 
(Box 22-1). 

Now let's examine the process of nitrogen fixation, 
the first step in the nitrogen cycle. 

Nitrogen Is Fixed by Enzymes of the Nitrogenase Complex 

Only certain bacteria and archaea can fix atmospheric 
nitrogen. These include the cyanobacteria of soils and 
fresh and salt waters, methanogenic archaea (strict 
anaerobes that obtain energy and carbon by converting 
H 2 and C0 2 to methane), other kinds of free-living soil 
bacteria such as Azotobacter species, and the nitrogen- 
fixing bacteria that live as symbionts in the root nod- 
ules of leguminous plants. The first important product 
of nitrogen fixation is ammonia, which can be used by 
all organisms either directly or after its conversion to 
other soluble compounds such as nitrites, nitrates, or 
amino acids. 

The reduction of nitrogen to ammonia is an exer- 
gonic reaction: 

N 2 + 3H 2 > 2NH 3 AG'° = -33.5 kJ/mol 

The N=N triple bond, however, is very stable, with a 
bond energy of 930 kJ/mol. Nitrogen fixation therefore 
has an extremely high activation energy, and atmo- 
spheric nitrogen is almost chemically inert under normal 
conditions. Ammonia is produced industrially by the 
Haber process (named for its inventor, Fritz Haber), 




FIGURE 22-1 The nitrogen cycle. The 

total amount of nitrogen fixed annually 
in the biosphere exceeds 10 11 kg. Reac- 
tions with red arrows occur largely or 
entirely in anaerobic environments. The 
redox states of the various nitrogen 
species are depicted at the bottom of the 
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BOX 22-1 



Unusual Lifestyles of the Obscure but Abundant 



Air-breathers that we are, we can easily overlook the 
bacteria and archaea that thrive in anaerobic environ- 
ments. Although rarely featured in introductory bio- 
chemistry textbooks, these organisms constitute much 
of the biomass of this planet, and their contributions to 
the balance of carbon and nitrogen in the biosphere are 
essential to all forms of life. 

As detailed in earlier chapters, the energy used to 
maintain living systems relies on the generation of proton 
gradients across membranes. Electrons derived from a 
reduced substrate are made available to electron carriers 
in membranes and pass through a series of electron trans- 
fers to a final electron acceptor. As a byproduct of this 
process, protons are released on one side of the mem- 
brane, generating the transmembrane proton gradient. 
The proton gradient is used to synthesize ATP or to drive 
other energy-requiring processes. For all eukaryotes, the 
reduced substrate is generally a carbohydrate (glucose or 
pyruvate) and the electron acceptor is oxygen. 

Many bacteria and archaea are much more versatile. 
In anaerobic environments such as marine and freshwa- 
ter sediments, the variety of life strategies is extraordi- 
nary- Almost any available redox pair can be an energy 
source for some specialized organism or group of organ- 
isms. For example, a large number of lithotrophic bacte- 
ria (a lithotroph is a chemotroph that uses inorganic 
energy sources; see Fig. 1-5) have a hydrogenase that 
uses molecular hydrogen to reduce NAD + : 



Ho + NAD + 



NADH + H 4 



The NADH is a source of electrons for a variety of mem- 
brane-bound electron acceptors, generating the proton 
gradient needed for ATP synthesis. Other lithotrophs 
oxidize sulfur compounds (H 2 S, elemental sulfur, or 
thiosulfate) or ferrous iron. A widespread group of ar- 
chaea called methanogens, all strict anaerobes, extract 
energy from the reduction of C0 2 to methane. And this 
is just a small sampling of what anaerobic organisms do 
for a living. Their metabolic pathways are replete with 
interesting reactions and highly specialized cofactors 
unknown in our own world of obligate aerobic metabo- 
lism. Study of these organisms can yield practical divi- 
dends. It can also provide clues about the origins of life 
on an early earth, in an atmosphere that lacked molecu- 
lar oxygen. 

The nitrogen cycle depends on a wide range of spe- 
cialized bacteria. There are two groups of nitrifying bac- 
teria: those that oxidize ammonia to nitrites and those 
that oxidize the resulting nitrites to nitrates (see Fig. 
22-1). Nitrate is second only to 0 2 as a biological elec- 
tron acceptor, and a great many bacteria and archaea 
can catalyze the denitrification of nitrates to nitrogen, 



which the nitrogen-fixing bacteria then convert back 
into ammonia. Ammonia is a major pollutant in sewage 
and in farm animal waste, and is a byproduct of fertilizer 
manufacture and oil refining. Waste-treatment plants 
have generally made use of communities of nitrifying 
and denitrifying bacteria to convert ammonia waste to 
atmospheric nitrogen. The process is expensive, requir- 
ing inputs of organic carbon and oxygen. 

In the 1960s and 1970s, a few articles appeared 
in the research literature suggesting that ammonia 
could be oxidized to nitrogen anaerobically, using ni- 
trite as an electron acceptor; this process was called 
anammox. The reports received little notice until bac- 
teria promoting anammox were discovered in a waste- 
treatment system in Delft, the Netherlands, in the 
mid-1980s. A team of Dutch microbiologists led by Gijs 
Kuenen and Mike Jetten began to study these bacteria, 
which were soon identified as belonging to an unusual 
bacterial phylum, Planctomycetes. Some surprises 
were to follow. 

The biochemistry underlying the anammox process 
was slowly unraveled (Fig. 1). Hydrazine (N 2 H 4 ), a highly 
reactive molecule used as a rocket fuel, was an unex- 
pected intermediate. As a small molecule, hydrazine 
is both highly toxic and difficult to contain. It readily 
diffuses across typical phospholipid membranes. The 

(continued on next page) 
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FIGURE 1 The anammox reactions. Ammonia and ammonium hydrox- 
ide are converted to hydrazine and H 2 0 by hydrazine hydrolase, and 
the hydrazine is oxidized by hydrazine-oxidizing enzyme, generating 
N 2 and protons, The protons generate a proton gradient for ATP 
synthesis. On the anammoxosome exterior, protons are used by the 
nitrite-reducing enzyme, producing ammonium oxide and completing 
the cycle. All of the anammox enzymes are embedded in the anam- 
moxosome membrane. 
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FIGURE 2 (a) Ladderane lipids of the anammoxosome membrane. The 
mechanism for synthesis of the unstable fused cyclobutane ring struc- 
tures is unknown, (b) Ladderanes can stack to form a very dense, im- 

anammox bacteria solve this problem by sequestering hy- 
drazine in a specialized organelle, dubbed the anammox- 
osome. The membrane of this organelle is composed of 
lipids known as ladderanes (Fig. 2), never before en- 
countered in biology. The fused cyclobutane rings of lad- 
deranes stack tightly to form a very dense barrier, greatly 
slowing the release of hydrazine. Cyclobutane rings are 
strained and difficult to synthesize; the bacterial mecha- 
nisms for synthesizing these lipids are not yet known. 

The anammoxosome was a surprising finding. Bac- 
terial cells generally do not have compartments, and the 
lack of a membrane-enclosed nucleus is often cited as 
the primary distinction between eukaryotes and bacte- 
ria. One type of organelle in a bacterium was interesting 
enough, but planctomycetes also have a nucleus: their 
chromosomal DNA is contained within a membrane 
(Fig. 3). Discovery of this subcellular organization has 
prompted further research to trace the origin of the 
planctomycetes and the evolution of eukaryotic nuclei. 
Planctomycetes are an ancient bacterial line with multi- 
ple genera, three of which are known to carry out the 
anammox reactions. Further study of this group may ul- 
timately bring us closer to a key goal of evolutionary bi- 
ology: a description of the organism affectionately 
referred to as LUCA— the Last Universal Common An- 
cestor of all life on our planet. 

For now, the anarnmox bacteria offer a major ad- 
vance in waste treatment, reducing the cost of ammonia 
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permeable, hydrophobic membrane structure, allowing sequestration 
of the hydrazine produced in the anammox reactions. 



removal by as much as 90% (the conventional denitrifi- 
cation steps are eliminated completely, and the aeration 
costs associated with nitrification are lower) and reduc- 
ing the release of polluting byproducts. Clearly, a 
greater familiarity with the bacterial underpinnings of 
the biosphere can pay big dividends as we deal with the 
environmental challenges of the twenty-first century. 
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FIGURE 3 Transmission electron micrograph of a cross section through 
Cemmata obscuriglobus, showing the DNA in a nucleus (N) with 
enclosing nuclear envelope (NE). Bacteria of the Gemmata genus 
(phylum Planctomycetes) do not promote the anammox reactions. 



which requires temperatures of 400 to 500 °C and nitro- 
gen and hydrogen at pressures of tens of thousands of 
kilopascals (several hundred atmospheres) to provide 
the necessary activation energy. Biological nitrogen fix- 
ation, however, must occur at biological temperatures 
and at 0.8 atm of nitrogen, and the high activation bar- 
rier is overcome by other means. This is accomplished, 



at least in part, by the binding and hydrolysis of ATP. 
The overall reaction can be written 

N 2 + 10H + + 8e~ + 16ATP ► 2NHJ + 16ADP + 16Pi + H 2 

Biological nitrogen fixation is carried out by a 
highly conserved complex of proteins called the nitro- 
genase complex (Fig. 22-2), the crucial components 
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4 pyruvate 
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FIGURE 22-3 Enzymes and cofactors of the nitrogenase complex. 

{PDB ID 1 N2C) (a) In this ribbon diagram, the dinitrogenase subunits 
are shown in gray and pink, the dinitrogenase reductase subunits in 
blue and green. The bound ADP is red, Note the 4Fe-4S complex (Fe 
atoms orange, S atoms yellow) and the iron-molybdenum cofactor (Mo 



FIGURE 22-2 Nitrogen fixation by the nitrogenase complex. Electrons 
are transferred from pyruvate to dinitrogenase via ferredoxin (or flavo- 
doxin) and dinitrogenase reductase. Dinitrogenase reductase reduces 
dinitrogenase one electron at a time, with at least six electrons re- 
quired to fix one molecule of N 2 . An additional two electrons are used 
to reduce 2 H + to H 2 in a process that obligatorily accompanies nitro- 
gen fixation in anaerobes, making a total of eight electrons required 
per N 2 molecule. The subunit structures and metal cofactors of the 
dinitrogenase reductase and dinitrogenase proteins are described in 
the text and in Figure 22-3. 



of which are dinitrogenase reductase and dinitro- 
genase (Fig. 22-3). Dinitrogenase reductase (M r 
60,000) is a dimer of two identical subunits. It con- 
tains a single 4Fe-4S redox center (see Fig. 19-5), 
bound between the subunits, and can be oxidized and 
reduced by one electron. It also has two binding sites 
for ATP/ADP (one site on each subunit). Dinitroge- 
nase (M r 240,000), a tetramer with two copies of two 
different subunits, contains both iron and molybde- 
num; its redox centers have a total of 2 Mo, 32 Fe, and 
30 S per tetramer. About half of the iron and sulfur is 
present as two bridged pairs of 4Fe-4S centers called 
P clusters; the remainder is present as part of a novel 
iron-molybdenum cofactor. A form of nitrogenase that 
contains vanadium rather than molybdenum has been 
discovered, and some bacterial species can produce 
both types of nitrogenase system. The vanadium-con- 
taining enzyme may be the primary nitrogen-fixing 
system under some environmental conditions, but it is 




black, homocitrate light gray). The P clusters (bridged pairs of 4Fe-4S 
complexes) are also shown, (b) The dinitrogenase complex cofactors 
without the protein (colors as in (a)), (c) The iron-molybdenum cofac- 
tor contains 1 Mo (black), 7 Fe (orange), 9 S (yellow), and one mole- 
cule of homocitrate (gray). 
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not yet as well characterized as the molybdenum- 
dependent enzyme. 

Nitrogen fixation is carried out by a highly reduced 
form of dinitrogenase and requires eight electrons: six 
for the reduction of N 2 and two to produce one molecule 
of H 2 as an obligate part of the reaction mechanism. 
Dinitrogenase is reduced by the transfer of electrons 
from dinitrogenase reductase (Fig. 22-2). The dinitro- 
genase tetramer has two binding sites for the reductase. 
The required eight electrons are transferred from re- 
ductase to dinitrogenase one at a time: a reduced reduc- 
tase molecule binds to the dinitrogenase and transfers a 
single electron, then the oxidized reductase dissociates 
from dinitrogenase, in a repeating cycle. Each turn of 
the cycle requires the hydrolysis of two ATP molecules 
by the dimeric reductase. The immediate source of elec- 
trons to reduce dinitrogenase reductase varies, with re- 
duced ferredoxin (p. 753; see also Fig. 19-5), reduced 
flavodoxin, and perhaps other sources playing a role. In 
at least one species, the ultimate source of electrons to 
reduce ferredoxin is pyruvate (Fig. 22-2). 

The role of ATP in this process is somewhat un- 
usual. As you will recall, ATP can contribute not only 
chemical energy, through the hydrolysis of one or more 
of its phosphoanhydride bonds, but also binding en- 
ergy (pp. 189, 297), through noncovalent interactions 
that lower the activation energy. In the reaction carried 
out by dinitrogenase reductase, both ATP binding and 
ATP hydrolysis bring about protein conformational 
changes that help overcome the high activation energy 
of nitrogen fixation. The binding of two ATP molecules 
to the reductase shifts the reduction potential (£"°) of 
this protein from -300 to -420 mV, an enhancement 
of its reducing power that is required to transfer elec- 
trons to dinitrogenase. The ATP molecules are then 
hydrolyzed just before the actual transfer of one electron 
to dinitrogenase. 



Another important characteristic of the nitroge- 
nase complex is its extreme lability in the presence of 
oxygen. The reductase is inactivated in air, with a half- 
life of 30 seconds; dinitrogenase has a half -life of 10 
minutes in air. Free-living bacteria that fix nitrogen 
cope with this problem in a variety of ways. Some live 
only anaerobically or repress nitrogenase synthesis 
when oxygen is present. Some aerobic species, such as 
Azotobacter vinelandii, partially uncouple electron 
transfer from ATP synthesis so that oxygen is burned 
off as rapidly as it enters the cell (see Box 19-1). When 
fixing nitrogen, cultures of these bacteria actually in- 
crease in temperature as a result of their efforts to rid 
themselves of oxygen. 

The symbiotic relationship between leguminous 
plants and the nitrogen-fixing bacteria in their root 
nodules (Fig. 22-4) takes care of both the energy re- 
quirements and the oxygen lability of the nitrogenase 
complex. The energy required for nitrogen fixation was 
probably the evolutionary driving force for this plant- 
bacteria association. The bacteria in root nodules have 
access to a large reservoir of energy in the form of 
abundant carbohydrate and citric acid cycle intermedi- 
ates made available by the plant. This may allow the 
bacteria to fix hundreds of times more nitrogen than 
do their free-living cousins under conditions generally 
encountered in soils. To solve the oxygen-toxicity 
problem, the bacteria in root nodules are bathed in a 
solution of the oxygen-binding heme protein leghemo- 
globin, produced by the plant (although the heme 
may be contributed by the bacteria). Leghemoglobin 
binds all available oxygen so that it cannot interfere 
with nitrogen fixation, and efficiently delivers the oxy- 
gen to the bacterial electron-transfer system. The ben- 
efit to the plant, of course, is a ready supply of reduced 
nitrogen. The efficiency of the symbiosis between 
plants and bacteria is evident in the enrichment of soil 



FIGURE 22-4 Nitrogen-fixing nodules, (a) Root nod- 
ules of bird's-foot trefoil, a legume. The flower of this 
common plant is shown in the inset, (b) Artificially 
colorized electron micrograph of a thin section 
through a pea root nodule. Symbiotic nitrogen-fixing 
bacteria, or bacteroids (red), live inside the nodule 
cell, surrounded by the peribacteroid membrane 
(blue). Bacteroids produce the nitrogenase complex 
that converts atmospheric nitrogen (N 2 ) to ammo- 
nium (NHj); without the bacteroids, the plant is un- 
able to utilize N 2 . The infected root cell provides 
some factors essential for nitrogen fixation, including 
leghemoglobin; this heme protein has a very high 
binding affinity for oxygen, which strongly inhibits 
nitrogenase. (The cell nucleus is shown in 
yellow/green. Not visible in this micrograph are 
other organelles of the infected root cell that are nor- 
mally found in plant cells.) 
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nitrogen brought about by leguminous plants. This en- 
richment is the basis of crop rotation methods, in 
which plantings of nonleguminous plants (such as 
maize) that extract fixed nitrogen from the soil are al- 
ternated every few years with plantings of legumes 
such as alfalfa, peas, or clover. 

Nitrogen fixation is the subject of intense study, 
because of its immense practical importance. Indus- 
trial production of ammonia for use in fertilizers re- 
quires a large and expensive input of energy, and this 
has spurred a drive to develop recombinant or trans- 
genic organisms that can fix nitrogen. Recombinant 
DNA techniques (Chapter 9) are being used to trans- 
fer the DNA that encodes the enzymes of nitrogen fix- 
ation into non-nitrogen-fixing bacteria and plants. 
Success in these efforts will depend on overcoming 
the problem of oxygen toxicity in any cell that pro- 
duces nitrogenase. 

Ammonia Is Incorporated into Biomolecuies 
through Glutamate and Glutamine 

Reduced nitrogen in the form of NH4 is assimilated into 
amino acids and then into other nitrogen-containing bio- 
molecules. Two amino acids, glutamate and gluta- 
mine, provide the critical entry point. Recall that these 
same two amino acids play central roles in the catabo- 
lism of ammonia and amino groups in amino acid oxida- 
tion (Chapter 18). Glutamate is the source of amino 
groups for most other amino acids, through transamina- 
tion reactions (the reverse of the reaction shown in 
Fig. 18-4). The amide nitrogen of glutamine is a source 
of amino groups in a wide range of biosynthetic 
processes. In most types of cells, and in extracellular flu- 
ids in higher organisms, one or both of these amino acids 
are present at higher concentrations — sometimes an or- 
der of magnitude or more higher — than other amino 
acids. An Escherichia coli cell requires so much gluta- 
mate that this amino acid is one of the primary solutes in 
the cytosol. Its concentration is regulated not only in re- 
sponse to the cell's nitrogen requirements but also to 
maintain an osmotic balance between the cytosol and 
the external medium. 

The biosynthetic pathways to glutamate and gluta- 
mine are simple, and all or some of the steps occur in 
most organisms. The most important pathway for the as- 
similation of NH4 into glutamate requires two reactions. 
First, glutamine synthetase catalyzes the reaction of 
glutamate and NH4 to yield glutamine. This reaction 
takes place in two steps, with enzyme-bound y-glutamyl 
phosphate as an intermediate (see Fig. 18-8): 

(1) Glutamate + ATP > y-glutamyl phosphate + ADP 

(2) y-Glutamyl phosphate + NHj > glutamine + P t + H + 

Sum: Glutamate + NHJ+ ATP > 

glutamine + ADP + P i + H + (22-1) 



Glutamine synthetase is found in all organisms. In addi- 
tion to its importance for NHj assimilation in bacteria, it 
has a central role in amino acid metabolism in mammals, 
converting free NH4 , which is toxic, to glutamine for 
transport in the blood (Chapter 18). 

In bacteria and plants, glutamate is produced from 
glutamine in a reaction catalyzed by glutamate syn- 
thase. a-Ketoglutarate, an intermediate of the citric 
acid cycle, undergoes reductive amination with gluta- 
mine as nitrogen donor: 

a-Ketoglutarate + glutamine + NADPH + H + > 

2 glutamate + NADP + (22-2) 

The net reaction of glutamine synthetase and glutamate 
synthase (Eqns 22-1 and 22-2) is 

a-Ketoglutarate + NH4 + NADPH + ATP ► 

L-glutamate + NADP + + ADP + ^ 

Glutamate synthase is not present in animals, which in- 
stead maintain high levels of glutamate by processes 
such as the transamination of a-ketoglutarate during 
amino acid catabolism. 

Glutamate can also be formed in yet another, albeit 
minor, pathway: the reaction of a-ketoglutarate and 
NH4 to form glutamate in one step. This is catalyzed by 
L-glutamate dehydrogenase, an enzyme present in all 
organisms. Reducing power is furnished by NADPH: 

a-Ketoglutarate + NH4 + NADPH ► 

L-glutamate + NADP + + H 2 0 

We encountered this reaction in the catabolism of 
amino acids (see Fig. 18-7). In eukaryotic cells, L-gluta- 
mate dehydrogenase is located in the mitochondrial 
matrix. The reaction equilibrium favors the reactants, 
and the K m for NH4 (~ 1 ium) is so high that the reaction 
probably makes only a modest contribution to NH4 as- 
similation into amino acids and other metabolites. (Re- 
call that the glutamate dehydrogenase reaction, in 
reverse (see Fig. 18-10), is one source of NH4 destined 
for the urea cycle.) Concentrations of NH4 high enough 
for the glutamate dehydrogenase reaction to make a 
significant contribution to glutamate levels generally 
occur only when NH 3 is added to the soil or when or- 
ganisms are grown in a laboratory in the presence of 
high NH 3 concentrations. In general, soil bacteria and 
plants rely on the two-enzyme pathway outlined above 
(Eqns 22-1,22-2). 

Glutamine Synthetase Is a Primary Regulatory 
Point in Nitrogen Metabolism 

The activity of glutamine synthetase is regulated in vir- 
tually all organisms — not surprising, given its central 
metabolic role as an entry point for reduced nitrogen. In 
enteric bacteria such as E. coli, the regulation is unusu- 
ally complex. The enzyme has 12 identical subunits of 
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M r 50,000 (Fig. 22-5) and is regulated both allosteri- 
cally and by covalent modification. Alanine, glycine, and 
at least six end products of glutamine metabolism are 
allosteric inhibitors of the enzyme (Fig. 22-6). Each 
inhibitor alone produces only partial inhibition, but the 
effects of multiple inhibitors are more than additive, 
and all eight together virtually shut down the enzyme. 
This control mechanism provides a constant adjust- 
ment of glutamine levels to match immediate metabolic 
requirements. 

Superimposed on the allosteric regulation is inhibi- 
tion by adenylylation of (addition of AMP to) Tyr 397 , lo- 
cated near the enzyme's active site (Fig. 22-7). This 
covalent modification increases sensitivity to the 
allosteric inhibitors, and activity decreases as more 



Glutamate 





FIGURE 22-5 Subunit structure of glutamine synthetase as deter- 
mined by x-ray diffraction. (PDB ID 2GLS) (a) Side view. The 12 sub- 
units are identical; they are differently colored to illustrate packing and 
placement, (b) Top view, showing active sites (green). 
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FIGURE 22-6 Allosteric regulation of glutamine synthetase. The en- 
zyme undergoes cumulative regulation by six end products of gluta- 
mine metabolism. Alanine and glycine probably serve as indicators of 
the general status of amino acid metabolism in the cell. 

subunits are adenylylated. Both adenylylation and 
deadenylylation are promoted by adenylyltrans- 
ferase (AT in Fig. 22-7), part of a complex enzymatic 
cascade that responds to levels of glutamine, a-ketoglu- 
tarate, ATP, and P it The activity of adenylyltransferase 
is modulated by binding to a regulatory protein called 
P n , and the activity of P n , in turn, is regulated by cova- 
lent modification (uridylylation) , again at a Tyr 
residue. The adenylyltransferase complex with uridy- 
lylated P n (Pn-UMP) stimulates deadenylylation, 
whereas the same complex with deuridylylated P n 
stimulates adenylylation of glutamine synthetase. Both 
uridylylation and deuridylylation of P n are brought 
about by a single enzyme, uridylyltransf erase. 
Uridylylation is inhibited by binding of glutamine and 
Pi to uridylyltransferase and is stimulated by binding of 
a-ketoglutarate and ATP to P n . 

The regulation does not stop there. The uridylylated 
P n also mediates the activation of transcription of the 
gene encoding glutamine synthetase, thus increasing 
the cellular concentration of the enzyme; the deuridyly- 
lated P n brings about a decrease in transcription of the 
same gene. This mechanism involves an interaction of 
P n with additional proteins involved in gene regulation, 
of a type described in Chapter 28. The net result of this 
elaborate system of controls is a decrease in glutamine 
synthetase activity when glutamine levels are high, and 
an increase in activity when glutamine levels are low and 
a-ketoglutarate and ATP (substrates for the synthetase 
reaction) are available. The multiple layers of regulation 
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FIGURE 22-7 Second level of regulation of glutamine synthetase: 
covalent modifications, (a) An adenylylated Tyr residue, (b) Cascade 
leading to adenylylation (inactivation) of glutamine synthetase. AT 
represents adenyly transferase; UT, uridyly transferase . Details of this 
cascade are discussed in the text. 




Glutamine 



permit a sensitive response in which glutamine synthe- 
sis is tailored to cellular needs. 

Several Classes of Reactions Play Special Roles in the 
Biosynthesis of Amino Acids and Nucleotides 

The pathways described in this chapter include a variety 
of interesting chemical rearrangements. Several of 
these recur and deserve special note before we progress 
to the pathways themselves. These are (1) transamina- 
tion reactions and other rearrangements promoted by 
enzymes containing pyridoxal phosphate; (2) transfer 
of one-carbon groups, with either tetrahydrofolate 
(usually at the — CHO and — CH 2 OH oxidation levels) 
or S-adenosylmethionine (at the — CH 3 oxidation level) 
as cofactor; and (3) transfer of amino groups derived 
from the amide nitrogen of glutamine. Pyridoxal 
phosphate (PLP), tetrahydrofolate (H 4 folate), and 
S-adenosylmethionine (adoMet) are described in some 
detail in Chapter 18 (see Figs 18-6, 18-17, and 18-18). 
Here we focus on amino group transfer involving the 
amide nitrogen of glutamine. 



More than a dozen known biosynthetic reactions 
use glutamine as the major physiological source of 
amino groups, and most of these occur in the pathways 
outlined in this chapter. As a class, the enzymes catalyz- 
ing these reactions are called glutamine amidotrans- 
ferases. All have two structural domains: one binding 
glutamine, the other binding the second substrate, 
which serves as amino group acceptor (Fig. 22-8). A 
conserved Cys residue in the glutamine-binding domain 
is believed to act as a nucleophile, cleaving the amide bond 
of glutamine and forming a covalent glutamyl-enzyme 
intermediate. The NH 3 produced in this reaction is not 
released, but instead is transferred through an "ammo- 
nia channel" to a second active site, where it reacts with 
the second substrate to form the aminated product. The 
covalent intermediate is hydrolyzed to the free enzyme 
and glutamate. If the second substrate must be acti- 
vated, the usual method is the use of ATP to generate an 
acyl phosphate intermediate (R — OX in Fig. 22-8, with 
X as a phosphoryl group). The enzyme glutaminase acts 
in a similar fashion but uses H 2 0 as the second sub- 
strate, yielding NHj and glutamate (see Fig. 18-8). 
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MECHANISM FIGURE 22-8 Proposed mechanism for glutamine amido- 
transferases. Each enzyme has two domains. The glutamine-binding 
domain contains structural elements conserved among many of these 
enzymes, including a Cys residue required for activity. The NH 3 - 
acceptor (second-substrate) domain varies. Two types of amino acceptors 
are shown. X represents an activating group, typically a phosphoryl 
group derived from ATP, that facilitates displacement of a hydroxyl 
group from R — OH by NH 3 . 



SUMMARY 22.1 



Overview of Nitrogen 
Metabolism 



The molecular nitrogen that makes up 80% of the 
earth's atmosphere is unavailable to most living 
organisms until it is reduced. This fixation of 
atmospheric N 2 takes place in certain free-living 



bacteria and in symbiotic bacteria in the root 
nodules of leguminous plants. 

■ The nitrogen cycle entails formation of ammonia 
by bacterial fixation of N 2 , nitrification of ammonia 
to nitrate by soil organisms, conversion of nitrate to 
ammonia by higher plants, synthesis of amino acids 
from ammonia by all organisms, and conversion of 
nitrate to N 2 by denitrifying soil bacteria. The 
anammox bacteria anaerobically oxidize ammonia 
to nitrogen, using nitrite as an electron acceptor. 

■ Fixation of N 2 as NH 3 is carried out by the 
nitrogenase complex, in a reaction that requires 
ATP. The nitrogenase complex is highly labile in 
the presence of 0 2 . 

■ In living systems, reduced nitrogen is incorporated 
first into amino acids and then into a variety of 
other biomolecules, including nucleotides. The key 
entry point is the amino acid glutamate. Glutamate 
and glutamine are the nitrogen donors in a wide 
range of biosynthetic reactions. Glutamine 
synthetase, which catalyzes the formation of 
glutamine from glutamate, is a main regulatory 
enzyme of nitrogen metabolism. 

■ The amino acid and nucleotide biosynthetic 
pathways make repeated use of the biological 
cofactors pyridoxal phosphate, tetrahydrofolate, 
and iS-adenosylmethionine. Pyridoxal phosphate is 
required for transamination reactions involving 
glutamate and for other amino acid transformations. 
One-carbon transfers require 5-adenosylmethionine 
and tetrahydrofolate. Glutamine amidotransferases 
catalyze reactions that incorporate nitrogen 
derived from glutamine. 

22,2 Biosynthesis of Amino Acids 

All amino acids are derived from intermediates in gly- 
colysis, the citric acid cycle, or the pentose phosphate 
pathway (Fig. 22-9). Nitrogen enters these pathways 
by way of glutamate and glutamine. Some pathways are 
simple, others are not. Ten of the amino acids are just 
one or several steps removed from the common metabo- 
lite from which they are derived. The biosynthetic path- 
ways for others, such as the aromatic amino acids, are 
more complex. 

Organisms vary greatly in their ability to synthesize 
the 20 common amino acids. Whereas most bacteria and 
plants can synthesize all 20, mammals can synthesize only 
about half of them — generally those with simple pathways. 
These are the nonessential amino acids, not needed in 
the diet (see Table 18-1). The remainder, the essential 
amino acids, must be obtained from food. Unless other- 
wise indicated, the pathways for the 20 common amino 
acids presented below are those operative in bacteria. 

A useful way to organize these biosynthetic path- 
ways is to group them into six families corresponding 
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FIGURE 22-9 Overview of amino acid biosynthesis. The carbon skele- 
ton precursors derive from three sources: glycolysis (pink), the citric 
acid cycle (blue), and the pentose phosphate pathway (purple). 



to their metabolic precursors (Table 22-1), and we 
use this approach to structure the detailed descriptions 
that follow. In addition to these six precursors, there is 
a notable intermediate in several pathways of amino 
acid and nucleotide synthesis: 5-phosphoribosyl- 
1 -pyrophosphate (PRPP): 




TABLE 22-1 



Amino Acid Biosynthetic Families, 
Grouped by Metabolic Precursor 



a-Ketoglutarate 

Glutamate 
Glutamine 
Proline 
Arginine 

3-Phosphoglycerate 

Serine 

Glycine 

Cysteine 

Oxaloacetate 

Aspartate 

Asparagine 

Methionine* 

Threonine* 

Lysine* 



Pyruvate 

Alanine 
Valine* 
Leucine* 
Isoleucine* 

Phosphoenolpyruvate 
and erythrose 
4-phosphate 

Tryptophan* 

Phenylalanine* 

Tyrosine^ 

Ribose 5-phosphate 

Histidine* 



* Essential amino acids. 

derived from phenylalanine in mammals. 



PRPP is synthesized from ribose 5-phosphate derived 
from the pentose phosphate pathway (see Fig. 14-21), 
in a reaction catalyzed by ribose phosphate py- 
rophosphokinase : 

Ribose 5-phosphate + ATP > 

5-phosphoribosyl-l-pyrophosphate + AMP 

This enzyme is allosterically regulated by many of the 
biomolecules for which PRPP is a precursor. 

a-Ketoglutarate Gives Rise to Glutamate, 
Glutamine, Proline, and Arginine 



flf-Ketoglutarate 

j 



Glutamate 



Glutamine 



Proline 



\ 



OH OH 



We have already described the biosynthesis of gluta- 
mate and glutamine. Proline is a cyclized derivative 
of glutamate (Fig. 22-10). In the first step of proline 
synthesis, ATP reacts with the y-carboxyl group of glu- 
tamate to form an acyl phosphate, which is reduced by 
NADPH or NADH to glutamate y-semialdehyde. This 
intermediate undergoes rapid spontaneous cyclization 
and is then reduced further to yield proline. 

Arginine is synthesized from glutamate via or- 
nithine and the urea cycle in animals (Chapter 18). In 
principle, ornithine could also be synthesized from glu- 
tamate y-semialdehyde by transamination, but the 
spontaneous cyclization of the semialdehyde in the 
proline pathway precludes a sufficient supply of this 
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FIGURE 22-10 Biosynthesis of proline and arginine from glutamate in 
bacteria. All five carbon atoms of proline arise from glutamate. In 
many organisms, glutamate dehydrogenase is unusual in that it uses ei- 
ther NADH or NADPH as a cofactor. The same may be true of other en- 
zymes in these pathways. The y-semialdehyde in the proline pathway 
undergoes a rapid, reversible cyclization to A^pyrroline-S-carboxylate 
(P5C), with the equilibrium favoring P5C formation. Cyclization is 
averted in the ornithine/arginine pathway by acetylation of the a- 
amino group of glutamate in the first step and removal of the acetyl 
group after the transamination. Although some bacteria lack arginase 
and thus the complete urea cycle, they can synthesize arginine from 
ornithine in steps that parallel the mammalian urea cycle, with cit- 
rulline and argininosuccinate as intermediates (see Fig. 18-10). 

Here, and in subsequent figures in this chapter, the reaction arrows 
indicate the linear path to the final products, without considering the re- 
versibility of individual steps. For example, the step of the pathway lead- 
ing to arginine that is catalyzed by N-acetylglutamate dehydrogenase is 
chemically similar to the glyceraldehyde 3-phosphate dehydrogenase 
reaction in glycolysis (see Fig. 14-7), and is readily reversible. 
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FIGURE 22-11 Ornithines-aminotransferase 
reaction: a step in the mammalian pathway 
to proline. This enzyme is found in the mito- 
chondrial matrix of most tissues. Although the 
equilibrium favors P5C formation, the reverse 
reaction is the only mammalian pathway for 
synthesis of ornithine (and thus arginine) 
when arginine levels are insufficient for pro- 
tein synthesis. 



intermediate for ornithine synthesis. Bacteria have a de 
novo biosynthetic pathway for ornithine (and thus argi- 
nine) that parallels some steps of the proline pathway 
but includes two additional steps that avoid the prob- 
lem of the spontaneous cyclization of glutamate y-semi- 
aldehyde (Fig. 22-10). In the first step, the a-amino 
group of glutamate is blocked by an acetylation requir- 
ing acetyl-CoA; then, after the transamination step, the 
acetyl group is removed to yield ornithine. 

The pathways to proline and arginine are some- 
what different in mammals. Proline can be synthesized 
by the pathway shown in Figure 22-10, but it is also 
formed from arginine obtained from dietary or tissue 
protein. Arginase, a urea cycle enzyme, converts argi- 
nine to ornithine and urea (see Figs 18-10, 18-26). 
The ornithine is converted to glutamate y-semialde- 
hyde by the enzyme ornithine ^-aminotransferase 
(Fig. 22-11). The semialdehyde cyclizes to A 1 -pyrro- 
line-5-carboxylate, which is then converted to proline 
(Fig. 22-10). The pathway for arginine synthesis 
shown in Figure 22-10 is absent in mammals. When 
arginine from dietary intake or protein turnover is in- 
sufficient for protein synthesis, the ornithine S-amino- 
transferase reaction operates in the direction of 
ornithine formation. Ornithine is then converted to cit- 
rulline and arginine in the urea cycle. 

Serine, Glycine, and Cysteine Are Derived 
from 3-Phosphoglycerate 

3 -Phosphogly cerate 



Serine 



Glycine 



Cysteine 



The major pathway for the formation of serine is the 
same in all organisms (Fig. 22-12). In the first step, the 
hydroxyl group of 3-phosphoglycerate is oxidized by a 
dehydrogenase (using NAD + ) to yield 3-phosphohy- 
droxypyruvate. Transamination from glutamate yields 
3-phosphoserine, which is hydrolyzed to free serine by 
phosphoserine phosphatase. 
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FIGURE 22-12 Biosynthesis of serine from 3-phosphoglycerate and of 
glycine from serine in all organisms. Glycine is also made from C0 2 
and NHj by the action of glycine synthase, with /V 5 ,/V 10 -methylenete- 
trahydrofolate as methyl group donor (see text). 
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Serine (three carbons) is the precursor of glycine 
(two carbons) through removal of a carbon atom by 
serine hydroxymethyltransferase (Fig. 22-12). 
Tetrahydrofolate accepts the /3 carbon (C-3) of serine, 
which forms a methylene bridge between N-5 and N-10 
to yield A^A/^-methylenetetrahydrofolate (see Fig. 
18-17). The overall reaction, which is reversible, also 
requires pyridoxal phosphate. In the liver of verte- 
brates, glycine can be made by another route: the re- 
verse of the reaction shown in Figure 18-20c, 
catalyzed by glycine synthase (also called glycine 
cleavage enzyme): 

C0 2 + NH 4 + + A/ r5 ,A/ rl0 -methylenetetrahydrofolate + 
NADH + H + ► 

glycine + tetrahydrofolate + NAD + 

Plants and bacteria produce the reduced sulfur re- 
quired for the synthesis of cysteine (and methionine, 



described later) from environmental sulfates; the path- 
way is shown on the right side of Figure 22-13. Sulfate 
is activated in two steps to produce 3'-phosphoadenosine 
5'-phosphosulfate (PAPS), which undergoes an eight- 
electron reduction to sulfide. The sulfide is then used in 
the formation of cysteine from serine in a two-step path- 
way. Mammals synthesize cysteine from two amino acids: 
methionine furnishes the sulfur atom, and serine fur- 
nishes the carbon skeleton. Methionine is first converted 
to S-adenosylmethionine (see Fig. 18-18), which can 
lose its methyl group to any of a number of acceptors to 
form 5-adenosylhomocysteine (adoHcy). This demethy- 
lated product is hydrolyzed to free homocysteine, which 
undergoes a reaction with serine, catalyzed by cystathio- 
nine j8-synthase, to yield cystathionine (Fig. 22-14). 
Finally, cystathionine y-lyase, a PLP-requiring en- 
zyme, catalyzes removal of ammonia and cleavage of 
cystathionine to yield free cysteine. 
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FIGURE 22-13 Biosynthesis of cysteine from serine in bacteria and plants. The origin of reduced sulfur is 
shown in the pathway on the right. 
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FIGURE 22-14 Biosynthesis of cysteine from homocysteine and serine 
in mammals. The homocysteine is formed from methionine, as de- 
scribed in the text. 



Three Nonessential and Six Essential Amino Acids 
Are Synthesized from Oxaloacetate and Pyruvate 

Oxaloacetate 




Asparagine Methionine Lysine Threonine 




Pyruvate 



Alanine and aspartate are synthesized from pyruvate 
and oxaloacetate, respectively, by transamination from 
glutamate. Asparagine is synthesized by amidation of 
aspartate, with glutamine donating the NH4 . These are 
nonessential amino acids, and their simple biosynthetic 
pathways occur in all organisms. 

For reasons incompletely understood, the malig- 
nant lymphocytes present in childhood acute lym- 
phoblastic leukemia (ALL) require serum asparagine for 
growth. The chemotherapy for ALL is administered 



together with an L-asparaginase derived from bacteria, 
with the enzyme functioning to reduce serum asparagine. 
The combined treatment results in a greater than 95% re- 
mission rate in cases of childhood ALL (L-asparaginase 
treatment alone produces remission in 40% to 60% of 
cases). However, the asparaginase treatment has some 
deleterious side effects, and about 10% of patients who 
achieve remission eventually suffer relapse, with tumors 
resistant to drug therapy. Researchers are now develop- 
ing inhibitors of human asparagine synthetase to aug- 
ment these therapies for childhood ALL. ■ 

Methionine, threonine, lysine, isoleucine, valine, 
and leucine are essential amino acids. Their biosynthetic 
pathways are complex and interconnected (Fig. 22-15). 
In some cases, the pathways in bacteria, fungi, and 
plants differ significantly. Figure 22-15 shows the bacte- 
rial pathways. 

Aspartate gives rise to methionine, threonine, 
and lysine. Branch points occur at aspartate /3- 
semialdehyde, an intermediate in all three pathways, 
and at homoserine, a precursor of threonine and me- 
thionine. Threonine, in turn, is one of the precursors 
of isoleucine. The valine and isoleucine pathways 
share four enzymes (Fig. 22-15, steps @ to (21)). 
Pyruvate gives rise to valine and isoleucine in path- 
ways that begin with condensation of two carbons of 
pyruvate (in the form of hydroxyethyl thiamine py- 
rophosphate; see Fig. 14-14) with another molecule of 
pyruvate (the valine path) or with a-ketobutyrate (the 
isoleucine path). The a-ketobutyrate is derived from 
threonine in a reaction that requires pyridoxal phos- 
phate (Fig. 22-15, step @). An intermediate in the 
valine pathway, a-ketoisovalerate, is the starting point 
for a four-step branch pathway leading to leucine 
(steps (£2) to @). 

Chorismate Is a Key Intermediate in the Synthesis 
of Tryptophan, Phenylalanine, and Tyrosine 

Ph oaphoenolpyru vate 



Erytkrose 4- phosphate 
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Phenylalanine | Tyrosine] [ Tryptophan | 



1 
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Aromatic rings are not readily available in the environ- 
ment, even though the benzene ring is very stable. The 
branched pathway to tryptophan, phenylalanine, and ty- 
rosine, occurring in bacteria, fungi, and plants, is the 
main biological route of aromatic ring formation. It pro- 
ceeds through ring closure of an aliphatic precursor fol- 
lowed by stepwise addition of double bonds. The first 
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FIGURE 22-15 Biosynthesis of six essential amino acids from oxalo- 
acetate and pyruvate in bacteria: methionine, threonine, lysine, 
isoleucine, valine, and leucine. Here, and in other multistep pathways, 
the enzymes are listed in the key. Note that L,L-a,€-diaminopimelate, 
the product of step @, is symmetric. The carbons derived from pyru- 
vate (and the amino group derived from glutamate) are not traced be- 
yond this point, because subsequent reactions may place them at 
either end of the lysine molecule. 
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FIGURE 22-16 Biosynthesis of chorismate, an intermediate in the 
synthesis of aromatic amino acids in bacteria and plants. All car- 
bons are derived from either erythrose 4-phosphate (light purple) 
or phosphoenolpyruvate (pink). Note that the NAD 4 " required as 
a cofactor in step@ is released unchanged; it may be transiently 
reduced to NADH during the reaction, with formation of an ox- 
idized reaction intermediate. Step® is competitively inhibited 
by glyphosate ("COO— CH 2 — NH— CH 2 — POf " ), the active 
ingredient in the widely used herbicide Roundup. The herbicide 
is relatively nontoxic to mammals, which lack this biosynthetic 
pathway. The chemical names quinate, shikimate, and choris- 
mate are derived from the names of plants in which these inter- 
mediates have been found to accumulate. 



four steps produce shikimate, a seven-carbon molecule 
derived from erythrose 4-phosphate and phospho- 
enolpyruvate (Fig. 22-16). Shikimate is converted to 
chorismate in three steps that include the addition of 
three more carbons from another molecule of phospho- 
enolpyruvate. Chorismate is the first branch point of the 
pathway, with one branch leading to tryptophan, the 
other to phenylalanine and tyrosine. 

In the tryptophan branch (Fig. 22-17), choris- 
mate is converted to anthranilate in a reaction in which 
glutamine donates the nitrogen that will become part of 
the indole ring. Anthranilate then condenses with PRPP. 



The indole ring of tryptophan is derived from the ring 
carbons and amino group of anthranilate plus two car- 
bons derived from PRPP. The final reaction in the se- 
quence is catalyzed by tryptophan synthase. This 
enzyme has an a 2 /3 2 subunit structure and can be disso- 
ciated into two a subunits and a /3 2 unit that catalyze dif- 
ferent parts of the overall reaction: 

Indole-3-glycerol phosphate > 

a subunit 

indole + glyceraldehyde 3-phosphate 

Indole + serine > tryptophan + H 2 0 

02 subunit 
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FIGURE 22-17 Biosynthesis of tryptophan from chorismate in bacte- 
ria and plants. In E. coii, enzymes catalyzing steps (T) and @ are 
subunits of a single complex. 



The second part of the reaction requires pyridoxal phos- 
phate (Fig. 22-18). Indole formed in the first part is 
not released by the enzyme, but instead moves through 
a channel from the a-subunit active site to one of the )8- 
subunit active sites, where it condenses with a Schiff 
base intermediate derived from serine and PLP. Inter- 
mediate channeling of this type may be a feature of the 
entire pathway from chorismate to tryptophan. Enzyme 
active sites catalyzing different steps (sometimes not 
sequential steps) of the pathway to tryptophan are 
found on single polypeptides in some species of fungi 
and bacteria, but are separate proteins in other species. 
In addition, the activity of some of these enzymes re- 
quires a noncovalent association with other enzymes of 
the pathway. These observations suggest that all the 
pathway enzymes are components of a large, multien- 
zyme complex in both bacteria and eukaryotes. Such 
complexes are generally not preserved intact when the 
enzymes are isolated using traditional biochemical 
methods, but evidence for the existence of multienzyme 
complexes is accumulating for this and other metabolic 
pathways (p. 619). 

In plants and bacteria, phenylalanine and tyro- 
sine are synthesized from chorismate in pathways much 
less complex than the tryptophan pathway The com- 
mon intermediate is prephenate (Fig. 22-19). The final 
step in both cases is transamination with glutamate. 

Animals can produce tyrosine directly from pheny- 
lalanine through hydroxylation at C-4 of the phenyl 
group by phenylalanine hydroxylase; this enzyme 
also participates in the degradation of phenylalanine 
(see Figs 18-23, 18-24). Tyrosine is considered a condi- 
tionally essential amino acid, or as nonessential insofar 
as it can be synthesized from the essential amino acid 
phenylalanine. 

Histidine Biosynthesis Uses Precursors 
of Purine Biosynthesis 

Ribose 5-phosphate 



Histidine 



The pathway to histidine in all plants and bacteria dif- 
fers in several respects from other amino acid biosyn- 
thetic pathways. Histidine is derived from three 
precursors (Fig. 22-20): PRPP contributes five carbons, 
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FIGURE 22-19 Biosynthesis of phenylalanine and tyrosine from cho- 
rismate in bacteria and plants. Conversion of chorismate to prephen- 
ate is a rare biological example of a Claisen rearrangement. 
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MECHANISM FIGURE 22-18 Tryptophan synthase reaction. This en- 
zyme catalyzes a multistep reaction with several types of chemical re- 
arrangements. The PLP-facilitated transformations occur at the j3 carbon 



(C-3) of the amino acid, as opposed to the a-carbon reactions de- 
scribed in Figure 1 8-6. The jS carbon of serine is attached to the indole 
ring system. £ Tryptophan Synthase Mechanism 
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FIGURE 22-20 Biosynthesis of histidine in bacteria and plants. Atoms 
derived from PRPP and ATP are shaded pink and blue, respectively. Two 
of the histidine nitrogens are derived from glutamine and glutamate 

the purine ring of ATP contributes a nitrogen and a car- 
bon, and glutamine supplies the second ring nitrogen. 
The key steps are condensation of ATP and PRPP, in 
which N-l of the purine ring is linked to the activated C-l 
of the ribose of PRPP (step © in Fig. 22-20); purine ring 
opening that ultimately leaves N-l and C-2 of adenine 
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(green). Note that the derivative of ATP remaining after step © (AICAR) 
is an intermediate in purine biosynthesis (see Fig. 22-33, step @), so 
ATP is rapidly regenerated. 



linked to the ribose (step (3)); and formation of the imi- 
dazole ring, a reaction in which glutamine donates a ni- 
trogen (step (5)). The use of ATP as a metabolite rather 
than a high-energy cofactor is unusual — but not wasteful, 
because it dovetails with the purine biosynthetic 
pathway. The remnant of ATP that is released after the 
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transfer of N-l and C-2 is 5-aminoimidazole-4-carbox- 
amide ribonucleotide (AICAR), an intermediate of 
purine biosynthesis (see Fig. 22-33) that is rapidly re- 
cycled to ATP. 

Amino Acid Biosynthesis Is under Allosteric Regulation 

As detailed in Chapter 15, the control of flux through a 
metabolic pathway often reflects the activity of multiple 
enzymes in that pathway. In the case of amino acid syn- 
thesis, regulation takes place in part through feedback 
inhibition of the first reaction by the end product of 
the pathway. This first reaction is often catalyzed by an 
allosteric enzyme that plays an important role in the 
overall control of flux through that pathway. As an ex- 
ample, Figure 22-21 shows the allosteric regulation of 
isoleucine synthesis from threonine (detailed in Fig. 
22-15). The end product, isoleucine, is an allosteric in- 
hibitor of the first reaction in the sequence. In bacteria, 
such allosteric modulation of amino acid synthesis con- 
tributes to the minute-to-minute adjustment of pathway 
activity to cellular needs. 

Allosteric regulation of an individual enzyme can be 
considerably more complex. An example is the remark- 
able set of allosteric controls exerted on glutamine syn- 
thetase ofE. coli (Fig. 22-6). Six products derived from 
glutamine serve as negative feedback modulators of the 
enzyme, and the overall effects of these and other mod- 
ulators are more than additive. Such regulation is called 
concerted inhibition. 

Additional mechanisms contribute to the regulation 
of the amino acid biosynthetic pathways. Because the 20 
common amino acids must be made in the correct 
proportions for protein synthesis, cells have developed 
ways not only of controlling the rate of synthesis of indi- 
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FIGURE 22-21 Allosteric regulation of isoleucine biosynthesis. The 

first reaction in the pathway from threonine to isoleucine is inhibited 
by the end product isoleucine. This was one of the first examples of 
allosteric feedback inhibition to be discovered. The steps from a- 
ketobutyrate to isoleucine correspond to steps @ through @> in Fig- 
ure 22-1 5 (five steps, because (19) is a two-step reaction). 



vidual amino acids but also of coordinating their forma- 
tion. Such coordination is especially well developed in 
fast-growing bacterial cells. Figure 22-22 shows how 
E. coli cells coordinate the synthesis of lysine, methion- 
ine, threonine, and isoleucine, all made from aspartate. 
Several important types of inhibition patterns are evi- 
dent. The step from aspartate to aspartyl-/3-phosphate 
is catalyzed by three isozymes, each independently con- 
trolled by different modulators. This enzyme multi- 
plicity prevents one biosynthetic end product from 
shutting down key steps in a pathway when other prod- 
ucts of the same pathway are required. The steps from 
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FIGURE 22-22 Interlocking regulatory mechanisms in the biosynthe- 
sis of several amino acids derived from aspartate in E. coli. Three en- 
zymes (A, B, C) have either two or three isozyme forms, indicated by 
numerical subscripts. In each case, one isozyme (A 2 , B-,, and C 2 ) has 
no allosteric regulation; these isozymes are regulated by changes in 
the amount synthesized (Chapter 28). Synthesis of isozymes A 2 and Bt 
is repressed when methionine levels are high, and synthesis of isozyme 
C 2 is repressed when isoleucine levels are high. Enzyme A is asparto- 
kinase; B, homoserine dehydrogenase; C, threonine dehydratase. 
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aspartate /3-semialdehyde to homoserine and from thre- 
onine to a-ketobutyrate (detailed in Fig. 22-15) are also 
catalyzed by dual, independently controlled isozymes. 
One isozyme for the conversion of aspartate to aspartyl- 
/3-phosphate is allosterically inhibited by two different 
modulators, lysine and isoleucine, whose action is more 
than additive — another example of concerted inhibition. 
The sequence from aspartate to isoleucine undergoes 
multiple, overlapping negative feedback inhibitions; for 
example, isoleucine inhibits the conversion of threonine 
to a-ketobutyrate (as described above), and threonine 
inhibits its own formation at three points: from ho- 
moserine, from aspartate /3-semialdehyde, and from as- 
partate (steps @, (3), and ® in Fig. 22-15). This overall 
regulatory mechanism is called sequential feedback 
inhibition. 

Similar patterns are evident in the pathways leading 
to the aromatic amino acids. The first step of the early 
pathway to the common intermediate chorismate is cat- 
alyzed by the enzyme 2-keto-3-deoxy-D-arabinoheptu- 
losonate 7-phosphate (DAHP) synthase (step ® in 
Fig. 22-16). Most microorganisms and plants have three 
DAHP synthase isozymes. One is allosterically inhibited 
(feedback inhibition) by phenylalanine, another by tyro- 
sine, and the third by tryptophan. This scheme helps the 
overall pathway to respond to cellular requirements for 
one or more of the aromatic amino acids. Additional reg- 
ulation takes place after the pathway branches at cho- 
rismate. For example, the enzymes catalyzing the first 
two steps of the tryptophan branch are subject to 
allosteric inhibition by tryptophan. 

SUMMARY 22.2 Biosynthesis of Amino 
Acids 

■ Plants and bacteria synthesize all 20 common amino 
acids. Mammals can synthesize about half; the others 
are required in the diet (essential amino acids). 

■ Among the nonessential amino acids, glutamate is 
formed by reductive amination of a-ketoglutarate 
and serves as the precursor of glutamine, proline, 
and arginine. Alanine and aspartate (and thus 
asparagine) are formed from pyruvate and 
oxaloacetate, respectively, by transamination. 
The carbon chain of serine is derived from 
3-phosphoglycerate. Serine is a precursor of 
glycine; the /3-carbon atom of serine is transferred 
to tetrahydrofolate. In microorganisms, cysteine is 
produced from serine and from sulfide produced by 
the reduction of environmental sulfate. Mammals 
produce cysteine from methionine and serine by a 
series of reactions requiring 5-adenosylmethionine 
and cystathionine. 

■ Among the essential amino acids, the aromatic amino 
acids (phenylalanine, tyrosine, and tryptophan) 
form by a pathway in which chorismate occupies a 



key branch point. Phosphoribosyl pyrophosphate is a 
precursor of tryptophan and histidine. The pathway 
to histidine is interconnected with the purine 
synthetic pathway. Tyrosine can also be formed 
by hydroxylation of phenylalanine (and thus is 
considered conditionally essential). The pathways 
for the other essential amino acids are complex. 

■ The amino acid biosynthetic pathways are subject to 
allosteric end-product inhibition; the regulatory en- 
zyme is usually the first in the sequence. Regulation 
of the various synthetic pathways is coordinated. 

22.3 Molecules Derived from Amino Acids 

In addition to their role as the building blocks of pro- 
teins, amino acids are precursors of many specialized 
biomolecules, including hormones, coenzymes, nu- 
cleotides, alkaloids, cell wall polymers, porphyrins, 
antibiotics, pigments, and neurotransmitters. We de- 
scribe here the pathways to a number of these amino 
acid derivatives. 

Glycine Is a Precursor of Porphyrins 

The biosynthesis of porphyrins, for which glycine is a 
major precursor, is our first example because of the cen- 
tral importance of the porphyrin nucleus in heme pro- 
teins such as hemoglobin and the cytochromes. The 
porphyrins are constructed from four molecules of the 
monopyrrole derivative porphobilinogen, which itself 
is derived from two molecules of S-aminolevulinate. 
There are two major pathways to 5-aminolevulinate. 
In higher eukaryotes (Fig. 22-23a), glycine reacts 
with succinyl-CoA in the first step to yield a-amino- 
/3-ketoadipate, which is then decarboxylated to 
5-aminolevulinate. In plants, algae, and most bacteria, 
5-airdnolevulinate is formed from glutamate (Fig. 22-23b). 
The glutamate is first esterified to glutamyl-tRNA Glu 
(see Chapter 27 on the topic of transfer RNAs); reduc- 
tion by NADPH converts the glutamate to glutamate 
1-semialdehyde, which is cleaved from the tRNA. An 
aminotransferase converts the glutamate 1-semialde- 
hyde to 6-aminolevulinate. 

In all organisms, two molecules of 5-aminolevulinate 
condense to form porphobilinogen and, through a series 
of complex enzymatic reactions, four molecules of por- 
phobilinogen come together to form protoporphyrin 
(Fig. 22-24). The iron atom is incorporated after the 
protoporphyrin has been assembled, in a step catalyzed 
by ferrochelatase. Porphyrin biosynthesis is regulated in 
higher eukaryotes by the concentration of the heme 
product, which serves as a feedback inhibitor of early 
steps in the synthetic pathway. Genetic defects in the 
biosynthesis of porphyrins can lead to the accumulation 
of pathway intermediates, causing a variety of human 
diseases known collectively as porphyrias (Box 22-2). 
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FIGURE 22-23 Biosynthesis of 5-aminolevulinate. (a) In most animals, 
including mammals, 5-aminolevulinate is synthesized from glycine and 
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succinyl-CoA, The atoms furnished by glycine are shown in red. (b) In 
bacteria and plants, the precursor of 5-aminolevulinate is glutamate. 
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BOX 22-2 



On Kings and Vampires 



Porphyrias are a group of genetic diseases that result from 
defects in enzymes of the biosynthetic pathway from 
glycine to porphyrins; specific porphyrin precursors accu- 
mulate in erythrocytes, body fluids, and the liver. The 
most common form is acute intermittent porphyria. Most 
individuals inheriting this condition are heterozygotes and 
are usually asymptomatic, because the single copy of the 
normal gene provides a sufficient level of enzyme func- 
tion. However, certain nutritional or environmental factors 
(as yet poorly understood) can cause a buildup of 8- 
aminolevulinate and porphobilinogen, leading to attacks 
of acute abdominal pain and neurological dysfunction. 
King George III, British monarch during the American 
Revolution, suffered several episodes of apparent mad- 
ness that tarnished the record of this otherwise accom- 
plished man. The symptoms of his condition suggest that 
George III suffered from acute intermittent porphyria. 

One of the rarer porphyrias results in an accumula- 
tion of uroporphyrinogen I, an abnormal isomer of a pro- 
toporphyrin precursor. This compound stains the urine 
red, causes the teeth to fluoresce strongly in ultraviolet 
light, and makes the skin abnormally sensitive to sunlight. 
Many individuals with this porphyria are anemic because 
insufficient heme is synthesized. This genetic condition 



may have given rise to the vampire myths of folk legend. 

The symptoms of most porphyrias are now readily 
controlled with dietary changes or the administration of 
heme or heme derivatives. 
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Heme Is the Source of Bile Pigments 

HThe iron-porphyrin (heme) group of hemoglobin, 
released from dying erythrocytes in the spleen, is 
degraded to yield free Fe 2+ and, ultimately, bilirubin. 
This pathway is arresting for its capacity to inject color 
into human biochemistry. 

The first step in the two-step pathway, catalyzed by 
heme oxygenase, converts heme to biliverdin, a linear 
(open) tetrapyrrole derivative (Fig. 22-25). The other 
products of the reaction are free Fe 2+ and CO. The Fe 2+ 
is quickly bound by ferritin. Carbon monoxide is a poi- 
son that binds to hemoglobin (see Box 5-1), and the 
production of CO by heme oxygenase ensures that, even 
in the absence of environmental exposure, about 1% of 
an individual's heme is complexed with CO. 

Biliverdin is converted to bilirubin in the second 
step, catalyzed by biliverdin reductase. You can moni- 
tor this reaction colorimetrically in a familiar in situ ex- 
periment. When you are bruised, the black and/or 
purple color results from hemoglobin released from 
damaged erythrocytes. Over time, the color changes to 
the green of biliverdin, and then to the yellow of biliru- 
bin. Bilirubin is largely insoluble, and it travels in the 
bloodstream as a complex with serum albumin. In the 
liver, bilirubin is transformed to the bile pigment biliru- 
bin diglucuronide. This product is sufficiently water- 



soluble to be secreted with other components of bile 
into the small intestine, where microbial enzymes con- 
vert it to several products, predominantly urobilino- 
gen. Some urobilinogen is reabsorbed into the blood 
and transported to the kidney, where it is converted to 
urobilin, the compound that gives urine its yellow color 
(Fig. 22-25, left branch). Urobilinogen remaining in 
the intestine is converted (in another microbe-depend- 
ent reaction) to stercobilin (Fig. 22-25, right branch), 
which imparts the red-brown color to feces. 

Impaired liver function or blocked bile secretion 
causes bilirubin to leak from the liver into the blood, re- 
sulting in a yellowing of the skin and eyeballs, a condition 
called jaundice. In cases of jaundice, determination of 
the concentration of bilirubin in the blood may be useful 
in the diagnosis of underlying liver disease. Newborn in- 
fants sometimes develop jaundice because they have not 
yet produced enough glucuronyl bilirubin transferase to 
process their bilirubin. A traditional treatment to reduce 
excess bilirubin, exposure to a fluorescent lamp, causes 
a photochemical conversion of bilirubin to compounds 
that are more soluble and easily excreted. 

These pathways of heme breakdown play significant 
roles in protecting cells from oxidative damage and in 
regulating certain cellular functions. The CO produced 
by heme oxygenase is toxic at high concentrations, but 
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FIGURE 22-25 Bilirubin and its breakdown 
products. M represents methyl; V, vinyl; Pr, pro- 
pionyl; E, ethyl. For ease of comparison, these 
structures are shown in linear form, rather than 
in their correct stereochemical conformations. 
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at the very low concentrations generated during heme 
degradation it seems to have some regulatory and/or sig- 
naling functions. It acts as a vasodilator, much the same 
as (but less potent than) nitric oxide (discussed below). 
Low levels of CO also have some regulatory effects on 
neurotransmission. Bilirubin is the most abundant an- 
tioxidant in mammalian tissues and is responsible for 
most of the antioxidant activity in serum. Its protective 
effects seem to be especially important in the develop- 
ing brain of newborn infants. The cell toxicity associated 
with jaundice may be due to bilirubin levels in excess of 
the serum albumin needed to solubilize it. 

Given these varied roles of heme degradation prod- 
ucts, the degradative pathway is subject to regulation, 
mainly at the first step. Humans have at least three 
isozymes of heme oxygenase (HO). HO-1 is highly regu- 
lated; the expression of its gene is induced by a wide 
range of stress conditions (shear stress, angiogenesis 
(uncontrolled development of blood vessels), hypoxia, 



hyperoxia, heat shock, exposure to ultraviolet light, hy- 
drogen peroxide, and many other metabolic insults). 
HO-2 is found mainly in brain and testes, where it is con- 
tinuously expressed. The third isozyme, HO-3, is not yet 
well characterized. ■ 

Amino Acids Are Precursors of Creatine and Glutathione 

Phosphocreatine, derived from creatine, is an im- 
portant energy buffer in skeletal muscle (see Fig. 
13-15). Creatine is synthesized from glycine and argi- 
nine (Fig. 22-26); methionine, in the form of 
S-adenosylmethionine, acts as methyl group donor. 

Glutathione (GSH), present in plants, animals, 
and some bacteria, often at high levels, can be thought 
of as a redox buffer. It is derived from glutamate, cys- 
teine, and glycine (Fig. 22-27). The y-carboxyl group 
of glutamate is activated by ATP to form an acyl phos- 
phate intermediate, which is then attacked by the 
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FIGURE 22-26 Biosynthesis of creatine and phosphocreatine. Crea- 
tine is made from three amino acids: glycine, arginine, and methion- 
ine. This pathway shows the versatility of amino acids as precursors of 
other nitrogenous biomolecules. 

a-amino group of cysteine. A second condensation reac- 
tion follows, with the a-carboxyl group of cysteine acti- 
vated to an acyl phosphate to permit reaction with 
glycine. The oxidized form of glutathione (GSSG), pro- 
duced in the course of its redox activities, contains two 
glutathione molecules linked by a disulfide bond. 

Glutathione probably helps maintain the sulfhydryl 
groups of proteins in the reduced state and the iron of 
heme in the ferrous (Fe 2+ ) state, and it serves as a re- 
ducing agent for glutaredoxin in deoxyribonucleotide 
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FIGURE 22-27 Glutathione metabolism, (a) Biosynthesis of glutathione, 
(b) Oxidized form of glutathione. 



synthesis (see Fig. 22-39). Its redox function is also 
used to remove toxic peroxides formed in the normal 
course of growth and metabolism under aerobic 
conditions: 



2 GSH + R— O— O— H - 



GSSG + H 2 0 + R— OH 



This reaction is catalyzed by glutathione peroxidase, 

a remarkable enzyme in that it contains a covalently 
bound selenium (Se) atom in the form of selenocysteine 
(see Fig. 3-8a), which is essential for its activity. 

D-Amino Acids Are Found Primarily in Bacteria 

Although D-amino acids do not generally occur 
in proteins, they do serve some special functions 
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in the structure of bacterial cell walls and peptide an- 
tibiotics. Bacterial peptidoglycans (see Fig. 20-31) 
contain both D-alanine and D-glutamate. d- Amino acids 
arise directly from the l isomers by the action of amino 
acid racemases, which have pyridoxal phosphate as co- 
factor (see Fig. 18-6). Amino acid racemization is 
uniquely important to bacterial metabolism, and en- 
zymes such as alanine racemase are prime targets for 
pharmaceutical agents. One such agent, L-fluoroala- 
nine, is being tested as an antibacterial drug. Another, 
cycloserine, is used to treat tuberculosis. Because 
these inhibitors also affect some PLP-requiring human 
enzymes, however, they have potentially undesirable 
side effects, ■ 
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Aromatic Amino Acids Are Precursors 
of Many Plant Substances 

Phenylalanine, tyrosine, and tryptophan are converted 
to a variety of important compounds in plants. The 
rigid polymer lignin, derived from phenylalanine and 
tyrosine, is second only to cellulose in abundance in 
plant tissues. The structure of the lignin polymer is com- 
plex and not well understood. Tryptophan is also the 
precursor of the plant growth hormone indole-3- 
acetate, or auxin (Fig. 22-28a), which is important 
in the regulation of a wide range of biological processes 
in plants. 

Phenylalanine and tyrosine also give rise to many 
commercially significant natural products, including the 
tannins that inhibit oxidation in wines; alkaloids such as 
morphine, which have potent physiological effects; and 
the flavoring of cinnamon oil (Fig. 22-28b), nutmeg, 
cloves, vanilla, cayenne pepper, and other products. 

Biological Amines Are Products 
of Amino Acid Decarboxylation 

Many important neurotransmitters are primary or 
secondary amines, derived from amino acids in 
simple pathways. In addition, some polyamines that form 
complexes with DNA are derived from the amino acid or- 
nithine, a component of the urea cycle. A common de- 
nominator of many of these pathways is amino acid 
decarboxylation, another PLP-requiring reaction (see 
Fig. 18-6). 

The synthesis of some neurotransmitters is illus- 
trated in Figure 22-29. Tyrosine gives rise to a family 
of catecholamines that includes dopamine, norepi- 
nephrine, and epinephrine. Levels of catecholamines 
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FIGURE 22-28 Biosynthesis of two plant substances from amino acids. 

(a) lndole-3-acetate (auxin) and (b) cinnamate (cinnamon flavor). 



are correlated with, among other things, changes in 
blood pressure. The neurological disorder Parkinson's 
disease is associated with an underproduction of 
dopamine, and it has traditionally been treated by ad- 
ministering L-dopa. Overproduction of dopamine in the 
brain may be linked to psychological disorders such as 
schizophrenia. 

Glutamate decarboxylation gives rise to y-amino- 
butyrate (GABA), an inhibitory neurotransmitter. Its 
underproduction is associated with epileptic seizures. 
GABA analogs are used in the treatment of epilepsy and 
hypertension. Levels of GABA can also be increased by 
administering inhibitors of the GABA-degrading enzyme 
GABA aminotransferase. Another important neuro- 
transmitter, serotonin, is derived from tryptophan in a 
two-step pathway. 

Histidine undergoes decarboxylation to histamine, 
a powerful vasodilator in animal tissues. Histamine is 
released in large amounts as part of the allergic re- 
sponse, and it also stimulates acid secretion in the stom- 
ach. A growing array of pharmaceutical agents are being 
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FIGURE 22-29 Biosynthesis of some neurotransmitters from amino acids. The key step is the same in each 
case: a PLP-dependent decarboxylation (shaded in pink). 



designed to interfere with either the synthesis or the ac- 
tion of histamine. A prominent example is the histamine 
receptor antagonist cimetidine (Tagamet), a structural 
analog of histamine: 

CH 3 CH 2 -S-CH 2 -CH 2 — NH— C-NH— CH 3 

f\ N— C=N 

N^NH 



It promotes the healing of duodenal ulcers by inhibiting 
secretion of gastric acid. 

Polyamines such as spermine and spermidine, in- 
volved in DNA packaging, are derived from methionine 
and ornithine by the pathway shown in Figure 22-30. 
The first step is decarboxylation of ornithine, a precursor 
of arginine (Fig. 22-10). Ornithine decarboxylase, a 
PLP-requiring enzyme, is the target of several powerful 
inhibitors used as pharmaceutical agents (Box 22-3). ■ 
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African sleeping sickness, or African trypanosomiasis, is 
caused by protists (single-celled eukaryotes) called 
trypanosomes (Fig. 1). This disease (and related try- 
panosome-caused diseases) is medically and economi- 
cally significant in many developing nations. Until the 
late twentieth century, the disease was virtually incur- 
able. Vaccines are ineffective because the parasite has a 
novel mechanism to evade the host immune system. 

The cell coat of trypanosomes is covered with a sin- 
gle protein, which is the antigen to which the immune 
system responds. Every so often, however, by a process 
of genetic recombination (see Table 28-1), a few cells in 
the population of infecting trypanosomes switch to a new 
protein coat, not recognized by the Immune system. This 
process of "changing coats" can occur hundreds of times. 
The result is a chronic cyclic infection: the human host 
develops a fever, which subsides as the immune system 
beats back the first infection; trypanosomes with 
changed coats then become the seed for a second infec- 
tion, and the fever recurs. This cycle can repeat for 
weeks, and the weakened person eventually dies. 

Some modern approaches to treating African sleep- 
ing sickness have been based on an understanding of 
enzymology and metabolism. In at least one such 
approach, this involves pharmaceutical agents designed 
as mechanism-based enzyme inactivators (suicide 



■ 




FIGURE 1 Trypanosoma brucei rhodesiense, one of several try- 
panosomes known to cause African sleeping sickness. 

inactivators; p. 204). A vulnerable point in trypanosome 
metabolism is the pathway of polyamine biosynthesis. 
The polyamines spermine and spermidine, involved in 
DNA packaging, are required in large amounts in rap- 
idly dividing cells. The first step in their synthesis is cat- 
alyzed by ornithine decarboxylase, a PLP-requiring 
enzyme (see Fig. 22-30). In mammalian cells, ornithine 
decarboxylase undergoes rapid turnover— that is, a 
constant round of enzyme degradation and synthesis. 
In some trypanosomes, however, the enzyme (for rea- 
sons not well understood) is stable, not readily replaced 
by newly synthesized enzyme. An inhibitor of ornithine 
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decarboxylase that binds permanently to the enzyme 
would thus have little effect on human cells, which 
could rapidly replace inactivated enzyme, but would 
adversely affect the parasite. 

The first few steps of the normal reaction catalyzed 
by ornithine decarboxylase are shown in Figure 2. Once 
C0 2 is released, the electron movement is reversed and 
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putrescine is produced (see Fig. 22-30). Based on this 
mechanism, several suicide inactivators have been de- 
signed, one of which is difluoromethylornithine 
(DFMO). DFMO is relatively inert in solution. When it 
binds to ornithine decarboxylase, however, the enzyme 
is quickly inactivated (Fig. 3). The inhibitor acts by 
providing an alternative electron sink in the form of 
two strategically placed fluorine atoms, which are ex- 
cellent leaving groups. Instead of electrons moving into 
the ring structure of PLP, the reaction results in dis- 
placement of a fluorine atom. The S of a Cys residue at 
the enzyme's active site then forms a covalent complex 
with the highly reactive PLP-inhibitor adduct in an es- 
sentially irreversible reaction. In this way, the inhibitor 
makes use of the enzyme's own reaction mechanisms 
to kill it. 

DFMO has proved highly effective against African 
sleeping sickness in clinical trials and is now used to 
treat African sleeping sickness caused by Trypanosoma 
brucei gambiense. Approaches such as this show great 
promise for treating a wide range of diseases. The 
design of drugs based on enzyme mechanism and struc- 
ture can complement the more traditional trial-and- 
error methods of developing pharmaceuticals. 
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FIGURE 22-31 Biosynthesis of nitric oxide. Both steps are catalyzed by nitric oxide synthase. The ni- 
trogen of the NO is derived from the guanidinium group of arginine 



Arginine Is the Precursor for Biological Synthesis 
of Nitric Oxide 

A surprise finding in the mid-1980s was the role of nitric 
oxide (NO)— previously known mainly as a component 
of smog— as an important biological messenger. This 
simple gaseous substance diffuses readily through mem- 
branes, although its high reactivity limits its range of dif- 
fusion to about a 1 mm radius from the site of synthesis. 
In humans NO plays a role in a range of physiological 
processes, including neurotransmission, blood clotting, 
and the control of blood pressure. Its mode of action is 
described in Chapter 12 (p. 446). 

Nitric oxide is synthesized from arginine in an 
NADPH-dependent reaction catalyzed by nitric oxide 
synthase (Fig. 22-31), a dimeric enzyme structurally 
related to NADPH cytochrome P-450 reductase (see 
Box 21-1). The reaction is a five-electron oxidation. 
Each subunit of the enzyme contains one bound mole- 
cule of each of four different cofactors: FMN, FAD, 
tetrahydrobiopterin, and Fe 3+ heme NO is an unstable 
molecule and cannot be stored. Its synthesis is stimu- 
lated by interaction of nitric oxide synthase with Ca 2+ - 
calmodulin (see Fig. 12-11). 

SUMMARY 22.3 Molecules Derived from 
Amino Acids 

■ Many important biomolecules are derived from 
amino acids. Glycine is a precursor of porphyrins. 
Degradation of iron-porphyrin (heme) generates 
bilirubin, which is converted to bile pigments, with 
several physiological functions. 

■ Glycine and arginine give rise to creatine and 
phosphocreatine, an energy buffer. Glutathione, 
formed from three amino acids, is an important 
cellular reducing agent. 

■ Bacteria synthesize D-amino acids from L-amino 
acids in racemization reactions requiring pyridoxal 



phosphate. D-Amino acids are commonly found in 
certain bacterial walls and certain antibiotics. 

■ The aromatic amino acids give rise to many plant 
substances. The PLP-dependent decarboxylation 
of some amino acids yields important biological 
amines, including neurotransmitters. 

■ Arginine is the precursor of nitric oxide, a biological 
messenger. 

22.4 Biosynthesis and Degradation 
of Nucleotides 

As discussed in Chapter 8, nucleotides have a variety of 
important functions in all cells. They are the precursors 
of DNA and RNA. They are essential carriers of chemical 
energy— a role primarily of ATP and to some extent 
GTP. They are components of the cofactors NAD, FAD, 
S-adenosylmethionine, and coenzyme A, as well as of 
activated biosynthetic intermediates such as UDP- 
glucose and CDP-diacylglycerol. Some, such as cAMP 
and cGMP, are also cellular second messengers. 

Two types of pathways lead to nucleotides: the de 
novo pathways and the salvage pathways. De novo 
synthesis of nucleotides begins with their metabolic pre- 
cursors: amino acids, ribose 5-phosphate, C0 2 , and NH 3 . 
Salvage pathways recycle the free bases and nucleo- 
sides released from nucleic acid breakdown. Both types 
of pathways are important in cellular metabolism and 
both are discussed in this section. 

The de novo pathways for purine and pyrimidine 
biosynthesis seem to be nearly identical in all living or- 
ganisms. Notably, the free bases guanine, adenine, 
thymine, cytidine, and uracil are not intermediates in 
these pathways; that is, the bases are not synthesized 
and then attached to ribose, as might be expected. The 
purine ring structure is built up one or a few atoms at a 
time, attached to ribose throughout the process. The 
pyrimidine ring is synthesized as orotate, attached to 
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ribose phosphate, and then converted to the common 
pyrimidine nucleotides required in nucleic acid synthe- 
sis. Although the free bases are not intermediates in the 
de novo pathways, they are intermediates in some of the 
salvage pathways. 

Several important precursors are shared by the de 
novo pathways for synthesis of pyrimidines and purines, 
Phosphoribosyl pyrophosphate (PRPP) is important in 
both, and in these pathways the structure of ribose is 
retained in the product nucleotide, in contrast to its fate in 
the tryptophan and histidine biosynthetic pathways dis- 
cussed earlier. An amino acid is an important precursor in 
each type of pathway: glycine for purines and aspartate 
for pyrimidines. Glutamine again is the most important 
source of amino groups — in five different steps in the de 
novo pathways. Aspartate is also used as the source of an 
amino group in the purine pathways, in two steps. 

Two other features deserve mention. First, there is 
evidence, especially in the de novo purine pathway, that 
the enzymes are present as large, multienzyme com- 
plexes in the cell, a recurring theme in our discussion of 
metabolism. Second, the cellular pools of nucleotides 
(other than ATP) are quite small, perhaps 1% or less of 
the amounts required to synthesize the cell's DNA. 
Therefore, cells must continue to synthesize nu- 
cleotides during nucleic acid synthesis, and in some 
cases nucleotide synthesis may limit the rates of DNA 
replication and transcription. Because of the impor- 
tance of these processes in dividing cells, agents that in- 
hibit nucleotide synthesis have become particularly 
important in medicine. 

We examine here the biosynthetic pathways of 
purine and pyrimidine nucleotides and their regulation, 
the formation of the deoxynucleotides, and the degrada- 
tion of purines and pyrimidines to uric acid and urea. We 
end with a discussion of chemotherapeutic agents that 
affect nucleotide synthesis. 

De Novo Purine Nucleotide Synthesis 
Begins with PRPP 

The two parent purine nucleotides of nucleic acids are 
adenosine 5 '-monophosphate (AMP; adenylate) and 
guanosine 5 '-monophosphate (GMP; guanylate), contain- 
ing the purine bases adenine and 
guanine. Figure 22-32 shows 
the origin of the carbon and ni- 
trogen atoms of the purine ring 
system, as determined by John 
Buchanan using isotopic tracer 
experiments in birds. The de- 
tailed pathway of purine biosyn- 
thesis was worked out primarily 
by Buchanan and G. Robert 
Greenberg in the 1950s. 
In the first committed step of 
John Buchanan the pathway, an amino group 
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FIGURE 22-32 Origin of the ring atoms of purines. This information 
was obtained from isotopic experiments with 14 C- or 15 N-labeled pre- 
cursors Formate is supplied in the form of N 10 -formyltetrahydrofolate. 



donated by glutamine is attached at C-l of PRPP (Fig. 
22-33). The resulting 5-phosphoribosylamine is 

highly unstable, with a half-life of 30 seconds at pH 7.5. 
The purine ring is subsequently built up on this struc- 
ture. The pathway described here is identical in all 
organisms, with the exception of one step that differs in 
higher eukaryotes as noted below. 

The second step is the addition of three atoms from 
glycine (Fig. 22-33, step (2)). An ATP is consumed to ac- 
tivate the glycine carboxyl group (in the form of an acyl 
phosphate) for this condensation reaction. The added 
glycine amino group is then formylated by AT 10 -formylte- 
trahydrofolate (step (3)), and a nitrogen is contributed 
by glutamine (step (4)) , before dehydration and ring clo- 
sure yield the five-membered imidazole ring of the 
purine nucleus, as 5-aminoimidazole ribonucleotide 
(AIR; step©). 

At this point, three of the six atoms needed for the 
second ring in the purine structure are in place. To 
complete the process, a carboxyl group is first added 
(step (6)). This carboxylation is unusual in that it does 
not require biotin, but instead uses the bicarbonate gen- 
erally present in aqueous solutions. A rearrangement 
transfers the carboxylate from the exocyclic amino 
group to position 4 of the imidazole ring (step ®) . Steps 
(6) and © are found only in bacteria and fungi. In higher 
eukaryotes, including humans, the 5-aminoimidazole ri- 
bonucleotide product of step (5) is carboxylated directly 
to carboxyaminoimidazole ribonucleotide in one step in- 
stead of two (step ©). The enzyme catalyzing this re- 
action is AIR carboxylase. 

Aspartate now donates its amino group in two steps 
(® and (9)): formation of an amide bond, followed by 
elimination of the carbon skeleton of aspartate (as 
fumarate). (Recall that aspartate plays an analogous 
role in two steps of the urea cycle; see Fig. 18-10.) The 
final carbon is contributed by Af 10 -formyltetrahydrofo- 
late (step ®), and a second ring closure takes place to 
yield the second fused ring of the purine nucleus (step 
©). The first intermediate with a complete purine ring 
is inosinate (IMP). 
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FIGURE 22-33 De novo synthesis of purine nucleotides: construction 
of the purine ring of inosinate (IMP). Each addition to the purine ring 
is shaded to match Figure 22-32. After step ©, R symbolizes the 
5-phospho-D-ribosyl group on which the purine ring is built. Formation 
of 5-phosphoribosylamine (step 0) is the first committed step in 
purine synthesis. Note that the product of step @ f AICAR, is the rem- 
nant of ATP released during histidine biosynthesis (see Fig. 22-20, step 
©). Abbreviations are given for most intermediates to simplify the 
naming of the enzymes. Step @ is the alternative path from AIR to 
CAIR occurring in higher eukaryotes. 
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As in the tryptophan and histidine biosynthetic 
pathways, the enzymes of IMP synthesis seem to be or- 
ganized as large, multienzyme complexes in the cell. 
Once again, evidence comes from the existence of single 
polypeptides with several functions, some catalyzing 
nonsequential steps in the pathway. In eukaryotic cells 
ranging from yeast to fruit flies to chickens, steps ®, (3), 
and (5) in Figure 22-33 are catalyzed by a multifunc- 
tional protein. An additional multifunctional protein cat- 
alyzes steps @ and (fj). In humans, a multifunctional 
enzyme combines the activities of AIR carboxylase and 
SAICAR synthetase (steps© and (§)). In bacteria, these 
activities are found on separate proteins, but the pro- 
teins may form a large noncovalent complex. The chan- 
neling of reaction intermediates from one enzyme to the 
next permitted by these complexes is probably espe- 
cially important for unstable intermediates such as 5- 
phosphoribosylamine. 

Conversion of inosinate to adenylate requires the 
insertion of an amino group derived from aspartate 
(Fig. 22-34); this takes place in two reactions similar 
to those used to introduce N-l of the purine ring (Fig. 
22-33, steps ©and (9)). A crucial difference is that GTP 
rather than ATP is the source of the high-energy phos- 
phate in synthesizing adenylosuccinate. Guanylate is 
formed by the NAD + -requiring oxidation of inosinate at 
C-2, followed by addition of an amino group derived 
from glutamine. ATP is cleaved to AMP and PPi in the 
final step (Fig. 22-34). 

Purine Nucleotide Biosynthesis Is Regulated 
by Feedback Inhibition 

Three major feedback mechanisms cooperate in regu- 
lating the overall rate of de novo purine nucleotide 
synthesis and the relative rates of formation of the two 
end products, adenylate and guanylate (Fig. 22-35). 
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FIGURE 22-35 Regulatory mechanisms in the biosynthesis of adenine 
and guanine nucleotides in £ colL Regulation of these pathways differs 
in other organisms. 

The first mechanism is exerted on the first reaction 
that is unique to purine synthesis: transfer of an amino 
group to PRPP to form 5-phosphoribosylamine. This 
reaction is catalyzed by the allosteric enzyme 
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glutamine-PRPP amidotransferase, which is inhibited 
by the end products IMP, AMP, and GMP AMP and 
GMP act synergistically in this concerted inhibition. 
Thus, whenever either AMP or GMP accumulates to 
excess, the first step in its biosynthesis from PRPP is 
partially inhibited. 

In the second control mechanism, exerted at a later 
stage, an excess of GMP in the cell inhibits formation of 
xanthylate from inosinate by IMP dehydrogenase, with- 
out affecting the formation of AMP (Fig. 22-35). Con- 
versely, an accumulation of adenylate inhibits formation 
of adenylosuccinate by adenylosuccinate synthetase, 
without affecting the biosynthesis of GMP. In the third 
mechanism, GTP is required in the conversion of IMP to 
AMP, whereas ATP is required for conversion of IMP to 
GMP (Fig. 22-34), a reciprocal arrangement that tends 
to balance the synthesis of the two ribonucleotides. 

The final control mechanism is the inhibition of 
PRPP synthesis by the allosteric regulation of ribose 
phosphate pyrophosphokinase. This enzyme is inhibited 
by ADP and GDP, in addition to metabolites from other 
pathways for which PRPP is a starting point. 

Pyrimidine Nucleotides Are Made from Aspartate, 
PRPP, and Carbamoyl Phosphate 

The common pyrimidine ribonucleotides are cytidine 
5 '-monophosphate (CMP; cytidylate) and uridine 5'- 
monophosphate (UMP; uridylate), which contain the 
pyrimidines cytosine and uracil. De novo pyrimidine 
nucleotide biosynthesis (Fig. 22-36) proceeds in a 
somewhat different manner from purine nucleotide 
synthesis; the six-membered pyrimidine ring is made 
first and then attached to ribose 5-phosphate. Re- 
quired in this process is carbamoyl phosphate, also an 
intermediate in the urea cycle (see Fig. 18-10). How- 
ever, as we noted in Chapter 18, in animals the car- 
bamoyl phosphate required in urea synthesis is made 
in mitochondria by carbamoyl phosphate synthetase 
I, whereas the carbamoyl phosphate required in 
pyrimidine biosynthesis is made in the cytosol by a 
different form of the enzyme, carbamoyl phosphate 
synthetase II. In bacteria, a single enzyme supplies 
carbamoyl phosphate for the synthesis of arginine 
and pyrimidines. The bacterial enzyme has three sep- 
arate active sites, spaced along a channel nearly 100 
A long (Fig. 22-37). Bacterial carbamoyl phosphate 
synthetase provides a vivid illustration of the chan- 
neling of unstable reaction intermediates between ac- 
tive sites. 



FIGURE 22-36 De novo synthesis of pyrimidine nucleotides: 
biosynthesis of UTP and CTP via orotidylate. The pyrimidine is con- 
structed from carbamoyl phosphate and aspartate. The ribose 5- 
phosphate is then added to the completed pyrimidine ring by orotate 
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phosphoribosyltransferase. The first step in this pathway (not shown 
here; see Fig. 1 8-1 1 a) is the synthesis of carbamoyl phosphate from 
C0 2 and NHj, catalyzed in eukaryotes by carbamoyl phosphate 
synthetase II. 
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FIGURE 22-37 Channeling of intermediates in bacterial carbamoyl 
phosphate synthetase. (Derived from PDB ID 1M6V)The reaction cat- 
alyzed by this enzyme is illustrated in Figure 1 8—1 1 a. The large and 
small subunits are shown in gray and blue, respectively; the channel 
between active sites (almost 100 A long) is shown as a yellow mesh. A 
glutamine molecule (green) binds to the small subunit, donating its 
amido nitrogen as NH4 in a glutamine amidotransferase-type reac- 
tion. The NH4 enters the channel, which takes it to a second active 
site, where it combines with bicarbonate in a reaction requiring ATP 
(bound ADP in blue). The carbamate then reenters the channel to reach 
the third active site, where it is phosphorylated to carbamoyl phos- 
phate (bound ADP in red). 

Carbamoyl phosphate reacts with aspartate to yield 
TV-carbamoylaspartate in the first committed step of 
pyrimidine biosynthesis (Fig. 22-36). This reaction is 
catalyzed by aspartate transcarbamoylase. In bacte- 
ria, this step is highly regulated, and bacterial aspartate 
transcarbamoylase is one of the most thoroughly stud- 
ied allosteric enzymes (see below). By removal of water 
fromTV-carbamoylaspartate, a reaction catalyzed by di- 
hydroorotase, the pyrimidine ring is closed to form L- 
dihydroorotate. This compound is oxidized to the 
pyrimidine derivative orotate, a reaction in which NAD + 
is the ultimate electron acceptor. In eukaryotes, the first 
three enzymes in this pathway — carbamoyl phosphate 
synthetase II, aspartate transcarbamoylase, and dihy- 
droorotase — are part of a single trifunctional protein. 
The protein, known by the acronym CAD, contains three 



identical polypeptide chains (each of M T 230,000), each 
with active sites for all three reactions. This suggests 
that large, multienzyme complexes may be the rule in 
this pathway. 

Once orotate is formed, the ribose 5-phosphate side 
chain, provided once again by PRPP, is attached to yield 
orotidylate (Fig. 22-36). Orotidylate is then decarboxy- 
lated to uridylate, which is phosphorylated to UTP. CTP 
is formed from UTP by the action of cytidylate syn- 
thetase, by way of an acyl phosphate intermediate 
(consuming one ATP). The nitrogen donor is normally 
glutamine, although the cytidylate synthetases in many 
species can use NH4 directly. 

Pyrimidine Nucleotide Biosynthesis Is Regulated 
by Feedback Inhibition 

Regulation of the rate of pyrimidine nucleotide synthe- 
sis in bacteria occurs in large part through aspartate 
transcarbamoylase (ATCase), which catalyzes the first 
reaction in the sequence and is inhibited by CTP, the 
end product of the sequence (Fig. 22-36). The bacterial 
ATCase molecule consists of six catalytic subunits and 
six regulatory subunits (see Fig. 6-32). The catalytic 
subunits bind the substrate molecules, and the allosteric 
subunits bind the allosteric inhibitor, CTP The entire 
ATCase molecule, as well as its subunits, exists in two 
conformations, active and inactive. When CTP is not 
bound to the regulatory subunits, the enzyme is maxi- 
mally active. As CTP accumulates and binds to the regu- 
latory subunits, they undergo a change in conformation. 
This change is transmitted to the catalytic subunits, 
which then also shift to an inactive conformation. ATP 
prevents the changes induced by CTP Figure 22-38 
shows the effects of the allosteric regulators on the 
activity of ATCase. 
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FIGURE 22-38 Allosteric regulation of aspartate transcarbamoylase 
by CTP and ATP. Addition of 0.8 mM CTP, the allosteric inhibitor of 
ATCase, increases the K 0 5 for aspartate (lower curve) and the rate of 
conversion of aspartate to N-carbamoylaspartate. ATP at 0.6 mM fully 
reverses this effect (middle curve). 
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Nucleoside Monophosphates Are Converted 
to Nucleoside Triphosphates 

Nucleotides to be used in biosynthesis are generally 
converted to nucleoside triphosphates. The conversion 
pathways are common to all cells. Phosphorylation of 
AMP to ADP is promoted by adenylate kinase, in the 
reaction 

ATP + AMP 2 ADP 

The ADP so formed is phosphorylated to ATP by the gly- 
colytic enzymes or through oxidative phosphorylation. 

ATP also brings about the formation of other nucle- 
oside diphosphates by the action of a class of enzymes 
called nucleoside monophosphate kinases. These 
enzymes, which are generally specific for a particular 
base but nonspecific for the sugar (ribose or deoxyri- 
bose), catalyze the reaction 

ATP + NMP ADP + NDP 

The efficient cellular systems for rephosphorylating 
ADP to ATP tend to pull this reaction in the direction of 
products. 

Nucleoside diphosphates are converted to triphos- 
phates by the action of a ubiquitous enzyme, nucleoside 
diphosphate kinase, which catalyzes the reaction 

NTP D + NDP A NDP D + NTP A 

This enzyme is notable in that it is not specific for the 
base (purines or pyrimidines) or the sugar (ribose or de- 
oxyribose). This nonspecificity applies to both phos- 
phate acceptor (A) and donor (D), although the donor 
(NTP D ) is almost invariably ATP because it is present in 
higher concentration than other nucleoside triphos- 
phates under aerobic conditions. 

Ribonucleotides Are the Precursors 
of Deoxyribonucleotides 

Deoxyribonucleotides, the building blocks of DNA, are 
derived from the corresponding ribonucleotides by 
direct reduction at the 2'-carbon atom of the D-ribose to 
form the 2'-deoxy derivative. For example, adenosine 
diphosphate (ADP) is reduced to 2'-deoxyadenosine 
diphosphate (dADP), and GDP is reduced to dGDP This 
reaction is somewhat unusual in that the reduction 
occurs at a nonactivated carbon; no closely analogous 
chemical reactions are known. The reaction is catalyzed 
by ribonucleotide reductase, best characterized in 
E. coli, in which its substrates are ribonucleoside 
diphosphates. 

The reduction of the D-ribose portion of a ribonu- 
cleoside diphosphate to 2'-deoxy-D-ribose requires a 
pair of hydrogen atoms, which are ultimately donated by 
NADPH via an intermediate hydrogen-carrying protein, 
thioredoxin. This ubiquitous protein serves a similar 
redox function in photosynthesis (see Fig. 20-19) and 
other processes. Thioredoxin has pairs of — SH groups 
that carry hydrogen atoms from NADPH to the ribonu- 
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FIGURE 22-39 Reduction of ribonucleotides to deoxyribonucleotides 
by ribonucleotide reductase. Electrons are transmitted (blue arrows) 
to the enzyme from NADPH via (a) glutaredoxin or (b) thioredoxin. 
The sulfide groups in glutaredoxin reductase are contributed by two 
molecules of bound glutathione (GSH; GSSG indicates oxidized glu- 
tathione). Note that thioredoxin reductase is a flavoenzyme, with FAD 
as prosthetic group. 

cleoside diphosphate. Its oxidized (disulfide) form is 
reduced by NADPH in a reaction catalyzed by thiore- 
doxin reductase (Fig. 22-39) ; and reduced thiore- 
doxin is then used by ribonucleotide reductase to 
reduce the nucleoside diphosphates (NDPs) to deoxyri- 
bonucleoside diphosphates (dNDPs). A second source 
of reducing equivalents for ribonucleotide reductase is 
glutathione (GSH). Glutathione serves as the reductant 
for a protein closely related to thioredoxin, glutare- 
doxin, which then transfers the reducing power to 
ribonucleotide reductase (Fig. 22-39). 

Ribonucleotide reductase is notable in that its reac- 
tion mechanism provides the best-characterized exam- 
ple of the involvement of free radicals in biochemical 
transformations, once thought to be rare in biological 
systems. The enzyme mE. coli and most eukaryotes is a 
dimer, with subunits designated Rl and R2 (Fig. 22-40). 
The Rl subunit contains two kinds of regulatory sites, as 
described below. The two active sites of the enzyme are 
formed at the interface between the Rl and R2 subunits. 
At each active site, Rl contributes two sulfhydryl groups 
required for activity and R2 contributes a stable tyrosyl 
radical. The R2 subunit also has a binuclear iron (Fe 3+ ) 
cofactor that helps generate and stabilize the tyrosyl 
radicals (Fig. 22^0). The tyrosyl radical is too far from 
the active site to interact directly with the site, but it 
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generates another radical at the active site that func- 
tions in catalysis. A likely mechanism for the ribonu- 
cleotide reductase reaction is illustrated in Figure 
22-41. InE. coli, likely sources of the required reducing 
equivalents for this reaction are thioredoxin and 
glutaredoxin, as noted above. 




FIGURE 22-40 Ribonucleotide reductase, (a) Subunit structure. The 
functions of the two regulatory sites are explained in Figure 22-42. Each 
active site contains two thiols and a group ( — XH) that can be converted 
to an active-site radical; this group is probably the — SH of Cys 439 , 
which functions as a thiyl radical, (b) The R2 subunits of E. coil ribonu- 
cleotide reductase (PDB ID 1 PFR). The Tyr residue that acts as the tyrosyl 
radical is shown in red; the binuclear iron center is orange, (c) The tyro- 
syl radical functions to generate the active-site radical ( — X*), which is 
used in the mechanism shown in Figure 22^1 . 
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MECHANISM FIGURE 22-41 
Proposed mechanism for ri- 
bonucleotide reductase. In the 

enzyme of E. coii and most eu- 
karyotes, the active thiol groups 
are on the R1 subunit; the ac- 
tive-site radical ( — X') is on the 
R2 subunit and in E . coii is prob- 
ably a thiyl radical of Cys 439 
(see Fig. 22-40). 
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FIGURE 22-42 Regulation of ribonucleotide reductase by deoxynu- 
cleoside triphosphates. The overall activity of the enzyme is affected 
by binding at the primary regulatory site (left). The substrate specificity 
of the enzyme is affected by the nature of the effector molecule bound 



at the second type of regulatory site, the substrate-specificity site 
(right). The diagram indicates inhibition or stimulation of enzyme ac- 
tivity with the four different substrates. The pathway from dUDP to 
dTTP is described later (see Figs 22-43, 22-44) 



Three classes of ribonucleotide reductase have been 
reported. Their mechanisms (where known) generally 
conform to the scheme in Figure 22^1 , but they differ in 
the identity of the group supplying the active-site radical 
and in the cofactors used to generate it. The E. coli en- 
zyme (class I) requires oxygen to regenerate the tyrosyl 
radical if it is quenched, so this enzyme functions only in 
an aerobic environment. Class II enzymes, found in other 
microorganisms, have 5'-deoxyadenosylcobalamin (see 
Box 17-2) rather than a binuclear iron center. Class III 
enzymes have evolved to function in an anaerobic envi- 
ronment. E. coli contains a separate class III ribonu- 
cleotide reductase when grown anaerobically; this 
enzyme contains an iron-sulfur cluster (structurally dis- 
tinct from the binuclear iron center of the class I enzyme) 
and requires NADPH and S-adenosylmethionine for ac- 
tivity. It uses nucleoside triphosphates rather than nucle- 
oside diphosphates as substrates. The evolution of 
different classes of ribonucleotide reductase for produc- 
tion of DNA precursors in different environments reflects 
the importance of this reaction in nucleotide metabolism. 

Regulation of E. coli ribonucleotide reductase is un- 
usual in that not only its activity but its substrate speci- 
ficity is regulated by the binding of effector molecules. 
Each Rl subunit has two types of regulatory site (Fig. 
22-40). One type affects overall enzyme activity and 
binds either ATP, which activates the enzyme, or dATP, 
which inactivates it. The second type alters substrate 
specificity in response to the effector molecule— ATP, 
dATP, dTTP, or dGTP— that is bound there (Fig. 22-42). 
When ATP or dATP is bound, reduction of UDP and CDP 
is favored. When dTTP or dGTP is bound, reduction of 
GDP or ADP, respectively, is stimulated. The scheme is 
designed to provide a balanced pool of precursors for 



DNA synthesis. ATP is also a general activator for 
biosynthesis and ribonucleotide reduction. The pres- 
ence of dATP in small amounts increases the reduction 
of pyrimidine nucleotides. An over supply of the pyrimi- 
dine dNTPs is signaled by high levels of dTTP, which 
shifts the specificity to favor reduction of GDP. High lev- 
els of dGTP, in turn, shift the specificity to ADP reduc- 
tion, and high levels of dATP shut the enzyme down. 
These effectors are thought to induce several distinct 
enzyme conformations with altered specificities. 

Thymidylate Is Derived from dCDP and dUMP 

DNA contains thymine rather than uracil, and the de 
novo pathway to thymine involves only deoxyribonu- 
cleotides. The immediate precursor of thymidylate 
(dTMP) is dUMP. In bacteria, the pathway to dUMP 
begins with formation of dUTP, either by deamination 
of dCTP or by phosphorylation of dUDP (Fig. 
22-43). The dUTP is converted to dUMP by a dUTP- 
ase. The latter reaction must be efficient to keep 
dUTP pools low and prevent incorporation of uridy- 
late into DNA. 

Conversion of dUMP to dTMP is catalyzed by 
thymidylate synthase. A one-carbon unit at the hydrox- 
ymethyl (— CH 2 OH) oxidation level (see Fig. 18-17) is 
transferred from A^,AT 10 -methylenetetrahydrofolate to 
dUMP, then reduced to a methyl group (Fig. 22-44). The 
reduction occurs at the expense of oxidation of tetrahy- 
drofolate to dihydrofolate, which is unusual in tetrahydro- 
folate-requiring reactions. (The mechanism of this 
reaction is shown in Fig. 22-50.) The dihydrofolate is re- 
duced to tetrahydrofolate by dihydrofolate reductase — 
a regeneration that is essential for the many processes that 
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FIGURE 22-43 Biosynthesis of thymidylate (dTMP). The pathways are 
shown beginning with the reaction catalyzed by ribonucleotide reduc- 
tase. Figure 22-44 gives details of the thymidylate synthase reaction. 



require tetrahydrofolate. In plants and at least one protist, 
thymidylate synthase and dihydrofolate reductase reside 
on a single bifunctional protein. 

About 10% of the human population (and up to 
50% of people in impoverished communities) suf- 
fers from folic acid deficiency. When the deficiency is se- 
vere, the symptoms can include heart disease, cancer, 
and some types of brain dysfunction. At least some of 
these symptoms arise from a reduction of thymidylate 
synthesis, leading to an abnormal incorporation of uracil 
into DNA. Uracil is recognized by DNA repair pathways 
(described in Chapter 25) and is cleaved from the DNA. 
The presence of high levels of uracil in DNA leads to 
strand breaks that can greatly affect the function and 
regulation of nuclear DNA, ultimately causing the ob- 
served effects on the heart and brain, as well as in- 
creased mutagenesis that leads to cancer. ■ 
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Degradation of Purines and Pyrimidines Produces 
Uric Acid and Urea, Respectively 

Purine nucleotides are degraded by a pathway in which 
they lose their phosphate through the action of 5 '-nu- 
cleotidase (Fig. 22-45). Adenylate yields adenosine, 
which is deaminated to inosine by adenosine deami- 
nase, and inosine is hydrolyzed to hypoxanthine (its 
purine base) and D-ribose. Hypoxanthine is oxidized suc- 
cessively to xanthine and then uric acid by xanthine ox- 
idase, a flavoenzyme with an atom of molybdenum and 
four iron-sulfur centers in its prosthetic group. Molecular 
oxygen is the electron acceptor in this complex reaction. 

GMP catabolism also yields uric acid as end product. 
GMP is first hydrolyzed to guanosine, which is then 
cleaved to free guanine. Guanine undergoes hydrolytic 
removal of its amino group to yield xanthine, which is 



converted to uric acid by xanthine oxidase (Fig. 22-45). 

Uric acid is the excreted end product of purine ca- 
tabolism in primates, birds, and some other animals. A 
healthy adult human excretes uric acid at a rate of about 
0.6 g/24 h; the excreted product arises in part from in- 
gested purines and in part from turnover of the purine 
nucleotides of nucleic acids. In most mammals and 
many other vertebrates, uric acid is further degraded to 
allantoin by the action of urate oxidase. In other or- 
ganisms the pathway is further extended, as shown in 
Figure 22-45. 

The pathways for degradation of pyrimidines gener- 
ally lead to NH4 production and thus to urea synthesis. 
Thymine, for example, is degraded to methylmalonylsemi- 
aldehyde (Fig. 22-46), an intermediate of valine catab- 
olism. It is further degraded through propionyl-CoA and 
methylmalonyl-CoA to succinyl-CoA (see Fig. 18-27). 



AMP 



p'-nuclootichu'c 1 




HN jp \ Hypoxanthine 
L / (keto form) 

N N 



HO^r g 



\ Xanthine 
(keto form) 



xanthin 



HN 
HO 



H 2 0 + 0 2 

0 

V 



OH 



H 



Uric acid 



0 

II 

ti l 



C— OH Uric acid 



Excreted by: 



Primates, birds, 
reptiles, insects 



unlit? uxirjuw 



i0 2 + H 2 0 
^C0 2 



o 



H 



NH 2 

C=0 Allantoin 

N h N 
H H 



Most mammals 



ulliHUoinjw 



r h 2 o 



NH 2 COO" NH 2 



L 



j#CL Allantoate 
H H 



Bony fishes 



;i|]jintrin - .'iHV 



^H 2 0 

v ^ COCT 
I 

CHO 

Glyoxylate 

? Urea 
2 H 2 N— C-NH 2 

^2H a O 

^ 2C0 2 
4NH , 



Amphibians, 
cartilaginous 
fishes 



Marine 
invertebrates 



FIGURE 22-45 Catabolism of purine nucleotides. Note that primates ex- 
crete much more nitrogen as urea via the urea cycle (Chapter 1 8) than as 



uric acid from purine degradation. Similarly, fish excrete much more 
nitrogen as NH4 than as urea produced by the pathway shown here. 
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lated in a sterile "bubble" environment. ADA deficiency 
was one of the first targets of human gene therapy trials 
(see Box 9-2). ■ 

Purine and Pyrimidine Bases Are Recycled 
by Salvage Pathways 

Free purine and pyrimidine bases are constantly released 
in cells during the metabolic degradation of nucleotides. 
Free purines are in large part salvaged and reused to 
make nucleotides, in a pathway much simpler than the de 
novo synthesis of purine nucleotides described earlier. 
One of the primary salvage pathways consists of a single 
reaction catalyzed by adenosine phosphoribosyltrans- 
ferase, in which free adenine reacts with PRPP to yield 
the corresponding adenine nucleotide: 



Adenine + PRPP 



AMP + PPi 



Free guanine and hypoxanthine (the deamination prod- 
uct of adenine; Fig. 22^15) are salvaged in the same way 
by hypoxanthine-guanine phosphoribosyltrans- 

f erase. A similar salvage pathway exists for pyrimidine 
bases in microorganisms, and possibly in mammals. 

A genetic lack of hypoxanthine-guanine phospho- 
ribosyltransferase activity, seen almost exclusively 
in male children, results in a bizarre set of symptoms 
called Lesch-Nyhan syndrome. Children with this ge- 
netic disorder, which becomes manifest by the age of 2 
years, are sometimes poorly coordinated and mentally re- 
tarded. In addition, they are extremely hostile and show 
compulsive self-destructive tendencies: they mutilate 
themselves by biting off their fingers, toes, and lips. 

The devastating effects of Lesch-Nyhan syndrome il- 
lustrate the importance of the salvage pathways. Hypox- 
anthine and guanine arise constantly from the breakdown 
of nucleic acids. In the absence of hypoxanthine-guanine 
phosphoribosyltransf erase, PRPP levels rise and purines 
are overproduced by the de novo pathway, resulting in 
high levels of uric acid production and goutlike damage to 
tissue (see below). The brain is especially dependent on 
the salvage pathways, and this may account for the cen- 
tral nervous system damage in children with Lesch-Ny- 
han syndrome. This syndrome was another target of early 
trials in gene therapy (see Box 9-2). ■ 



• ; - Genetic aberrations in human purine metabolism 
m>M have been found, some with serious conse- 
quences. For example, adenosine deaminase (ADA) 

deficiency leads to severe immunodeficiency disease 
in which T lymphocytes and B lymphocytes do not de- 
velop properly. Lack of ADA leads to a 100-fold increase 
in the cellular concentration of dATP, a strong inhibitor 
of ribonucleotide reductase (Fig. 22-42). High levels of 
dATP produce a general deficiency of other dNTPs in T 
lymphocytes. The basis for B-lymphocyte toxicity is 
less clear. Individuals with ADA deficiency lack an ef- 
fective immune system and do not survive unless iso- 



Excess Uric Acid Causes Gout 

R Long thought, erroneously, to be due to "high liv- 
mM ing," gout is a disease of the joints caused by an 
elevated concentration of uric acid in the blood and tis- 
sues. The joints become inflamed, painful, and arthritic, 
owing to the abnormal deposition of sodium urate crys- 
tals. The kidneys are also affected, as excess uric acid is 
deposited in the kidney tubules. Gout occurs predomi- 
nantly in males. Its precise cause is not known, but it of- 
ten involves an underexcretion of urate. A genetic 
deficiency of one or another enzyme of purine metabo- 
lism may also be a factor in some cases. 
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FIGURE 22-47 Allopurinol, an inhibitor of xanthine oxidase. Hypo- 
xanthine is the normal substrate of xanthine oxidase. Only a slight al- 
teration in the structure of hypoxanthine (shaded pink) yields the 
medically effective enzyme inhibitor allopurinol. At the active site, al- 
lopurinol is converted to oxypurinol, a strong competitive inhibitor 
that remains tightly bound to the reduced form of the enzyme. 

Gout is effectively treated by a combination of nu- 
tritional and drug therapies. Foods especially rich in 
nucleotides and nucleic acids, such as liver or glandular 
products, are withheld from the diet. Major alleviation of 
the symptoms is provided by the drug allopurinol 
(Fig. 22-47), which inhibits xanthine oxidase, the en- 
zyme that catalyzes the conversion of purines to uric 
acid. Allopurinol is a substrate of xanthine oxidase, 
which converts allopurinol to oxypurinol (alloxanthine). 
Oxypurinol inactivates the reduced form of the enzyme 
by remaining tightly bound in its active site^ When xan- 
thine oxidase is inhibited, the excreted products of 
purine metabolism are xanthine and hypoxanthine, 
which are more water-soluble than uric acid and less 
likely to form crystalline deposits. Allopurinol was de- 
veloped by Gertrude Elion and George Hitchings, who 
also developed acyclovir, used in treating people with 
genital and oral herpes infections, and other purine 
analogs used in cancer chemotherapy ■ 



Many Chemotherapeutic Agents Target Enzymes 
in the Nucleotide Biosynthetic Pathways 

■9 The growth of cancer cells is not controlled in the 
Kfl same way as cell growth in most normal tissues. 
Cancer cells have greater requirements for nucleotides 
as precursors of DNA and RNA, and consequently are 
generally more sensitive than normal cells to inhibitors 
of nucleotide biosynthesis. A growing array of important 
chemotherapeutic agents — for cancer and other dis- 
eases — act by inhibiting one or more enzymes in these 
pathways. We describe here several well-studied exam- 
ples that illustrate productive approaches to treatment 
and help us understand how these enzymes work. 

The first set of agents includes compounds that in- 
hibit glutamine amidotransferases. Recall that gluta- 
mine is a nitrogen donor in at least half a dozen separate 
reactions in nucleotide biosynthesis. The binding sites 
for glutamine and the mechanism by which NH4 is ex- 
tracted are quite similar in many of these enzymes. Most 
are strongly inhibited by glutamine analogs such as <i z el- 
se rine and acivicin (Fig. 22-48). Azaserine, charac- 
terized by John Buchanan in the 1950s, was one of the 
first examples of a mechanism-based enzyme inactivator 
(suicide inactivator; p. 204 and Box 22-3). Acivicin 
shows promise as a cancer chemotherapeutic agent. 

Other useful targets for pharmaceutical agents are 
thymidylate synthase and dihydrofolate reductase, en- 
zymes that provide the only cellular pathway for 
thymine synthesis (Fig. 22-49). One inhibitor that acts 
on thymidylate synthase, fluorouracil, is an important 
chemotherapeutic agent. Fluorouracil itself is not the 
enzyme inhibitor. In the cell, salvage pathways convert it 
to the deoxynucleoside monophosphate FdUMP, which 
binds to and inactivates the enzyme. Inhibition by 
FdUMP (Fig. 22-50) is a classic example of mecha- 
nism-based enzyme inactivation. Another prominent 
chemotherapeutic agent, methotrexate, is an inhibitor 
of dihydrofolate reductase. This folate analog acts as a 
competitive inhibitor; the enzyme binds methotrexate 
with about 100 times higher affinity than dihydrofolate. 
Aminopterin is a related compound that acts similarly. 
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FIGURE 22-48 Azaserine and acivicin, inhibitors of glutamine amido- 
transferases. These analogs of glutamine interfere in several amino 
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as targets of chemotherapy, (a) During thymidylate synthesis, 
N 5 ,N 10 -methylenetetrahydrofolate is converted to 7,8-dihydrofolate; 
the N 5 ,N 10 -methylenetetrahydrofolate is regenerated in two steps 
{see Fig. 22-44). This cycle is a major target of several chemothera- 
peutic agents, (b) Fluorouracil and methotrexate are important 
chemotherapeutic agents. In cells, fluorouracil is converted to FdUMP, 
which inhibits thymidylate synthase Methotrexate, a structural analog 
of tetrahydrofolate, inhibits dihydrofolate reductase; the shaded amino 
and methyl groups replace a carbonyl oxygen and a proton, respec- 
tively, in folate {see Fig. 22-44). Another important folate analog, 
aminopterin, is identical to methotrexate except that it lacks the 
shaded methyl group. Trimethoprim, a tight-binding inhibitor of bacte- 
rial dihydrofolate reductase, was developed as an antibiotic. 



MECHANISM FIGURE 22-50 Conversion of dUMP to dTMP and its in- 
hibition by FdUMP. The left side is the normal reaction mechanism of 
thymidylate synthase. The nucleophilic sulfhydryl group contributed 
by the enzyme in step ® and the ring atoms of dUMP taking part in 
the reaction are shown in red; iB denotes an amino acid side chain 
that acts as a base to abstract a proton after step (5) The hydrogens 
derived from the methylene group of N 5 ,N 10 -methylenetetrahydrofo- 
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The medical potential of inhibitors of nucleotide 
biosynthesis is not limited to cancer treatment. All 
fast-growing cells (including bacteria and protists) 
are potential targets. Trimethoprim, an antibiotic 
developed by Hitchings and Elion, binds to bacterial 
dihydrofolate reductase nearly 100,000 times better 
than to the mammalian enzyme. It is used to treat 
certain urinary and middle-ear bacterial infections. 
Parasitic protists, such as the trypanosomes that 
cause African sleeping sickness (African trypanoso- 
miasis), lack pathways for de novo nucleotide biosyn- 
thesis and are particularly sensitive to agents that 
interfere with their scavenging of nucleotides from 
the surrounding environment using salvage path- 
ways. Allopurinol (Fig. 22-47) and several similar 
purine analogs have shown promise for the treatment 
of African trypanosomiasis and related afflictions. 
See Box 22-3 for another approach to combating 
African trypanosomiasis, made possible by advances 
in our understanding of metabolism and enzyme 
mechanisms. ■ 
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Biosynthesis and Degradation 
of Nucleotides 



The purine ring system is built up step-by-step 
beginning with 5-phosphoribosylamine. The amino 
acids glut amine, glycine, and aspartate furnish all 
the nitrogen atoms of purines. Two ring-closure 
steps form the purine nucleus. 

Pyrimidines are synthesized from carbamoyl 
phosphate and aspartate, and ribose 5-phosphate 
is then attached to yield the pyrimidine 
ribonucleotides. 

Nucleoside monophosphates are converted to their 
triphosphates by enzymatic phosphorylation 
reactions. Ribonucleotides are converted to 
deoxyribonucleotides by ribonucleotide reductase, 
an enzyme with novel mechanistic and regulatory 
characteristics. The thymine nucleotides are 
derived from dCDP and dUMP. 

Uric acid and urea are the end products of purine 
and pyrimidine degradation. 

Free purines can be salvaged and rebuilt into 
nucleotides. Genetic deficiencies in certain 
salvage enzymes cause serious disorders such 
as Lesch-Nyhan syndrome and ADA deficiency. 

Accumulation of uric acid crystals in the joints, 
possibly caused by another genetic deficiency 
results in gout, 

Enzymes of the nucleotide biosynthetic pathways 
are targets for an array of chemotherapeutic agents 
used to treat cancer and other diseases. 
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Problems 

1. ATP Consumption by Root Nodules in Legumes Bac- 
teria residing in the root nodules of the pea plant consume 
more than 20% of the ATP produced by the plant. Suggest why 
these bacteria consume so much ATP. 

2. Glutamate Dehydrogenase and Protein Synthesis 

The bacterium Methylophilus methylotrophus can synthe- 
size protein from methanol and ammonia. Recombinant DNA 
techniques have improved the yield of protein by introducing 
into M. methylotrophus the glutamate dehydrogenase gene 
from E. coli. Why does this genetic manipulation increase the 
protein yield? 

3. PLP Reaction Mechanisms Pyridoxal phosphate can 
help catalyze transformations one or two carbons removed 
from the a carbon of an amino acid. The enzyme threonine 
synthase (see Fig. 22-15) promotes the PLP-dependent con- 
version of phosphohomoserine to threonine. Suggest a mecha- 
nism for this reaction. 

4. Transformation of Aspartate to Asparagine There are 
two routes for transforming aspartate to asparagine at the ex- 
pense of ATP. Many bacteria have an asparagine synthetase 
that uses ammonium ion as the nitrogen donor. Mammals have 
an asparagine synthetase that uses glutamine as the nitrogen 
donor, Given that the latter requires an extra ATP (for the syn- 
thesis of glutamine) , why do mammals use this route? 

5. Equation for the Synthesis of Aspartate from Glu- 
cose Write the net equation for the synthesis of aspartate (a 
nonessential amino acid) from glucose, carbon dioxide, and 
ammonia. 

6. Asparagine Synthetase Inhibitors in Leukemia 
Therapy Mammalian asparagine synthetase is a 
glutamine-dependent amidotransf erase. Efforts to identify an 
effective inhibitor of human asparagine synthetase for use in 
chemotherapy for patients with leukemia has focused not on 
the amino-terminal glutaminase domain but on the carboxyl- 
terminal synthetase active site. Explain why the glutaminase 
domain is not a promising target for a useful drug. 

7. Phenylalanine Hydroxylase Deficiency and Diet Tyro- 
sine is normally a nonessential amino acid, but individuals with 
a genetic defect in phenylalanine hydroxylase require tyrosine 
in their diet for normal growth. Explain, 

8. Cofactors for One-Carbon Transfer Reactions Most 
one-carbon transfers are promoted by one of three cofactors: 
bio tin, tetrahydrofolate, or 5-adenosylmethionine (Chapter 
18). S-Adenosylmethioiune is generally used as a methyl group 
donor; the transfer potential of the methyl group in A^-methyl- 
tetrahydrofolate is insufficient for most biosynthetic reactions. 
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However, one example of the use of A^-methyltetrahydrofolate 
in methyl group transfer is in methionine formation by the me- 
thionine synthase reaction (step (§) of Fig. 22-15); methionine 
is the immediate precursor of S-adenosylmethionine (see Fig. 
18-18). Explain how the methyl group of S-adenosylmethion- 
ine can be derived from A^-methyltetrahydrofolate, even 
though the transfer potential of the methyl group in A^-methyl- 
tetrahydrofolate is one one-thousandth of that in S-adenosyl- 
methionine. 

9. Concerted Regulation in Amino Acid Biosynthesis 

The glutamine synthetase of E coli is independently modulated 
by various products of glutamine metabolism (see Fig. 22-6) . In 
this concerted inhibition, the extent of enzyme inhibition is 
greater than the sum of the separate inhibitions caused by each 
product. For E coli grown in a medium rich in histidine, what 
would be the advantage of concerted inhibition? 

10. Relationship between Folic Acid Deficiency 
and Anemia Folic acid deficiency, believed to be the 
most common vitamin deficiency, causes a type of anemia in 
which hemoglobin synthesis is impaired and erythrocytes do 
not mature properly. What is the metabolic relationship be- 
tween hemoglobin synthesis and folic acid deficiency? 

11. Nucleotide Biosynthesis in Amino Acid Auxotrophic 
Bacteria Wild-type E coli cells can synthesize all 20 common 
amino acids, but some mutants, called amino acid auxotrophs, 
are unable to synthesize a specific amino acid and require its 
addition to the culture medium for optimal growth Besides 
their role in protein synthesis, some amino acids are also pre- 
cursors for other nitrogenous cell products. Consider the three 
amino acid auxotrophs that are unable to synthesize glycine, 
glutamine, and aspartate, respectively For each mutant, what 
nitrogenous products other than proteins would the cell fail to 
synthesize? 

12. Inhibitors of Nucleotide Biosynthesis Suggest mecha- 
nisms for the inhibition of (a) alanine racemase by L-fluoroalanine 
and (b) glutamine amidotransferases by azaserine. 

13. Mode of Action of Sulfa Drugs Some bacteria 
require p-aminobenzoate in the culture medium for 
normal growth, and their growth is severely inhibited by the 
addition of sulfanilamide, one of the earliest sulfa drugs. More- 
over, in the presence of this drug, 5-anunoimidazole-4-carbox- 
amide ribonucleotide (AICAR; see Fig. 22-33) accumulates in 
the culture medium. These effects are reversed by addition of 
excess p-aminobenzoate. 
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(a) What is the role of p-aminobenzoate in these bacteria? 
(Hint: See Fig. 18-16.) 

(b) Why does AICAR accumulate in the presence of sul- 
fanilamide? 

(c) Why are the inhibition and accumulation reversed by 
addition of excess p-aminobenzoate? 



14. Pathway of Carbon in Pyrimidine Biosynthesis Pre- 
dict the locations of 14 C in orotate isolated from cells grown on 



a small amount of uniformly labeled 
your prediction. 



C]succinate. Justify 



15. Nucleotides as Poor Sources of Energy Under starva- 
tion conditions, organisms can use proteins and amino acids as 
sources of energy. Deamination of amino acids produces carbon 
skeletons that can enter the glycolytic pathway and the citric 
acid cycle to produce energy in the form of ATP. Nucleotides, on 
the other hand, are not similarly degraded for use as energy- 
yielding fuels. What observations about cellular physiology sup- 
port this statement? What aspect of the structure of nucleotides 
makes them a relatively poor source of energy? 

16. Treatment of Gout Allopurinol (see Fig. 22-47), 
an inhibitor of xanthine oxidase, is used to treat 
chronic gout, Explain the biochemical basis for this treatment, 
Patients treated with allopurinol sometimes develop xanthine 
stones in the kidneys, although the incidence of kidney dam- 
age is much lower than in untreated gout. Explain this obser- 
vation in the light of the following solubilities in urine: uric 
acid, 0.15 g/L; xanthine, 0.05 g/L; and hypoxanthine, 1.4 g/L. 

17. Inhibition of Nucleotide Synthesis by Azaserine The 

diazo compound 0-(2-diazoacetyl)-L-serine, known also as 
azaserine (see Fig. 22^48), is a powerful inhibitor of glutamine 
amidotransferases. If growing cells are treated with azaserine, 
what intermediates of nucleotide biosynthesis will accumu- 
late? Explain. 



Data Analysis Problem 



18. Use of Modern Molecular Techniques to Determine 
the Synthetic Pathway of a Novel Amino Acid Most of the 
biosynthetic pathways described in this chapter were deter- 
mined before the development of recombinant DNA technology 
and genomics, so the techniques were quite different from 
those that researchers would use today Here we explore an ex- 
ample of the use of modern molecular techniques to investigate 
the pathway of synthesis of a novel amino acid, (2 i 5)-4-amino-2- 
hydroxybutyrate (AHBA). The techniques mentioned here are 
described in various places in the book; this problem is designed 
to show how they can be integrated in a comprehensive study. 

AHBA is a y-amino acid that is a component of some 
aminoglycoside antibiotics, including the antibiotic bu- 
tirosin. Antibiotics modified by the addition of an AHBA 
residue are often more resistant to inactivation by bacterial 
antibiotic-resistance enzymes. As a result, understanding 
how AHBA is synthesized and added to antibiotics is useful 
in the design of pharmaceuticals. 

In an article published in 2005, Li and coworkers 
describe how they determined the synthetic pathway of 
AHBA from glutamate. 
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(a) Briefly describe the chemical transformations needed 
to convert glutamate to AHBA. At this point, don't be con- 
cerned about the order of the reactions. 

Li and colleagues began by cloning the butirosin biosynthetic 
gene cluster from the bacterium Bacillus circulans, which 
makes large quantities of butirosin. They identified five genes that 
are essential for the pathway: btrl, btrJ, btrK, btrO, and btrV. 
They cloned these genes into E. coli plasmids that allow overex- 
pression of the genes, producing proteins with "histidine tags" 
(see p. 314) fused to their amino termini to facilitate purification, 

The predicted amino acid sequence of the Btrl protein 
showed strong homology to known acyl carrier proteins (see 
Fig. 21-5). Using mass spectrometry (see Box 3-2), Li and col- 
leagues found a molecular mass of 11,812 for the purified Btrl 
protein (including the His tag) . When the purified Btrl was in- 
cubated with coenzyme A and an enzyme known to attach CoA 
to other acyl carrier proteins, the majority molecular species 
had anM r of 12,153. 

(b) How would you use these data to argue that Btrl can 
function as an acyl carrier protein with a CoA prosthetic group? 

Using standard terminology, Li and coauthors called the 
form of the protein lacking CoA apo-Btrl and the form with CoA 
(linked as in Fig. 21-5) holo-Btrl. When holo-Btrl was incu- 
bated with glutamine, ATP, and purified BtrJ protein, the holo- 
Btrl species of M T 12,153 was replaced with a species of M T 
12,281, corresponding to the thioester of glutamate and holo- 
Btrl. Based on these data, the authors proposed the following 
structure for the M r 12,281 species (y-glutamyl-S-Btrl): 



NH3 




Btrl 

7-Glutamyl-S-Btrl 



Problems [899] 

(c) What other structure (s) is (are) consistent with the 
data above? 

(d) Li and coauthors argued that the structure shown 
here (y-glutamyl-S-Btrl) is likely to be correct because the a- 
carboxyl group must be removed at some point in the syn- 
thetic process. Explain the chemical basis of this argument. 
(Hint: See Fig. 18-6c.) 

The BtrK protein showed significant homology to PLP-de- 
pendent amino acid decarboxylases, and BtrK isolated fromi? 
coli was found to contain tightly bound PLP. When y-glutamyl- 
S-Btrl was incubated with purified BtrK, a molecular species 
of M T 12,240 was produced. 

(e) What is the most likely structure of this species? 

(f) Interestingly, when the investigators incubated gluta- 
mate and ATP with purified Btrl, BtrJ, and BtrK, they found a 
molecular species of M T 12,370. What is the most likely struc- 
ture of this species? Hint: Remember that BtrJ can use ATP to 
y-glutamylate nucleophilic groups. 

Li and colleagues found that BtrO is homologous to 
monooxygenase enzymes (see Box 21-1) that hydroxylate 
alkanes, using FMN as a cofactor, and BtrV is homologous to 
an NAD(P)H oxidoreductase. Two other genes in the cluster, 
btrG and MrH, probably encode enzymes that remove the 
y-glutamyl group and attach AHBA to the target antibiotic 
molecule. 

(g) Based on these data, propose a plausible pathway for 
the synthesis of AHBA and its addition to the target antibiotic. 
Include the enzymes that catalyze each step and any other 
substrates or cofactors needed (ATP, NAD, etc.). 

Reference 

Li, Y., Llewellyn, N.M., Giri, R., Huang, F., & Spencer, J.B. 

(2005) Biosynthesis of the unique amino acid side chain of bu- 
tirosin: possible protective-group chemistry in an acyl carrier pro- 
tein-mediated pathway Chem Biol 12, 665-675. 



We recognize that each tissue and, more generally, each cell of the or- 
ganism secretes . . . special products or ferments into the blood which 
thereby influence all the other cells thus integrated with each other by 
a mechanism other than the nervous system. 

— Charles Edouard Brown-Sequard and /. d'Arsonval, article 
in Comptes Rendus de la Societe de Biologie, 1891 
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I n Chapters 13 through 22 we have discussed metabo- 
I lism at the level of the individual cell, emphasizing 
I central pathways common to almost all cells, bacter- 
ial, archaeal, and eukaryotic. We have seen how meta- 
bolic processes within cells are regulated at the level of 
individual enzyme reactions, by substrate availability, by 
allosteric mechanisms, and by phosphorylation or other 
covalent modifications of enzymes. 

To appreciate fully the significance of individual meta- 
bolic pathways and their regulation, we must view these 
pathways in the context of the whole organism. An essen- 
tial characteristic of multicellular organisms is cell differ- 
entiation and division of labor. The specialized functions of 
the tissues and organs of complex organisms such as hu- 
mans impose characteristic fuel requirements and patterns 
of metabolism. Hormonal signals integrate and coordinate 
the metabolic activities of different tissues and optimize 
the allocation of fuels and precursors to each organ. 

In this chapter we focus on mammals, looking at the 
specialized metabolism of several major organs and tis- 
sues and the integration of metabolism in the whole or- 
ganism. We begin by examining the broad range of 



hormones and hormonal mechanisms, then turn to the 
tissue-specific functions regulated by these mecha- 
nisms. We discuss the distribution of nutrients to vari- 
ous organs — emphasizing the central role played by the 
liver — and the metabolic cooperation among these or- 
gans. To illustrate the integrative role of hormones, we 
describe the interplay of insulin, glucagon, and epineph- 
rine in coordinating fuel metabolism in muscle, liver, and 
adipose tissue. The metabolic disturbances in diabetes 
further illustrate the importance of hormonal regulation 
of metabolism. We discuss the long-term hormonal reg- 
ulation of body mass and, finally, the role of obesity in 
development of the metabolic syndrome and diabetes. 

23.1 Hormones: Diverse Structures for 
Diverse Functions 

Virtually every process in a complex organism is regu- 
lated by one or more hormones: maintenance of blood 
pressure, blood volume, and electrolyte balance; em- 
bryogenesis; sexual differentiation, development, and 
reproduction; hunger, eating behavior, digestion, and 
fuel allocation — to name but a few. We examine here the 
methods for detecting and measuring hormones and 
their interaction with receptors, and consider a repre- 
sentative selection of hormone types. 

The coordination of metabolism in mammals is 
achieved by the neuroendocrine system. Individual 
cells in one tissue sense a change in the organism's cir- 
cumstances and respond by secreting a chemical mes- 
senger that passes to another cell in the same or 
different tissue, where the messenger binds to a recep- 
tor molecule and triggers a change in this second cell. In 

[90!] 
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neuronal signaling (Fig. 23-la), the chemical messenger 
(neurotransmitter; acetylcholine, for example) may travel 
only a fraction of a micrometer, across the synaptic cleft to 
the next neuron in a network. In hormonal signaling, the 
messengers — hormones — are carried in the bloodstream 
to neighboring cells or to distant organs and tissues; they 
may travel a meter or more before encountering their 
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(b) Endocrine signaling 

FIGURE 23-1 Signaling by the neuroendocrine system, (a) In neuronal 
signaling, electrical signals (nerve impulses) originate in the cell body 
of a neuron and travel very rapidly over long distances to the axon tip, 
where neurotransmitters are released and diffuse to the target cell. The 
target cell (another neuron, a myocyte, or a secretory cell) is only a 
fraction of a micrometer or a few micrometers away from the site of 
neurotransmitter release, (b) In the endocrine system, hormones are 
secreted into the bloodstream, which carries them throughout the body to 
target tissues that may be a meter or more away from the secreting cell. 
Both neurotransmitters and hormones interact with specific receptors 
on or in their target cells, triggering responses. 



target cell (Fig. 23-lb). Except for this anatomic differ- 
ence, these two chemical signaling mechanisms are re- 
markably similar. Epinephrine and norepinephrine, for 
example, serve as neurotransmitters at certain synapses 
of the brain and neuromuscular junctions of smooth mus- 
cle, and as hormones that regulate fuel metabolism in liver 
and muscle. The following discussion of cellular signaling 
emphasizes hormone action, drawing on discussions of 
fuel metabolism in earlier chapters, but most of the funda- 
mental mechanisms described here also occur in neuro- 
transmitter action. 

The Detection and Purification of Hormones 
Requires a Bioassay 

How is a hormone detected and isolated? First, re- 
searchers find that a physiological process in one tis- 
sue depends on a signal that originates in another 
tissue. Insulin, for example, was first recognized as a 
substance that is produced in the pancreas and affects 
the volume and composition of urine (Box 23-1). 
Once a physiological effect of the putative hormone is 
discovered, a quantitative bioassay for the hormone 
can be developed. In the case of insulin, the assay con- 
sisted of injecting extracts of pancreas (a crude 
source of insulin) into experimental animals deficient 
in insulin, then quantifying the resulting changes in 
glucose concentration in blood and urine. To isolate a 
hormone, the biochemist fractionates extracts con- 
taining the putative hormone, with the same tech- 
niques used to purify other biomolecules (solvent 
fractionation, chromatography, and electrophoresis), 
and then assays each fraction for hormone activity. 
Once the chemical has been purified, its composition 
and structure can be determined. 

This protocol for hormone characterization is de- 
ceptively simple. Hormones are extremely potent and 
are produced in very small amounts. Obtaining suffi- 
cient hormone to allow its chemical characterization 
often involves biochemical isolations on a heroic scale. 
When Andrew Schally and Roger Guillemin independ- 
ently purified and characterized thyrotropin-releasing 
hormone (TRH) from the hypothalamus, Schally's 
group processed about 20 tons of hypothalamus from 
nearly two million sheep, and Guillemin's group ex- 
tracted the hypothalamus from about a million pigs! 
TRH proved to be a simple derivative of the tripeptide 
Glu-His-Pro (Fig. 23-2). Once the structure of the 
hormone was known, it could be chemically synthe- 
sized in large quantities for use in physiological and 
biochemical studies. 

For their work on hypothalamic hormones, Schally 
and Guillemin shared the Nobel Prize in Physiology or 
Medicine in 1977, along with Rosalyn Yalow, who (with 
Solomon A. Berson) developed the extraordinarily sen- 
sitive radioimmunoassay (RIA) for peptide hormones 
and used it to study hormone action. RIA revolutionized 
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BOX 23-1 




How Is a Hormone Discovered? The Arduous Path to 
Purified Insulin 




Millions of people with type 1 diabetes mellitus inject 
themselves daily with pure insulin to compensate for the 
lack of production of this critical hormone by their own 
pancreatic /3 cells. Insulin injection is not a cure for dia- 
betes, but it allows people who otherwise would have 
died young to lead long and productive lives. The dis- 
covery of insulin, which began with an accidental obser- 
vation, illustrates the combination of serendipity and 
careful experimentation that led to the discovery of 
many of the hormones. 

In 1889, Oskar Minkowski, a young assistant at 
the Medical College of Strasbourg, and Josef von Mer- 
ing, at the Hoppe-Seyler Institute in Strasbourg, had a 
friendly disagreement about whether the pancreas, 
known to contain lipases, was important in fat diges- 
tion in dogs. To resolve the issue, they began an ex- 
periment on the digestion of fats. They surgically 
removed the pancreas from a dog, but before their ex- 
periment got any farther, Minkowski noticed that the 
dog was now producing far more urine than normal (a 
common symptom of untreated diabetes). Also, the 
dog's urine had glucose levels far above normal (an- 
other symptom of diabetes). These findings sug- 
gested that lack of some pancreatic product caused 
diabetes. 

Minkowski tried unsuccessfully to prepare an ex- 
tract of dog pancreas that would reverse the effect of re- 
moving the pancreas — that is, would lower the urinary 
or blood glucose levels. We now know that insulin is a 
protein, and that the pancreas is very rich in proteases 
(trypsin and chymotrypsin) , normally released directly 
into the small intestine to aid in digestion. These pro- 
teases doubtless degraded the insulin in the pancreatic 
extracts in Minkowski's experiments. 

Despite considerable effort, no significant progress 
was made in the isolation or characterization of the 
"antidiabetic factor" until the summer of 1921, when 
Frederick G. Banting, a young scientist working in the 



laboratory of J. J. R. MacLeod at the University of 
Toronto, and a student assistant, Charles Best, took up 
the problem. By that time, several lines of evidence 
pointed to a group of specialized cells in the pancreas 
(the islets of Langerhans; see Fig. 23-27) as the source 
of the antidiabetic factor, which came to be called in- 
sulin (from Latin insula, "island"). 

Taking precautions to prevent proteolysis, Banting 
and Best (later aided by biochemist J. B. Collip) suc- 
ceeded in December 1921 in preparing a purified pan- 
creatic extract that cured the symptoms of experimental 
diabetes in dogs. On January 25, 1922 (just one month 
later!), their insulin preparation was injected into 
Leonard Thompson, a 14-year-old boy severely ill with 
diabetes mellitus. Within days, the levels of ketone 
bodies and glucose in Thompson's urine dropped dra- 
matically; the extract saved his life. In 1923, Banting 
and MacLeod won the Nobel Prize for their isolation of 
insulin. Banting immediately announced that he would 
share his prize with Best; MacLeod shared his with 
Collip. 

By 1923, pharmaceutical companies were supply- 
ing thousands of patients throughout the world with 
insulin extracted from porcine pancreas. With the de- 
velopment of genetic engineering techniques in the 
1980s (Chapter 9), it became possible to produce un- 
limited quantities of human insulin by inserting the 
cloned human gene for insulin into a microorganism, 
which was then cultured on an industrial scale. Some 
patients with diabetes are now fitted with implanted 
insulin pumps, which release adjustable amounts of 
insulin on demand to meet changing needs at meal 
times and during exercise. There is a reasonable 
prospect that, in the future, transplantation of pan- 
creatic tissue will provide diabetic patients with a 
source of insulin that responds as well as normal pan- 
creas, releasing insulin into the bloodstream only 
when blood glucose rises. 
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FIGURE 23-2 The structure of thyrotropin-releasing hormone (TRH). 

Purified (by heroic efforts) from extracts of hypothalamus, TRH proved 
to be a derivative of the tripeptide Glu-His-Pro. The side-chain car- 
boxyl group of the amino-terminal Glu forms an amide (red bond) 
with the residue's a-amino group, creating pyroglutamate, and the 
carboxyl group of the carboxyl-terminal Pro is converted to an amide 
(red — NH 2 ). Such modifications are common among the small pep- 
tide hormones. In a typical protein of M r -50,000, the charges on the 
amino- and carboxyl-terminal groups contribute relatively little to the 
overall charge on the molecule, but in a tripeptide these two charges 
dominate the properties of the molecule. Formation of the amide 
derivatives removes these charges, 



hormone research by making 
possible the rapid, quantitative, 
and specific measurement of 
hormones in minute amounts. 

Hormone-specific antibod- 
ies are the key to the radio-im- 
munoassay. Purified hormone, 
injected into rabbits, elicits an- 
tibodies that bind to the hor- 
mone with very high affinity 
and specificity. When a con- 
stant amount of isolated anti- 
body is incubated with a fixed 
amount of the radioactively la- 
beled hormone, a certain frac- 
tion of the radioactive hormone 
binds to the antibody (Fig. 
23-3). If, in addition to the ra- 
diolabeled hormone, unlabeled 
hormone is also present, the un- 
labeled hormone competes with 
and displaces some of the la- 
beled hormone from its binding 
site on the antibody. This bind- 
ing competition can be quanti- 
fied by reference to a standard 
curve obtained with known 
amounts of unlabeled hormone. 
The degree to which labeled hor- 
mone is displaced from antibody 
is a measure of the amount of 
(unlabeled) hormone in a sam- 
ple of blood or tissue extract. By 
using very highly radioactive 
hormone, researchers can make 
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FIGURE 23-3 Radioimmunoassay (RIA). (a) A low concentration of 
radiolabeled hormone (red) is incubated with (T) a fixed amount of anti- 
body specific for that hormone or @ a fixed amount of antibody and 
various concentrations of unlabeled hormone (blue). In the latter case, 
unlabeled hormone competes with labeled hormone for binding to the 
antibody; the amount of labeled hormone bound varies inversely with 
the concentration of unlabeled hormone present, (b) A radioim- 
munoassay for adrenocorticotropic hormone (ACTH; also called 
corticotropin) A standard curve of the ratio [bound]/[unbound] radio- 
labeled ACTH vs. [unlabeled ACTH added] (on a logarithmic scale) is 
constructed and used to determine the amount of (unlabeled) ACTH in 
an unknown sample. If an aliquot containing an unknown quantity of 
unlabeled hormone gives, say, a value of 0.4 for the ratio [bound]/[un- 
bound] (see arrow), the aliquot must contain about 20 pg of ACTH. 



the assay sensitive to picograms of hormone in a sample. A 
newer variation of this technique, enzyme-linked im- 
munosorbent assay (ELISA), is illustrated in Figure 5-26b. 

Hormones Act through Specific High-Affinity 
Cellular Receptors 

As we saw in Chapter 12, all hormones act through highly 
specific receptors in hormone-sensitive target cells, to 
which the hormones bind with high affinity (see 



23.1 Hormones: Diverse Structures for Diverse Functions [jwsj 



Fig. 12-la). Each cell type has its own combination of hor- 
mone receptors, which define the range of its hormone re- 
sponsiveness. Moreover, two cell types with the same type 
of receptor may have different intracellular targets of hor- 
mone action and thus may respond differently to the same 
hormone. The specificity of hormone action results from 
structural complementarity between the hormone and its 
receptor; this interaction is extremely selective, so struc- 
turally similar hormones can have different effects. The 
high affinity of the interaction allows cells to respond to 
very low concentrations of hormone. In the design of 
drugs intended to intervene in hormonal regulation, we 
need to know the relative specificity and affinity of the drug 
and the natural hormone. Recall that hormone-receptor in- 
teractions can be quantified by Scatchard analysis (see 
Box 12-1), which, under favorable conditions, yields a 
quantitative measure of affinity (the dissociation constant 
for the complex) and the number of hormone-binding sites 
in a preparation of receptor. 

The locus of the encounter between hormone and 
receptor may be extracellular, cytosolic, or nuclear, 
depending on the hormone type. The intracellular 
consequences of hormone-receptor interaction are of at 
least six general types: (1) a second messenger (such as 
cAMP or inositol trisphosphate) generated inside the cell 
acts as an allosteric regulator of one or more enzymes; 
(2) a receptor tyrosine kinase is activated by the extra- 
cellular hormone; (3) a receptor guanylyl cyclase is acti- 
vated and produces the second messenger cGMP; (4) a 
change in membrane potential results from the opening 
or closing of a hormone-gated ion channel; (5) an adhe- 
sion receptor on the cell surface interacts with molecules 
in the extracellular matrix and conveys information to 
the cytoskeleton; or (6) a steroid or steroidlike molecule 
causes a change in the level of expression (transcription 
of DNA into mRNA) of one or more genes, mediated by a 
nuclear hormone receptor protein (see Fig. 12-2). 

Water-soluble peptide and amine hormones (insulin 
and epinephrine, for example) act extracellularly by 
binding to cell surface receptors that span the plasma 
membrane (Fig. 23-4). When the hormone binds to its 
extracellular domain, the receptor undergoes a confor- 
mational change analogous to that produced in an al- 
losteric enzyme by binding of an effector molecule. The 
conformational change triggers the downstream effects 
of the hormone, 

A single hormone molecule, in forming a hormone- 
receptor complex, activates a catalyst that produces 
many molecules of second messenger, so the receptor 
serves not only as a signal transducer but also as a signal 
amplifier. The signal may be further amplified by a signal- 
ing cascade, a series of steps in which a catalyst activates 
a catalyst, resulting in very large amplifications of the 
original signal. A cascade of this type occurs in the regu- 
lation of glycogen synthesis and breakdown by 
epinephrine (see Fig. 12-7) . Epinephrine activates 
(through its receptor) adenylyl cyclase, which produces 
many molecules of cAMP for each molecule of receptor- 
bound hormone, Cyclic AMP in turn activates cAMP- 
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FIGURE 23-4 Two general mechanisms of hormone action. The pep- 
tide and amine hormones are faster acting than steroid and thyroid 
hormones. 

dependent protein kinase (protein kinase A), which acti- 
vates glycogen phosphorylase b kinase, which activates 
glycogen phosphorylase b. The result is signal amplifica- 
tion: one epinephrine molecule causes the production of 
many thousands of molecules of glucose 1 -phosphate 
from glycogen. 

Water-insoluble hormones (steroid, retinoid, and 
thyroid hormones) readily pass through the plasma 
membrane of their target cells to reach their receptor 
proteins in the nucleus (Fig. 23-4). With this class of 
hormones, the hormone-receptor complex itself carries 
the message; it interacts with DNA to alter the expres- 
sion of specific genes, changing the enzyme complement 
of the cell and thereby changing cellular metabolism 
(see Fig. 12-29). 

Hormones that act through plasma membrane re- 
ceptors generally trigger very rapid physiological or bio- 
chemical responses. Just seconds after the adrenal 
medulla secretes epinephrine into the bloodstream, 
skeletal muscle responds by accelerating the breakdown 
of glycogen. By contrast, the thyroid hormones and the 
sex (steroid) hormones promote maximal responses in 
their target tissues only after hours or even days. These 
differences in response time correspond to different 
modes of action. In general, the fast-acting hormones 
lead to a change in the activity of one or more preexist- 
ing enzymes in the cell, by allosteric mechanisms or 
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covalent modification. The slower-acting hormones gen- 
erally alter gene expression, resulting in the synthesis of 
more (upregulation) or less (downregulation) of the 
regulated protein(s). 

Hormones Are Chemically Diverse 

Mammals have several classes of hormones, distinguish- 
able by their chemical structures and their modes of 
action (Table 23-1). Peptide, amine, and eicosanoid 
hormones act from outside the target cell via surface re- 
ceptors. Steroid, vitamin D, retinoid, and thyroid hor- 
mones enter the cell and act through nuclear receptors. 
Nitric oxide also enters the cell, but activates a cytosolic 
enzyme, guanylyl cyclase (see Fig. 12-20). 

Hormones can also be classified by the way they get 
from their point of release to their target tissue. En- 
docrine (from the Greek endon, "within," and krinein, 
"to release") hormones are released into the blood and 
carried to target cells throughout the body (insulin and 
glucagon are examples). Paracrine hormones are re- 
leased into the extracellular space and diffuse to neigh- 
boring target cells (the eicosanoid hormones are of this 
type). Autocrine hormones affect the same cell that re- 
leases them, binding to receptors on the cell surface. 

Mammals are hardly unique in possessing hormonal 
signaling systems. Insects and nematode worms have 
highly developed systems for hormonal regulation, with 
fundamental mechanisms similar to those in mammals. 
Plants, too, use hormonal signals to coordinate the ac- 
tivities of their tissues (Chapter 12). The study of hor- 
mone action is not as advanced in plants as in animals, 
but we do know that some mechanisms are shared. 

To illustrate the structural diversity and range of ac- 
tion of mammalian hormones, we consider representa- 
tive examples of each major class listed in Table 23-1. 

Peptide Hormones Peptide hormones may have 
from 3 to 200 or more amino acid residues. They in- 
clude the pancreatic hormones insulin, glucagon, and 



somatostatin; the parathyroid hormone calcitonin; and 
all the hormones of the hypothalamus and pituitary (de- 
scribed below) . These hormones are synthesized on ribo- 
somes in the form of longer precursor proteins 
(prohormones), then packaged into secretory vesicles 
and proteolytically cleaved to form the active peptides. 
Insulin is a small protein (M T 5,800) with two polypeptide 
chains, A and B, joined by two disulfide bonds. It is syn- 
thesized in the pancreas as an inactive single-chain pre- 
cursor, preproinsulin (Fig. 23-5), with an ammo-terminal 
"signal sequence" that directs its passage into secretory 
vesicles. (Signal sequences are discussed in Chapter 27; 
see Fig. 27-38.) Proteolytic removal of the signal se- 
quence and formation of three disulfide bonds produces 
proinsulin, which is stored in secretory granules in pan- 
creatic jS cells. When blood glucose is elevated sufficiently 
to trigger insulin secretion, proinsulin is converted to ac- 
tive insulin by specific proteases, which cleave two pep- 
tide bonds to form the mature insulin molecule, 

In some cases, prohormone proteins, rather than 
yielding a single peptide hormone, produce several active 
hormones. Pro-opiomelanocortin (POMC) is a spectacu- 
lar example of multiple hormones encoded by a single 
gene. The POMC gene encodes a large polypeptide that is 
progressively carved up into at least nine biologically ac- 
tive peptides (Fig. 23-6). In many peptide hormones the 
terminal residues are modified, as in TRH (Fig. 23-2). 

The concentration of peptide hormones in secretory 
granules is so high that the vesicle contents are virtually 
crystalline; when the contents are released by exocyto- 
sis, a large amount of hormone is released suddenly. The 
capillaries that serve peptide-producing endocrine 
glands are fenestrated (and thus permeable to pep- 
tides), so the hormone molecules readily enter the 
bloodstream for transport to target cells elsewhere. As 
noted earlier, all peptide hormones act by binding to re- 
ceptors in the plasma membrane. They cause the gener- 
ation of a second messenger in the cytosol, which 
changes the activity of an intracellular enzyme, thereby 
altering the cell's metabolism. 



TABLE 23-1 



T*pe 



Cla sses of Hormones 
Example 



Synthetic path 



Mode of action 



Peptide 

Catecholamine 
Eicosanoid 

Steroid 
Vitamin D 
Retinoid 
Thyroid 
Nitric oxide 



Insulin, glucagon 

Epinephrine 
PGE X 

Testosterone 

1 ,25-Dihydroxycholecalciferol 
Retinoic acid 
Triiodothyronine (T 3 ) 
Nitric oxide 



Proteolytic processing of 
prohormone 

From tyrosine 

From arachidonate 
(20:4 fatty acid) 

From cholesterol 

From cholesterol 

From vitamin A 

From Tyr in thyroglobulin 

From arginine + 0 2 



Plasma membrane receptors; second 
messengers 



Nuclear receptors; transcriptional 
regulation 



Cytosolic receptor (guanylyl cyclase) 
and second messenger (cGMP) 
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Preproinsulin 



Proinsulin 



Mature 
insulin 




COO 



FIGURE 23-5 Insulin. Mature insulin is formed from 
its larger precursor preproinsulin by proteolytic pro- 
cessing. Removal of a 23 amino acid segment (the 
signal sequence) at the amino terminus of preproin- 
sulin and formation of three disulfide bonds produces 
proinsulin. Further proteolytic cuts remove the C pep- 
tide from proinsulin to produce mature insulin, com- 
posed of A and B chains. The amino acid sequence of 
bovine insulin is shown in Figure 3-24. 



Pro-opiomelanocortin (POMC) gene 
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FIGURE 23-6 Proteolytic processing of the pro-opiomelanocortin 
(POMC) precursor. The initial gene product of the POMC gene is 
a long polypeptide that undergoes cleavage by a series of specific 
proteases to produce ACTH, j8- and y-lipotropin, a-, j3-, and y- 
MSH (melanocyte-stimulating hormone, or melanocortin), CLIP 
(corticotropin-like intermediary peptide), j3-endorphin, and Met- 
enkephalin. The points of cleavage are paired basic residues, 
Arg-Lys, Lys-Arg, or Lys-Lys. 



ACTH jS-Lipotropin 
a-MSH CLIP y-Lipotropin p -Endorphin 

4 4 




p-MSB. Met-enkephalin 



Catecholamine Hormones The water-soluble com- 
pounds epinephrine (adrenaline) and norepineph- 
rine (noradrenaline) are catecholamines, named for 
the structurally related compound catechol. They are 
synthesized from tyrosine. 

Tyrosine > L-Dopa > Dopamine > 

Norepinephrine > Epinephrine 

Catecholamines produced in the brain and in other neural 
tissues function as neurotransmitters, but epinephrine 



and norepinephrine are also hormones, synthesized and 
secreted by the adrenal glands. Like the peptide hor- 
mones, catecholamines are highly concentrated in secre- 
tory vesicles and released by exocytosis, and they act 
through surface receptors to generate intracellular sec- 
ond messengers. They mediate a wide variety of physio- 
logical responses to acute stress (see Table 23-6). 

Eicosanoid Hormones The eicosanoid hormones 
(prostaglandins, thromboxanes, and leukotrienes) are 
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derived from the 20-carbon polyunsaturated fatty acid 
arachidonate. 

Phospholipids 



Arachidonate 
(20:4) 




Prostaglandins Thromboxanes Leukotrienes 

Unlike the hormones described above, they are not syn- 
thesized in advance and stored; they are produced, 
when needed, from arachidonate enzymatically released 
from membrane phospholipids by phospholipase A 2 . 
The enzymes of the pathway leading to prostaglandins 
and thromboxanes (see Fig. 21-15) are very widely dis- 
tributed in mammalian tissues; most cells can produce 
these hormone signals, and cells of many tissues can 
respond to them through specific plasma membrane 
receptors. The eicosanoid hormones are paracrine hor- 
mones, secreted into the interstitial fluid (not primarily 
into the blood) and acting on nearby cells. 

Prostaglandins promote the contraction of 
smooth muscle, including that of the intestine 
and uterus (and can therefore be used medically to in- 
duce labor). They also mediate pain and inflammation in 
all tissues. Many antiinflammatory drugs act by inhibit- 
ing steps in the prostaglandin synthetic pathway (see 
Fig. 21-15). Thromboxanes regulate platelet function 
and therefore blood clotting. Leukotrienes LTC 4 and 
LTD 4 act through plasma membrane receptors to stimu- 
late contraction of smooth muscle in the intestine, pul- 
monary airways, and trachea. They are mediators of 
anaphylaxis, a severe, detrimental immune response. ■ 

Steroid Hormones The steroid hormones (adreno- 
cortical hormones and sex hormones) are synthesized 
from cholesterol in several endocrine tissues. 

Cholesterol 



Progesterone 



Cortisol 
(glucocorticoid) 



Testosterone 



Aldosterone 
(mineralocorticoid) 



Estradiol 
(sex hormones) 



They travel to their target cells through the blood- 
stream, bound to carrier proteins. More than 50 corti- 
costeroid hormones are produced in the adrenal cortex 
by reactions that remove the side chain from the D ring 
of cholesterol and introduce oxygen to form keto and 



hydroxyl groups. Many of these reactions involve cy- 
tochrome P-450 enzymes (see Box 21-1). The steroid 
hormones are of two general types. Glucocorticoids 
(such as Cortisol) primarily affect the metabolism of car- 
bohydrates; mineralocorticoids (such as aldosterone) 
regulate the concentrations of electrolytes in the blood. 
Androgens (testosterone) and estrogens (such as estra- 
diol; see Fig. 10-19) are synthesized in the testes and 
ovaries. Their synthesis also involves cytochrome P-450 
enzymes that cleave the side chain of cholesterol and in- 
troduce oxygen atoms. These hormones affect sexual 
development, sexual behavior, and a variety of other re- 
productive and nonreproductive functions. 

All steroid hormones act through nuclear recep- 
tors to change the level of expression of specific genes 
(p. 456). They can also have more rapid effects, prob- 
ably mediated by receptors in the plasma membrane. 

Vitamin D Hormone Calcitriol (1 ,25-dihydroxy- 
cholecalciferol) is produced from vitamin D by enzyme- 
catalyzed hydroxylation in the liver and kidneys (see 
Fig. 10-20a). Vitamin D is obtained in the diet or by 
photolysis of 7-dehydrocholesterol in skin exposed to 
sunlight. 

7-Dehydrocholesterol 

UV light 



Vitamin D 3 
(cholecalciferol) 



25-Hydroxycholecalciferol 



1 ,25-Dihydroxycholecalciferol 

Calcitriol works in concert with parathyroid hor- 
mone in Ca 2+ homeostasis, regulating [Ca 2+ ] in 
the blood and the balance between Ca 2+ deposition and 
Ca 2+ mobilization from bone. Acting through nuclear re- 
ceptors, calcitriol activates the synthesis of an intestinal 
Ca 2+ -binding protein essential for uptake of dietary Ca 2+ . 
Inadequate dietary vitamin D or defects in the biosynthesis 
of calcitriol result in serious diseases such as rickets, in 
which bones are weak and malformed (see Fig. 10-20b). ■ 

Retinoid Hormones Retinoids are potent hormones 
that regulate the growth, survival, and differentiation of 
cells via nuclear retinoid receptors. The prohormone 
retinol is synthesized from /3-carotene, primarily in liver 
(see Fig. 10-21), and many tissues convert retinol to the 
hormone retinoic acid (RA). 

/3-Carotene 



Vitamin A t 
(retinol) 



Retinoic acid 
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All tissues are retinoid targets, as all cell types 
have at least one form of nuclear retinoid recep- 
tor. In adults, the most significant targets include 
cornea, skin, epithelia of the lungs and trachea, and the 
immune system. RA regulates the synthesis of proteins 
essential for growth or differentiation. Excessive vita- 
min A can cause birth defects, and pregnant women are 
advised not to use the retinoid creams that have been 
developed for treatment of severe acne. ■ 

Thyroid Hormones The thyroid hormones T 4 (thy- 
roxine) and T 3 (triiodothyronine) are synthesized from 
the precursor protein thyroglobulin (M T 660,000). Up to 
20 Tyr residues in thyroglobulin are enzymatically iodi- 
nated in the thyroid gland, then two iodotyrosine 
residues condense to form the precursor to thyroxine. 
When needed, thyroxine is released by proteolysis. Con- 
densation of monoiodotyrosine with diiodothyronine 
produces T 3 , which is also an active hormone released 
by proteolysis. 

Thyroglobulin - Tyr 



Thyroglobulin - Tyr - 1 
(iodinated Tyr residues) 

I proteolysis 

Thyroxine (T 4 ), 
triiodothyronine (T 3 ) 

The thyroid hormones act through nuclear receptors to 
stimulate energy-yielding metabolism, especially in liver 
and muscle, by increasing the expression of genes en- 
coding key catabolic enzymes. 

Nitric Oxide (NO) Nitric oxide is a relatively stable 
free radical synthesized from molecular oxygen and the 
guanidinium nitrogen of arginine (see Fig. 22-31) in a 
reaction catalyzed by NO synthase. 

Arginine + l\ NADPH + 20 2 ► 

NO + citrulline + 2H 2 0 + l\ NADP + 

This enzyme is found in many tissues and cell types: neu- 
rons, macrophages, hepatocytes, myocytes of smooth 
muscle, endothelial cells of the blood vessels, and ep- 
ithelial cells of the kidney NO acts near its point of re- 
lease, entering the target cell and activating the cytosolic 
enzyme guanylyl cyclase, which catalyzes the formation 
of the second messenger cGMP (see Fig. 12-20). 

Hormone Release Is Regulated by a Hierarchy of Neuronal 
and Hormonal Signals 

The changing levels of specific hormones regulate spe- 
cific cellular processes, but what regulates the level of 
each hormone? The brief answer is that the central 
nervous system receives input from many internal and 
external sensors — signals about danger, hunger, dietary 
intake, blood composition and pressure, for example — 
and orchestrates the production of appropriate hormonal 



signals by the endocrine tissues. For a more complete 
answer, we must look at the hormone-producing 
systems of the human body and some of their functional 
interrelationships. 

Figure 23-7 shows the anatomic location of the 
major endocrine glands in humans, and Figure 23-8 
represents the "chain of command" in the hormonal sig- 
naling hierarchy. The hypothalamus, a small region of 
the brain (Fig. 23-9), is the coordination center of the 
endocrine system; it receives and integrates messages 
from the central nervous system. In response to these 
messages, the hypothalamus produces regulatory hor- 
mones (releasing factors) that pass directly to the 
nearby pituitary gland, through special blood vessels 
and neurons that connect the two glands (Fig. 23-9b). 
The pituitary gland has two functionally distinct parts. 
The posterior pituitary contains the axonal endings of 
many neurons that originate in the hypothalamus. 
These neurons produce the short peptide hormones 
oxytocin and vasopressin (Fig. 23-10), which move 
down the axon to the nerve endings in the pituitary, 
where they are stored in secretory granules to await the 
signal for their release. 

The anterior pituitary responds to hypothalamic 
hormones carried in the blood, producing tropic 
hormones, or tropins (from the Greek tropos, "turn"). 
These relatively long polypeptides activate the next 
rank of endocrine glands (Fig. 23-8) , which includes the 
adrenal cortex, thyroid gland, ovaries, and testes. These 
glands in turn secrete their specific hormones, which 
are carried in the bloodstream to the target tissues. For 
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FIGURE 23-7 The major endocrine glands. The glands are shaded pink. 
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FIGURE 23-8 The major endocrine systems and their target 
tissues. Signals originating in the central nervous system (top) 
pass via a series of relays to the ultimate target tissues (bottom). 
In addition to the systems shown, the thymus, pineal gland, and 
groups of cells in the gastrointestinal tract also secrete hormones. 
Dashed lines represent neuronal connections. 
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example, corticotropin-releasing hormone from the hy- 
pothalamus stimulates the anterior pituitary to release 
ACTH, which travels to the zona fasciculata of the adre- 
nal cortex and triggers the release of Cortisol. Cortisol, 
the ultimate hormone in this cascade, acts through its 



receptor in many types of target cells to alter their 
metabolism. In hepatocytes, one effect of Cortisol is to 
increase the rate of gluconeogenesis. 

Hormonal cascades such as those responsible for 
the release of Cortisol and epinephrine result in large 
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FIGURE 23-9 Neuroendocrine origins of hormone signals, (a) Location of the hy- 
pothalamus and pituitary gland, (b) Details of the hypothalamus-pituitary system. 
Signals from connecting neurons stimulate the hypothalamus to secrete releasing 
factors into a blood vessel that carries the hormones directly to a capillary net- 
work in the anterior pituitary. In response to each hypothalamic releasing factor, 
the anterior pituitary releases the appropriate hormone into the general circula- 
tion. Posterior pituitary hormones are synthesized in neurons arising in the hypo- 
thalamus, transported along axons to nerve endings in the posterior pituitary, and 
stored there until released into the blood in response to a neuronal signal 
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FIGURE 23-10 Two hormones of the posterior pituitary gland. The 

carboxyl-terminal residue of both peptides is glycinamide, 
— NH— CH 2 — CONH 2 {as noted in Fig. 23-2, amidation of the car- 
boxyl terminus is common in short peptide hormones). These two hor- 
mones, identical in all but two residues (shaded), have very different 
biological effects, Oxytocin acts on the smooth muscle of the uterus 
and mammary gland, causing uterine contractions during labor and 
promoting milk release during lactation. Vasopressin (also called an- 
tidiuretic hormone) increases water reabsorption in the kidney and 
promotes the constriction of blood vessels, thereby increasing blood 
pressure. 

amplifications of the initial signal and allow exquisite 
fine-tuning of the output of the ultimate hormone 
(Fig. 23-11). At each level in the cascade, a small 
signal elicits a larger response. For example, the ini- 
tial electrical signal to the hypothalamus results in the 
release of a few nanograms of corticotropin-releasing 
hormone, which elicits the release of a few micrograms 
of corticotropin. Corticotropin acts on the adrenal cor- 
tex to cause the release of milligrams of Cortisol, for an 
overall amplification of at least a millionfold. 

At each level of a hormonal cascade, feedback inhi- 
bition of earlier steps in the cascade is possible; an un- 
necessarily elevated level of the ultimate hormone or of 
an intermediate hormone inhibits the release of earlier 
hormones in the cascade. These feedback mechanisms 
accomplish the same end as those that limit the output 
of a biosynthetic pathway (compare Fig. 23-11 with 
Fig. 6-33): a product is synthesized (or released) only 
until the necessary concentration is reached. 
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Hormones: Diverse 
Structures for Diverse 
Functions 



Hormones are chemical messengers secreted by 
certain tissues into the blood or interstitial fluid, 
serving to regulate the activity of other cells or 
tissues. 
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FIGURE 23-1 1 Cascade of hormone release following central nervous 
system input to the hypothalamus. In each endocrine tissue aiong the 
pathway, a stimulus from the level above is received, amplified, and 
transduced into the release of the next hormone in the cascade. The 
cascade is sensitive to regulation at several levels through feedback 
inhibition by the ultimate hormone (in this case, Cortisol). The product 
therefore regulates its own production, as in feedback inhibition of 
biosynthetic pathways within a single cell. 

■ Radioimmunoassay and ELISA are two very 
sensitive techniques for detecting and quantifying 
hormones. 

■ Peptide, amine, and eicosanoid hormones act 
outside the target cell on specific receptors in the 
plasma membrane, altering the level of an 
intracellular second messenger. 
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■ Steroid, vitamin D, retinoid, and thyroid hormones 
enter target cells and alter gene expression by 
interacting with specific nuclear receptors. 

■ Hormonal cascades, in which catalysts activate 
catalysts, amplify the initial stimulus by several 
orders of magnitude, often in a very short time 
(seconds). 

■ Nerve impulses stimulate the hypothalamus to send 
specific hormones to the pituitary gland, thus 
stimulating (or inhibiting) the release of tropic 
hormones. The anterior pituitary hormones in turn 
stimulate other endocrine glands (thyroid, adrenals, 
pancreas) to secrete their characteristic hormones, 
which in turn stimulate specific target tissues. 

23.2 Tissue-Specific Metabolism: 
The Division of Labor 

Each tissue of the human body has a specialized func- 
tion, reflected in its anatomy and metabolic activity 
(Fig. 23-12). Skeletal muscle allows directed motion; 
adipose tissue stores and releases energy in the form of 



fats, which serve as fuel throughout the body; in the brain, 
cells pump ions across their plasma membranes to pro- 
duce electrical signals. The liver plays a central processing 
and distributing role in metabolism and furnishes all other 
organs and tissues with an appropriate mix of nutrients via 
the bloodstream. The functional centrality of the liver is 
indicated by the common reference to all other tissues 
and organs as "extrahepatic" or "peripheral." We therefore 
begin our discussion of the division of metabolic labor by 
considering the transformations of carbohydrates, amino 
acids, and fats in the mammalian liver. This is followed by 
brief descriptions of the primary metabolic functions of 
adipose tissue, muscle, brain, and the medium that inter- 
connects all others: the blood. 

The Liver Processes and Distributes Nutrients 

During digestion in mammals, the three main classes of 
nutrients (carbohydrates, proteins, and fats) undergo 
enzymatic hydrolysis into their simple constituents. 
This breakdown is necessary because the epithelial cells 
lining the intestinal lumen absorb only relatively small 
molecules. Many of the fatty acids and monoacylglyc- 
erols released by digestion of fats in the intestine are 
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FIGURE 23-12 Specialized metabolic functions of mammalian tissues. 
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reassembled within these epithelial cells into triacyl- 
glycerols (TAGs). 

After being absorbed, most sugars and amino acids 
and some reconstituted TAGs pass from intestinal ep- 
ithelial cells into blood capillaries, and travel in the 
bloodstream to the liver; the remaining TAGs enter adi- 
pose tissue via the lymphatic system. The portal vein is 
a direct route from the digestive organs to the liver, and 
liver therefore has first access to ingested nutrients. The 
liver has two main cell types. Kupffer cells are phago- 
cytes, important in immune function. Hepatocytes, of 
primary interest here, transform dietary nutrients into 
the fuels and precursors required by other tissues, and 
export them via the blood. The kinds and amounts of nu- 
trients supplied to the liver vary with several factors, in- 
cluding the diet and the time between meals. The 
demand of extrahepatic tissues for fuels and precursors 
varies among organs and with the level of activity and 
overall nutritional state of the individual. 



To meet these changing circumstances, the liver has 
remarkable metabolic flexibility. For example, when the 
diet is rich in protein, hepatocytes supply themselves 
with high levels of enzymes for amino acid catabolism and 
gluconeogenesis. Within hours after a shift to a high- 
carbohydrate diet, the levels of these enzymes begin to 
drop and the hepatocytes increase their synthesis of en- 
zymes essential to carbohydrate metabolism and fat syn- 
thesis. Liver enzymes turn over (are synthesized and 
degraded) at 5 to 10 times the rate of enzyme turnover in 
other tissues, such as muscle. Extrahepatic tissues also 
can adjust their metabolism to prevailing conditions, but 
none is as adaptable as the liver, and none is so central to 
the organism's overall metabolism. What follows is a 
survey of the possible fates of sugars, amino acids, and 
lipids that enter the liver from the bloodstream. To help 
you recall the metabolic transformations discussed here, 
Table 23-2 shows the major pathways and processes and 
indicates by figure number where each pathway is 



ifffTy?g/ Pathways of Carbohydrate, Amino Acid, and Fat Metabolism Illustrated in Earlier Chapter: 
Pathway Figure reference 



Citric acid cycle: acetyl-CoA > 2C0 2 

Oxidative phosphorylation: ATP synthesis 
Carbohydrate catabolism 

Glycogenolysis: glycogen > glucose 1-phosphate > blood glucose 

Hexose entry into glycolysis: fructose, mannose, galactose > glucose 6-phosphate 

Glycolysis: glucose > pyruvate 

Pyruvate dehydrogenase reaction: pyruvate > acetyl-CoA 

Lactic acid fermentation: glucose > lactate + 2ATP 

Pentose phosphate pathway: glucose 6-phosphate > pentose phosphates + NADPH 

Carbohydrate anabolism 

Gluconeogenesis: citric acid cycle intermediates > glucose 

Glucose-alanine cycle: glucose > pyruvate > alanine > glucose 

Glycogen synthesis: glucose 6-phosphate > glucose 1-phosphate > glycogen 

Amino acid and nucleotide metabolism 

Amino acid degradation: amino acids > acetyl-CoA, citric acid cycle intermediates 

Amino acid synthesis 

Urea cycle: NH 3 > urea 

Glucose-alanine cycle: alanine > glucose 

Nucleotide synthesis: amino acids > purines, pyrimidines 

Hormone and neurotransmitter synthesis 
Fat catabolism 

p Oxidation of fatty acids: fatty acid ► acetyl-CoA 

Oxidation of ketone bodies: /3-hydroxybutyrate > acetyl-CoA > C0 2 via citric acid cycle 

Fat anabolism 

Fatty acid synthesis: acetyl-CoA > fatty acids 

Triacylglycerol synthesis: acetyl-CoA > fatty acids > triacylglycerol 

Ketone body formation: acetyl-CoA ► acetoacetate, /3-hydroxybutyrate 

Cholesterol and cholesteryl ester synthesis: acetyl-CoA > cholesterol > cholesteryl esters 

Phospholipid synthesis: fatty acids > phospholipids 
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presented in detail. Here, we provide summaries of the 
pathways, referring to the numbered pathways and reac- 
tions in Figures 23-13 to 23-15. 

Sugars The glucose transporter of hepatocytes 
(GLUT2) is so effective that the concentration of glu- 
cose in a hepatocyte is essentially the same as that in the 
blood. Glucose entering hepatocytes is phosphorylated 
by hexokinase IV (glucokinase) to yield glucose 6-phos- 
phate. Glucokinase has a much higher K m for glucose 
(10 mM) than do the hexokinase isozymes in other cells 
(p. 584) and, unlike these other isozymes, it is not inhib- 
ited by its product, glucose 6-phosphate. The presence 
of glucokinase allows hepatocytes to continue phospho- 
rylating glucose when the glucose concentration rises 
well above levels that would overwhelm other hexoki- 
nases. The high K m of glucokinase also ensures that the 
phosphorylation of glucose in hepatocytes is minimal 
when the glucose concentration is low, preventing the 
liver from consuming glucose as fuel via glycolysis. This 
spares glucose for other tissues. Fructose, galactose, 
and mannose, all absorbed from the small intestine, are 
also converted to glucose 6-phosphate by enzymatic 
pathways examined in Chapter 14. Glucose 6-phosphate 
is at the crossroads of carbohydrate metabolism in the 
liver. It may take any of several major metabolic routes 
(Fig. 23-13), depending on the current metabolic 




FIGURE 23-13 Metabolic pathways for glucose 6-phosphate in the 
liver. Here and in Figures 23-14 and 23-15, anabolic pathways are 
generally shown leading upward, cataboiic pathways leading down- 
ward, and distribution to other organs horizontally. The numbered 
processes in each figure are described in the text. 



needs of the organism. By the action of various alloster- 
ically regulated enzymes, and through hormonal regula- 
tion of enzyme synthesis and activity, the liver directs 
the flow of glucose into one or more of these pathways. 

(J) Glucose 6-phosphate is dephosphorylated by 
glucose 6-phosphatase to yield free glucose (see 
Fig. 15-28), which is exported to replenish blood glu- 
cose. Export is the predominant pathway when glucose 
6-phosphate is in limited supply, because the blood 
glucose concentration must be kept sufficiently high 
(4 mM) to provide adequate energy for the brain and 
other tissues. (2) Glucose 6-phosphate not immediately 
needed to form blood glucose is converted to liver glyco- 
gen, or has one of several other fates. Following glycoly- 
sis and the pyruvate dehydrogenase reaction, ® the 
acetyl-CoA so formed can be oxidized for energy produc- 
tion by the citric acid cycle, with ensuing electron trans- 
fer and oxidative phosphorylation yielding ATP. 
(Normally, however, fatty acids are the preferred fuel for 
energy production in hepatocytes.) (4) Acetyl-CoA can 
also serve as the precursor of fatty acids, which are in- 
corporated into TAGs and phospholipids, and of choles- 
terol. Much of the lipid synthesized in the liver is 
transported to other tissues by blood lipoproteins. © Al- 
ternatively, glucose 6-phosphate can enter the pentose 
phosphate pathway, yielding both reducing power 
(NADPH) , needed for the biosynthesis of fatty acids and 
cholesterol, and D-ribose 5-phosphate, a precursor for 
nucleotide biosynthesis. NADPH is also an essential co- 
factor in the detoxification and elimination of many 
drugs and other xenobiotics metabolized in the liver. 

Amino Acids Amino acids that enter the liver follow 
several important metabolic routes (Fig, 2:3-14). 
CD They axe precursors for protein synthesis, a process 
discussed in Chapter 27. The liver constantly renews its 
own proteins, which have a relatively high turnover rate 
(average half-life of hours to days), and is also the site of 
biosynthesis of most plasma proteins. (2) Alternatively, 
amino acids pass in the bloodstream to other organs, to 
be used in the synthesis of tissue proteins. (3) Other 
amino acids are precursors in the biosynthesis of nu- 
cleotides, hormones, and other nitrogenous compounds 
in the liver and other tissues. 

@ Amino acids not needed as biosynthetic precur- 
sors are transaminated or deaminated and degraded to 
yield pyruvate and citric acid cycle intermediates, with 
various fates; @ the ammonia released is converted to 
the excretory product urea. (5) Pyruvate can be con- 
verted to glucose and glycogen via gluconeogenesis, or 
(6) it can be converted to acetyl-CoA, which has several 
possible fates: (7) oxidation via the citric acid cycle and 
(8) oxidative phosphorylation to produce ATP, or © con- 
version to lipids for storage. @ Citric acid cycle inter- 
mediates can be siphoned off into glucose synthesis by 
gluconeogenesis. 

The liver also metabolizes amino acids that arrive in- 
termittently from other tissues. The blood is adequately 
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FIGURE 23-14 Metabolism of amino acids in the liver. 



supplied with glucose just after the digestion and ab- 
sorption of dietary carbohydrate or, between meals, by 
the conversion of liver glycogen to blood glucose. During 
the interval between meals, especially if prolonged, some 
muscle protein is degraded to amino acids. These amino 
acids donate their amino groups (by transamination) to 
pyruvate, the product of glycolysis, to yield alanine, 
which @ is transported to the liver and deaminated. 
Hepatocytes convert the resulting pyruvate to blood glu- 
cose (via gluconeogenesis (§)), and the ammonia to urea 
for excretion @. One benefit of this glucose-alanine cy- 
cle (see Fig. 18-9) is the smoothing out of fluctuations in 
blood glucose between meals. The amino acid deficit in- 
curred in muscles is made up after the next meal by in- 
coming dietary amino acids. 

Lipids The fatty acid components of lipids entering he- 
patocytes also have several different fates (Fig. 23-15). 
(T) Some are converted to liver lipids. (2) Under most cir- 
cumstances, fatty acids are the primary oxidative fuel in the 
liver. Free fatty acids may be activated and oxidized to yield 
acetyl-CoA and NADH. (3) The acetyl-CoA is further oxi- 
dized via the citric acid cycle, and @ oxidations in the cycle 
drive the synthesis of ATP by oxidative phosphorylation. 
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FIGURE 23-15 Metabolism of fatty acids in the liver. 
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(5) Excess acetyl-CoA, not required by the liver, is con- 
verted to acetoacetate and /3-hydroxybutyrate; these ke- 
tone bodies circulate in the blood to other tissues, to be 
used as fuel for the citric acid cycle. Ketone bodies may 
be regarded as a transport form of acetyl groups. They 
can supply a significant fraction of the energy in some ex- 
trahepatic tissues — up to one-third in the heart, and as 
much as 60% to 70% in the brain during prolonged fast- 
ing. (6) Some of the acetyl-CoA derived from fatty acids 
(and from glucose) is used for the biosynthesis of choles- 
terol, which is required for membrane synthesis. Choles- 
terol is also the precursor of all steroid hormones and of 
the bile salts, which are essential for the digestion and 
absorption of lipids. 

The other two metabolic fates of lipids involve spe- 
cialized mechanisms for the transport of insoluble lipids 
in blood. ® Fatty acids are converted to the phospho- 
lipids and TAGs of plasma lipoproteins, which carry 
lipids to adipose tissue for storage as TAGs. (8) Some 
free fatty acids are bound to serum albumin and carried 
to the heart and skeletal muscles, which take up and 
oxidize free fatty acids as a major fuel Serum albumin is 
the most abundant plasma protein; one molecule can 
carry up to 10 molecules of free fatty acid. 

The liver thus serves as the body's distribution center, 
exporting nutrients in the correct proportions to other 
organs, smoothing out fluctuations in metabolism caused 
by intermittent food intake, and processing excess amino 
groups into urea and other products to be disposed of by 
the kidneys. Certain nutrients are stored in the liver, in- 
cluding Fe ions and vitamin A. The liver also detoxifies 
foreign organic compounds, such as drugs, food additives, 
preservatives, and other possibly harmful agents with no 
food value. Detoxification often involves the cytochrome 
P-450-dependent hydroxylation of relatively insoluble 
organic compounds, making them sufficiently soluble for 
further breakdown and excretion (see Box 21-1). 



Adipose Tissues Store and Supply Fatty Acids 

There are two distinct types of adipose tissue, white and 
brown, with quite distinct roles, and we focus first on 
the more abundant of the two. White adipose tissue 
(WAT) (Fig. 23-16a) is amorphous and widely distrib- 
uted in the body: under the skin, around the deep blood 
vessels, and in the abdominal cavity. The adipocytes 
of WAT are large (diameter 30 to 70 /jlto), spherical 
cells, completely filled with a single large lipid (TAG) 
droplet that constitutes about 65% of the cell mass and 
squeezes the mitochondria and nucleus into a thin layer 
against the plasma membrane (Fig. 23-1 6b). In humans, 
WAT typically makes up about 15% of the mass of a 
healthy young adult. The adipocytes are metabolically very 
active, responding quickly to hormonal stimuli in a meta- 
bolic interplay with the liver, skeletal muscles, and heart. 

Like other cell types, adipocytes have an active gly- 
colytic metabolism, oxidize pyruvate and fatty acids via 
the citric acid cycle, and carry out oxidative phosphoryla- 
tion. During periods of high carbohydrate intake, adipose 
tissue can convert glucose (via pyruvate and acetyl-CoA) 
to fatty acids, convert the fatty acids to TAGs, and store 
the TAGs as large fat globules — although, in humans, 
much of the fatty acid synthesis occurs in hepatocytes. 
Adipocytes store TAGs arriving from the liver (carried in 
the blood as VLDLs; see Fig. 21-40a) and from the intes- 
tinal tract (carried in chylomicrons), particularly after 
meals rich in fat. 

When the demand for fuel rises, lipases in 
adipocytes hydrolyze stored TAGs to release free fatty 
acids, which can travel in the bloodstream to skeletal 
muscle and the heart. The release of fatty acids from 
adipocytes is greatly accelerated by epinephrine, which 
stimulates the cAMP-dependent phosphorylation of per- 
ilipin and thus gives the hormone-sensitive lipase access 
to TAGs in the lipid droplet (see Fig. 17-3). Hormone- 
sensitive lipase is also stimulated by phosphorylation, 
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FIGURE 23-16 Adipocytes of white and brown adipose tissue. 

(a) Colorized scanning electron micrograph of human adipocytes in 
white adipose tissue (WAT). In fat tissues, capillaries and collagen 
fibers form a supporting network around spherical adipocytes. Almost 
the entire volume of each of these metabolically active cells is taken up 
by a fat droplet, (b) A typical adipocyte from WAT and (c) an adipocyte 
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from brown adipose tissue (BAT). In BAT cells, mitochondria are much 
more prominent, the nucleus is near the center of the cell, and multi- 
ple fat droplets are present White adipocytes are larger and contain a 
single huge lipid droplet, which squeezes the mitochondria and nu- 
cleus against the plasma membrane. 
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but this is not the main cause of increased lipolysis. 
Insulin counterbalances this effect of epinephrine, 
decreasing the activity of the lipase. 

The breakdown and synthesis of TAGs in adipose 
tissue constitute a substrate cycle; up to 70% of the 
fatty acids released by hormone-sensitive lipase are 
reesterified in adipocytes, re-forming TAGs. Recall from 
Chapter 15 that such substrate cycles allow fine regula- 
tion of the rate and direction of flow of intermediates 
through a bidirectional pathway. In adipose tissue, glyc- 
erol liberated by hormone-sensitive lipase cannot be 
reused in the synthesis of TAGs, because adipocytes 
lack glycerol kinase. Instead, the glycerol phosphate 
required for TAG synthesis is made from pyruvate by 
glyceroneogenesis, involving the cytosolic PEP car- 
boxykinase (see Fig. 21-22). 

In addition to its central function as a fuel depot, 
adipose tissue plays an important role as an endocrine 
organ, producing and releasing hormones that signal the 
state of energy reserves and coordinate metabolism of 
fats and carbohydrates throughout the body. We return 
to this function later in the chapter as we discuss the 
hormonal regulation of body mass. 

Brown Adipose Tissue Is Thermogenic 

In small vertebrates and hibernating animals, a signifi- 
cant proportion of the adipose tissue is brown adipose 
tissue (BAT), distinguished from WAT by its smaller 
(diameter 20 to 40 jxia), differently shaped (polygonal, 
not round) adipocytes (Fig. 23-16c). Like white 
adipocytes, brown adipocytes store triacylglycerols, but 
in several smaller lipid droplets per cell rather than as a 
single central droplet. BAT cells have more mitochondria 
and a richer supply of capillaries than WAT cells, and it is 
the cytochromes of mitochondria and the hemoglobin in 
capillaries that give BAT its characteristic brown color. 
A unique feature of brown adipocytes is their strong 
expression of the gene UNCI, which encodes ther- 
mogenic the mitochondrial uncoupling protein (see 
Fig. 19-34). Thermogenin activity is responsible for the 
principal function of BAT: thermogenesis. 

In brown adipocytes, fatty acids stored in lipid 
droplets are released, enter mitochondria, and un- 
dergo complete conversion to C0 2 via /3 oxidation and 
the citric acid cycle. The reduced FADH 2 and NADH 
so generated pass their electrons through the respira- 
tory chain to molecular oxygen. In WAT, protons 
pumped out of the mitochondria during electron 
transfer reenter the matrix through ATP synthase, with 
the energy of electron transfer conserved in ATP syn- 
thesis. In BAT, thermogenin provides an alternative 
route for protons to reenter the matrix that bypasses 
ATP synthase; the energy of the proton gradient is 
thus dissipated as heat, which can maintain the body 
(especially the nervous system and viscera) at its op- 
timal temperature when the ambient temperature is 
relatively low. 




FIGURE 23-17 Distribution of brown adipose tissue in a newborn 
infant. At birth, human infants have brown fat distributed as shown 
here, to protect the major blood vessels and the internal organs. This 
brown fat recedes over time, so that an adult has no major reserves of 
brown adipose. 

In the human fetus, differentiation of fibroblast 
"preadipocytes" into BAT begins at the twentieth week 
of gestation, and at the time of birth BAT represents 1% 
of total body weight. The brown fat deposits are located 
where the heat generated by thermogenesis can ensure 
that vital tissues— blood vessels to the head, major ab- 
dominal blood vessels, and the viscera, including the 
pancreas, adrenal glands, and kidneys — are not chilled 
as the newborn enters a world of lower ambient temper- 
ature (Fig. 23-17). 

At birth, WAT development begins and BAT begins 
to disappear. By adulthood humans have no discrete 
deposits of BAT, although brown adipocytes remain 
scattered throughout the WAT, making up about 1% of 
all adipocytes. Adults also have preadipocytes that can 
be induced to differentiate into BAT during adapta- 
tion to chronic cold exposure. Humans with pheochro- 
mocytoma (tumors of the adrenal gland) overproduce 
epinephrine and norepinephrine, and one effect is dif- 
ferentiation of preadipocytes into discrete regions of 
BAT, localized roughly as in newborns. In the induced 
adaptation to chronic cold, and in the normal differen- 
tiation of WAT and BAT, the nuclear transcription factor 
PPARy (described later in the chapter) plays a central 
role. 
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Muscles Use ATP for Mechanical Work 

Metabolism in the cells of skeletal muscle — myocytes — 
is specialized to generate ATP as the immediate source of 
energy for contraction. Moreover, skeletal muscle is 
adapted to do its mechanical work in an intermittent fash- 
ion, on demand. Sometimes skeletal muscles must work 
at their maximum capacity for a short time, as in a 100 m 
sprint; at other times more prolonged work is required, as 
in running a marathon or in extended physical labor. 

There are two general classes of muscle tissue, 
which differ in physiological role and fuel utilization. 
Slow-twitch muscle, also called red muscle, provides 
relatively low tension but is highly resistant to fatigue. It 
produces ATP by the relatively slow but steady process 
of oxidative phosphorylation. Red muscle is very rich in 
mitochondria and is served by very dense networks of 
blood vessels, which bring the oxygen essential to ATP 
production. Fast-twitch muscle, or white muscle, has 
fewer mitochondria than red muscle and is less well sup- 
plied with blood vessels, but it can develop greater ten- 
sion, and do so faster. White muscle is quicker to fatigue 
because when active, it uses ATP faster than it can re- 
place it. There is a genetic component to the proportion 
of red and white muscle in any individual; with training, 
the endurance of fast-twitch muscle can be improved. 

Skeletal muscle can use free fatty acids, ketone bod- 
ies, or glucose as fuel, depending on the degree of mus- 
cular activity (Fig. 23-18). In resting muscle, the 
primary fuels are free fatty acids from adipose tissue 
and ketone bodies from the liver. These are oxidized and 
degraded to yield acetyl-CoA, which enters the citric 
acid cycle for oxidation to C0 2 . The ensuing transfer of 
electrons to 0 2 provides the energy for ATP synthesis by 
oxidative phosphorylation. Moderately active muscle 
uses blood glucose in addition to fatty acids and ketone 
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FIGURE 23-18 Energy sources for muscle contraction. Different fuels 
are used for ATP synthesis during bursts of heavy activity and during 
light activity or rest. Phosphocreatine can rapidly supply ATP 



bodies. The glucose is phosphorylated, then degraded 
by glycolysis to pyruvate, which is converted to acetyl- 
CoA and oxidized via the citric acid cycle and oxidative 
phosphorylation. 

In maximally active fast-twitch muscles, the de- 
mand for ATP is so great that the blood flow cannot pro- 
vide 0 2 and fuels fast enough to supply sufficient ATP by 
aerobic respiration alone. Under these conditions, 
stored muscle glycogen is broken down to lactate by fer- 
mentation (p. 530). Each glucose unit degraded yields 
three ATP, because phosphorolysis of glycogen pro- 
duces glucose 6-phosphate (via glucose 1 -phosphate), 
sparing the ATP normally consumed in the hexokinase 
reaction. Lactic acid fermentation thus responds more 
quickly than oxidative phosphorylation to an increased 
need for ATP, supplementing basal ATP production by 
aerobic oxidation of other fuels via the citric acid cycle 
and respiratory chain. The use of blood glucose and 
muscle glycogen as fuels for muscular activity is greatly 
enhanced by the secretion of epinephrine, which stimu- 
lates both the release of glucose from liver glycogen and 
the breakdown of glycogen in muscle tissue. 

The relatively small amount of glycogen (about 1% 
of the total weight of skeletal muscle) limits the amount 
of glycolytic energy available during all-out exertion. 
Moreover, the accumulation of lactate and consequent 
decrease in pH in maximally active muscles reduces 
their efficiency. Skeletal muscle, however, contains 
another source of ATP, phosphocreatine (10 to 30 mM), 
which can rapidly regenerate ATP from ADP by the cre- 
atine kinase reaction: 
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During periods of active contraction and glycolysis, this 
reaction proceeds predominantly in the direction of 
ATP synthesis; during recovery from exertion, the same 
enzyme resynthesizes phosphocreatine from creatine 
and ATP. Given the relatively high levels of ATP and 
phosphocreatine in muscle, these compounds can be 
detected in intact muscle, in real time, by NMR spec- 
troscopy (Fig. 23-19). 

After a period of intense muscular activity, the in- 
dividual continues breathing heavily for some time, 
using much of the extra 0 2 for oxidative phosphoryla- 
tion in the liver. The ATP produced is used for gluco- 
neogenesis (in the liver) from lactate that has been 
carried in the blood from the muscles. The glucose 
thus formed returns to the muscles to replenish their 
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FIGURE 23-19 Phosphocreatine buffers ATP concentration during 

exercise. A "stack plot" of magnetic resonance spectra {of 31 P) showing 
inorganic phosphate (Pj), phosphocreatine (PCr), and ATP (each of its 
three phosphates giving a signal). The series of plots represents the pas- 
sage of time, from a period of rest to one of exercise, and then of re- 
covery. Note that the ATP signal hardly changes during exercise, kept 
high by continued respiration and by the reservoir of phosphocreatine, 
which diminishes during exercise. During recovery, when ATP produc- 
tion by catabolism is greater than ATP utilization by the (now resting) 
muscle, the phosphocreatine reservoir is refilled. 



glycogen, completing the Cori cycle (Fig. 23-20; see 
also Box 15-4). 

Actively contracting skeletal muscle generates heat 
as a byproduct of imperfect coupling of the chemical en- 
ergy of ATP with the mechanical work of contraction. This 
heat production can be put to good use when ambient 
temperature is low: skeletal muscle carries out shivering 
thermogenesis, rapidly repeated muscle contraction 
that produces heat but little motion, helping to maintain 
the body at its preferred temperature of 37 °C. 

Heart muscle differs from skeletal muscle in that 
it is continuously active in a regular rhythm of 
contraction and relaxation, and it has a completely 
aerobic metabolism at all times. Mitochondria are 
much more abundant in heart muscle than in skeletal 
muscle, making up almost half the volume of the cells 
(Fig. 23-21). The heart uses mainly free fatty acids, but 
also some glucose and ketone bodies taken up from the 
blood, as sources of energy; these fuels are oxidized via 
the citric acid cycle and oxidative phosphorylation to 
generate ATP. Like skeletal muscle, heart muscle does 
not store lipids or glycogen in large amounts. It does 
have small amounts of reserve energy in the form of 
phosphocreatine, enough for a few seconds of contrac- 
tion. Because the heart is normally aerobic and obtains 
its energy from oxidative phosphorylation, the failure of 



FIGURE 23-21 Electron micrograph of heart muscle. In the profuse 
mitochondria of heart tissue, pyruvate (from glucose), fatty acids, and 
ketone bodies are oxidized to drive ATP synthesis. This steady aerobic 
metabolism allows the human heart to pump blood at a rate of nearly 
6 L/min, or about 350 L/hr— or 200 X 1 0 6 L over 70 years. 
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FIGURE 23-20 Metabolic cooperation between skeletal muscle and 
the liver: the Cori cycle. Extremely active muscles use glycogen as 
energy source, generating lactate via glycolysis. During recovery, some 
of this lactate is transported to the liver and converted to glucose via 
gluconeogenesis. This glucose is released to the blood and returned to 
the muscles to replenish their glycogen stores. The overall pathway 
(glucose -> lactate -> glucose) constitutes the Cori cycle. 
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0 2 to reach a portion of the heart muscle when the blood 
vessels are blocked by lipid deposits (atherosclerosis) or 
blood clots (coronary thrombosis) can cause that region 
of the heart muscle to die. This is what happens in 
myocardial infarction, more commonly known as a heart (a) 
attack. ■ 



The Brain Uses Energy for Transmission of 
Electrical Impulses 

The metabolism of the brain is remarkable in several 
respects. The neurons of the adult mammalian brain 
normally use only glucose as fuel (Fig. 23-22). (Astro- 
cytes, the other major cell type in the brain, can oxidize 
fatty acids.) The brain has a very active respiratory me- 
tabolism (Fig. 23-23); it uses 0 2 at a fairly constant 
rate, accounting for almost 20% of the total 0 2 con- 
sumed by the body at rest. Because the brain contains 
very little glycogen, it is constantly dependent on in- 
coming glucose in the blood. Should blood glucose fall 
significantly below a critical level for even a short time, 
severe and sometimes irreversible changes in brain 
function may result. 

Although the neurons of the brain cannot directly 
use free fatty acids or lipids from the blood as fuels, they 
can, when necessary, use /3-hydroxybutyrate (a ketone 
body), formed from fatty acids in the liver. The capacity 
of the brain to oxidize /3-hydroxybutyrate via acetyl-CoA 
becomes important during prolonged fasting or starva- 
tion, after liver glycogen has been depleted, because it 
allows the brain to use body fat as an energy source. 
This spares muscle proteins — until they become the 
brain's ultimate source of glucose (via gluconeogenesis 
in the liver) during severe starvation. 

Neurons oxidize glucose by glycolysis and the citric 
acid cycle, and the flow of electrons from these oxidations 
through the respiratory chain provides almost all the ATP 
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FIGURE 23-22 The fuels that supply ATP in the brain. The energy 
source used by the brain varies with nutritional state. The ketone body 
used during starvation is /3-hydroxybutyrate. Electrogenic transport by 
the Na + K + ATPase maintains the transmembrane potential essential to 
information transfer among neurons. 
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FIGURE 23-23 Glucose metabolism in the brain. The technique of 
positron emission tomography (PET) scanning shows metabolic activ- 
ity in specific regions of the brain. PET scans allow visualization of 
isotopically labeled glucose in precisely localized regions of the brain 
of a living person, in real time. A positron-emitting glucose analog 
(2-[ 18 F]-fluoro-2-deoxy-D-glucose) is injected into the bloodstream; a 
few seconds later, a PET scan shows how much of the glucose has 
been taken up by each region of the brain — a measure of metabolic ac- 
tivity. Shown here are PET scans of front-to-back cross sections of the 
brain at three levels, from the top (at the left) downward (to the right). 
The scans compare glucose metabolism when the experimental 
subject (a) is rested and (b) has been deprived of sleep for 48 hours. 



used by these cells. Energy is required to create and 
maintain an electrical potential across the neuronal 
plasma membrane. The membrane contains an electro- 
genic ATP-driven antiporter, the Na + K + ATPase, which 
simultaneously pumps 2 K + ions into and 3 Na + ions out 
of the neuron (see Fig. 11-37). The resulting transmem- 
brane potential changes transiently as an electrical signal 
(action potential) sweeps from one end of a neuron to the 
other (see Fig. 12-25). Action potentials are the chief 
mechanism of information transfer in the nervous system, 
so depletion of ATP in neurons would have disastrous ef- 
fects on all activities coordinated by neuronal signaling. 

Blood Carries Oxygen, Metabolites, and Hormones 

Blood mediates the metabolic interactions among all tis- 
sues. It transports nutrients from the small intestine to 
the liver, and from the liver and adipose tissue to other 
organs; it also transports waste products from the extra- 
hepatic tissues to the liver for processing and to the kid- 
neys for excretion. Oxygen moves in the bloodstream 
from the lungs to the tissues, and C0 2 generated by tis- 
sue respiration returns via the bloodstream to the lungs 
for exhalation. Blood also carries hormonal signals from 
one tissue to another. In its role as signal carrier, the cir- 
culatory system resembles the nervous system; both 
regulate and integrate the activities of different organs. 
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Inorganic components (10%) 
NaCI, bicarbonate, phosphate, 
CaCl 2 , MgCl 2 , KC1, Na 2 S0 4 



Organic metabolites and waste 
products (20%) 
glucose, amino acids, lactate, 
pyruvate, ketone bodies, 
citrate, urea, uric acid 



Plasma proteins (70%) 

Major plasma proteins: serum albumin, very-low-density 
lipoproteins (VLDL), low-density lipoproteins (LDL), 
high-density lipoproteins (HDL), immunoglobulins (hundreds 
of kinds), fibrinogen, prothrombin, many specialized transport 
proteins such as transferrin 

FIGURE 23-24 The composition of blood. Whole blood can be sepa- 
rated into blood plasma and cells by centrifugation. About 10% of 
blood plasma is solutes, of which about 10% consists of inorganic 
salts, 20% small organic molecules, and 70% plasma proteins. The 
major dissolved components are listed. Blood contains many other 
substances, often in trace amounts. These include other metabolites, 
enzymes, hormones, vitamins, trace elements, and bile pigments. 
Measurements of the concentrations of components in blood plasma 
are important in the diagnosis and treatment of many diseases. 

The average adult human has 5 to 6 L of blood. Almost 
half of this volume is occupied by three types of blood cells 
(Fig. 23-24): erythrocytes (red cells), filled with hemo- 
globin and specialized for carrying 0 2 and C0 2 ; much 
smaller numbers of leukocytes (white cells) of several 
types (including lymphocytes, also found in lymphatic 
tissue) , which are central to the immune system that de- 
fends against infections; and platelets, which help to me- 
diate blood clotting. The liquid portion is the blood 
plasma, which is 90% water and 10% solutes. Dissolved 
or suspended in the plasma is a large variety of proteins, 
lipoproteins, nutrients, metabolites, waste products, inor- 
ganic ions, and hormones. More than 70% of the plasma 
solids are plasma proteins, primarily immunoglobulins 
(circulating antibodies), serum albumin, apolipoproteins 
involved in the transport of lipids, transferrin (for iron 
transport), and blood-clotting proteins such as fibrinogen 
and prothrombin. 

The ions and low molecular weight solutes in blood 
plasma are not fixed components but are in constant flux 
between blood and various tissues. Dietary uptake of the 



inorganic ions that are the predominant electrolytes of 
blood and cytosol (Na + , K + , and Ca 2+ ) is, in general, 
counterbalanced by their excretion in the urine. For many 
blood components, something near a dynamic steady 
state is achieved: the concentration of the component 
changes little, although a continuous flux occurs between 
the digestive tract, blood, and urine. The plasma levels of 
Na + , K + , and Ca 2+ remain close to 140, 5, and 2.5 rtiM, re- 
spectively, with little change in response to dietary intake. 
Any significant departure from these values can result in 
serious illness or death. The kidneys play an especially im- 
portant role in maintaining ion balance by selectively fil- 
tering waste products and excess ions out of the blood 
while preventing the loss of essential nutrients and ions. 

The human erythrocyte loses its nucleus and mito- 
chondria during differentiation. It therefore relies on 
glycolysis alone for its supply of ATP. The lactate pro- 
duced by glycolysis returns to the liver, where gluconeo- 
genesis converts it to glucose, to be stored as glycogen 
or recirculated to the peripheral tissues. The erythro- 
cyte has constant access to glucose in the bloodstream. 

HThe concentration of glucose in plasma is subject 
to tight regulation. We have noted the constant 
requirement of the brain for glucose and the role of the 
liver in maintaining blood glucose in the normal range, 60 
to 90 mg/100 mL of whole blood (-4.5 ium). (Because ery- 
throcytes make up a significant fraction of blood volume, 
their removal by centrifugation leaves a supernatant fluid, 
the plasma, containing the "blood glucose" in a smaller 
volume. To convert blood glucose to plasma glucose con- 
centration, multiply the blood glucose level by 1.14.) 
When blood glucose in a human drops to 40 mg/100 mL 
(the hypoglycemic condition), the person experiences 
discomfort and mental confusion (Fig. 23-25); further 



FIGURE 23-25 Physiological effects of low 
blood glucose in humans. Blood glucose levels 
of 40 mg/100 mL and below constitute severe 
hypoglycemia. 
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reductions lead to coma, convulsions, and, in extreme 
hypoglycemia, death. Maintaining the normal concentra- 
tion of glucose in blood is therefore a very high priority of 
the organism, and a variety of regulatory mechanisms 
have evolved to achieve that end. Among the most impor- 
tant regulators of blood glucose are the hormones insulin, 
glucagon, and epinephrine, as discussed in Section 23.3. ■ 

SUMMARY 23.2 Tissue-Specific Metabolism: 
The Division of Labor 

■ In mammals there is a division of metabolic labor 
among specialized tissues and organs. The liver is 
the central distributing and processing organ for 
nutrients. Sugars and amino acids produced in 
digestion cross the intestinal epithelium and enter 
the blood, which carries them to the liver. Some 
triacylglycerols derived from ingested lipids also 
make their way to the liver, where the constituent 
fatty acids are used in a variety of processes. 

■ Glucose 6-phosphate is the key intermediate in 
carbohydrate metabolism. It may be polymerized 
into glycogen, dephosphorylated to blood glucose, 
or converted to fatty acids via acetyl-CoA. It may 
undergo oxidation by glycolysis, the citric acid 
cycle, and respiratory chain to yield ATP, or enter 
the pentose phosphate pathway to yield pentoses 
and NADPH. 

■ Amino acids are used to synthesize liver and plasma 
proteins, or their carbon skeletons are converted 

to glucose and glycogen by gluconeogenesis; the 
ammonia formed by deamination is converted to urea. 

■ The liver converts fatty acids to triacylglycerols, 
phospholipids, or cholesterol and its esters, for 
transport as plasma lipoproteins to adipose tissue 
for storage. Fatty acids can also be oxidized to yield 
ATP or to form ketone bodies, which are circulated 
to other tissues. 

■ White adipose tissue stores large reserves of 
triacylglycerols, and releases them into the blood in 
response to epinephrine or glucagon. Brown adipose 
tissue is specialized for thermogenesis, the result of 
fatty acid oxidation in uncoupled mitochondria. 

■ Skeletal muscle is specialized to produce and use 
ATP for mechanical work. During strenuous muscular 
activity, glycogen is the ultimate fuel, supplying 
ATP through lactic acid fermentation. During 
recovery, the lactate is reconverted (through 
gluconeogenesis) to glycogen and glucose in the 
liver. Phosphocreatine is an immediate source of 
ATP during active contraction. 

■ Heart muscle obtains nearly all its ATP from 
oxidative phosphorylation. 

■ The neurons of the brain use only glucose and 
jS-hydroxybutyrate as fuels, the latter being 
important during fasting or starvation. The brain 



uses most of its ATP for the active transport of Na + 
and K + to maintain the electrical potential across 
the neuronal membrane. 

■ The blood transfers nutrients, waste products, and 
hormonal signals among tissues and organs. 

23.3 Hormonal Regulation of Fuel Metabolism 

The minute-by-minute adjustments that keep the blood 
glucose level near 4.5 mM involve the combined actions 
of insulin, glucagon, epinephrine, and Cortisol on meta- 
bolic processes in many body tissues, but especially in 
liver, muscle, and adipose tissue. Insulin signals these 
tissues that blood glucose is higher than necessary; as a 
result, cells take up excess glucose from the blood and 
convert it to glycogen and triacylglycerols for storage. 
Glucagon signals that blood glucose is too low, and tis- 
sues respond by producing glucose through glycogen 
breakdown and (in the liver) gluconeogenesis and by 
oxidizing fats to reduce the use of glucose. Epinephrine 
is released into the blood to prepare the muscles, lungs, 
and heart for a burst of activity. Cortisol mediates the 
body's response to longer-term stresses. We discuss 
these hormonal regulations in the context of three nor- 
mal metabolic states — well-fed, fasted, and starving — 
and look at the metabolic consequences of diabetes 
mellitus, a disorder that results from derangements in 
the signaling pathways that control glucose metabolism. 

Insulin Counters High Blood Glucose 

Acting through plasma membrane receptors (see 
Figs 12-15, 12-16), insulin stimulates glucose uptake by 
muscle and adipose tissue (Table 23-3), where the 
glucose is converted to glucose 6-phosphate. In the 
liver, insulin also activates glycogen synthase and inacti- 
vates glycogen phosphorylase, so that much of the glu- 
cose 6-phosphate is channeled into glycogen. 

Insulin also stimulates the storage of excess fuel as 
fat in adipose tissue (Fig. 23-26). In the liver, insulin 
activates both the oxidation of glucose 6-phosphate to 
pyruvate via glycolysis and the oxidation of pyruvate to 
acetyl-CoA. If not oxidized further for energy produc- 
tion, this acetyl-CoA is used for fatty acid synthesis, and 
the fatty acids are exported from the liver as the TAGs of 
plasma lipoproteins (VLDLs) to adipose tissue. Insulin 
stimulates the synthesis of TAGs in adipocytes, from 
fatty acids released from the VLDL triacylglycerols. 
These fatty acids are ultimately derived from the excess 
glucose taken up from blood by the liver. In summary, 
the effect of insulin is to favor the conversion of excess 
blood glucose to two storage forms: glycogen (in the 
liver and muscle) and triacylglycerols (in adipose tis- 
sue) (Table 23-3). 

Besides acting directly on muscle and liver to 
change their metabolism of carbohydrates and fats, 
insulin can also act in the brain to signal these tissues 
indirectly, as described later. 
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FIGURE 23-26 The well-fed state: the lipogenic liver. Immediately after 
a calorie-rich meal, glucose, fatty acids, and amino acids enter the liver. 
Insulin released in response to the high blood glucose concentration 
stimulates glucose uptake by the tissues. Some glucose is exported to 
the brain for its energy needs, and some to adipose and muscle tissue. 
In the liver, excess glucose is oxidized to acetyl-CoA, which is used to 



synthesize fatty acids for export as triacylglycerols in VLDLs to adipose 
and muscle tissue. The NADPH necessary for lipid synthesis is obtained 
by oxidation of glucose in the pentose phosphate pathway. Excess 
amino acids are converted to pyruvate and acetyl-CoA, which are also 
used for lipid synthesis. Dietary fats move via the lymphatic system, as 
chylomicrons, from the intestine to muscle and adipose tissues. 



TABLE 23-3 



Metabolic effect 



Effects of Insulin on Blood Glucose: Uptake of Glucos 
Storage as Triacylglycerols and Glycogen 

Target enzyme 



ose by Cells and 



T Glucose uptake (muscle, adipose) 

T Glucose uptake (liver) 

t Glycogen synthesis (liver, muscle) 

i Glycogen breakdown (liver, muscle) 

T Glycolysis, acetyl-CoA production 
(liver, muscle) 

T Fatty acid synthesis (liver) 

T Triacylglycerol synthesis (adipose tissue) 



T Glucose transporter (GLUT4) 

T Glucokinase (increased expression) 

T Glycogen synthase 

1 Glycogen phosphorylase 

TPFK-1 (bytPFK-2) 

T Pyruvate dehydrogenase complex 

T Acetyl-CoA carboxylase 

T Lipoprotein lipase 



Pancreatic j3 Cells Secrete Insulin in Response to Changes 
in Blood Glucose 

When glucose enters the bloodstream from the intestine 
after a carbohydrate-rich meal, the resulting increase in 
blood glucose causes increased secretion of insulin (and 



decreased secretion of glucagon) by the pancreas. In- 
sulin release is largely regulated by the level of glucose 
in the blood supplying the pancreas. The peptide hor- 
mones insulin, glucagon, and somatostatin are produced 
by clusters of specialized pancreatic cells, the islets of 
Langerhans (Fig. 23-27). Each cell type of the islets 
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a eel] (glucagon) 

/ (3 cell 

(insulin) 




(somatostatin) 

FIGURE 23-27 The endocrine system of the pancreas. The pancreas 
contains both exocrine cells (see Fig. 18-3b), which secrete digestive 
enzymes in the form of zymogens, and clusters of endocrine cells, the 
islets of Langerhans The islets contain a, (3, and 8 cells (also known as 
A, B, and D cells, respectively), each cell type secreting a specific pep- 
tide hormone. 

produces a single hormone: a cells produce glucagon; /3 
cells, insulin; and 5 cells, somatostatin. 

As shown in Figure 23-28, when blood glucose 
rises, Q GLUT2 transporters carry glucose into the j3 
cells, where it is immediately converted to glucose 
6-phosphate by hexokinase IV (glucokinase) and enters 



glycolysis. With the higher rate of glucose catabolism, 
(2) [ATP] increases, causing the closing of ATP-gated 
K + channels in the plasma membrane. (3) Reduced 
efflux of K + depolarizes the membrane. (Recall from 
Section 12.6 that exit of K + through an open K + chan- 
nel hyperpolarizes the membrane; closing the K + chan- 
nel therefore effectively depolarizes the membrane.) 
Membrane depolarization opens voltage-gated Ca 2+ 
channels, and ® the resulting increase in cytosolic 
[Ca 2+ ] triggers © the release of insulin by exocytosis. 
Parasympathetic and sympathetic nervous system sig- 
nals also affect (stimulate and inhibit, respectively) in- 
sulin release. A simple feedback loop limits hormone 
release: insulin lowers blood glucose by stimulating glu- 
cose uptake by the tissues; the reduced blood glucose is 
detected by the (3 cell as a diminished flux through the 
hexokinase reaction; this slows or stops the release of 
insulin. This feedback regulation holds blood glucose 
concentration nearly constant despite large fluctua- 
tions in dietary intake. 

The activity of ATP-gated K + channels is central to 
the regulation of insulin secretion by /3 cells. The 
channels are octamers of four identical Kir6.2 subunits 
and four identical SUR1 subunits, and are constructed 
along the same lines as the K + channels of bacteria and 
those of other eukaryotic cells (see Figs 11^8, 11^9, 
and 11-50). The four Kir6.2 subunits form a cone around 
the K + channel and function as the selectivity filter and 
ATP-gating mechanism (Fig. 23-29). When [ATP] rises 
(indicating increased blood glucose), the K + channels 
close, depolarizing the plasma membrane and triggering 
insulin release as shown in Figure 23-28. 




FIGURE 23-28 Glucose regulation of insulin 
secretion by pancreatic p cells. When the 
blood glucose level is high, active metabolism 
of glucose in the /? cell raises intracellular 
[ATP], closing K + channels in the plasma mem- 
brane and thus depolarizing the membrane. In 
response to the change in membrane potential, 
voltage-gated Ca 2+ channels open, allowing 
Ca 2+ to flow into the cell. (Ca 2+ is also released 
from the endoplasmic reticulum, in response to 
the initial elevation of [Ca 2+ ] in the cytosol.) 
Cytosolic [Ca 2+ ] is now high enough to trigger 
insulin release by exocytosis. The numbered 
processes are discussed in the text. 
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FIGURE 23-29 ATP-gated K + channels in cells, (a) The octameric 
structure of the channel, viewed perpendicular to the membrane. The 
channel is formed by four identical Kir6.2 subunits, and outside them 
are the four SUR1 (sulfonylurea receptor) subunits. (b) The structure of 
the Kir6.2 portion of the channel, viewed in the plane of the mem- 
brane. For clarity, only two transmembrane domains and two cytosolic 
domains are shown. Three K + ions (green) are shown in the region of 
the selectivity filter. Mutation in certain amino acid residues (shown in 
red) leads to neonatal diabetes; mutation in others (shown in blue) 
leads to hyperinsulinism of infancy. This structure (coordinates cour- 
tesy of Frances Ashford and her colleagues at the University of Oxford) 
was not obtained directly by crystallography, but by mapping the 
known Kir6.2 sequence onto the crystal structures of a bacterial Kir 
channel (KirBad.l; PDB ID 1P7B) and the amino and carboxyl do- 
mains of another Kir protein, Kir3J (PDB ID 1U4E). Compare this 
structure with the gated K + channel in Figure 1 1-50. 

The sulfonylurea drugs, oral medications used 
in the treatment of type 2 diabetes mellitus, bind to 
the SUR1 (sulfonylurea receptor) subunits of the K + 
channels, closing the channels and stimulating insulin 
release. The first generation of these drugs (tolbutamide, 
for example) was developed in the 1950s. The second- 
generation drugs, including glyburide (Micronase), glip- 
izide (Glucotrol), and glimepiride (Amaryl), are more 
potent and have fewer side effects. (The sulfonylurea 
moiety is screened pink in the following structures.) 



The sulfonylureas are sometime used in combination 
with injected insulin, but often suffice alone for control- 
ling type 2 diabetes. 




Mutations in the ATP-gated K + channels of (5 cells are, 
fortunately, rare. Mutations in Kir6.2 that result in con- 
stantly open K + channels (red residues in Fig. 23-29b) 
lead to neonatal diabetes mellitus, with severe hyper- 
glycemia that requires insulin therapy. Other mutations 
in Kir6.2 or SUR1 (blue residues in Fig. 23-29b) pro- 
duce permanently closed K + channels and continuous 
release of insulin. If untreated, individuals with these 
mutations develop congenital hyperinsulinemia (hyper- 
insulinism of infancy); excessive insulin causes severe 
hypoglycemia (low blood glucose) leading to irre- 
versible brain damage. One effective treatment is surgi- 
cal removal of part of the pancreas to reduce insulin 
production. ■ 

Glucagon Counters Low Blood Glucose 

Several hours after the intake of dietary carbohydrate, 
blood glucose levels fall slightly because of the ongoing 
oxidation of glucose by the brain and other tissues. Low- 
ered blood glucose triggers secretion of glucagon and 
decreases insulin release (Fig. 23-30). 

Glucagon causes an increase in blood glucose con- 
centration in several ways (Table 23-4). Like epineph- 
rine, it stimulates the net breakdown of liver glycogen 
by activating glycogen phosphorylase and inactivating 
glycogen synthase; both effects are the result of phos- 
phorylation of the regulated enzymes, triggered by 
cAMP. Glucagon inhibits glucose breakdown by glycol- 
ysis in the liver, and stimulates glucose synthesis by 
gluconeogenesis. Both effects result from lowering the 
concentration of fructose 2,6-bisphosphate, an al- 
losteric inhibitor of the gluconeogenic enzyme fruc- 
tose 1,6-bisphosphatase (FBPase-1) and an activator 
of the glycolytic enzyme phosphofructokinase-1. Re- 
call that [fructose 2,6-bisphosphate] is ultimately con- 
trolled by a cAMP-dependent protein phosphorylation 
reaction (see Fig. 15-17). Glucagon also inhibits the 
glycolytic enzyme pyruvate kinase (by promoting its 
cAMP-dependent phosphorylation), thus blocking the 
conversion of phosphoenolpyruvate to pyruvate and 
preventing oxidation of pyruvate via the citric acid 
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FIGURE 23-30 The fasting state: the glucogenic 
liver. After some hours without a meal, the liver 
becomes the principal source of glucose for the 
brain. Liver glycogen is broken down, and the glu- 
cose 1 -phosphate produced is converted to glu- 
cose 6-phosphate, then to free glucose, which is 
released into the bloodstream. Amino acids from 
the degradation of proteins in liver and muscle, 
and glycerol from the breakdown of TAGs in adi- 
pose tissue, are used for gluconeogenesis. The 
liver uses fatty acids as its principal fuel, and ex- 
cess acetyl-CoA is converted to ketone bodies for 
export to other tissues; the brain is especially de- 
pendent on this fuel when glucose is in short sup- 
ply (see Fig. 23-22). 
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cycle. The resulting accumulation of phosphoenolpyru- 
vate favors gluconeogenesis. This effect is augmented 
by glucagon's stimulation of the synthesis of the gluco- 
neogenic enzyme PEP carboxykinase. By stimulating 
glycogen breakdown, preventing glycolysis, and promot- 
ing gluconeogenesis in hepatocytes, glucagon enables 
the liver to export glucose, restoring blood glucose to its 
normal level. 

Although its primary target is the liver, glucagon 
(like epinephrine) also affects adipose tissue, activat- 
ing TAG breakdown by causing cAMP-dependent 
phosphorylation of perilipin and hormone-sensitive li- 
pase. The activated lipase liberates free fatty acids, 
which are exported to the liver and other tissues as 
fuel, sparing glucose for the brain. The net effect of 



glucagon is therefore to stimulate glucose synthesis 
and release by the liver and to mobilize fatty acids 
from adipose tissue, to be used instead of glucose by 
tissues other than the brain (Table 23-4). All these ef- 
fects of glucagon are mediated by cAMP-dependent 
protein phosphorylation. 

During Fasting and Starvation, Metabolism Shifts to 
Provide Fuel for the Brain 

The fuel reserves of a healthy adult human are of three 
types: glycogen stored in the liver and, in smaller quan- 
tities, in muscles; large quantities of triacylglycerols in 
adipose tissues; and tissue proteins, which can be de- 
graded when necessary to provide fuel (Table 23-5). 



TABLE 23- 



Effects of Glucagon on Blood Glucose: Production and Release of Glucose by the Liver 
Metabolic effect Effect on glucose metabolism Target enzyme 



T Glycogen breakdown (liver) 
I Glycogen synthesis (liver) 
I Glycolysis (liver) 
T Gluconeogenesis (liver) 

T Fatty acid mobilization (adipose tissue) 
T Ketogenesis 



Glycogen > glucose 

Less glucose stored as glycogen 
Less glucose used as fuel in liver 
Amino acids 1 

Glycerol > > glucose 

Oxaloacetate J 

Less glucose used as fuel by liver, muscle 

Provides alternative to glucose as 
energy source for brain 



t Glycogen phosphorylase 

I Glycogen synthase 

i PFK-1 

T FBPase-2 

4 Pyruvate kinase 

t PEP carboxykinase 

T Hormone-sensitive lipase 
T PKA (perilipin— (P)) 
i Acetyl-CoA carboxylase 
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TABLE 23-5 



Type of fuel 



Available Metabolic Fuels in a Normal-Weight 70 kg Man and in an Obese 140 kg Man 
at the Beginning of a Fast 




Weight (kg) 



Caloric equivalent 
(thousands of kcal (kJ)) 



Estimated survival 
(months)* 



Normal-weight, 70 kg man 

Triacylglycerols (adipose tissue) 

Proteins (mainly muscle) 

Glycogen (muscle, liver) 

Circulating fuels (glucose, fatty acids, 
triacylglycerols, etc.) 



Total 

Obese, 140 kg man 

Triacylglycerols (adipose tissue) 
Proteins (mainly muscle) 
Glycogen (muscle, liver) 
Circulating fuels 



15 
6 

0.225 
0.023 



80 
8 

0 23 
0.025 



141 (589) 
24 (100) 
0.90 (3.8) 
0.10(0.42) 



166 (694) 

752 (3,140) 
32 (134) 
0.92 (3.8) 
0.11 (0.46) 



Total 



785 (3,280) 



14 



*Survival time is calculated on the assumption of a basal energy expenditure of 1,800 kcal/day. 



In the first two hours after a meal, the blood glu- 
cose level is diminished slightly, and tissues receive 
glucose released from liver glycogen. There is little or 
no synthesis of lipids. By four hours after a meal, blood 
glucose has fallen further, insulin secretion has slowed, 
and glucagon secretion has increased. These hormonal 
signals mobilize triacylglycerols, which now become 
the primary fuel for muscle and liver. Figure 23-31 



shows the responses to prolonged fasting. CD To pro- 
vide glucose for the brain, the liver degrades certain 
proteins — those most expendable in an organism not 
ingesting food. Their nonessential amino acids are 
transaminated or deaminated (Chapter 18), and (2) 
the extra amino groups are converted to urea, which 
is exported via the bloodstream to the kidneys and 
excreted. 
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FIGURE 23-31 Fuel metabolism in the liver 
during prolonged fasting or in uncontrolled di- 
abetes mellitus. After depletion of stored car- 
bohydrates, proteins become an important 
source of glucose, produced from glucogenic 
amino acids by gluconeogenesis (© to ®). 
Fatty acids imported from adipose tissue are 
converted to ketone bodies for export to the 
brain (© to ®). Broken arrows represent reac- 
tions with reduced flux under these conditions. 
The steps are further described in the text. 
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Also in the liver, the carbon skeletons of glucogenic 
amino acids are converted to pyruvate or intermedi- 
ates of the citric acid cycle. (3) These intermediates (as 
well as the glycerol derived from TAGs in adipose tis- 
sue) provide the starting materials for gluconeogenesis 
in the liver, @ yielding glucose for export to the brain. 
(5) Fatty acids are oxidized to acetyl-CoA, but as ox- 
aloacete is depleted by the use of citric acid cycle inter- 
mediates for gluconeogenesis, (6) entry of acetyl-CoA 
into the cycle is inhibited and acetyl-CoA accumulates. 
(7) This favors the formation of acetoacetyl-CoA and 
ketone bodies. After a few days of fasting, the levels of 
ketone bodies in the blood rise (Fig. 23-32) as they are 
exported from the liver to the heart, skeletal muscle, 
and brain, which use these fuels instead of glucose 
(Fig. 23-31,®). 

Acetyl-CoA is a critical regulator of the fate of pyru- 
vate; it allosterically inhibits pyruvate dehydrogenase and 
stimulates pyruvate carboxylase (see Fig. 15-20). In these 
ways acetyl-CoA prevents it own further production from 
pyruvate while stimulating the conversion of pyruvate to 
oxaloacetate, the first step in gluconeogenesis. 

Triacylglycerols stored in the adipose tissue of a 
normal-weight adult could provide enough fuel to main- 
tain a basal rate of metabolism for about three months; a 
very obese adult has enough stored fuel to endure a fast 
of more than a year (Table 23-5). When fat reserves are 
gone, the degradation of essential proteins begins; this 
leads to loss of heart and liver function and, in prolonged 
starvation, to death. Stored fat can provide adequate 
energy (calories) during a fast or rigid diet, but vitamins 
and minerals must be provided, and sufficient dietary 
glucogenic amino acids are needed to replace those 
being used for gluconeogenesis. Rations for those on a 
weight-reduction diet are commonly fortified with vita- 
mins, minerals, and amino acids or proteins. 

Epinephrine Signals Impending Activity 

When an animal is confronted with a stressful situation 
that requires increased activity — fighting or fleeing, in 
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FIGURE 23-32 Plasma concentrations of fatty acids, glucose, and ke- 
tone bodies during the first week of starvation. Despite the hormonal 
mechanisms for maintaining the level of glucose in blood, glucose 
begins to diminish after 2 days of fasting. The level of ketone bodies, 
almost unmeasurable before the fast, rises dramatically after 2 to 4 
days of fasting. These water-soluble ketones, acetoacetate and j9- 
hydroxybutyrate, supplement glucose as an energy source during a 
long fast. Fatty acids cannot serve as a fuel for the brain; they do not 
cross the blood-brain barrier. 



the extreme case — neuronal signals from the brain trig- 
ger the release of epinephrine and norepinephrine from 
the adrenal medulla. Both hormones dilate the respira- 
tory passages to facilitate the uptake of 0 2 , increase the 
rate and strength of the heartbeat, and raise the blood 
pressure, thereby promoting the flow of 0 2 and fuels to 
the tissues (Table 23-6). 

Epinephrine acts primarily on muscle, adipose, and 
liver tissues. It activates glycogen phosphorylase and 
inactivates glycogen synthase by cAMP-dependent phos- 
phorylation of the enzymes, thus stimulating the conver- 
sion of liver glycogen to blood glucose, the fuel for 
anaerobic muscular work. Epinephrine also promotes 



| TABLE 23-6 


logical and Metabolic Effects of Epinephrine: Preparation for Action 


Immediate effect 


Overall effect 



Physiological 

t Heart rate 

T Blood pressure 

T Dilation of respiratory passages 

Metabolic 

t Glycogen breakdown (muscle, liver) 

I Glycogen synthesis (muscle, liver) 

T Gluconeogenesis (liver) 

T Glycolysis (muscle) 

T Fatty acid mobilization (adipose tissue) 

T Glucagon secretion 
I Insulin secretion 



Increase delivery of 0 2 to tissues (muscle) 



Increase production of glucose for fuel 

Increases ATP production in muscle 
Increases availability of fatty acids as fuel 

Reinforce metabolic effects of epinephrine 
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the anaerobic breakdown of muscle glycogen by lactic 
acid fermentation, stimulating glycolytic ATP formation. 
The stimulation of glycolysis is accomplished by raising 
the concentration of fructose 2,6-bisphosphate, a potent 
allosteric activator of the key glycolytic enzyme phos- 
phofructokinase-1 (see Figs 15-16, 15-17). Epinephrine 
also stimulates fat mobilization in adipose tissue, activat- 
ing (by cAMP-dependent phosphorylation) hormone- 
sensitive lipase and moving aside the perilipin covering 
the lipid droplet surface (see Fig. 17-3). Finally, epi- 
nephrine stimulates glucagon secretion and inhibits in- 
sulin secretion, reinforcing its effect of mobilizing fuels 
and inhibiting fuel storage. 

Cortisol Signals Stress, Including Low Blood Glucose 

A variety of stressors (anxiety, fear, pain, hemorrhage, 
infection, low blood glucose, starvation) stimulate re- 
lease of the corticosteroid hormone Cortisol from the 
adrenal cortex. Cortisol acts on muscle, liver, and adi- 
pose tissue to supply the organism with fuel to with- 
stand the stress. Cortisol is a relatively slow-acting 
hormone that alters metabolism by changing the kinds 
and amounts of certain enzymes synthesized in its tar- 
get cells, rather than by regulating the activity of exist- 
ing enzyme molecules. 

In adipose tissue, Cortisol leads to an increase in 
the release of fatty acids from stored TAGs. The ex- 
ported fatty acids serve as fuel for other tissues, and 
the glycerol is used for gluconeogenesis in the liver. 
Cortisol stimulates the breakdown of muscle proteins 
and the export of amino acids to the liver, where they 
serve as precursors for gluconeogenesis. In the liver, 
Cortisol promotes gluconeogenesis by stimulating syn- 
thesis of the key enzyme PEP carboxykinase (see Fig. 
14-1 7b); glucagon has the same effect, whereas insulin 
has the opposite effect. Glucose produced in this way is 
stored in the liver as glycogen or exported immediately 
to tissues that need glucose for fuel. The net effect of 
these metabolic changes is to restore blood glucose to 
its normal level and to increase glycogen stores, ready 
to support the fight-or-flight response commonly asso- 
ciated with stress. The effects of Cortisol therefore 
counterbalance those of insulin. During extended peri- 
ods of stress, the continued release of Cortisol loses its 
positive adaptive value and begins to cause damage to 
muscle and bone, and to impair endocrine and immune 
function. 

Diabetes Mellitus Arises from Defects in Insulin 
Production or Action 

BS Diabetes mellitus is a relatively common dis- 
11 ease: nearly 6% of the U.S. population shows 
some degree of abnormality in glucose metabolism that 
is indicative of diabetes or a tendency toward the condi- 
tion. There are two major clinical classes of diabetes 
mellitus: type 1 diabetes, sometimes referred to as 



insulin-dependent diabetes mellitus (IDDM), and type 
2 diabetes, or non-insulin-dependent diabetes mellitus 
(NIDDM), also called insulin-resistant diabetes. 

Type 1 diabetes begins early in life, and symptoms 
quickly become severe. This disease responds to in- 
sulin injection, because the metabolic defect stems 
from an autoimmune destruction of pancreatic /3 cells 
and a consequent inability to produce sufficient in- 
sulin. Type 1 diabetes requires both insulin therapy 
and careful, lifelong control of the balance between di- 
etary intake and insulin dose. Characteristic symptoms 
of type 1 (and type 2) diabetes are excessive thirst and 
frequent urination (polyuria), leading to the intake of 
large volumes of water (polydipsia) ("diabetes melli- 
tus" means "excessive excretion of sweet urine"). 
These symptoms are due to the excretion of large 
amounts of glucose in the urine, a condition known as 
glucosuria. 

Type 2 diabetes is slow to develop (typically in 
older, obese individuals), and the symptoms are milder 
and often go unrecognized at first. This is really a group 
of diseases in which the regulatory activity of insulin is 
disordered: insulin is produced, but some feature of the 
insulin-response system is defective. Individuals with 
this disorder are insulin-resistant. The connection be- 
tween type 2 diabetes and obesity (discussed below) is 
an active and exciting area of research. 

Individuals with either type of diabetes are unable 
to take up glucose efficiently from the blood; recall 
that insulin triggers the movement of GLUT4 glucose 
transporters to the plasma membrane in muscle and 
adipose tissue (see Fig. 12-16). Another characteristic 
metabolic change in diabetes is excessive but incom- 
plete oxidation of fatty acids in the liver. The acetyl- 
CoA produced by (3 oxidation cannot be completely 
oxidized by the citric acid cycle, because the high 
[NADH]/[NAD + ] ratio produced by /3 oxidation inhibits 
the cycle (recall that three steps of the cycle convert 
NAD + to NADH). Accumulation of acetyl-CoA leads to 
overproduction of the ketone bodies, acetoacetate and 
jS-hydroxybutyrate, which cannot be used by extra- 
hepatic tissues as fast as they are made in the liver. In 
addition to /3-hydroxybutyrate and acetoacetate, the 
blood of individuals with diabetes also contains ace- 
tone, which results from the spontaneous decarboxyla- 
tion of acetoacetate: 

O 0 

II II 
CH 3 — C— CH 2 — COO +H 2 0 > CH 3 — C— CH3 + HCO3 

Acetoacetate Acetone 

Acetone is volatile and is exhaled, and in uncon- 
trolled diabetes the breath has a characteristic odor 
sometimes mistaken for ethanol. A diabetic individual 
who is experiencing mental confusion due to high blood 
glucose is occasionally misdiagnosed as intoxicated, an 
error that can be fatal. The overproduction of ketone 
bodies, called ketosis, results in greatly increased 
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concentrations of ketone bodies in the blood (ketone- 
mia) and urine (ketonuria). 

The ketone bodies are carboxylic acids, which ion- 
ize, releasing protons. In uncontrolled diabetes this acid 
production can overwhelm the capacity of the blood's bi- 
carbonate buffering system and produce a lowering of 
blood pH called acidosis or, in combination with ketosis, 
ketoacidosis, a potentially life-threatening condition. 

Biochemical measurements on blood and urine 
samples are essential in the diagnosis and treatment of 
diabetes. A sensitive diagnostic criterion is provided by 
the glucose-tolerance test. The individual fasts 
overnight, then drinks a test dose of 100 g of glucose dis- 
solved in a glass of water. The blood glucose concentra- 
tion is measured before the test dose and at 30 min 
intervals for several hours thereafter. A healthy individ- 
ual assimilates the glucose readily, the blood glucose ris- 
ing to no more than about 9 or 10 mM; little or no glucose 
appears in the urine. In diabetes, individuals assimilate 
the test dose of glucose poorly; their blood glucose level 
rises dramatically and returns to the fasting level very 
slowly Because the blood glucose levels exceed the kid- 
ney threshold (about 10 mM), glucose also appears in 
the urine. ■ 

SUMMARY 23.3 Hormonal Regulation of 
Fuel Metabolism 

■ The concentration of glucose in blood is hormonally 
regulated. Fluctuations in blood glucose (normally 
60 to 90 mg/100 mL, or about 4.5 mM) due to dietary 
intake or vigorous exercise are counterbalanced 

by a variety of hormonally triggered changes in 
metabolism in several organs. 

■ High blood glucose elicits the release of insulin, 
which speeds the uptake of glucose by tissues 
and favors the storage of fuels as glycogen and 
triacylglycerols, while inhibiting fatty acid 
mobilization in adipose tissue. 

■ Low blood glucose triggers release of glucagon, 
which stimulates glucose release from liver 
glycogen and shifts fuel metabolism in liver and 
muscle to fatty acid oxidation, sparing glucose for 
use by the brain. In prolonged fasting, triacylglycerols 
become the principal fuel; the liver converts the 
fatty acids to ketone bodies for export to other 
tissues, including the brain. 

■ Epinephrine prepares the body for increased 
activity by mobilizing glucose from glycogen and 
other precursors, releasing it into the blood. 

■ Cortisol, released in response to a variety of 
stressors (including low blood glucose), 
stimulates gluconeogenesis from amino acids 
and glycerol in the liver, thus raising blood 
glucose and counterbalancing the effects of 
insulin. 



■ In diabetes, insulin is either not produced or not 
recognized by the tissues, and the uptake of blood 
glucose is compromised. When blood glucose levels 
are high, glucose is excreted. Tissues then depend 
on fatty acids for fuel (producing ketone bodies) 
and degrade cellular proteins to provide glucogenic 
amino acids for glucose synthesis. Uncontrolled 
diabetes is characterized by high glucose levels 
in the blood and urine and the production and 
excretion of ketone bodies. 

23.4 Obesity and the Regulation 
of Body Mass 

I In the U.S. population, 30% of adults are obese and 
HI another 35% are overweight, as defined in terms of 
body mass index (BMI), calculated as (weight in 
kg)/(height in m) 2 . A BMI below 25 is considered normal; 
an individual with a BMI of 25 to 30 is overweight; a BMI 
greater than 30 indicates obesity. Obesity is life-threat- 
ening. It significantly increases the chances of developing 
type 2 diabetes as well as heart attack, stroke, and can- 
cers of the colon, breast, prostate, and endometrium. 
Consequently, there is great interest in understanding 
how body mass and the storage of fats in adipose tissue 
are regulated. ■ 

To a first approximation, obesity is the result of tak- 
ing in more calories in the diet than are expended by the 
body's energy-consuming activities. The body can deal 
with an excess of dietary calories in three ways: (1) con- 
vert excess fuel to fat and store it in adipose tissue, 
(2) burn excess fuel by extra exercise, and (3) "waste" 
fuel by diverting it to heat production (thermogenesis) 
by uncoupled mitochondria. In mammals, a complex set 
of hormonal and neuronal signals acts to keep fuel in- 
take and energy expenditure in balance, so as to hold 
the amount of adipose tissue at a suitable level. Dealing 
effectively with obesity requires understanding these 
various checks and balances under normal conditions, 
and how these homeostatic mechanisms can fail. 

Adipose Tissue Has important Endocrine Functions 

One early hypothesis to explain body-mass homeostasis, 
the "adiposity negative-feedback" model, postulated a 
mechanism that inhibits eating behavior and increases en- 
ergy consumption whenever body weight exceeds a cer- 
tain value (the set point); the inhibition is relieved when 
body weight drops below the set point (Fig. 23-33). This 
model predicts that a feedback signal originating in 
adipose tissue influences the brain centers that con- 
trol eating behavior and activity (metabolic and motor 
activity). The first such factor, leptin, was discovered 
in 1994, and subsequent research revealed that adi- 
pose tissue is an important endocrine organ that pro- 
duces peptide hormones, known as adipokines. 
Adipokines may act locally (autocrine and paracrine 
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FIGURE 23-33 Set-point model for maintaining constant mass. When 
the mass of adipose tissue increases (dashed outline), released leptin in- 
hibits feeding and fat synthesis and stimulates oxidation of fatty acids. 
When the mass of adipose tissue decreases (solid outline), a lowered lep- 
tin production favors a greater food intake and less fatty acid oxidation. 



FIGURE 23-34 Obesity caused by defective leptin production. Both of 
these mice, which are the same age, have defects in the OB gene. The 
mouse on the right was injected daily with purified leptin and weighs 
35 g. The mouse on the left got no leptin, and consequently ate more 
food and was less active; it weighs 67 g. 



action) or systemically (endocrine action), carrying 
information about the adequacy of the energy reserves 
(TAGs) stored in adipose tissue to other tissues and to 
the brain. Normally, adipokines produce changes in 
fuel metabolism and feeding behavior that reestablish 
adequate fuel reserves and maintain body mass. When 
adipokines are over- or underproduced, the resulting 
dysregulation may result in life-threatening disease. 

Leptin (Greek leptos, "thin") is an adipokine 
(167 amino acids) that, on reaching the brain, acts on 
receptors in the hypothalamus to curtail appetite. Leptin 
was first identified as the product of a gene designated OB 
(obese) in laboratory mice. Mice with two defective 
copies of this gene (oblob genotype; lowercase letters 
signify a mutant form of the gene) show the behavior 
and physiology of animals in a constant state of starva- 
tion: their plasma Cortisol levels are elevated; they are 
unable to stay warm and they grow abnormally, do not 
reproduce, and exhibit unrestrained appetite. As a con- 
sequence of the last effect, they become severely obese, 
weighing as much as three times more than normal mice 
(Fig. 23-34). They also have metabolic disturbances 
very similar to those seen in diabetes, and they are in- 
sulin-resistant. When leptin is injected into obiob mice, 
they lose weight and increase their locomotor activity 
and thermogenesis. 

A second mouse gene, designated DB (diabetic), 
has also been found to have a role in appetite regulation. 
Mice with two defective copies (dbidb) are obese and 
diabetic. The DB gene encodes the leptin receptor. 
When the receptor is defective, the signaling function of 
leptin is lost. 

The leptin receptor is expressed primarily in regions 
of the brain known to regulate feeding behavior — 

FIGURE 23-35 Hypothalamic regulation of food intake and energy 
expenditure, (a) Anatomy of the hypothalamus and its interaction with 
adipose tissue, (b) Details of the interaction between the hypothala- 
mus and an adipocyte, described later in the text. 



neurons of the arcuate nucleus of the hypothalamus (Fig. 
23-35a). Leptin carries the message that fat reserves are 
sufficient, and it promotes a reduction in fuel intake and 
increased expenditure of energy. Leptin-receptor 
interaction in the hypothalamus alters the release of 
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neuronal signals to the region of the brain that affects ap- 
petite. Leptin also stimulates the sympathetic nervous 
system, increasing blood pressure, heart rate, and ther- 
mogenesis by uncoupling the mitochondria of white 
adipocytes (Fig. 23-35b). Recall that thermogenic or 
UCP, forms a channel in the inner mitochondrial mem- 
brane that allows protons to reenter the mitochondrial 
matrix without passing through the ATP synthase com- 
plex. This permits continual oxidation of fuel (fatty acids 
in an adipocyte) without ATP synthesis, dissipating en- 
ergy as heat and consuming dietary calories or stored 
fats in potentially very large amounts. 



Leptin Stimulates Production of Anorexigenic 
Peptide Hormones 

Two types of neurons in the arcuate nucleus control fuel 
intake and metabolism (Fig. 23-36). The orexigenic 
(appetite-stimulating) neurons stimulate eating by pro- 
ducing and releasing neuropeptide Y (NPY), which 
causes the next neuron in the circuit to send the signal 
to the brain: Eat! The blood level of NPY rises during 
starvation, and is elevated in both ob/ob and db/db mice. 
The high NPY concentration presumably underlies the 
obesity of these mice, who eat voraciously. 
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FIGURE 23-36 Hormones that control eating. In the arcuate nucleus, 
two sets of neurosecretory cells receive hormonal input and relay neu- 
ronal signals to the cells of muscle, adipose tissue, and liver. Leptin 
and insulin are released from adipose tissue and pancreas, respec- 
tively, in proportion to the mass of body fat. The two hormones act on 
anorexigenic neurosecretory cells to trigger release of a-MSH 
(melanocortin); this produces neuronal signals to eat less and metabo- 
lize more fuel. Leptin and insulin also act on orexigenic neurosecre- 



tory cells to inhibit the release of NPY, reducing the "eat" signal sent to 
the tissues. As described later in the text, the gastric hormone ghrelin 
stimulates appetite by activating the NPY-expressing cells; PYY 3 _ 36/ re- 
leased from the colon, inhibits these neurons and thereby decreases 
appetite. Each of the two types of neurosecretory cells inhibits hor- 
mone production by the other, so any stimulus that activates orexi- 
genic cells inactivates anorexigenic cells, and vice versa. This 
strengthens the effect of stimulatory inputs. 
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The anorexigenic (appetite-suppressing) neurons 
in the arcuate nucleus produce a-melanocyte- 
stimulating hormone (a-MSH, also known as 
melanocortin), formed from its polypeptide precursor 
pro-opiomelanocortin (POMC; Fig. 23-6). Release of 
a-MSH causes the next neuron in the circuit to send the 
signal to the brain: Stop eating! 

The amount of leptin released by adipose tissue de- 
pends on both the number and the size of adipocytes. 
When weight loss decreases the mass of lipid tissue, lep- 
tin levels in the blood decrease, the production of NPY 
falls, and the processes in adipose tissue shown in Fig- 
ure 23-35 are reversed. Uncoupling is diminished, slow- 
ing thermogenesis and saving fuel, and fat mobilization 
slows in response to reduced signaling by cAMP. Con- 
sumption of more food combined with more efficient uti- 
lization of fuel results in replenishment of the fat reserve 
in adipose, bringing the system back into balance. 

Leptin may also be essential to the normal develop- 
ment of hypothalamic neuronal circuits. In mice, the 
outgrowth of nerve fibers from the arcuate nucleus dur- 
ing early brain development is slower in the absence of 
leptin, affecting both the orexigenic and (to a lesser ex- 
tent) anorexigenic outputs of the hypothalamus. It is 
possible that the leptin levels during development of 
these circuits determine the details of the hardwiring of 
this regulatory system. 

Leptin Triggers a Signaling Cascade That Regulates 
Gene Expression 

The leptin signal is transduced by a mechanism also 
used by receptors for interferon and growth factors, the 
JAK-STAT system (Fig. 23-37; see Fig. 12-18). The 
leptin receptor, which has a single transmembrane seg- 
ment, dimerizes when leptin binds to the extracellular 
domains of two monomers. Both monomers are phos- 
phorylated on a T^r residue of their intracellular domain 
by a Janus kinase (JAK). The (£)-Tyr residues be- 
come docking sites for three proteins that are signal 
transducers and activators of transcription (STATs 3, 5, 
and 6; sometimes called fat-STATS). The docked STATs 
are then phosphorylated on Tyr residues by the same 
JAK. After phosphorylation, the STATs dimerize then 
move to the nucleus, where they bind to specific DNA 
sequences and stimulate the expression of target genes, 
including the gene for POMC, from which a-MSH is 
produced. 

The increased catabolism and thermogenesis trig- 
gered by leptin are due in part to increased synthesis of 
the mitochondrial uncoupling protein thermogenin 
(product of the UCP1 gene) in adipocytes. Leptin stim- 
ulates the synthesis of thermogenin by altering synaptic 
transmissions from neurons in the arcuate nucleus to 
adipose and other tissues. In these tissues, leptin causes 
increased release of norepinephrine, which acts through 
jS 3 -adrenergic receptors to stimulate transcription of the 
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FIGURE 23-37 The JAK-STAT mechanism of leptin signal transduction 
in the hypothalamus. Leptin binding induces dimerization of the lep- 
tin receptor, followed by phosphorylation of specific Tyr residues, cat- 
alyzed by Janus kinase (JAK). STATs bound to the phosphorylated 
leptin receptor through their SH2 domains are now phosphorylated on 
Tyr residues by a separate activity of JAK. The STATs dimerize, binding 
each other's ®-Tyr residues, and enter the nucleus. Here, they bind 
specific regulatory regions in the DNA and alter the expression of cer- 
tain genes. The products of these genes ultimately influence the organ- 
ism's feeding behavior and energy expenditure. 

UCP1 gene. The resulting uncoupling of electron trans- 
fer from oxidative phosphorylation consumes fat and is 
thermogenic (Fig. 23-35). 

Might human obesity be the result of insufficient 
leptin production, and therefore treatable by the in- 
jection of leptin? Blood levels of leptin are in fact usu- 
ally much higher in obese animals (including humans) 
than in animals of normal body mass (except, of 
course, in oblob mutants, which cannot make leptin). 
Some downstream element in the leptin response sys- 
tem must be defective in obese individuals, and the 
elevation in leptin is the result of an (unsuccessful) 
attempt to overcome the leptin resistance. In those 
very rare humans with extreme obesity who have a de- 
fective leptin gene (05), leptin injection does result in 
dramatic weight loss. In the vast majority of obese in- 
dividuals, however, the OB gene is intact. In clinical 
trials, the injection of leptin did not have the weight- 
reducing effect observed in obese ob/ob mice. Clearly, 
most cases of human obesity involve one or more fac- 
tors in addition to leptin. 
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The Leptin System May Have Evolved to Regulate the 
Starvation Response 

The leptin system probably evolved to adjust an animal's 
activity and metabolism during periods of fasting and 
starvation, not as a means to restrict weight gain. The 
reduction in leptin level triggered by nutritional defi- 
ciency reverses the thermogenic processes illustrated in 
Figure 23-35, allowing fuel conservation. Leptin (acting 
in the hypothalamus) also triggers decreased produc- 
tion of thyroid hormone (slowing basal metabolism), de- 
creased production of sex hormones (preventing 
reproduction), and increased production of glucocorti- 
coids (mobilizing the body's fuel-generating resources). 
By minimizing energy expenditure and maximizing the 
use of endogenous reserves of energy, these leptin- 
mediated responses may allow an animal to survive 
periods of severe nutritional deprivation. In liver 
and muscle, leptin stimulates AMP-activated protein 
kinase (AMPK), and through its action inhibits fatty 
acid synthesis and activates fatty acid oxidation, favoring 
energy-producing processes. 

Insulin Acts in the Arcuate Nucleus to Regulate Eating and 
Energy Conservation 

Insulin secretion reflects both the size of fat reserves 
(adiposity) and the current energy balance (blood glu- 
cose level). Insulin acts on insulin receptors in the hy- 
pothalamus to inhibit eating (Fig. 23-36). Insulin 
receptors in the orexigenic neurons of the arcuate nu- 
cleus inhibit the release of NPY, and insulin receptors in 
the anorexigenic neurons stimulate a-MSH production, 
thereby decreasing fuel intake and increasing thermoge- 
nesis. By mechanisms discussed in Section 23.3, insulin 
also signals muscle, liver, and adipose tissues to increase 
the conversion of glucose to acetyl-CoA, providing the 
starting material for fat synthesis. 

Leptin makes the cells of liver and muscle more 
sensitive to insulin. One hypothesis to explain this ef- 
fect suggests cross-talk between the protein tyrosine 
kinases activated by leptin and those activated by in- 
sulin (Fig. 23-38); common second messengers in the 
two signaling pathways allow leptin to trigger some of the 
same downstream events that are triggered by insulin, 
through insulin receptor substrate-2 (IRS-2) and phos- 
phoinositide 3-kinase (PI-3K) (Fig. 12-16). 

Adiponectin Acts through AMPK to Increase 
Insulin Sensitivity 

Adiponectin is a peptide hormone (224 amino acids) 
produced almost exclusively in adipose tissue, an 
adipokine that sensitizes other organs to the effects of in- 
sulin, protects against atherosclerosis, and inhibits 
inflammatory responses (monocyte adhesion, macrophage 
transformation, and the proliferation and migration of the 
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FIGURE 23-38 A possible mechanism for cross-talk between receptors 
for insulin and leptin. The insulin receptor has intrinsic Tyr kinase activ- 
ity {see Fig. 12-1 5), and the leptin receptor, when occupied by its ligand, 
is phosphorylated by a soluble Tyr kinase OAK). One possible explana- 
tion for the observed interaction between leptin and insulin is that both 
may phosphorylate the same substrate — in the case shown here, insulin 
receptor substrate-2 (IRS-2), When phosphorylated, IRS-2 activates Pl- 
3K, which has downstream consequences that include inhibition of food 
intake. IRS-2 serves here as an integrator of the input from two receptors. 



cells of vascular smooth muscle). Adiponectin circulates 
in the blood and powerfully affects the metabolism of 
fatty acids and carbohydrates in liver and muscle. It 
increases the uptake of fatty acids from the blood by 
myocytes and the rate at which fatty acids undergo /3 
oxidation in muscle. It also blocks fatty acid synthesis and 
gluconeogenesis in hepatocytes, and stimulates glucose 
uptake and catabolism in muscle and liver. 

These effects of adiponectin are indirect and not 
fully understood, but AMPK clearly mediates many of 
them. Acting through its plasma membrane receptor, 
adiponectin triggers phosphorylation and activation of 
AMPK. Recall (see Fig. 15-6) that AMPK is activated 
by factors that signal the need to shift metabolism 
away from biosynthesis and toward energy production 
(Fig. 23-39). When activated, AMPK phosphorylates 
target proteins critical to the metabolism of lipids and 
carbohydrates, with profound effects on the metabolism 
of the whole animal (Fig. 23-40). Adiponectin receptors 
are also present in the brain; the hormone activates 
AMPK in the hypothalamus, stimulating food intake and 
reducing energy expenditure. 

One enzyme regulated by AMPK in the liver and in 
white adipose tissue is acetyl-CoA carboxylase, which 
produces malonyl-CoA, the first intermediate commit- 
ted to fatty acid synthesis. Malonyl-CoA is a powerful 
inhibitor of the enzyme carnitine acyltransferase I, 
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which starts the process of oxidation by transporting 
fatty acids into the mitochondrion (see Fig. 17-6). By 
phosphorylating and inactivating acetyl-CoA carboxy- 
lase, AMPK inhibits fatty acid synthesis while relieving 
the inhibition (by malonyl-CoA) of (3 oxidation (see 



FIGURE 23-39 The role of AMP-activated protein kinase (AMPK) in 
regulating ATP metabolism. ADP produced in synthetic reactions is 
converted to AMP by adenylate kinase. AMP activates AMPK, which 
regulates anabolic and catabolic pathways by phosphorylating key en- 
zymes (see Fig. 23-40). 

Fig. 17-12). Cholesterol synthesis is also inhibited by 
AMPK, which phosphorylates and inactivates HMG-CoA 
reductase, an enzyme in the path to cholesterol (see 
Fig. 21-34). Similarly, AMPK inhibits fatty acid synthase 
and acyl transferase, effectively blocking the synthesis 
of triacylglycerols. 

■ Mice with defective adiponectin genes are less 
sensitive to insulin than those with normal 
adiponectin, and they show poor glucose tolerance: in- 
gestion of dietary carbohydrate causes a long-lasting 
rise in blood glucose. These metabolic defects resemble 
those of humans with type 2 diabetes, who are similarly 
insulin-insensitive and clear glucose from the blood 
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FIGURE 23-40 Formation of adiponectin and its actions through 
AMPK. Extended fasting or starvation results in decreased reserves of 
triacylglycerols in adipose tissue, which triggers adiponectin produc- 
tion and release from adipocytes. The rise in plasma adiponectin acts 
through its plasma membrane receptors in various cell types and or- 
gans to inhibit energy-consuming processes and stimulate energy- 
producing processes. Adiponectin acts through its receptors in the 
brain to stimulate feeding behavior and inhibit energy-consuming 
physical activity, and to inhibit thermogenesis in brown fat. This hor- 
mone exerts its metabolic effects by activating AMPK, which regulates 
(by phosphorylation) specific enzymes in key metabolic processes (see 



T 



1 

<8> 



Pancreas 
0 cell: 
insulin secretion 



Liver 
fatty acid synthesis, 
cholesterol synthesis 



Adipose tissue 
fatty acid synthesis, 
lipolysis 



behavior expenditure 



Fig. 1 5-6). PFK-2, phosphofructokinase-2; GLUT1 and GLUT4, glucose 
transporters; FAS I, fatty acid synthase I; ACC, acetyl-CoA carboxylase; 
HSL, hormone-sensitive lipase; HMGR, HMG-CoA reductase; GPAT, 
acyl transferase; GS, glycogen synthase; eEF2, eukaryotic elongation 
factor 2 (required for protein synthesis; see Chapter 27); mTOR, mam- 
malian target of rapamycin (a protein kinase that regulates protein syn- 
thesis on the basis of nutrient availability), Thiazolidinedione drugs 
activate the transcription factor PPARy (see Figs 23-41, 23-42), which 
then turns on adiponectin synthesis, indirectly activating AMPK. Exer- 
cise, through conversion of ATP to ADP and AMP, also stimulates 
AMPK. 
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only slowly. Indeed, individuals with obesity or type 2 di- 
abetes have lower blood adiponectin levels than nondia- 
betic controls. Moreover, drugs used in the treatment of 
type 2 diabetes — the thiazolidinediones, such as rosigli- 
tazone (Avandia) and pioglitazone (Actos) (p. 824) — 
increase the expression of adiponectin mRNA in adipose 
tissue and increase blood adiponectin levels in experi- 
mental animals; they also activate AMPK (Fig. 23-40). 
(In 2007 the safety of Avandia was being reassessed be- 
cause of an associated increased risk of heart attack.) It 
seems likely that adiponectin will prove to be an impor- 
tant link between type 2 diabetes and its most important 
predisposing factor, obesity. ■ 

Diet Regulates the Expression of Genes Central to 
Maintaining Body Mass 

Proteins in a family of ligand-activated transcription 
factors, the peroxisome proliferator-activated 
receptors (PPARs), respond to changes in dietary 
lipid by altering the expression of genes involved in fat 
and carbohydrate metabolism. These transcription fac- 
tors were first recognized for their roles in peroxisome 
synthesis— thus their name. Their normal ligands are 
fatty acids or fatty acid derivatives, but they can also 
bind synthetic agonists and can be activated in the labo- 
ratory by genetic manipulation. PPARa, PPARS, and 
PPARy are members of this nuclear receptor superfam- 
ily. They act in the nucleus by forming heterodimers 
with another nuclear receptor, RXR (retinoid X recep- 
tor), binding to regulatory regions of DNA near the 
genes under their control and changing the rate of tran- 
scription of those genes (Fig. 23-41). 

PPARy, expressed primarily in liver and adipose 
tissue, is involved in turning on genes necessary to the 
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FIGURE 23-41 Mode of action of PPARs. PPARs are transcription factors 
that, when bound to their cognate ligand (L), form heterodimers with the 
nuclear receptor RXR. The dimer binds specific regions of DNA known 
as response elements, stimulating transcription of genes in those regions. 

differentiation of fibroblasts into adipocytes and genes 
that encode proteins required for lipid synthesis and 
storage in adipocytes (Fig. 23-42), PPARy is activated 
by the thiazolidinedione drugs that are used to treat 
type 2 diabetes. 



Liver 

^^^^^^^^^^ Fat 

Fatty acid oxidation and storage 
Starvation response 



Adipose tissue 




Fat synthesis 

and storage Fatty acid oxidation 
Adipokine production Thermogenesis 



FIGURE 23-42 Metabolic integration by 
PPARs. The three PPAR isoforms regulate lipid 
and glucose homeostasis through their coordi- 
nated effects on gene expression in liver, mus- 
cle, and adipose tissue. PPARa and PPARS (and 
its closely related isoform PPAR/3) regulate lipid 
utilization; PPARy regulates lipid storage and 
insulin sensitivity of various tissues. 



|PPAR« 



Fatty acid oxidation 

\ 



PARy 



Insulin 
sensitivity 



PPAR6 



Fatty acid oxidation 
Thermogenesis 




Muscle 



23.4 Obesity and the Regulation of Body Mass 937 



PPARa is expressed in liver, kidney, heart, skeletal 
muscle, and brown adipose tissue. The ligands that acti- 
vate this transcription factor include eicosanoids, free 
fatty acids, and the class of drugs called fibrates, such as 
fenofibrate (TriCor) and ciprofibrate (Modalim), which 
are used to treat coronary heart disease by raising HDL 
and lowering blood triacylglycerols. In hepatocytes, 
PPARa turns on the genes necessary for the uptake and 
jS oxidation of fatty acids and formation of ketone bodies 
during fasting. 

PPARS (and its closely related isotype, PPAR/3) are 
key regulators of fat oxidation, which act by sensing 
changes in dietary lipid. PPARS acts in liver and muscle, 
stimulating the transcription of at least nine genes en- 
coding proteins for oxidation and for energy dissipa- 
tion through uncoupling of mitochondria. Normal mice 
overfed on high-fat diets accumulate massive amounts of 
both brown and white fat, and fat droplets accumulate in 
the liver. But when the same overfeeding experiment is 
carried out with mice that have a genetically altered, al- 
ways active PPARS, this fat accumulation is prevented. 
In mice with a nonfunctioning leptin receptor (db/db), 
activated PPAR£ prevents the development of obesity 
that would otherwise occur (see Fig. 23-34). By stimu- 
lating fatty acid breakdown in uncoupled mitochondria, 
PPARS causes fat depletion, weight loss, and thermogen- 
esis. Seen in this light, thermogenesis is both a means of 
keeping warm and a defense against obesity. Clearly, 
PPAR5 is a potential target for drugs to treat obesity. 

Short-Term Eating Behavior Is influenced by 
Ghrelin and PYY 3 _ 36 

Ghrelin is a peptide hormone (28 amino acids) pro- 
duced in cells lining the stomach. It was originally rec- 
ognized as the stimulus for the release of growth 
hormone (ghre is the Proto-Indo-European root of 
"grow"), then subsequently shown to be a powerful ap- 
petite stimulant that works on a shorter time scale (be- 
tween meals) than leptin and insulin. Ghrelin receptors 
are located in the pituitary gland (presumably mediating 
growth hormone release) and in the hypothalamus (af- 
fecting appetite), as well as in heart muscle and adipose 
tissue. The concentration of ghrelin in the blood varies 
strikingly between meals, peaking just before a meal and 
dropping sharply just after the meal (Fig. 23-43). 
Injection of ghrelin into humans produces immediate 
sensations of intense hunger. Individuals with Prader- 
Willi syndrome, whose blood levels of ghrelin are excep- 
tionally high, have an uncontrollable appetite, leading to 
extreme obesity that often results in death before the 
age of 30. 

PYY 3 _3 6 is a peptide hormone (34 amino acids) se- 
creted by endocrine cells in the lining of the small intes- 
tine and colon in response to food entering from the 
stomach. The level of PYY 3 _3 6 in the blood rises after a 
meal and remains high for some hours. It is carried in the 



(a) 

Breakfast Lunch Dinner 



sun 




i \}\t 




600 




500 




400 




300 


_ 1 

1 1 1 


200 


.It !i i 



(b) 





€00 






500 




o 

a 


400 




3 
a 


300 




Insuli 


200 






100 





oil J I J_ 1 

t\ i.m. 12 noon 6 p.m. 12 midnight 

Time of day 

FIGURE 23-43 Variations in ghrelin and insulin relative to meal times. 

(a) Plasma levels of ghrelin rise sharply just before the normal time for 
meals (7 a.m. breakfast, 12 noon lunch, 5:30 p.m. dinner) and drop 
precipitously just after meals, paralleling the subjective feelings of 
hunger, (b) Insulin levels rise immediately after each meal, in response 
to the increase in blood glucose concentration. 



blood to the arcuate nucleus, where it acts on orexigenic 
neurons, inhibiting NPY release and reducing hunger (Fig. 
23-36). Humans injected with PYY 3 _ 36 feel little hunger 
and eat less than normal amounts for about 12 hours. 

This interlocking system of neuroendocrine controls of 
food intake and metabolism presumably evolved to pro- 
tect against starvation and to eliminate counterproduc- 
tive accumulation of fat (extreme obesity). The difficulty 
most people face in trying to lose weight testifies to the 
remarkable effectiveness of these controls. 

SUMMARY 23.4 Obesity and the 

Regulation of Body Mass 

■ Obesity is increasingly common in the United States 
and other developed countries, and predisposes 
people toward several life-threatening conditions. 

■ Adipose tissue produces leptin, a hormone that 
regulates feeding behavior and energy expenditure 
so as to maintain adequate reserves of fat. Leptin 
production and release increase with the number 
and size of adipocytes. 
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■ Leptin acts on receptors in the arcuate nucleus 
of the hypothalamus, causing the release of 
anorexigenic peptides, including a-MSH, that act 
in the brain to inhibit eating. Leptin also stimulates 
sympathetic nervous system action on adipocytes, 
leading to uncoupling of mitochondrial oxidative 
phosphorylation, with consequent thermogenesis. 

m The signal-transduction mechanism for leptin 
involves phosphorylation of the JAK-STAT system. 
On phosphorylation by JAK, STATs can bind to 
regulatory regions in nuclear DNA and alter the 
expression of genes for proteins that set the level of 
metabolic activity and determine feeding behavior. 
Insulin acts on receptors in the arcuate nucleus, 
with results similar to those caused by leptin. 

■ The hormone adiponectin stimulates fatty acid 
uptake and oxidation and inhibits fatty acid synthesis, 
as well as sensitizing muscle and liver to insulin. 

Its actions are mediated by AMPK, which is also 
activated by low [AMP] and exercise. 

■ Ghrelin, a hormone produced in the stomach, acts 
on orexigenic neurons in the arcuate nucleus to 
produce hunger before a meal. PYY 3 _ 36 , a peptide 
hormone of the intestine, acts at the same site to 
lessen hunger after a meal. 

23.5 Obesity, the Metabolic Syndrome, 
and Type 2 Diabetes 

EF9 In the industrialized world, where the food sup- 
HH ply is more than adequate, there is a growing 
epidemic of obesity and the type 2 diabetes associated 
with it. Worldwide, between 150 and 170 million people 
have diabetes, primarily of type 2. The projected num- 
ber of cases rises to 220 million in 2010, and 360 million 
in 2030. The pathology of diabetes includes cardiovas- 
cular disease, renal failure, blindness, amputations due 
to poor healing in the extremities, and neuropathy. In 
2000, the global mortality from diabetes was 2.9 mil- 
lion, a number that is sure to rise in coming years. 
Clearly, it is essential to understand type 2 diabetes 
and its relationship to obesity, and to find countermea- 
sures that prevent or reverse the damage done by this 
disease. ■ 

In Type 2 Diabetes the Tissues Become 
insensitive to insulin 

I The hallmark of type 2 diabetes is the develop- 
HH ment of insulin resistance: a state in which more 
insulin is needed to bring about the biological effects 
produced by a lower amount of insulin in the normal, 
healthy state. In the early stages of the disease, pancre- 
atic (3 cells secrete enough insulin to overcome the lower 
insulin sensitivity of muscle and liver. But the /3 cells 
eventually fail, and the lack of insulin becomes apparent 



in the body's inability to regulate blood glucose. The 
intermediate stage, preceding type 2 diabetes melli- 
tus, is sometimes called the metabolic syndrome, or 
syndrome X. This is typified by obesity, especially in 
the abdomen; hypertension (high blood pressure); ab- 
normal blood lipids (high TAG and LDL, low HDL); 
slightly high blood glucose; and a reduced ability to clear 
glucose in the glucose-tolerance test. Individuals with 
metabolic syndrome often also show changes in blood 
proteins, changes that are associated with abnormal 
clotting (high fibrinogen concentration) or inflammation 
(high concentrations of the C-reactive peptide, which 
typically increases with an inflammatory response). 
About 27% of the adult population in the United States 
has these symptoms of metabolic syndrome! 

According to the "lipid burden" hypothesis to ex- 
plain the onset of type 2 diabetes, the action of PPARy on 
adipocytes normally keeps the cells ready to synthesize 
and store triacylglycerols — the adipocytes are insulin- 
sensitive and produce leptin, which leads to their contin- 
ued intracellular deposition of TAG. However, in obese 
individuals the adipocytes become filled with TAG, and 
the adipose tissue cannot meet any increased demand 
for TAG storage. Adipocytes and their precursors, 
preadipocytes, become less sensitive to insulin. Gene ex- 
pression normally associated with the development of 
new adipocytes (genes for the transcription factors 
SREBP1 and PPARy, for example) is dowmregulated in 
the adipocytes of obese individuals, but is unregulated in 
other tissues, including skeletal muscle and liver, which 
begin to store TAGs (Fig. 23-44). Substantial quantities 
of triacylglycerols are now stored "ectopically" — in ab- 
normal locations. Moreover, according to this hypothesis, 
excess stored fatty acids and TAGs are toxic to liver and 
muscle. Some individuals are less well equipped geneti- 
cally to handle this burden of ectopic lipids and are more 
susceptible to the cellular damage that leads to the de- 
velopment of type 2 diabetes. Insulin resistance probably 
involves impairment of several of the mechanisms by 
which insulin exerts its metabolic effects — changes in 
protein levels; changes in the activities of signaling en- 
zymes and transcription factors. For example, both 
adiponectin synthesis in adipocytes and adiponectin 
level in the blood decrease with obesity, and increase 
with weight loss. 

Chronic inflammation of adipose tissue is a com- 
mon feature of obesity. Genes associated with inflam- 
mation and macrophage activity are upregulated, and 
macrophage invasion of adipose tissue is frequently 
detected in obese individuals. Lipid-laden adipocytes 
also undergo lipoapoptosis — programmed cell death 
triggered by abnormal concentrations of lipids. 

Several of the drugs that are effective in improving in- 
sulin sensitivity in type 2 diabetes are known to act on spe- 
cific proteins in signaling pathways, and their actions are 
consistent with the lipotoxicity model. Thiazolidinediones 
bind to PPARy, turning on a set of adipocyte-specific genes 
and promoting the differentiation of preadipocytes to small 
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FIGURE 23-44 "Lipid burden" hypothesis for the development of type 
2 diabetes mellitus. Normally, adipocytes have reserve capacity for 
storing triacylglycerols, maintained in this state by expression of the 
transcription factor PPARy. In obesity, the capacity of adipose tissue to 
store TAGs is exhausted. The lipid-synthesizing machinery maintained 
by SREBP1 decreases as the expression of this transcription factor 

adipocytes, thereby increasing the body's capacity for ab- 
sorbing fatty acids from the diet and storing them as TAGs. 

There are clearly genetic factors that predispose 
toward type 2 diabetes. Although 80% of individuals 
with type 2 diabetes are obese, most obese individuals 
do not develop type 2 diabetes. Given the complexity 
of the regulatory mechanisms we have discussed in 
this chapter, it is not surprising that the genetics of di- 
abetes is complex, involving interactions between vari- 
ant genes and environmental factors, including diet 
and lifestyle. At least 10 genetic loci have been reliably 
linked to type 2 diabetes; variation in any of these "di- 
abetogenes" alone would cause a relatively small in- 
crease in the likelihood of developing type 2 diabetes. 
For example, people with a variant PPARy in which an 
Ala residue replaces Pro at position 12 are at a slightly, 
though significantly, increased risk of developing type 
2 diabetes. ■ 
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decreases in adipocytes and increases in liver and muscle, and the latter 
tissues begin to store TAGs. This ectopic storage of lipids is better toler- 
ated in some individuals than in others; the more-tolerant individuals 
are genetically less susceptible to the deleterious changes resulting 
from ectopic lipid storage (leading to type 2 diabetes) or are genetically 
better equipped to manage this storage. 

Type 2 Diabetes Is Managed with Diet Exercise, and 
Medication 

Studies show that three factors improve the 
health of individuals with type 2 diabetes: dietary 
restriction, regular exercise, and drugs that increase in- 
sulin sensitivity or insulin production. Dietary restric- 
tion (and accompanying weight loss) reduces the 
overall burden of handling fatty acids. The lipid compo- 
sition of the diet influences, through PPARs and other 
transcription factors, the expression of genes that encode 
proteins involved in fatty acid oxidation and in energy 
expenditure via thermogenesis. Exercise activates 
AMPK, as does adiponectin; AMPK shifts metabolism to- 
ward fat oxidation and inhibits fat synthesis. 

Several classes of drugs are used in the management 
of type 2 diabetes, some of which we have discussed 
earlier in the chapter (Table 23-7). Biguanides such as 



:| Treatments for Type 2 Diabetes Mellitus 



Intervention/treatment 



Direct target 



Weight loss 



Exercise 

Sulfonylureas: glipizide (Glucotrol), 
glyburide (several brands), 
glimepiride (Amaryl) 

Biguanides: metformin (Glucophage) 

Thiazoladinediones: troglitazone 
(Rezulin),* rosiglitazone (Avandia), 1 
pioglitazone (Actos) 

GLP-1 modulators: exenatide (Byetta), 
sitagliptin (Januvia) 



Adipose tissue; reduces TAG 
content 

AMPK, activated by increasing 

[AMP]/[ATP] 
Pancreatic jS cells; K + channels 

blocked 

AMPK, activated 
PPARy 



Glucagon-like peptide- 1, 
dipeptide protease IV 



Effect of treatment 



Reduces lipid burden; increases capacity 
for lipid storage in adipose tissue; 
restores insulin sensitivity 

See Fig. 23-40; aids weight loss 

Stimulates insulin secretion by pancreas 



Increases glucose uptake by muscle; 

decreases glucose production in liver 
Stimulates expression of genes, potentiating 

the action of insulin in liver, muscle, 

adipose tissue; increases glucose uptake; 

decreases glucose synthesis in liver 
Enhances insulin secretion by pancreas 



♦Voluntarily withdrawn because of side effects 
f Being reevaluated because of emerging side effects. 



940J Hormonal Regulation and Integration of Mammalian Metabolism 



metformin (Glucophage) activate AMPK, mimicking 
the effects of adiponectin. Thiazolidinediones act 
through PPARs to increase the concentration of 
adiponectin in plasma and to stimulate adipocyte dif- 
ferentiation, thereby increasing the capacity for TAG 
storage. Sulfonylureas act on the ATP-gated K + chan- 
nels in p cells to stimulate insulin release. Inhibitors of 
dipeptide protease IV (DPP IV) prevent the prote- 
olytic degradation of glucagon-like peptide- 1 (GLP-1), 
a peptide hormone produced in the gut that stimulates 
pancreatic insulin secretion. Inhibition of the pepti- 
dase prolongs the action of GLP-1, effectively increas- 
ing insulin secretion. 

Clearly, a combination of weight loss and exer- 
cise is the preferred way to prevent development of 
the metabolic syndrome and type 2 diabetes. Recent 
findings suggest an interesting possibility for aiding 
weight loss and reducing the amount of TAG that 
must be stored. The protein PRDM16 is expressed 
strongly, and perhaps uniquely, in brown adipose 
tissue. Its exact function is unknown, but the pro- 
tein has a zinc finger, typical of many proteins that 
interact with DNA and influence transcription (see 
Fig. 28-12). When overexpressed in the adipose tis- 
sue of mice, PRDM16 induces the differentiation of 
preadipocytes in white adipose tissue into brown 
adipocytes, with high levels of thermogenin and 
strikingly uncoupled respiration. Such cells could, 
in principle, consume fatty acids above the amount 
needed for ATP production, converting the energy 
of oxidation to heat. Given the widespread and in- 
creasing occurrence of type 2 diabetes, research 
into methods for avoiding or reversing the disease is 
sure to attract a great deal of interest. ■ 

SUMMARY 23.5 Obesity, the Metabolic 
Syndrome, and Type 2 
Diabetes 

■ The metabolic syndrome, which includes obesity, 
hypertension, elevated blood lipids, and insulin 
resistance, is often the prelude to type 2 
diabetes. 

■ The insulin resistance that characterizes type 

2 diabetes may be a consequence of abnormal lipid 
storage in muscle and liver, in response to a lipid 
intake that cannot be accommodated by adipose 
tissue. 

■ Expression of the enzymes of lipid synthesis is 
under tight and complex regulation. PPARs are 
transcription factors that determine the rate of 
synthesis of many enzymes involved in lipid 
metabolism and adipocyte differentiation. 

r Effective treatments for type 2 diabetes include 
exercise, appropriate diet, and drugs that increase 
insulin sensitivity or insulin production. 
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pose tissue as source and target for novel therapies. Trends En- 
docrinol Metab 17, 26-32. 
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Muoio, D.M. & Newgard, C.B. (2006) Obesity-related derange- 
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Nadler, S.T. & Attie, A.D. (2001) Please pass the chips: genomic 
insights into obesity and diabetes J. Nutr. 131, 2078-2081. 

Clear statement of the "lipid burden" hypothesis for the origin of 
type 2 diabetes. 

Powell, K. (2007) The two faces of fat. Nature 447, 525-527. 

Signals from fat cells directly promote insulin resistance and trig- 
ger inflammation, which may in turn cause type 2 diabetes, cardio- 
vascular disease, increased cancer risk, and other obesity-associated 
problems, 

Qatanani, M. & Lazar, M.A. (2007) Mechanisms of obesity- 
associated insulin resistance: many choices on the menu. Genes Dev. 
21, 1443-1455 

Savage, D.B., Petersen, K.F., & Shulman, G.I. (2007) Disordered 
lipid metabolism and the pathogenesis of insulin resistance. Physiol 
Rev 87, 507-520. 

Saxena, R., Voight, B.F., I^yssenko, V., Burtt, N.P., de Bakker, 
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1331-1336 

Seale, P., Kajimura, S., Yang, W., Chin, S., Rohas, L.M., Uldry, 
M., Tavernier, G., Langin, D., & Spiegelman, B.M. (2007) Tran- 
scriptional control of brown fat determination by PRDM16. Cell 
Metab 6,38-54. 

Overexpression of PRDM16 leads to development of brown adi- 
pose tissue. 

Semple, R.K., Chatterjee, V.K.K., & O'Rahilly, S. (2006) PPARy 
and human metabolic disease J. Clin Invest. 116,581-589. 
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1821-1830. 

Problems 

1. Peptide Hormone Activity Explain how two peptide 
hormones as structurally similar as oxytocin and vasopressin 
can have such different effects (see Fig. 23-10). 

2. ATP and Phosphocreatine as Sources of Energy for 
Muscle During muscle contraction, the concentration of 
phosphocreatine in skeletal muscle drops while the concentra- 
tion of ATP remains fairly constant. However, in a classic ex- 
periment, Robert Davies found that if he first treated muscle 
with l-fluoro-2,4-dinitrobenzene (p. 94), the concentration of 
ATP declined rapidly while the concentration of phosphocrea- 
tine remained unchanged during a series of contractions. Sug- 
gest an explanation. 

3. Metabolism of Glutamate in the Brain Brain tissue 
takes up glutamate from the blood, transforms it into gluta- 
mine, and releases the glutamine into the blood. What is ac- 
complished by this metabolic conversion? How does it take 
place? The amount of glutamine produced in the brain can ac- 
tually exceed the amount of glutamate entering from the 
blood. How does this extra glutamine arise? (Hint: You may 
want to review amino acid catabolism in Chapter 18; recall that 
NH4 is very toxic to the brain.) 

4. Proteins as Fuel during Fasting When muscle proteins 
are catabolized in skeletal muscle during a fast, what are the 
fates of the amino acids? 

5. Absence of Glycerol Kinase in Adipose Tissue Glyc- 
erol 3-phosphate is required for the biosynthesis of triacyl- 
glycerols. Adipocytes, specialized for the synthesis and 
degradation of triacylglycerols, cannot use glycerol directly, 
because they lack glycerol kinase, which catalyzes the reaction 

Glycerol + ATP > glycerol 3-phosphate + ADP 

How does adipose tissue obtain the glycerol 3-phosphate nec- 
essary for triacylglycerol synthesis? 

6. Oxygen Consumption during Exercise A sedentary 
adult consumes about 0.05 L of 0 2 in 10 seconds. A sprinter, 
running a 100 m race, consumes about 1 L of 0 2 in 10 seconds 
After finishing the race, the sprinter continues to breathe at an 
elevated (but declining) rate for some minutes, consuming an 
extra 4 L of 0 2 above the amount consumed by the sedentary 
individual 

(a) Why does the need for 0 2 increase dramatically dur- 
ing the sprint? 

(b) Why does the demand for 0 2 remain high after the 
sprint is completed? 

7. Thiamine Deficiency and Brain Function Individuals 
with thiamine deficiency show some characteristic neurologi- 
cal signs and symptoms, including loss of reflexes, anxiety, and 



mental confusion. Why might thiamine deficiency be mani- 
fested by changes in brain function? 

8. Potency of Hormones Under normal conditions, the hu- 
man adrenal medulla secretes epinephrine (C 9 H 13 N0 3 ) at a rate 
sufficient to maintain a concentration of 10" 10 m in circulating 
blood. To appreciate what that concentration means, calculate 
the diameter of a round swimming pool, with a water depth of 
2 0 m, that would be needed to dissolve 1.0 g (about 1 teaspoon) 
of epinephrine to a concentration equal to that in blood. 

9. Regulation of Hormone Levels in the Blood The half- 
life of most hormones in the blood is relatively short. For ex- 
ample, when radioactively labeled insulin is injected into an 
animal, half of the labeled hormone disappears from the blood 
within 30 min. 

(a) What is the importance of the relatively rapid inacti- 
vation of circulating hormones? 

(b) In view of this rapid inactivation, how is the level of 
circulating hormone kept constant under normal conditions? 

(c) In what ways can the organism make rapid changes in 
the level of a circulating hormone? 

10. Water-Soluble versus Lipid-Soluble Hormones On 

the basis of their physical properties, hormones fall into one of 
two categories: those that are very soluble in water but rela- 
tively insoluble in lipids (e.g., epinephrine) and those that are 
relatively insoluble in water but highly soluble in lipids (e.g., 
steroid hormones). In their role as regulators of cellular activ- 
ity, most water-soluble hormones do not enter their target 
cells. The lipid-soluble hormones, by contrast, do enter their 
target cells and ultimately act in the nucleus. What is the cor- 
relation between solubility, the location of receptors, and the 
mode of action of these two classes of hormones? 

11. Metabolic Differences between Muscle and Liver in 
a "Fight-or-Flight" Situation When an animal confronts a 
"fight-or-flight" situation, the release of epinephrine promotes 
glycogen breakdown in the liver, heart, and skeletal muscle. 
The end product of glycogen breakdown in the liver is glucose; 
the end product in skeletal muscle is pyruvate. 

(a) What is the reason for the different products of 
glycogen breakdown in the two tissues? 

(b) What is the advantage to an animal that must fight or 
flee of these specific glycogen breakdown routes? 

12. Excessive Amounts of Insulin Secretion: 
HI Hyperinsulinism Certain malignant tumors of the 
pancreas cause excessive production of insulin by the f3 cells. 
Affected individuals exhibit shaking and trembling, weakness 
and fatigue, sweating, and hunger. 

(a) What is the effect of hyperinsulinism on the metabo- 
lism of carbohydrates, amino acids, and lipids by the liver? 

(b) What are the causes of the observed symptoms? Sug- 
gest why this condition, if prolonged, leads to brain damage. 

13. Thermogenesis Caused by Thyroid Hormones Thy- 
roid hormones are intimately involved in regulating the basal 
metabolic rate. Liver tissue of animals given excess thyroxine 
shows an increased rate of 0 2 consumption and increased heat 
output (thermogenesis), but the ATP concentration in the tissue 



is normal. Different explanations have been offered for the ther- 
mogenic effect of thyroxine. One is that excess thyroxine causes 
uncoupling of oxidative phosphorylation in mitochondria. How 
could such an effect account for the observations? Another ex- 
planation suggests that the thermogenesis is due to an increased 
rate of ATP utilization by the thyroxine-stimulated tissue. Is this 
a reasonable explanation? Why? 

14. Function of Prohormones What are the possible ad- 
vantages of synthesizing hormones as prohormones? 

15. Sources of Glucose during Starvation The typical 
human adult uses about 160 g of glucose per day, 120 g of 
which is used by the brain. The available reserve of glucose 
(-20 g of circulating glucose and -190 g of glycogen) is ade- 
quate for about one day. After the reserve has been depleted 
during starvation, how would the body obtain more glucose? 

16. Parabiotic ob/ob Mice By careful surgery, researchers 
can connect the circulatory systems of two mice so that the 
same blood circulates through both animals. In these parabi- 
otic mice, products released into the blood by one animal 
reach the other animal via the shared circulation. Both animals 
are free to eat independently. If a mutant ob/ob mouse (both 
copies of the OB gene are defective) and a normal OB/OB 
mouse (two good copies of the OB gene) were made parabi- 
otic, what would happen to the weight of each mouse? 

17. Calculation of Body Mass Index A portly biochem- 
istry professor weighs 260 lb (118 kg) and is 5 feet 8 inches 
(173 cm) tall. What is his body mass index? How much weight 
would he have to lose to bring his body mass index down to 25 
(normal)? 

18. Insulin Secretion Predict the effects on insulin secre- 
tion by pancreatic /3 cells of exposure to the potassium 
ionophore valinomycin (p. 404). Explain your prediction. 

19. Effects of a Deleted Insulin Receptor A strain of 
mice specifically lacking the insulin receptor of liver is found to 
have mild fasting hyperglycemia (blood glucose = 132 mg/dL, 
vs. 101 mg/dL in controls) and a more striking hyperglycemia 
in the fed state (glucose = 363 mg/dL, vs. 135 mg/dL in con- 
trols). The mice have higher than normal levels of glucose 6- 
phosphatase in the liver and elevated levels of insulin in the 
blood. Explain these observations. 

B9 20. Decisions on Drug Safety The drug Avandia 

Efl (rosiglitazone) is effective in lowering blood glucose in 
patients with type 2 diabetes, but also seems to carry an in- 
creased risk of heart attack, If it were your responsibility to de- 
cide whether this drug should remain on the market (labeled 
with suitable warnings of its side effects) or should be with- 
drawn, what factors would you weigh in making your decision? 

21. Type 2 Diabetes Medication The drugs acar- 

wLm bose (Precose) and miglitol (Glyset), used in the treat- 
ment of type 2 diabetes mellitus, inhibit a-glucosidases in the 
brush border of the small intestine. These enzymes degrade 
oligosaccharides derived from glycogen or starch to monosac- 
charides. Suggest a possible mechanism for the salutary effect 
of these drugs on individuals with diabetes. What side effects, 
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if any, would you expect from these drugs. Why? (Hint: Review 
lactose intolerance, p. 545). 

Data Analysis Problem 

22. Cloning the Pancreatic j3-Cell Sulfonylurea Recep- 
tor Glyburide, a member of the sulfonylurea family of drugs 
shown on p. 925, is used to treat type 2 diabetes. It binds to 
and closes the ATP-gated K + channel shown in Figures 23-28 
and 23-29 

(a) Given the mechanism shown in Figure 23-28, would 
treatment with glyburide result in increased or decreased in- 
sulin secretion by pancreatic /3 cells? Explain your reasoning. 

(b) How does treatment with glyburide help reduce the 
symptoms of type 2 diabetes? 

(c) Would you expect glyburide to be useful for treating 
type 1 diabetes? Why or why not? 

Aguilar-Bryan and coauthors (1995) cloned the gene for 
the sulfonylurea receptor (SUR) portion of the ATP-gated K + 
channel from hamsters. The research team went to great 
lengths to ensure that the gene they cloned was in fact the 
SUR-encoding gene. Here we explore how it is possible for re- 
searchers to demonstrate that they have actually cloned the 
gene of interest rather than another gene. 

The first step was to obtain pure SUR protein. As was al- 
ready known, drugs such as glyburide bind SUR with very high 
affinity (K d < 10 nM), and SUR has a molecular weight of 140 
to 170 kDa. Aguilar-Bryan and coworkers made use of the 
high-affinity glyburide binding to tag the SUR protein with a 
radioactive label that would serve as a marker to purify the 
protein from a cell extract. First, they made a radiolabeled de- 
rivative of glyburide, using radioactive iodine ( 125 I): 




[125 jj 5_iodo-2-hydroxyglyburide 



(d) In preliminary studies, the 125 I-labeled glyburide deriv- 
ative (hereafter, [ 125 I]glyburide) was shown to have the same 
K d and binding characteristics as unaltered glyburide. Why 
was it necessary to demonstrate this (what alternative possi- 
bilities did it rule out)? 

Even though [ 125 I]glyburide bound to SUR with high affin- 
ity, a significant amount of the labeled drug would probably 
dissociate from the SUR protein during purification. To pre- 
vent this, [ 125 I]glyburide had to be covalently cross-linked to 
SUR. There are many methods for covalent cross-linking; 
Aguilar-Bryan and coworkers used UV light. When aromatic 
molecules are exposed to short-wave UV, they enter an ex- 
cited state and readily form covalent bonds with nearby mole- 
cules. By cross-linking the radiolabeled glyburide to the SUR 
protein, the researchers could simply track the 125 I radioactiv- 
ity to follow SUR through the purification procedure. 

Aguilar-Bryan and colleagues treated hamster HIT cells 
(which express SUR) with [ 125 I]glyburide and UV light, purified 
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the I-labeled 140 kDa protein, and sequenced its amino-ter- 
minal 25 amino acid segment; they found the sequence 
PLAFCGTENHSAAYRVDQGVLNNGC. The investigators then 
generated antibodies that bound to two short peptides in this 
sequence, one that bound to PLAFCGTE and the other to 
HSAAYRVDQGV, and showed that these antibodies bound the 
purified 125 I-labeled 140 kDa protein. 

(e) Why was it necessary to include this antibody-binding 
step? 

Next, the researchers designed PCR primers based on the 
sequences above, and cloned a gene from a hamster cDNA li- 
brary that encoded a protein that included these sequences 
(see Chapter 9 on biotechnology methods). The cloned puta- 
tive SUR cDNA hybridized to an mRNA of the appropriate 
length that was present in cells known to contain SUR. The pu- 
tative SUR cDNA did not hybridize to any mRNA fraction of the 
mRNAs isolated from hepatocytes, which do not express SUR. 

(f) Why was it necessary to include this putative SUR 
cDNA-mRNA hybridization step? 

Finally, the cloned gene was inserted into and expressed 
in COS cells, which do not normally express the SUR gene. 
The investigators mixed these cells with [ 125 I]glyburide with or 
without a large excess of unlabeled glyburide, exposed the 



cells to UV light, and measured the radioactivity of the 140 kDa 
protein produced. Their results are shown in the table. 







Added 


Added excess 


125 I label 




Cell 


putative SUR 


unlabeled 


in 140 kDa 


Experiment 


type 


cDNA? 


glyburide? 


protein 


1 


HIT 


No 


No 


+ + + 


2 


HIT 


No 


Yes 




3 


COS 


No 


No 




4 


COS 


Yes 


No 


+ + + 


5 


COS 


Yes 


Yes 





(g) Why was no 125 I-labeled 140 kDa protein found in ex- 
periment 2? 

(h) How would you use the information in the table to ar- 
gue that the cDNA encoded SUR? 

(i) What other information would you want to collect to 
be more confident that you had cloned the SUR gene? 
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Boyd, A.E. III, Gonzalez, G., Herrera-Sosa, H., Nguy, K., Bryan, 
J., & Nelson, D.A. (1995) Cloning of the /3 cell high-affinity sulfony- 
lurea receptor: a regulator of insulin secretion. Science 268, 423^26. 



PART 1 1 1 




INFORMATION 
PATHWAYS 



24 Genes and Chromosomes 947 

25 DNA Metabolism 975 

26 RNA Metabolism 1021 

27 Protein Metabolism 1065 

28 Regulation of Gene Expression 1115 



*phe third and final part of this book explores the bio- 
chemical mechanisms underlying the apparently 
I contradictory requirements for both genetic conti- 
nuity and the evolution of living organisms. What is the 
molecular nature of genetic material? How is genetic in- 
formation transmitted from one generation to the next 
with high fidelity? How do the rare changes in genetic 
material that are the raw material of evolution arise? 
How is genetic information ultimately expressed in the 
amino acid sequences of the astonishing variety of pro- 
tein molecules in a living cell? 

Today's understanding of information pathways 
has arisen from the convergence of genetics, physics, 
and chemistry in modern biochemistry. This was epit- 
omized by the discovery of the double-helical struc- 
ture of DNA, postulated by James Watson and Francis 
Crick in 1953 (see Fig. 8-13). Genetic theory con- 
tributed the concept of coding by genes. Physics per- 
mitted the determination of molecular structure by 
x-ray diffraction analysis. Chemistry revealed the com- 
position of DNA. The profound impact of the Watson- 
Crick hypothesis arose from its ability to account for a 
wide range of observations derived from studies in 
these diverse disciplines. 

This revolutionized our understanding of the struc- 
ture of DNA and inevitably stimulated questions about 
its function. The double-helical structure itself clearly 
suggested how DNA might be copied so that the infor- 
mation it contains can be transmitted from one genera- 
tion to the next. Clarification of how the information in 



DNA is converted into functional proteins came with the 
discovery of messenger RNA and transfer RNA and with 
the deciphering of the genetic code. 

These and other major advances gave rise to the 
central dogma of molecular biology, comprising the 
three major processes in the cellular utilization of ge- 
netic information. The first is replication, the copying of 
parental DNA to form daughter DNA molecules with 
identical nucleotide sequences. The second is transcrip- 
tion, the process by which parts of the genetic message 
encoded in DNA are copied precisely into RNA. The 
third is translation, whereby the genetic message en- 
coded in messenger RNA is translated on the ribosomes 
into a polypeptide with a particular sequence of amino 
acids. 




Replication I DNA 

Transcription 

RNA 
^ Translation 

Protein 

The central dogma of molecular biology, showing the general pathways 
of information flow via replication, transcription, and translation. The 
term "dogma" is a misnomer, and is retained for historical reasons only. 
Introduced by Francis Crick at a time when little evidence supported 
these ideas, the dogma has become a well-established principle. 
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Part III explores these and related processes. In 
Chapter 24 we examine the structure, topology, and 
packaging of chromosomes and genes. The processes 
underlying the central dogma are elaborated in Chap- 
ters 25 through 27. Finally, we turn to regulation, exam- 
ining how the expression of genetic information is 
controlled (Chapter 28). 

A major theme running through these chapters is 
the added complexity inherent in the biosynthesis of 
macromolecules that contain information. Assembling 
nucleic acids and proteins with particular sequences of 
nucleotides and amino acids represents nothing less 
than preserving the faithful expression of the template 
upon which life itself is based. We might expect the for- 
mation of phosphodiester bonds in DNA or peptide 
bonds in proteins to be a trivial feat for cells, given the 
arsenal of enzymatic and chemical tools described in 
Part II. However, the framework of patterns and rules 
established in our examination of metabolic pathways 
thus far must be enlarged considerably to take into ac- 
count molecular information. Bonds must be formed be- 
tween particular subunits in informational biopolymers, 
avoiding either the occurrence or the persistence of 
sequence errors. This has an enormous impact on the 
thermodynamics, chemistry, and enzymology of the 
biosynthetic processes. Formation of a peptide bond 
requires an energy input of only about 21 kJ/mol of 
bonds and can be catalyzed by relatively simple en- 
zymes. But to synthesize a bond between two specific 
amino acids at a particular point in a polypeptide, the 



cell invests about 125 kJ/mol while making use of more 
than 200 enzymes, RNA molecules, and specialized pro- 
teins. The chemistry involved in peptide bond formation 
does not change because of this requirement, but addi- 
tional processes are layered over the basic reaction to 
ensure that the peptide bond is formed between partic- 
ular amino acids. Information is expensive. 

The dynamic interaction between nucleic acids and 
proteins is another central theme of Part III. With the im- 
portant exception of a few catalytic RNA molecules (dis- 
cussed in Chapters 26 and 27), the processes that make 
up the pathways of cellular information flow are catalyzed 
and regulated by proteins. An understanding of these en- 
zymes and other proteins can have practical as well as in- 
tellectual rewards, because they form the basis of 
recombinant DNA technology (introduced in Chapter 9). 

Evolution again constitutes an overarching theme. 
Many of the processes outlined in Part III can be traced 
back billions of years, and a few can be traced to LUCA, 
the Last Universal Common Ancestor. The ribosome, 
most of the translational apparatus, and some parts of 
the transcriptional machinery are shared by every living 
organism on this planet. Genetic information is a kind of 
molecular clock that can help define ancestral relation- 
ships among species. Shared information pathways con- 
nect humans to every other species now living on Earth, 
and to all species that came before. Exploration of these 
pathways is allowing scientists to slowly open the cur- 
tain on the first act— the events that may have heralded 
the beginning of life on Earth. 



Superceding, in fact does more for DNA than act as an executive 
enhancer; it keeps rhe unruly, spreading DNA inside the cramped 
confines that the cell has provided for tl. 
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24.1 Chromosomal Elements 947 

24.2 DNA Supercoiling 954 

24.3 The Structure of Chromosomes 962 

The size of DNA molecules presents an interesting bi- 
ological puzzle. Given that these molecules are gen- 
erally much longer than the cells or viral particles 
that contain them (Fig. 24-1), how do they fit into 
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FIGURE 24-1 Bacteriophage T2 protein coat surrounded by its single, 
linear molecule of DNA. The DNA was released by lysing the bacterio- 
phage particle in distilled water and allowing the DNA to spread on the 
water surface. An undamaged T2 bacteriophage particle consists of a 
head structure that tapers to a tail by which the bacteriophage attaches 
itself to the outer surface of a bacterial cell. All the DNA shown in this 
electron micrograph is normally packaged inside the phage head 



their cellular or viral packages? To address this ques- 
tion, we shift our focus from the secondary structure of 
DNA, considered in Chapter 8, to the extraordinary de- 
gree of organization required for the tertiary packaging 
of DNA into chromosomes — the repositories of genetic 
information. The chapter begins with an examination of 
the elements that make up viral and cellular chromo- 
somes, and then considers chromosomal size and orga- 
nization. We then discuss DNA topology, describing the 
coiling and supercoiling of DNA molecules. Finally, we 
consider the protein-DNA interactions that organize 
chromosomes into compact structures. 

24.1 Chromosomal Elements 

Cellular DNA contains genes and intergenic regions, 
both of which may serve functions vital to the cell. The 
more complex genomes, such as those of eukaryotic 
cells, demand increased levels of chromosomal organi- 
zation, and this is reflected in the chromosomes' struc- 
tural features. We begin by considering the different 
types of DNA sequences and structural elements within 
chromosomes. 

Genes Are Segments of DNA That Code for 
Polypeptide Chains and RNAs 

Our understanding of genes has evolved tremen- 
dously over the last century. Classically, a gene was 
defined as a portion of a chromosome that determines 
or affects a single character or phenotype (visible 
property), such as eye color. George Beadle and Ed- 
ward Tatum proposed a molecular definition of a gene 
in 1940. After exposing spores of the fungus Neu- 
rospora crassa to x rays and other agents known to 
damage DNA and cause alterations in DNA sequence 
(mutations), they detected mutant fungal strains 
that lacked one or another specific enzyme, some- 
times resulting in the failure of an entire metabolic 
pathway. Beadle and Tatum concluded that a gene is a 
segment of genetic material that determines, or codes 
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for, one enzyme: the one gene-one enzyme hypoth- 
esis. Later this concept was broadened to one gene- 
one polypeptide, because many genes code for a 
protein that is not an enzyme or for one polypeptide 
of a multisubunit protein. 



DNA 





George W. Beadle, 
1903-1989 
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The modern biochemical definition of a gene is 
even more precise. A gene is all the DNA that encodes 
the primary sequence of some final gene product, which 
can be either a polypeptide or an RNA with a structural 
or catalytic function. DNA also contains other segments 
or sequences that have a purely regulatory function. 
Regulatory sequences provide signals that may de- 
note the beginning or the end of genes, or influence the 
transcription of genes, or function as initiation points 
for replication or recombination (Chapter 28). Some 
genes can be expressed in different ways to generate 
multiple gene products from a single segment of DNA. 
The special transcriptional and translational mecha- 
nisms that allow this are described in Chapters 26 
through 28. 

We can estimate directly the minimum overall size 
of genes that encode proteins. As described in detail in 
Chapter 27, each amino acid of a polypeptide chain is 
coded for by a sequence of three consecutive nu- 
cleotides in a single strand of DNA (Fig. 24-2), with 
these "codons" arranged in a sequence that corresponds 
to the sequence of amino acids in the polypeptide that 
the gene encodes. A polypeptide chain of 350 amino 
acid residues (an average-size chain) corresponds to 
1,050 bp. Many genes in eukaryotes and a few in bacte- 
ria and archaea are interrupted by noncoding DNA seg- 
ments and are therefore considerably longer than this 
simple calculation would suggest. 

How many genes are in a single chromosome? The 
Escherichia coli chromosome, one of the bacterial 
genomes that have been completely sequenced, is a cir- 
cular DNA molecule (in the sense of an endless loop 
rather than a perfect circle) with 4,639,675 bp. These 
base pairs encode about 4,300 genes for proteins and 
another 157 genes for structural or catalytic RNA mole- 
cules. Among eukaryotes, the approximately 3.1 billion 
base pairs of the human genome include almost 29,000 
genes on 24 different chromosomes. 
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FIGURE 24-2 Colinearity of the coding nucleotide sequences of DNA 
and mRNA and the amino acid sequence of a polypeptide chain. The 

triplets of nucleotide units in DNA determine the amino acids in a pro- 
tein through the intermediary mRNA. One of the DNA strands serves 
as a template for synthesis of mRNA, which has nucleotide triplets 
(codons) complementary to those of the DNA. In some bacterial and 
many eukaryotic genes, coding sequences are interrupted at intervals 
by regions of noncoding sequences (called introns). 

DNA Molecules Are Much Longer Than the Cellular or 
Viral Packages That Contain Them 

Chromosomal DNAs are often many orders of magni- 
tude longer than the cells or viruses in which they are lo- 
cated (Fig. 24-1; Table 24-1). This is true of every class 
of organism or viral parasite. 

Viruses Viruses are not free-living organisms; rather, 
they are infectious parasites that use the resources of a 
host cell to carry out many of the processes they require 
to propagate. Many viral particles consist of no more 
than a genome (usually a single RNA or DNA molecule) 
surrounded by a protein coat. 

Almost all plant viruses and some bacterial and ani- 
mal viruses have RNA genomes. These genomes tend to 
be particularly small. For example, the genomes of 
mammalian retroviruses such as HIV are about 9,000 nu- 
cleotides long, and the genome of the bacteriophage Q/3 
has 4,220 nucleotides. Both types of virus have single- 
stranded RNA genomes. 
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| TABLE 24-1 


The Sizes of DN A and Viral Particles for Some Bacterial Vim 






Size of viral 


Length of 


Long dimension of 


Virus 


DNA (bp) 


viral DNA (nm) 


viral particle (nm) 


0X174 


5,386 


1,939 


OR 
AD 


T7 


39,936 


14,377 


78 


A (lambda) 


48,502 


17,460 


190 


T4 


168,889 


60,800 


210 



Note: Data on size of DNA are for the replicative form (double-stranded). The contour length is calculated assuming that 
each base pair occupies a length of 3 4 A (see Fig. 8-13). 



The genomes of DNA viruses vary greatly in size 
(Table 24-1). Many viral DNAs are circular for at least 
part of their life cycle. During viral replication within a 
host cell, specific types of viral DNA called replicative 
forms may appear; for example, many linear DNAs be- 
come circular and all single-stranded DNAs become 
double-stranded. A typical medium-sized DNA virus is 
bacteriophage A (lambda), which infects E. coli. In its 
replicative form inside cells, A DNA is a circular double 
helix. This double-stranded DNA contains 48,502 bp 
and has a contour length of 17.5 jxm. Bacteriophage 
<£X174 is a much smaller DNA virus; the DNA in the vi- 
ral particle is a single-stranded circle, and the double- 
stranded replicative form contains 5,386 bp. Although 
viral genomes are small, the contour lengths of their 
DNAs are typically hundreds of times longer than the 
long dimensions of the viral particles that contain them 
(Table 24-1). 

Bacteria A single E, coli cell contains almost 100 
times as much DNA as a bacteriophage A particle. The 
chromosome of an E. coli cell is a single double- 
stranded circular DNA molecule. Its 4,639,675 bp 
have a contour length of about 1.7 mm, some 850 
times the length of thei?. coli cell (Fig. 24-3). In ad- 
dition to the very large, circular DNA chromosome in 
their nucleoid, many bacteria contain one or more 
small circular DNA molecules that are free in the cy- 
tosol. These extrachromosomal elements are called 
plasmids (Fig. 24-4; see also p. 307). Most plasmids 
are only a few thousand base pairs long, but some 
contain more than 10,000 bp. They carry genetic in- 
formation and undergo replication to yield daughter 
plasmids, which pass into the daughter cells at cell di- 
vision. Plasmids have been found in yeast and other 
fungi as well as in bacteria. 

In many cases plasmids confer no obvious advan- 
tage on their host, and their sole function seems to be 
self-propagation. However, some plasmids carry genes 
that are useful to the host bacterium. For example, 
some plasmid genes make a host bacterium resistant to 
antibacterial agents. Plasmids carrying the gene for the 
enzyme /3-lactamase confer resistance to /3-lactam an- 



tibiotics such as penicillin, ampicillin, and amoxicillin 
(see Fig. 6-28). These and similar plasmids may pass 
from an antibiotic-resistant cell to an antibiotic-sensitive 
cell of the same or another bacterial species, making the 
recipient cell antibiotic resistant. The extensive use of 
antibiotics in some human populations has served as a 
strong selective force, encouraging the spread of antibi- 
otic resistance-coding plasmids (as well as transposable 
elements, described below, that harbor similar genes) in 
disease-causing bacteria. Physicians are becoming in- 
creasingly reluctant to prescribe antibiotics unless a 
clear clinical need is confirmed. For similar reasons, the 
widespread use of antibiotics in animal feeds is being 
curbed. 

Eukaryotes A yeast cell, one of the simplest eukary- 
otes, has 2.6 times more DNA in its genome than an 
E. coli cell (Table 24-2). Cells of Drosophila, the fruit 
fly used in classical genetic studies, contain more than 
35 times as much DNA as E. coli cells, and human cells 
have almost 700 times as much. The cells of many plants 
and amphibians contain even more. The genetic mate- 
rial of eukaryotic cells is apportioned into chromosomes, 
the diploid (2n) number depending on the species 
(Table 24-2). A human somatic cell, for example, has 
46 chromosomes (Fig. 24-5). Each chromosome of a 
eukaryotic cell, such as that shown in Figure 24-5a, con- 
tains a single, very large, duplex DNA molecule. The 
DNA molecules in the 24 different types of human 
chromosomes (22 matching pairs plus the X and Y sex 
chromosomes) vary in length over a 25-fold range. Each 
type of chromosome in eukaryotes carries a characteris- 
tic set of genes. 

The DNA of one human genome (22 chromosomes 
plus X and Y or two X chromosomes), placed end to 
end, would extend for about a meter. Most human cells 
are diploid and each cell contains a total of 2 m of DNA. 
An adult human body contains approximately 10 14 cells 
and thus a total DNA length of 2 X 10 11 km. Compare 
this with the circumference of the earth (4 X 10 4 km) 
or the distance between the earth and the sun (1.5 X 
10 8 km)— a dramatic illustration of the extraordinary 
degree of DNA compaction in our cells. 
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FIGURE 24—3 The length of the E. coli chromosome (1.7 mm) depicted 
in linear form relative to the length of a typical E. coli cell (2 fim). 






FIGURE 24-4 DNA from a lysed f. coli cell. In this electron micrograph several small, circular 
plasmid DNAs are indicated by white arrows. The black spots and white specks are artifacts of the 
preparation. 



TABLE 24-2 



I A, Gene, and Chromosome Content in Some Genomes 





Total DNA (bp) 


Number of 
chromosomes* 


Approximate 
number of genes 


Escherichia coli K12 (bacterium) 


4,639,675 


1 


4,435 


Saccharomyces cerevisiae (yeast) 


12,080,000 


16 f 


5,860 


Caenorhabditis elegans (nematode) 


90,269,800 


12* 


23,000 


Arabidopsis thaliana (plant) 


119,186,200 


10 


33,000 


Drosophila melanogaster (fruit fly) 


120,367,260 


18 


20,000 


Oryza saliva (rice) 


480,000,000 


24 


57,000 


Mus musculus (mouse) 


2,634,266,500 


40 


27,000 


Homo sapiens (human) 


3,070,128,600 


46 


29,000 



Note: This information is constantly being refined. For the most current information, consult the websites for the individual genome projects. 
*The diploid chromosome number is given for all eukaryotes except yeast. 

t Haploid chromosome number. Wild yeast strains generally have eight (octoploid) or more sets of these chromosomes. 
*Number for females, with two X chromosomes. Males have an X but no Y, thus 11 chromosomes in all. 
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FIGURE 24-5 Eukaryotic chromosomes, (a) A pair of linked and condensed sis- 
ter chromatids from a human chromosome. Eukaryotic chromosomes are in this 
state after replication at metaphase during mitosis, (b) A complete set of chromo- 
somes from a leukocyte from one of the authors. There are 46 chromosomes in 
every normal human somatic cell. 



Eukaryotic cells also have organelles, mitochon- 
dria (Fig. 24-(>) and chloroplasts, that contain DNA. 
Mitochondrial DNA (mtDNA) molecules are much 
smaller than the nuclear chromosomes. In animal 
cells, mtDNA contains fewer than 20,000 bp (16,569 bp 




FIGURE 24-6 A dividing mitochondrion. Some mitochondrial proteins 
and RNAs are encoded by one of the copies of the mitochondrial DNA 
{none of which are visible here). The DNA (mtDNA) is replicated each 
time the mitochondrion divides, before cell division. 



in human mtDNA) and is a circular duplex. Each 
mitochondrion typically has 2 to 10 copies of this 
mtDNA molecule, and the number can rise to hun- 
dreds in certain cells of an embryo that is undergoing 
cell differentiation. In a few organisms (trypanosomes, 
for example) each mitochondrion contains thousands 
of copies of mtDNA, organized into a complex and in- 
terlinked matrix known as a kinetoplast. Plant cell 
mtDNA ranges in size from 200,000 to 2,500,000 bp. 
Chloroplast DNA (cpDNA) also exists as circular du- 
plexes and ranges in size from 120,000 to 160,000 bp. 
The evolutionary origin of mitochondrial and chloro- 
plast DNAs has been the subject of much speculation. 
A widely accepted view is that they are vestiges of the 
chromosomes of ancient bacteria that gained access to 
the cytoplasm of host cells and became the precursors 
of these organelles (see Fig. 1-36). Mitochondrial 
DNA codes for the mitochondrial tRNAs and rRNAs 
and for a few mitochondrial proteins. More than 95% 
of mitochondrial proteins are encoded by nuclear 
DNA. Mitochondria and chloroplasts divide when the 
cell divides. Their DNA is replicated before and during 
division, and the daughter DNA molecules pass into 
the daughter organelles. 
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FIGURE 24-7 Introns in two eukaryotic genes. The gene for ovalbumin 
has seven introns (A to G), splitting the coding sequences into eight ex- 
ons (L, and 1 to 7) The gene for the (3 subunit of hemoglobin has two 



G 



3 

126 bp 



Exon 
Intron 



B 

851 bp 

introns and three exons, including one intron that alone contains more 
than half the base pairs of the gene. 



Eukaryotic Genes and Chromosomes Are Very Complex 

Many bacterial species have only one chromosome per 
cell and, in nearly all cases, each chromosome contains 
only one copy of each gene. A very few genes, such as 
those for rRNAs, are repeated several times. Genes and 
regulatory sequences account for almost all the DNA in 
bacteria. Moreover, almost every gene is precisely colin- 
ear with the amino acid sequence (or RNA sequence) 
for which it codes (Fig. 24-2). 

The organization of genes in eukaryotic DNA is 
structurally and functionally much more complex. The 
study of eukaryotic chromosome structure, and more 
recently the sequencing of entire eukaryotic genomes, 
has yielded many surprises. Many, if not most, eukary- 
otic genes have a distinctive and puzzling structural fea- 
ture: their nucleotide sequences contain one or more 
intervening segments of DNA that do not code for the 
amino acid sequence of the polypeptide product. These 
nontranslated inserts interrupt the otherwise colinear 
relationship between the nucleotide sequence of the 
gene and the amino acid sequence of the polypeptide it 
encodes. Such nontranslated DNA segments in genes 
are called intervening sequences or introns, and the 




coding segments are called exons. Few bacterial genes 
contain introns. 

In higher eukaryotes, the typical gene has much 
more intron sequence than sequences devoted to exons. 
For example, in the gene coding for the single pclypep- 
tide chain of ovalbumin, an avian egg protein (Fig. 24-7), 
the introns are much longer than the exons; altogether, 
seven introns make up 85% of the gene's DNA. In the 
gene for the (3 subunit of hemoglobin, a single intrcn con- 
tains more than half of the gene's DNA. The gene for the 
muscle protein titin is the intron champion, with 78 in- 
trons. Genes for histones seem to have no introns. Li most 
cases the function of introns is not clear. In total, only 
about 1.5% of human DNA is "coding" or exon DNA, car- 
rying information for protein or RNA products. However, 
when the much larger introns are included in the count, 
as much as 30% of the human genome consists of genes. 

The relative paucity of genes in the human genome 
leaves a lot of DNA unaccounted for. Figure 24-8 pro- 
vides a summary of sequence types. Much of the ncngene 

FIGURE 24-8 Types of sequences in the human genome. This p ie chart 
divides the genome into transposons (transposable elements) genes, 
and miscellaneous sequences. There are four main classes of trans- 
posons (three of them shown here). Long interspersed elements (LINEs), 
6 to 8 kbp long (1 kbp = 1,000 bp), typically include a few genes en- 
coding proteins that catalyze transposition. The genome has about 
850,000 LINEs. Short interspersed elements (SINEs) are about 100 to 
300 bp long. Of the 1 .5 million in the human genome more than 1 mil- 
lion are Alu elements, so called because they generally include one 
copy of the recognition sequence for Alul, a restriction endonuclease 
(see Fig, 9-2). The genome also contains 450,000 copies of retrovirus- 
like transposons, 1.5 to 1 1 kbp long. Although these are "trapped" in 
the genome and cannot move from one cell to another, they an? evolu- 
tionarily related to the retroviruses (Chapter 26), which include HIV. A 
final class of transposons (making up <3% and not shown here) con- 
sists of a variety of transposon remnants that differ greatly in ler gth. 

About 30% of the genome consists of sequences included in 
genes for proteins, but only a small fraction of this DNA is in exons 
(coding sequences). Miscellaneous sequences include simple- 
sequence repeats (SSR) and large segmental duplications (SD), the latter 
being segments that appear more than once in different locations. As 
described in Chapter 26, almost all of the genome appears to be tran- 
scribed into RNA, with many of the RNAs not yet characterized. Also 
present are remnants of transposons that have been evolutionarily 
altered so that they are now hard to identify. 
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DNA is in the form of repeated sequences of several 
kinds. Perhaps most surprising, about half the human 
genome is made up of moderately repeated sequences 
that are derived from transposable elements — segments 
of DNA, ranging from a few hundred to several thousand 
base pairs long, that can move from one location to an- 
other in the genome. Transposable elements (trans- 
posons) are a kind of molecular parasite, efficiently 
making a home within the host genome. Many have genes 
encoding the proteins that catalyze the transposition 
process, described in more detail in Chapters 25 and 26. 
Some transposons in the human genome are active, mov- 
ing at a low frequency, but most are inactive relics, evolu- 
tionary altered by mutations. Although these elements 
generally do not encode proteins or RNAs that are used in 
human cells, they have played a major role in human evo- 
lution: movement of transposons can lead to the redistri- 
bution of other genomic sequences. 

Another 3% or so of the human genome consists 
of highly repetitive sequences, also referred to as 
simple-sequence DNA or simple sequence repeats 
(SSR). These short sequences, generally less than 10 bp 
long, are sometimes repeated millions of times per cell. 
The simple-sequence DNA has also been called satellite 
DNA, so named because its unusual base composition of- 
ten causes it to migrate as "satellite" bands (separated 
from the rest of the DNA) when fragmented cellular DNA 
samples are centrifuged in a cesium chloride density gra- 
dient. Studies suggest that simple-sequence DNA does 
not encode proteins or RNAs. Unlike the transposable el- 
ements, the highly repetitive DNA can have identifiable 
functional importance in human cellular metabolism, be- 
cause much of it is associated with two defining features 
of eukaryotic chromosomes: centromeres and telomeres. 

The centromere (Fig. 24-9) is a sequence of DNA 
that functions during cell division as an attachment 
point for proteins that link the chromosome to the mi- 
totic spindle. This attachment is essential for the equal 
and orderly distribution of chromosome sets to daugh- 
ter cells. The centromeres of Saccharomyces cere- 
visiae have been isolated and studied. The sequences 
essential to centromere function are about 130 bp long 
and are very rich in A=T pairs. The centromeric se- 
quences of higher eukaryotes are much longer and, un- 
like those of yeast, generally contain simple-sequence 
DNA, which consists of thousands of tandem copies of 
one or a few short sequences of 5 to 10 bp, in the same 
orientation. The precise role of simple-sequence DNA in 
centromere function is not yet understood. 



Telomere 



Centromere 



Telomere 



I TABLE 24-3 | 






Telomere repeat 




sequence 



Homo sapiens (human) 


(TTAGGG) n 


Tetrahymena thermophila 


(TTGGGG)^ 


(ciliated protozoan) 




Saccharomyces cerevisiae 


((TG) 1 _ 3 (TG) 2 _ 3 ). 


(yeast) 




Arabidopsis thaliana (plant) 


(TTTAGGG) n 



Telomeres (Greek telos, "end") are sequences at 
the ends of eukaryotic chromosomes that help stabilize 
the chromosome. Telomeres end with multiple repeated 
sequences of the form 



Unique sequences (genes), dispersed repeats, 
and multiple replication origins 

FIGURE 24-9 Important structural elements of a yeast chromosome. 



■ 

(3')(A^) 

where x and y are generally between 1 and 4 (Table 24-3) . 
The number of telomere repeats, n, is in the range of 20 
to 100 for most single-celled eukaryotes and is gener- 
ally more than 1,500 in mammals. The ends of a linear 
DNA molecule cannot be routinely replicated by the 
cellular replication machinery (which may be one rea- 
son why bacterial DNA molecules are circular). Re- 
peated telomeric sequences are added to eukaryotic 
chromosome ends primarily by the enzyme telomerase 
(see Fig. 26-39). 

Artificial chromosomes (Chapter 9) have been con- 
structed as a means of better understanding the func- 
tional significance of many structural features of 
eukaryotic chromosomes. A reasonably stable artificial 
linear chromosome requires only three components: a 
centromere, a telomere at each end, and sequences that 
allow the initiation of DNA replication. Yeast artificial 
chromosomes (YACs; see Fig. 9-7) have been developed 
as a research tool in biotechnology. Similarly, human ar- 
tificial chromosomes (HACs) are being developed for 
the treatment of genetic diseases by somatic gene ther- 
apy (see Box 9-2 on somatic gene therapy). 

SUMMARY 24.1 Chromosomal Elements 

■ Genes are segments of a chromosome that contain 
the information for a functional polypeptide or 
RNA molecule. In addition to genes, chromosomes 
contain a variety of regulatory sequences involved 
in replication, transcription, and other processes. 

■ Genomic DNA and RNA molecules are generally 
orders of magnitude longer than the viral particles 
or cells that contain them. 

* Many genes in eukaryotic cells (but few in bacteria 
and archaea) are interrupted by noncoding 
sequences, or introns. The coding segments 
separated by introns are called exons. 
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■ Less than one-third of human genomic DNA 
consists of genes. Much of the remainder consists 
of repeated sequences of various types. Nucleic 
acid parasites known as transposons account for 
about half of the human genome. 

■ Eukaryotic chromosomes have two important 
special-function repetitive DNA sequences: 
centromeres, which are attachment points for the 
mitotic spindle, and telomeres, located at the ends 
of chromosomes. 

24.2 DNA Superceding 

Cellular DNA, as we have seen, is extremely compacted, 
implying a high degree of structural organization. The 
folding mechanism not only must pack the DNA but also 
must permit access to the information in the DNA. Be- 
fore considering how this is accomplished in processes 
such as replication and transcription, we need to exam- 
ine an important property of DNA structure known as 
supercoiling. 

"Supercoiling" means the coiling of a coil. A tele- 
phone cord, for example, is typically a coiled wire. The 
path taken by the wire between the base of the phone 
and the receiver often includes one or more supercoils 
(Fig. 24-10). DNA is coiled in the form of a double he- 
lix, with both strands of the DNA coiling around an axis. 




FIGURE 24-10 Supercoils. A typical phone cord is coiied like a DNA 
helix, and the coiled cord can itself coil in a supercoil. The illustration 
is especially appropriate because an examination of phone cords 
helped lead Jerome Vinograd and his colleagues to the insight that 
many properties of small circular DNAs can be explained by supercoil- 
ing. They first detected DNA supercoiling— in small circular viral 
DNAs— in 1965. 



DNA double 
helix (coil) 



Axis 




FIGURE 24-1 1 Supercoiling of DNA. When the axis of the DNA dou- 
ble helix is coiled on itself, it forms a new helix (superhelix). The DNA 
superhelix is usually called a supercoil. 

The further coiling of that axis upon itself (Fig. 24-11) 
produces DNA supercoiling. As detailed below, DNA su- 
percoiling is generally a manifestation of structural 
strain. When there is no net bending of the DNA axis 
upon itself, the DNA is said to be in a relaxed state. 

We might have predicted that DNA compaction in- 
volved some form of supercoiling. Perhaps less pre- 
dictable is that replication and transcription of DNA also 
affect and are affected by supercoiling. Both processes 
require a separation of DNA strands — a process compli- 
cated by the helical interwinding of the strands (as 
demonstrated in Fig. 24-12). 




FIGURE 24-12 Supercoiling induced by separating the strands of a 
helical structure. Twist two linear strands of rubber band into a right- 
handed double helix as shown. Fix one end by having a friend hold 
onto it, then pull apart the two strands at the other end. The resulting 
strain will produce supercoiling. 
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FIGURE 24-13 Relaxed and supercoiled plasmid DNAs. The molecule in the leftmost 
electron micrograph is relaxed; the degree of supercoiling increases from left to right. 



That a DNA molecule would bend on itself and be- 
come supercoiled in tightly packaged cellular DNA 
would seem logical, then, and perhaps even trivial, were 
it not for one additional fact: many circular DNA mole- 
cules remain highly supercoiled even after they are ex- 
tracted and purified, freed from protein and other 
cellular components. This indicates that supercoiling is 
an intrinsic property of DNA tertiary structure. It occurs 
in all cellular DNAs and is highly regulated by each cell. 

Several measurable properties of supercoiling have 
been established, and the study of supercoiling has pro- 
vided many insights into DNA structure and function. 
This work has drawn heavily on concepts derived from a 
branch of mathematics called topology, the study of 
the properties of an object that do not change under 
continuous deformations. For DNA, continuous defor- 
mations include conformational changes due to thermal 
motion or an interaction with proteins or other mole- 
cules; discontinuous deformations involve DNA strand 
breakage. For circular DNA molecules, a topological 
property is one that is unaffected by deformations of the 
DNA strands as long as no breaks are introduced. Topo- 
logical properties are changed only by breakage and 
rejoining of the backbone of one or both DNA strands. 

We now examine the fundamental properties and 
physical basis of supercoiling. 

Most Cellular DNA Is Underwound 

To understand supercoiling we must first focus on the 
properties of small circular DNAs such as plasmids and 
small viral DNAs. When these DNAs have no breaks in 
either strand, they are referred to as closed-circular 
DNAs. If the DNA of a closed-circular molecule con- 
forms closely to the B-form structure (the Watson-Crick 
structure; see Fig. 8-13), with one turn of the double 
helix per 10.5 bp, the DNA is relaxed rather than super- 
coiled (Fig. 24-13). Supercoiling results when DNA is 
subject to some form of structural strain. Purified 
closed-circular DNA is rarely relaxed, regardless of its 
biological origin. Furthermore, DNAs derived from a 
given cellular source have a characteristic degree of 



supercoiling. DNA structure is therefore strained in a 
manner that is regulated by the cell to induce the 
supercoiling. 

In almost every instance, the strain is a result of un- 
derwinding of the DNA double helix in the closed cir- 
cle. In other words, the DNA has fewer helical turns 
than would be expected for the B-form structure. The 
effects of underwinding are summarized in Figure 
24-14. An 84 bp segment of a circular DNA in the re- 
laxed state would contain eight double-helical turns, or 




(a) Relaxed (8 turns) 




(b) Strained (7 turns) 




(d) Strand separation 



FIGURE 24-14 Effects of DNA underwinding. (a) A segment of DNA in 
a closed-circular molecule, 84 bp long, in its relaxed form with eight 
helical turns, (b) Removal of one turn induces structural strain, (c) The 
strain is generally accommodated by formation of a supercoil. (d) DNA 
underwinding also makes the separation of strands somewhat easier. In 
principle, each turn of underwinding should facilitate strand separa- 
tion over about 1 0 bp, as shown. However, the hydrogen-bonded base 
pairs would generally preclude strand separation over such a short dis- 
tance, and the effect becomes important only for longer DNAs and 
higher levels of DNA underwinding. 
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one for every 10.5 bp. If one of these turns were re- 
moved, there would be (84 bp)/7 = 12.0 bp per turn, 
rather than the 10.5 found in B-DNA (Fig. 24-14b). This 
is a deviation from the most stable DNA form, and the 
molecule is thermodynamically strained as a result. 
Generally, much of this strain would be accommodated 
by coiling the axis of the DNA on itself to form a super- 
coil (Fig. 24-14c; some of the strain in this 84 bp seg- 
ment would simply become dispersed in the untwisted 
structure of the larger DNA molecule). In principle, 
the strain could also be accommodated by separating 
the two DNA strands over a distance of about 10 bp 
(Fig. 24-14d). In isolated closed-circular DNA, strain 
introduced by underwinding is generally accommodated 
by supercoiling rather than strand separation, because 
coiling the axis of the DNA usually requires less energy 
than breaking the hydrogen bonds that stabilize paired 
bases. Note, however, that the underwinding of DNA in 
vivo makes separation of the DNA strands easier, facili- 
tating access to the information they contain. 

Every cell actively underwinds its DNA with the aid 
of enzymatic processes (described below), and the re- 
sulting strained state represents a form of stored en- 
ergy. Cells maintain DNA in an underwound state to 
facilitate its compaction by coiling. The underwinding of 
DNA is also important to enzymes of DNA metabolism 
that must bring about strand separation as part of their 
function. 

The underwound state can be maintained only if the 
DNA is a closed circle or if it is bound and stabilized by 
proteins so that the strands are not free to rotate about 
each other. If there is a break in one strand of an isolated, 
protein-free circular DNA, free rotation at that point will 
cause the underwound DNA to revert spontaneously to 
the relaxed state. In a closed-circular DNA molecule, 
however, the number of helical turns cannot be changed 
without at least transiently breaking one of the DNA 
strands. The number of helical turns in a DNA molecule 
therefore provides a precise description of supercoiling. 

DNA Underwinding Is Defined by Topological 
Linking Number 

The field of topology provides some ideas that are useful 
to the discussion of DNA supercoiling, particularly the 
concept of linking number. Linking number is a topo- 
logical property of double-stranded DNA, because it 
does not vary when the DNA is bent or deformed, as 
long as both DNA strands remain intact. Linking number 
(LA;) is illustrated in Figure 24-15. 

Let's begin by visualizing the separation of the two 
strands of a double-stranded circular DNA. If the two 
strands are linked as shown in Figure 24- 15a, they are 
effectively joined by what can be described as a topolog- 
ical bond. Even if all hydrogen bonds and base-stacking 
interactions were abolished such that the strands were 
not in physical contact, this topological bond would still 
link the two strands. Visualize one of the circular strands 
as the boundary of a surface (such as a soap film span- 




(b) Lk = 6 



FIGURE 24-15 Linking number, Lk. Here, as usual, each blue ribbon 
represents one strand of a double-stranded DNA molecule. For the 
molecule in (a), Lk = 1. For the molecule in (b), Lk - 6. One of the 
strands in (b) is kept untwisted for illustrative purposes, to define 
the border of an imaginary surface (shaded blue). The number of times 
the twisting strand penetrates this surface provides a rigorous definition 
of linking number. 

ning the space framed by a circular wire before you blow 
a soap bubble). The linking number can be defined as 
the number of times the second strand pierces this sur- 
face. For the molecule in Figure 24-15a, Lk = 1; for that 
in Figure 24-15b, Lk = 6. The linking number for a 
closed-circular DNA is always an integer. By convention, 
if the links between two DNA strands are arranged so 
that the strands are interwound in a right-handed helix, 
the linking number is defined as positive (+); for 
strands interwound in a left-handed helix, the linking 
number is negative (-). Negative linking numbers are, 
for all practical purposes, not encountered in DNA. 

We can now extend these ideas to a closed-circular 
DNA with 2,100 bp (Fig. 24-16a). When the molecule 
is relaxed, the linking number is simply the number of 
base pairs divided by the number of base pairs per turn, 
which is close to 10.5; so in this case, Lk = 200. For a 
circular DNA molecule to have a topological property 
such as linking number, neither strand may contain a 
break. If there is a break in either strand, the strands 
can, in principle, be unraveled and separated com- 
pletely. In this case, no topological bond exists and Lk is 
undefined (Fig. 24-1 6b). 

We can now describe DNA underwinding in terms of 
changes in the linking number. The linking number in 
relaxed DNA, Lk Q , is used as a reference. For the mole- 
cule shown in Fig ure 24— 16a, Lk$ — 200; if two turns are 
removed from this molecule, Lk = 198. The change can 
be described by the equation 

ALk = Lk - Lk 0 (24-1) 
- 198 - 200 = -2 

It is often convenient to express the change in linking 
number in terms of a quantity that is independent of the 
length of the DNA molecule. This quantity, called the 
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tc) Lk = 198 



FIGURE 24-16 Linking number applied to closed-circular DNA mole- 
cules. A 2,100 bp circular DNA is shown in three forms: (a) relaxed, 
Lk = 200; (b) relaxed with a nick (break) in one strand, Lk undefined; 
and (c) underwound by two turns, Lk = 198. The underwound mole- 
cule generally exists as a supercoiled molecule, but underwinding also 
facilitates the separation of DNA strands. 

specific linking difference (er) or superhelical den- 
sity, is a measure of the number of turns removed rela- 
tive to the number present in relaxed DNA: 



In the example in Figure 24-16c, a = -0.01, which 
means that 1% (2 of 200) of the helical turns present in 
the DNA (in its B form) have been removed. The degree 
of underwinding in cellular DNAs generally falls in the 
range of 5% to 7%; that is, a = -0.05 to -0.07. The neg- 
ative sign indicates that the change in linking number is 
due to underwinding of the DNA. The supercoiling in- 
duced by underwinding is therefore defined as negative 
supercoiling. Conversely, under some conditions DNA 
can be overwound, resulting in positive supercoiling. 
Note that the twisting path taken by the axis of the DNA 
helix when the DNA is underwound (negative supercoil- 
ing) is the mirror image of that taken when the DNA 
is overwound (positive supercoiling) (Fig. 24-17). Su- 
percoiling is not a random process; the path of the 
supercoiling is largely prescribed by the torsional strain 
imparted to the DNA by decreasing or increasing the 
linking number relative to B-DNA. 

Linking number can be changed by ± 1 by breaking 
one DNA strand, rotating one of the ends 360° about the 
unbroken strand, and rejoining the broken ends. This 
change has no effect on the number of base pairs or the 
number of atoms in the circular DNA molecule. Two 
forms of a circular DNA that differ only in a topological 
property such as linking number are referred to as 
topoisomers. 




FIGURE 24-17 Negative and positive supercoils. For the relaxed DNA 
molecule of Figure 24-1 6a, underwinding or overwinding by two hel- 
ical turns (Lk = 198 or 202) will produce negative or positive super- 
coiling, respectively. Note that the DNA axis twists in opposite 
directions in the two cases. 

Mm WORKED EXAMPLE 24-1 Calculation of 

Superhelical Density 

What is the superhelical density (er) of a closed-circular 
DNA with a length of 4,200 bp and a linking number 
(Lk) of 374? What is the superhelical density of the 
same DNA when Lk = 412? Are these molecules nega- 
tively or positively supercoiled? 

Solution: First, calculate Lk 0 by dividing the length of the 
closed-circular DNA (in bp) by 10.5 bp/turn: (4,200 
bp)/(10.5 bp/turn) = 400. We can now calculate ALk 
from Equation 24-1: ALk = Lk - Lk 0 = 374 400 = 
-26. Substituting the values for ALk and Lk 0 into Equa- 
tion 24-2: cr = ALk/Lko = -26/400 = -0.065. Since the 
superhelical density is negative, this DNA molecule is 
negatively supercoiled. 

When the same DNA molecule has an Lk of 412, 
ALk = 412 - 400 = 12, and a = 12/400 = 0.03. The 
superhelical density is positive, and the molecule is 
positively supercoiled. 



Linking number can be broken down into two structural 
components, twist (Tw) and writhe (Wr) (Fig. 24-18). 

These are more difficult to describe than linking number, but 
writhe may be thought of as a measure of the coiling of the 
helix axis, and twist as determining the local twisting or spa- 
tial relationship of neighboring base pairs. When the linking 
number changes, some of the resulting strain is usually com- 
pensated for by writhe (supercoiling) and some by changes 
in twist, giving rise to the equation 

Lk = Tw + Wr 

Tw and Wr need not be integers. Twist and writhe are 
geometric rather than topological properties, because 
they may be changed by deformation of a closed-circular 
DNA molecule. 

In addition to causing supercoiling and making 
strand separation somewhat easier, the underwinding of 
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Straight ribbon (relaxed DNA) 




Large writhe, small change in twist 



Zero writhe, large change in twist 

FIGURE 24-18 Ribbon model for illustrating twist and writhe. The 

pink ribbon represents the axis of a relaxed DNA molecule. Strain in- 
troduced by twisting the ribbon (underwinding the DNA) can be man- 
ifested as writhe or twist. Topological changes in linking number are 
usually accompanied by geometric changes in both writhe and twist. 

DNA facilitates structural changes in the molecule. 
These are of less physiological importance but help illus- 
trate the effects of underwinding. Recall that a cruci- 
form (see Fig. 8-19) generally contains a few unpaired 
bases; DNA underwinding helps to maintain the re- 
quired strand separation (Fig. 24-19). Underwinding 
of a right-handed DNA helix also facilitates the forma- 
tion of short stretches of left-handed Z-DNA in regions 
where the base sequence is consistent with the Z form 
(see Chapter 8). 







Relaxed DNA 







Underwound DNA 




Cruciform DNA 

FIGURE 24-19 Promotion of cruciform structures by DNA underwind- 
ing. In principle, cruciforms can form at palindromic sequences (see 
Fig. 8-19), but they seldom occur in relaxed DNA because the linear 
DNA accommodates more paired bases than does the cruciform struc- 
ture. Underwinding of the DNA facilitates the partial strand separation 
needed to promote cruciform formation at appropriate sequences. 



Topoisomerases Catalyze Changes in the 
Linking Number of DNA 

DNA superceding is a precisely regulated process that 
influences many aspects of DNA metabolism. Every cell 
has enzymes with the sole function of underwinding 
and/or relaxing DNA. The enzymes that increase or 
decrease the extent of DNA underwinding are topoiso- 
merases; the property of DNA that they change is the 
linking number. These enzymes play an especially impor- 
tant role in processes such as replication and DNA pack- 
aging. There are two classes of topoisomerases. T^pe I 
topoisomerases act by transiently breaking one of the 
two DNA strands, passing the unbroken strand through 
the break and rejoining the broken ends; they change Lk 
in increments of 1. Type II topoisomerases break both 
DNA strands and change Lk in increments of 2. 

The effects of these enzymes can be demonstrated 
with agarose gel electrophoresis (Fig. 24-20). A popu- 
lation of identical plasmid DNAs with the same linking 
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FIGURE 24-20 Visualization of topoisomers. In this experiment, all 
DNA molecules have the same number of base pairs but exhibit some 
range in the degree of supercoiling. Because supercoiled DNA mole- 
cules are more compact than relaxed molecules, they migrate more 
rapidly during gel electrophoresis. The gels shown here separate topo- 
isomers (moving from top to bottom) over a limited range of superheli- 
cal density. In lane 1, highly supercoiled DNA migrates in a single 
band, even though different topoisomers are probably present Lanes 2 
and 3 illustrate the effect of treating the supercoiled DNA with a type I 
topoisomerase; the DNA in lane 3 was treated for a longer time than 
that in lane 2. As the superhelical density of the DNA is reduced to the 
point where it corresponds to the range in which the gel can resolve in- 
dividual topoisomers, distinct bands appear. Individual bands in the 
region indicated by the bracket next to lane 3 each contain DNA cir- 
cles with the same linking number; the linking number changes by 1 
from one band to the next. 
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number migrates as a discrete band during electrophore- 
sis. Topoisomers with Lk values differing by as little as 1 
can be separated by this method, so changes in linking 
number induced by topoisomerases are readily detected. 

E coli has at least four individual topoisomerases 
(I through IV). Those of type I (topoisomerases I and 
III) generally relax DNA by removing negative super- 
coils (increasing Lk). The way in which bacterial type I 
topoisomerases change linking number is illustrated in 
Figure 24-21. A bacterial type II enzyme, called either 
topoisomerase II or DNA gyrase, can introduce negative 



supercoils (decrease Lk). It uses the energy of ATP to 
accomplish this. To alter DNA linking number, type II 
topoisomerases cleave both strands of a DNA molecule 
and pass another duplex through the break. The degree 
of supercoiling of bacterial DNA is maintained by regula- 
tion of the net activity of topoisomerases I and II. 

Eukaryotic cells also have type I and type II topoiso- 
merases. The type I enzymes are topoisomerases I and III; 
the single type II enzyme has two isoforms in vertebrates 
called lla and II/3. Most of the type II enzymes, including 
a DNA gyrase in archaea, are related and define a family 
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MECHANISM FIGURE 24-21 Bacterial type I topoisomerases alter link- 
ing number. A proposed reaction sequence for the bacterial topoiso- 
merase 1 is illustrated. The enzyme has closed and open conformations, 
(a) A DNA molecule binds to the closed conformation and one DNA 



DNA is released, or 
new cycle begins. 



strand is cleaved, (b) The enzyme changes to its open conformation, 
and the other DNA strand moves through the break in the first strand, 
(c) In the closed conformation, the DNA strand is religated. 
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called type IIA. Archaea also have an unusual enzymes, 
topoisomerase VI, which alone defines the type IIB family. 
The eukaryotic type II topoisomerases cannot underwind 
DNA (introduce negative supercoils), but they can relax 
both positive and negative supercoils (Fig. 24-22). 

FIGURE 24-22 Proposed mechanism for the alteration of linking num- 
ber by eukaryotic type IIA topoisomerases. (T) The multisubunit 
enzyme binds one DNA molecule (blue). Gated cavities above and 
below the bound DNA are called the N-gate and the C-gate. (2) A 
second segment of the same DNA molecule (red) is bound at the 
N-gate, and (3) is trapped. Both strands of the first DNA are now 
cleaved (the chemistry is similar to that in Fig. 24-20b), and® the 
second DNA segment is passed through the break. (5) The broken 
DNA is religated, and the second DNA segment is released through the 
C-gate. Two ATPs are bound and hydrolyzed during this cycle; it is 
likely that one is hydrolyzed in the step leading to the complex in 
step @. Additional details of the ATP hydrolysis component of the 
reaction remain to be worked out 
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Curing Disease by Inhibiting Topoisomerases 



The topological state of cellular DNA is intimately con- 
nected with its function. Without topoisomerases, cells 
cannot replicate or package their DNA, or express their 
genes — and they die. Inhibitors of topoisomerases have 
therefore become important pharmaceutical agents, tar- 
geted at infectious agents and malignant cells. 

Two classes of bacterial topoisomerase inhibitors 
have been developed as antibiotics. The coumarins, in- 
cluding novobiocin and coumermycin Al, are natural 
products derived from Streptomyces species. They in- 
hibit the ATP binding of the bacterial type II topoiso- 
merases, DNA gyrase and topoisomerase IV. These 
antibiotics are not often used to treat infections in 
humans, but research continues to identify clinically 
effective variants. 

The quinolone antibiotics, also inhibitors of bacter- 
ial DNA gyrase and topoisomerase IV, first appeared in 
1962 with the introduction of nalidixic acid. This com- 
pound had limited effectiveness and is no longer used 
clinically in the United States, but the continued devel- 
opment of this class of drugs eventually led to the 
introduction of the fluoroquinolones, exemplified by 
ciprofloxacin (Cipro). The quinolones act by blocking 
the last step of the topoisomerase reaction, the reseal- 
ing of the DNA strand breaks. Ciprofloxacin is a wide- 
spectrum antibiotic. It is one of the few antibiotics 
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reliably effective in treating anthrax infections, and is 
considered a valuable agent in protection against possi- 
ble bioterrorism. Quinolones are selective for the bacte- 
rial topoisomerases, inhibiting the eukaryotic enzymes 
only at concentrations several orders of magnitude 
greater than the therapeutic doses. 

Some of the most important chemotherapeutic 
agents used in cancer treatment are inhibitors of 
human topoisomerases. Topoisomerases are generally 
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As we will show in the next few chapters, topoiso- 
merases play a critical role in every aspect of DNA 
metabolism. As a consequence, they are important drug 
targets for the treatment of bacterial infections and 
cancer (Box 24-1). 

DNA Compaction Requires a Special Form 
of Supercoiling 

Supercoiled DNA molecules are uniform in a number 
of respects. The supercoils are right-handed in a 
negatively supercoiled DNA molecule (Fig. 24-17), 
and they tend to be extended and narrow rather 
than compacted, often with multiple branches (Fig. 
24-23). At the superhelical densities normally en- 
countered in cells, the length of the supercoil axis, 
including branches, is about 40% of the length of the 
DNA. This type of supercoiling is referred to as plec- 
tonemic (from the Greek plektos, "twisted," and 



nema, "thread"). This term can be applied to any 
structure with strands intertwined in some simple 
and regular way, and it is a good description of the 
general structure of supercoiled DNA in solution. 

Plectonemic supercoiling, the form observed in iso- 
lated DNAs in the laboratory, does not produce suffi- 
cient compaction to package DNA in the cell. A second 
form of supercoiling, solenoidal (Fig. 24-24), can be 
adopted by an underwound DNA. Instead of the ex- 
tended right-handed supercoils characteristic of the 
plectonemic form, solenoidal supercoiling involves tight 
left-handed turns, similar to the shape taken up by a 
garden hose neatly wrapped on a reel. Although their 
structures are dramatically different, plectonemic and 
solenoidal supercoiling are two forms of negative super- 
coiling that can be taken up by the same segment of 
underwound DNA. The two forms are readily intercon- 
vertible. Although the plectonemic form is more stable 
in solution, the solenoidal form can be stabilized by 



present at elevated levels in tumor cells, and agents tar- 
geted to these enzymes are much more toxic to the 
tumors than to most other tissue types. Inhibitors of both 
type I and type II topoisomerases have been developed 
as anticancer drugs. 

Camptothecin, isolated from a Chinese ornamental 
tree and first tested clinically in the 1970s, is an inhibitor 
of eukaryotic type I topoisomerases. Clinical trials indi- 
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cated limited effectiveness, despite its early promise in 
preclinical work on mice. However, two effective deriva- 
tives, irinotecan (Campto) and topotecan (Hycamtin) — 
used to treat colorectal cancer and ovarian cancer, 
respectively — were developed in the 1990s. Additional 
derivatives are likely to be approved for clinical use in 
the coming years. All of these drugs act by trapping the 
topoisomerase-DNA complex in which the DNA is 
cleaved, inhibiting religation. 

The human type II topoisomerases are targeted by a 
variety of antitumor drugs, including doxorubicin (Adri- 
amycin), etoposide (Etopophos), and ellipticine. Dox- 
orubicin, effective against several kinds of human 
tumors, is an anthracycline in clinical use. Most of 
these drugs stabilize the covalent topoisomerase-DNA 
(cleaved) complex. 

All of these anticancer agents generally increase the 
levels of DNA damage in the targeted, rapidly growing 
tumor cells. However, noncancerous tissues can also be 
affected, leading to a more general toxicity and unpleas- 
ant side effects that must be managed during therapy. 
As cancer therapies become more effective and survival 
statistics for cancer patients improve, the independent 
appearance of new tumors is becoming a greater prob- 
lem. In the continuing search for new cancer therapies, 
the topoisomerases are likely to remain prominent tar- 
gets for research. 
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FIGURE 24-23 Plectonemic supercoiling. (a) 

Electron micrograph of plectonemically super- 
coiled plasmid DNA and (b) an interpretation of 
the observed structure. The purple lines show 
the axis of the supercoil; note the branching of 
the supercoil. (c) An idealized representation of 



W (b) (c) this structure. 



Plectonemic 



Solenoidal 




(a) (b) 



protein binding and is the form found in chromatin. 
It provides a much greater degree of compaction 
(Fig. 24-24b). Solenoidal supercoiling is the mechanism 
by which underwinding contributes to DNA compaction. 

SUMMARY 24.2 DNA Supercoiling 

■ Most cellular DNAs are supercoiled. Underwmding 
decreases the total number of helical turns in the 
DNA relative to the relaxed, B form. To maintain an 
underwound state, DNA must be either a closed 
circle or bound to protein. Underwinding is quantified 
by a topological parameter called linking number, Lk 

■ Underwinding is measured in terms of specific 
linking difference, a (also called superhelical 
density), which is (Lk - Lk 0 yLk 0 . For cellular 
DNAs, a is typically -0.05 to -0.07, which means 
that approximately 5% to 7% of the helical turns in 
the DNA have been removed. DNA underwinding 
facilitates strand separation by enzymes of DNA 
metabolism. 

■ DNAs that differ only in linking number are called 
topoisomers. Enzymes that underwind and/or relax 
DNA, the topoisomerases, catalyze changes in 
linking number. The two classes of topoisomerases, 
type I and type II, change Lk in increments of 1 or 
2, respectively, per catalytic event. 



FIGURE 24-24 Plectonemic and solenoidal supercoiling. (a) Plectone- 
mic supercoiling takes the form of extended right-handed coils. Sole- 
noidal negative supercoiling takes the form of tight left-handed turns 
about an imaginary tubelike structure. The two forms are readily inter- 
converted, although the solenoidal form is generally not observed un- 
less certain proteins are bound to the DNA. (b) Plectonemic (top) and 
solenoidal supercoiling of the same DNA molecule, drawn to scale. 
Solenoidal supercoiling provides a much greater degree of compaction. 

24.3 The Structure of Chromosomes 

The term "chromosome" is used to refer to a nucleic 
acid molecule that is the repository of genetic informa- 
tion in a virus, a bacterium, a eukaryotic cell, or an or- 
ganelle. It also refers to the densely colored bodies seen 
in the nuclei of dye-stained eukaryotic cells, as visual- 
ized using a light microscope. 

Chromatin Consists of DNA and Proteins 

The eukaryotic cell cycle (see Fig. 12-43) produces re- 
markable changes in the structure of chromosomes (Fig. 
24-25). In nondividing eukaryotic cells (in GO) and those 
in interphase (Gl, S, and G2), the chromosomal material, 
chromatin, is amorphous and seems to be randomly dis- 
persed in certain parts of the nucleus. In the S phase of 
interphase the DNA in this amorphous state replicates, 
each chromosome producing two sister chromosomes 
(called sister chromatids) that remain associated with 
each other after replication is complete. The chromo- 
somes become much more condensed during prophase of 
mitosis, taking the form of a species-specific number of 
well-defined pairs of sister chromatids (Fig. 24-5). 

Chromatin consists of fibers containing protein and 
DNA in approximately equal proportions (by mass), 
along with a small amount of RNA. The DNA in the chro- 
matin is very tightly associated with proteins called 
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FIGURE 24-25 Changes in chromosome structure during the eukary- 
otic cell cycle. The relative lengths of the phases shown here are for 
convenience only. The duration of each phase varies with cell type 
and growth conditions (for single-celled organisms) or metabolic state 
(for multi-celled organisms); mitosis is typically the shortest. Cellular 
DNA is uncondensed throughout interphase, as shown in the cartoons 
of the nucleus in the diagram. The interphase period can be divided 
(see Fig, 12-43) into the G1 (gap) phase; the S (synthesis) phase, when 
the DNA is replicated; and the G2 phase, throughout which the repli- 
cated chromosomes (chromatids) cohere to each another. Mitosis can 
be divided into four stages. The DNA undergoes condensation in 
prophase. During metaphase, the condensed chromosomes line up in 
pairs along the plane halfway between the spindle poles. The two 
chromosomes of each pair are linked to different spindle poles via mi- 
crotubules that extend between the spindle and the centromere. The 
sister chromatids separate at anaphase, each drawn toward the spin- 
dle pole to which it is connected. The process is completed in 
telophase. After cell division is complete, the chromosomes decon- 
dense and the cycle begins anew. 
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TABLE 24-4 


Types and Properties of Histories 








Molecular 


Number of 
amino acid 


Content of basic amino 
acids (% of total) 


Histone 


weight 


residues 




Arg 


HI* 


21,130 


223 


29.5 


11.3 


H2A* 


13,960 


129 


10.9 


19.3 


H2B* 


13,774 


125 


16.0 


16.4 


H3 


15,273 


135 


19.6 


13.3 


H4 


11,236 


102 


10.8 


13.7 



*The sizes of these histories vary somewhat from species to species. The numbers given here are for bovine histones, 



FIGURE 24-26 Nucleosomes. Regularly spaced nucleosomes consist 
of histone complexes bound to DNA. (a) Schematic illustration and 
(b) electron micrograph. 



histones, which package and order the DNA into struc- 
tural units called nucleosomes (Fig. 24-26). Also 
found in chromatin are many nonhistone proteins, some 
of which help maintain chromosome structure and others 
that regulate the expression of specific genes (Chapter 
28). Beginning with nucleosomes, eukaryotic chromoso- 
mal DNA is packaged into a succession of higher-order 
structures that ultimately yield the compact chromosome 
seen with the light microscope. We now turn to a descrip- 
tion of this structure in eukaryotes and compare it with 
the packaging of DNA in bacterial cells. 

Histories Are Small, Basic Proteins 

Found in the chromatin of all eukaryotic cells, histones 
have molecular weights between 11,000 and 21,000 and 
are very rich in the basic amino acids 
arginine and lysine (together these 
make up about one-fourth of the 
amino acid residues). All eukaryotic 
cells have five major classes of his- 
tones, differing in molecular weight 
and amino acid composition (Table 
24-4). The H3 histones are nearly 
identical in amino acid sequence in all 
eukaryotes, as are the H4 histones, 
suggesting strict conservation of their 
functions. For example, only 2 of 102 
amino acid residues differ between 
the H4 histone molecules of peas and 
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cows, and only 8 differ between the H4 histones of hu- 
mans and yeast. Histones HI, H2A, and H2B show less 
sequence similarity among eukaryotic species. 

Each type of histone is subject to enzymatic modifi- 
cation by methylation, acetylation, ADP-ribosylation, 
phosphorylation, glycosylation, sumoylation, or ubiquiti- 
nation. Such modifications affect the net electric charge, 
shape, and other properties of histones, as well as the 
structural and functional properties of the chromatin, 
and they play a role in the regulation of transcription. 

In addition, eukaryotes generally have several vari- 
ant forms of certain histones, most notably histones 
H2A and H3, described in more detail below. The variant 
forms, along with their modifications, have specialized 
roles in DNA metabolism. 

Nucleosomes Are the Fundamental Organizational 
Units of Chromatin 

The eukaryotic chromosome depicted in Figure 24-5 rep- 
resents the compaction of a DNA molecule about 10 5 /xm 
long into a cell nucleus that is typically 5 to 10 fim in di- 
ameter. This compaction involves several levels of highly 
organized folding. Subjection of chromosomes to treat- 
ments that partially unfold them reveals a structure in 
which the DNA is bound tightly to beads of protein, often 
regularly spaced. The beads in this "beads-on-a-string" 
arrangement are complexes of histones and DNA. The 
bead plus the connecting DNA that leads to the next bead 
form the nucleosome, the fundamental unit of organiza- 
tion on which the higher-order packing of chromatin is 
built (Fig. 24-27). The bead of each nucleosome con- 
tains eight histone molecules: two copies each of H2A, 
H2B, H3, and H4. The spacing of the nucleosome beads 
provides a repeating unit typically of about 200 bp, of 
which 146 bp are bound tightly around the eight-part his- 
tone core and the remainder serve as linker DNA be- 
tween nucleosome beads. Histone HI binds to the linker 
DNA. Brief treatment of chromatin with enzymes that di- 
gest DNA causes preferential degradation of the linker 
DNA, releasing histone particles containing 146 bp of 
bound DNA that have been protected from digestion. Re- 
searchers have crystallized nucleosome cores obtained in 
this way, and x-ray diffraction analysis reveals a particle 
made up of the eight histone molecules with the DNA 
wrapped around it in the form of a left-handed solenoidal 
supercoil (Fig. 24-27). 

A close inspection of this structure reveals why eu- 
karyotic DNA is underwound even though eukaryotic 
cells lack enzymes that underwind DNA. Recall that the 
solenoidal wrapping of DNA in nucleosomes is but one 
form of supercoiling that can be taken up by under- 
wound (negatively supercoiled) DNA. The tight wrap- 
ping of DNA around the histone core requires the 
removal of about one helical turn in the DNA. When the 
protein core of a nucleosome binds in vitro to a relaxed, 
closed-circular DNA, the binding introduces a negative 
supercoil. Because this binding process does not break 




FIGURE 24-27 DNA wrapped around a nucleosome core, (a) Space- 
filling representation of a nucleosome protein core from the African 
frog Xenopus laevis, with different colors for the different histones 
(PDB ID 1 AOI). (b) Top and (c) side views of the crystal structure of a 
nucleosome (gray surface contour) with 146 bp of bound DNA (blue). 
The DNA binds in a left-handed solenoidal supercoil that circumnavi- 
gates the histone complex 1 .8 times. A schematic drawing is included 
in (c) for comparison with other figures depicting nucleosomes. 
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the DNA or change the linking number, the formation of 
a negative solenoidal supercoil must be accompanied by 
a compensatory positive supercoil in the unbound re- 
gion of the DNA (Fig. 24-28). As mentioned earlier, 
eukaryotic topoisomerases can relax positive supercoils. 
Relaxing the unbound positive supercoil leaves the neg- 
ative supercoil fixed (through its binding to the micro- 
some histone core) and results in an overall decrease in 
linking number. Indeed, topoisomerases have proved 
necessary for assembling chromatin from purified his- 
tones and closed-circular DNA in vitro. 

Another factor that affects the binding of DNA to 
histones in nucleosome cores is the sequence of the 
bound DNA. Histone cores do not bind randomly to DNA; 
rather, they tend to position themselves at certain loca- 
tions. This positioning is not fully understood but in some 
cases seems to depend on a local abundance of A=T base 
pairs in the DNA helix where it is in contact with the his- 



DNA 




Bound 1 
negative \ 

supercoil Unbound positive 

(solenoidal) supercoil (plectonemic) 



\Lk - -1 

lopoisomei 



(O 




One (net) negative 
supercoil 



FIGURE 24-28 Chromatin assembly, (a) Relaxed, closed-circular DNA. 
(b) Binding of a histone core to form a nucleosome induces one 
negative supercoiL In the absence of any strand breaks, a positive 
supercoil must form elsewhere in the DNA {ALk = 0). (c) Relaxation of 
this positive supercoil by a topoisomerase leaves one net negative 
supercoil (ALk = -1 ). 




FIGURE 24-29 Positioning of a nucleosome to make optimal use of 
A— T base pairs where the histone core is in contact with the minor 
groove of the DNA helix. 

tones (Fig. 24-29), facilitating the compression of the 
minor groove that is needed for the tight wrapping of the 
DNA around the nucleosome's histone core. A cluster of 
two or three A=T base pairs makes this compression 
more likely. Nucleosomes bind particularly well to se- 
quences where AA or AT or TT dinucleotides are stag- 
gered at 10 bp intervals, an arrangement that can account 
for up to 50% of the positions of bound histones in vivo. 

Other proteins are required for the positioning of 
some nucleosome cores on DNA. In several organisms, 
certain proteins bind to a specific DNA sequence and 
facilitate the formation of a nucleosome core nearby. 
Nucleosomes are deposited on the DNA during replica- 
tion, or following other processes that require a tran- 
sient displacement of nucleosomes. Deposition seems to 
occur in a stepwise manner. A tetramer of two H3 and 
two H4 histones binds first, followed by deposition of 
H2A-H2B dimers. The incorporation of nucleosomes 
into chromosomes after chromosomal replication is me- 
diated by a protein complex, replication-coupling as- 
sembly factor (RCAF). RCAF includes acetylated 
histones H3 and H4, a three-subunit protein known as 
chromatin assembly factor 1 (CAF1), and a protein 
known as anti-silencing factor 1 (ASF1). The mecha- 
nism of nucleosome deposition is not understood in de- 
tail, although parts of the RCAF complex are known to 
directly interact with parts of the replication machinery. 
When nucleosomes must be assembled after DNA repair 
or other processes, RCAF is replaced by other special- 
ized protein complexes to mediate the deposition. His- 
tone exchange factors permit the substitution of histone 
variants for core histones in some contexts. Proper 
placement of these variant histones is important. Stud- 
ies have shown that mice lacking one of these variant 
histones die as early embryos (Box 24-2). Precise 
positioning of nucleosome cores also plays a role in the 
expression of some eukaryotic genes (Chapter 28). 
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BOX 24-2 :;| Epigenetks, Nucleosome Structure, and Histone Variants 



Information that is passed from one generation to the 
next — to daughter cells at cell division or from parent to 
offspring — but is not encoded in DNA sequences is re- 
ferred to as epigenetic information. Much of it is in the 
form of covalent modification of histones and/or the 
placement of histone variants in chromosomes. 

The chromatin regions where active gene expres- 
sion (transcription) is occurring tend to be partially 
decondensed and are called euchromatin. In these re- 
gions, histones H3 and H2A are often replaced by the 
histone variants H3.3 and H2AZ, respectively (Fig. 1). 
The complexes that deposit nucleosomes containing 
histone variants on the DNA are similar to those that 
deposit nucleosomes with the more common histones. 
Nucleosomes containing histone H3.3 are deposited by 
a complex in which chromatin assembly factor 1 
(CAF1) is replaced by the protein HIRA (the name is 
derived from a class of proteins called HIR, for histone 
repressor). Both CAF1 and HIRA can be considered hi- 
stone chaperones, helping to ensure the proper assem- 
bly and placement of nucleosomes. Histone H3.3 
differs in sequence from H3 by only four amino acid 
residues, but these residues all play key roles in his- 
tone deposition. 

Like histone H3.3, H2AZ is associated with a dis- 
tinct nucleosome deposition complex, and it is gener- 
ally associated with chromatin regions involved in 
active transcription. Incorporation of H2AZ stabilizes 
the nucleosome octamer, but impedes some coopera- 
tive interactions between nucleosomes that are needed 
to compact the chromosome. This leads to a more open 
chromosome structure that facilitates the expression 
of genes in the region where H2AZ is located. The 
gene encoding H2AZ is essential in mammals. In fruit 
flies, loss of H2AZ prevents development beyond the 
larval stages. 

Another H2A variant is H2AX, which is associated 
with DNA repair and genetic recombination. In mice, 
the absence of H2AX results in genome instability and 
male infertility. Modest amounts of H2AX seem to be 
scattered throughout the genome. When a double- 
strand break occurs, nearby molecules of H2AX become 
phosphorylated at Ser 139 in the carboxyl-terminal 



region. If this phosphorylation is blocked experimentally, 
formation of the protein complexes necessary for DNA 
repair is inhibited. 

The H3 histone variant known as CENPA is associ- 
ated with the repeated DNA sequences in centromeres. 
The chromatin in the centromere region contains the hi- 
stone chaperones CAF1 and HIRA, and both proteins 
could be involved in the deposition of nucleosomes con- 
taining CENPA. Elimination of the gene for CENPA is 
lethal in mice. 

The function and positioning of the histone variants 
can be studied by an application of technologies used in 
genomics. One useful technology is chromatin immuno- 
precipitation, or chromatin IP (ChIP). Nucleosomes 
containing a particular histone variant are precipitated 
by an antibody that binds specifically to this variant. 
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FIGURE 1 Several variants of histones H3, H2A, and H2B are known. 
Shown here are the core histones and a few of the known variants. 
Sites of Lys/Arg residue methylation and Ser phosphorylation are indi- 
cated. HFD denotes the histone-fold domain/ a structural domain 
common to all core histones. 



Nucleosomes Are Packed into Successively 
Higher-Order Structures 

Wrapping of DNA around a nucleosome core compacts 
the DNA length about sevenfold. The overall com- 
paction in a chromosome, however, is greater than 
10,000-fold — ample evidence for even higher orders of 
structural organization. In chromosomes isolated by 
very gentle methods, nucleosome cores seem to be 
organized into a structure called the 30 nm fiber 



(Fig. 24-30). This packing requires one molecule of 
histone HI per nucleosome core. Organization into 30 
nm fibers does not extend over the entire chromo- 
some but is punctuated by regions bound by se- 
quence-specific (nonhistone) DNA-binding proteins. 
The 30 nm structure also seems to depend on the tran- 
scriptional activity of the particular region of DNA. 
Regions in which genes are being transcribed are ap- 
parently in a less-ordered state that contains little, if 
any, histone HI. 
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These nucleosomes can be studied in isolation from 
their DNA, but more commonly the DNA associated 
with them is included in the study to determine where 
the nucleosomes of interest bind. The DNA can be la- 
beled and used to probe a microarray (see Fig. 9-22), 
yielding a map of genomic sequences to which those 
particular nucleosomes bind. Because microarrays are 
often referred to as chips, this technique is called a 
ChlP-chip experiment (Fig. 2). 



The histone variants, along with the many covalent 
modifications that histones undergo, help define and 
isolate the functions of chromatin. They mark the chro- 
matin, facilitating or suppressing specific functions such 
as chromosome segregation, transcription, and DNA re- 
pair. The histone modifications do not disappear at cell 
division or during meiosis, and thus they become part of 
the information transmitted from one generation to the 
next in all eukaryotic organisms. 
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FIGURE 2 A ChlP-chip experiment is designed to reveal the genomic 
DNA sequences to which a particular histone variant binds, (a) A his- 
tone variant with an epitope tag (a protein or chemical structure recog- 
nized by an antibody; see Chapters 5 and 9) is introduced into a 
particular cell type, where it is incorporated into nucleosomes. (In 
some cases, an epitope tag is unnecessary because antibodies may be 
available that bind directly to the histone modification of interest.) 
Chromatin is isolated from the cells and digested briefly with micro- 
coccal nuclease (MNase). The DNA bound by nucleosomes is protected 
from digestion, but the linker DNA is cleaved, releasing segments of DNA 
bound to one or two nucleosomes. Antibody is added, and the nucle- 



osomes containing the epitope-tagged histone variant are selectively 
precipitated. The DNA in these nucleosomes is extracted from the pre- 
cipitate, labeled, and used to probe a microarray representing all or 
part of the genomic sequences of that particular cell type, (b) The pat- 
tern of hybridization on the microarray reveals the DNA sequences 
bound by the nucleosomes with the histone variant. In the example 
here (top panel), patterns of hybridization are shown for DNA bound 
by histone H3.3 in a small segment of the Drosophila genome. Anno- 
tated genes in this segment of the genome are shown in the bottom 
panel (thickened bars). The histone H3.3 binding is concentrated in 
and near genes undergoing active transcription (red bars). 
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FIGURE 24-30 The 30 nm fiber, a higher-order organization of nucleosomes. 

(a) Schematic illustration of the probable structure of the fiber, showing nucleo- 
some packing, (b) Electron micrograph. 
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FIGURE 24-31 A partially unraveled h uman chromosome, revealing 
numerous loops of DNA attached to a scaffoldlike structure. 

The 30 nm fiber — a second level of chromatin 
organization— provides an approximately 100-fold com- 
paction of the DNA. The higher levels of folding are 
not yet understood, but certain regions of DNA seem to 
associate with a nuclear scaffold (Fig. 24-31). The 
scaffold-associated regions are separated by loops of 
DNA with perhaps 20 to 100 kbp. The DNA in a loop may 
contain a set of related genes. In Drosophila, for exam- 
ple, complete sets of histone-coding genes seem to clus- 
ter together in loops that are bounded by scaffold 
attachment sites (Fig. 24-32). The scaffold itself may 
contain several proteins, notably large amounts of his- 
tone HI (located in the interior of the fiber) and topo- 
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FIGURE 24-32 Loops of chromosomal DNA attached to a nuclear 
scaffold. The DNA in the loops is packaged as 30 nm fibers, so the 
loops are the next level of organization. Loops often contain groups of 
genes with related functions. Complete sets of histone-coding genes, 
as shown in this schematic illustration, seem to be clustered in loops of 
this kind. Unlike most genes, histone genes occur in multiple copies in 
many eukaryotic genomes. 



isomerase II. The presence of topoisomerase II further 
emphasizes the relationship between DNA underwind- 
ing and chromatin structure. Topoisomerase II is so im- 
portant to the maintenance of chromatin structure that 
inhibitors of this enzyme can kill rapidly dividing cells. 
Several drugs used in cancer chemotherapy are topoiso- 
merase II inhibitors that allow the enzyme to promote 
strand breakage but not the resealing of the breaks. 

Evidence exists for additional layers of organiza- 
tion in eukaryotic chromosomes, each dramatically 
enhancing the degree of compaction. One model for 
achieving this compaction is illustrated in Figure 24-33. 
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FIGURE 24-33 Compaction of 
DNA in a eukaryotic chromo- 
some. This mode! shows the 
evels of organization that 
could provide the observed de- 
gree of DNA compaction in 
the chromosomes of eukaryotes. 
The levels take the form of 
coils upon coils. In cells, the 
higher-order structures (above 
the 30 nm fibers) are unlikely to 
be as uniform as depicted here. 
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Higher-order chromatin structure probably varies from 
chromosome to chromosome, from one region to the 
next in a single chromosome, and from moment to mo- 
ment in the life of a cell. No single model can adequately 
describe these structures. Nevertheless, the principle is 
clear: DNA compaction in eukaryotic chromosomes is 

likely to involve COils upon Coils upon coils $ Three- 
Dimensional Packaging of Nuclear Chromosomes 

Condensed Chromosome Structures Are 
Maintained by SMC Proteins 

A third major class of chromatin proteins, in addition to 
the histones and topoisomerases, is the SMC proteins 
(structural maintenance of chromosomes). The primary 
structure of SMC proteins consists of five distinct do- 
mains (Fig. 24-34a). The amino- and carboxyl-terminal 
globular domains, N and C, each of which contains part 
of an ATP-hydrolytic site, are connected by two regions 
of a-helical coiled-coil motifs (see Fig. 4-10) that are 
joined by a hinge domain. The proteins are generally 
dimeric, forming a V-shaped complex that is thought to 
be tied together through the protein's hinge domains 
(Fig. 24-34b, c). One N and one C domain come to- 
gether to form a complete ATP-hydrolytic site at each 
free end of the V. 



(d) Cohesin Condensin 




FIGURE 24-34 Structure of SMC proteins, (a) The five domains of the 
SMC primary structure. N and C denote the amino-terminal and 
carboxyl-terminal domains, (b) Each polypeptide is folded so that the 
two coiled-coil domains wrap around each other and the N and C 
domains come together to form a complete ATP-binding site. Two 
polypeptides are linked at the hinge region to form the dimeric 
V-shaped SMC molecule, (c) Electron micrograph of SMC proteins 



Proteins in the SMC family are found in all types of 
organisms, from bacteria to humans. Eukaryotes have 
two major types, cohesins and condensins, both of 
which are bound by regulatory and accessory proteins 
(Fig. 24-34d). The cohesins play a substantial role in 
linking together sister chromatids immediately after 
replication and keeping them together as the chromo- 
somes condense to metaphase. This linkage is essential 
if chromosomes are to segregate properly at cell divi- 
sion. The cohesins, along with a third protein, kleisin, 
are thought to form a ring around the replicated chro- 
mosomes that ties them together until separation is 
required at cell division. The ring may expand and con- 
tract in response to ATP hydrolysis (Fig. 24-34e). The 
condensins are essential to the condensation of chro- 
mosomes as cells enter mitosis. In the laboratory, con- 
densins bind to DNA in a manner that creates positive 
supercoils; that is, condensin binding causes the DNA 
to become overwound, in contrast to the underwinding 
induced by the binding of nucleosomes. It is not yet 
clear how this helps to compact the chromatin. How- 
ever, wrapping of the DNA about condensin may 
contribute to DNA condensation. The cohesins and 
condensins are essential in orchestrating the many 
changes in chromosome structure during the eukary- 
otic cell cycle (Fig. 24-35). 



(e) 




from Bacillus subtilis. (d) Cohesins are made up of pairs of Smc1 and 
Smc3 proteins, and condensins consist of Smc2-Smc4 pairs. These 
eukaryotic SMC proteins are bound by kleisin and some additional 
regulatory proteins (not shown), (e) ATP hydrolysis may serve to open 
and close the ATPase domain ends of the SMC protein dimer, which 
remain linked by kleisin (and other proteins not shown). 
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FIGURE 24-35 Model for the roles of cohesins and condensins during 
the eukaryotic cell cycle. Cohesins are loaded onto the chromosomes 
during the Gl phase, tethering the sister chromatids together during 
replication. With the onset of mitosis, condensins bind and maintain 



the chromatids in a condensed state. During anaphase, an activity 
called separase removes the cohesin links. Once the chromatids sepa- 
rate, the condensins begin to unload and the daughter chromosomes 
return to an uncondensed state. 



Bacterial DNA Is Also Highly Organized 

We now turn briefly to the structure of bacterial chro- 
mosomes. Bacterial DNA is compacted in a structure 
called the nucleoid, which can occupy a significant 
fraction of the cell volume (Fig. 24-36). The DNA 
seems to be attached at one or more points to the inner 
surface of the plasma membrane. Much less is known 
about the structure of the nucleoid than of eukaryotic 
chromatin, but a complex organization is slowly being 
revealed. InE. coli, a scaffoldlike structure seems to or- 
ganize the circular chromosome into a series of about 
500 looped domains, each encompassing 10,000 bp on 
average (Fig. 24-37), as described above for chro- 
matin. The domains are topologically constrained; for 
example, if the DNA is cleaved in one domain, only the 
DNA within that domain will be relaxed. The domains do 
not have fixed end points. Instead, the boundaries are 
most likely in constant motion along the DNA, coordi- 
nated with DNA replication. Bacterial DNA does not 
seem to have any structure comparable to the local 
organization provided by nucleosomes in eukaryotes. 
Histonelike proteins are abundant in E. coli — the best- 
characterized example is a two-subunit protein called 
HU (M T 19,000) — but these proteins bind and dissociate 
within minutes, and no regular, stable DNA-histone 
structure has been found. The dynamic structural 
changes in the bacterial chromosome may reflect a re- 
quirement for more ready access to its genetic informa- 
tion. The bacterial cell division cycle can be as short as 
15 min, whereas a typical eukaryotic cell may not divide 
for hours or even months. In addition, a much greater 
fraction of bacterial DNA is used to encode RNA and/or 
protein products. Higher rates of cellular metabolism in 
bacteria mean that a much higher proportion of the 
DNA is being transcribed or replicated at a given time 
than in most eukaryotic cells. 
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FIGURE 24-36 E. coli nucleoids. The DNA of these cells is stained with 
a dye that fluoresces when exposed to UV light. The light area defines 
the nucleoid. Note that some cells have replicated their DNA but have 
not yet undergone ceil division and hence have multiple nucleoids. 




FIGURE 24-37 Looped domains of the E. coli chromosome. Each do- 
main is about 1 0,000 bp in length. The domains are not static, but move 
along the DNA as replication proceeds. Barriers at the boundaries of 
the domains, of unknown composition, prevent the relaxation of 
DNA beyond the boundaries of the domain where a strand break 
occurs. The putative boundary complexes are shown as gray shaded 
ovoids. The arrows denote movement of DNA through the boundary 
complexes. 
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With this overview of the complexity of DNA struc- 
ture, we are now ready to turn, in the next chapter, to a 
discussion of DNA metabolism. 
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SUMMARY 24.3 



The Structure of 
Chromosomes 



The fundamental unit of organization in the 
chromatin of eukaryotic cells is the nucleosome, 
which consists of histones and a 200 bp segment 
of DNA. A core protein particle containing eight 
histones (two copies each of histones H2A, H2B, 
H3 ; and H4) is encircled by a segment of DNA 
(about 146 bp) in the form of a left-handed 
solenoidal supercoil. 

Nucleosomes are organized into 30 nm fibers, and 
the fibers are extensively folded to provide the 
10,000-fold compaction required to fit a typical 
eukaryotic chromosome into a cell nucleus. The 
higher-order folding involves attachment to a 
nuclear scaffold that contains histone HI, 
topoisomerase II, and SMC proteins. The SMC 
proteins, principally cohesins and condensins, 
play important roles in keeping the chromosomes 
organized during each stage of the cell cycle. 

Bacterial chromosomes are extensively compacted 
into the nucleoid, but the chromosome seems to 
be much more dynamic and irregular in structure 
than eukaryotic chromatin, reflecting the shorter 
cell cycle and very active metabolism of a 
bacterial cell. 
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Problems . . 

1. Packaging of DNA in a \lrus Bacteriophage T2 has a 
DNA of molecular weight 120 x 10 6 contained in a head about 
210 nm long. Calculate the length of the DNA (assume the mo- 
lecular weight of a nucleotide pair is 650) and compare it with 
the length of the T2 head. 

2. The DNA of Phage M13 The base composition of phage 
M13 DNA is A, 23%; T, 36%; G, 21%; C, 20%. What does this 
tell you about the DNA of phage M13? 

3. The Mycoplasma Genome The complete genome of the 
simplest bacterium known, Mycoplasma genitalium, is a cir- 
cular DNA molecule with 580,070 bp. Calculate the molecular 
weight and contour length (when relaxed) of this molecule. 
What is Lk 0 for the Mycoplasma chromosome? If a = -0.06, 
what is Lk? 

4. Size of Eukaryotic Genes An enzyme isolated from rat 
liver has 192 amino acid residues and is coded for by a gene 
with 1,440 bp. Explain the relationship between the number of 
amino acid residues in the enzyme and the number of nu- 
cleotide pairs in its gene. 

5. Linking Number A closed-circular DNA molecule in its 
relaxed form has an Lk of 500. Approximately how many base 



pairs are in this DNA? How is the linking number altered (in- 
creases, decreases, doesn't change, becomes undefined) when 
(a) a protein complex binds to form a nucleosome, (b) one 
DNA strand is broken, (c) DNA gyrase and ATP are added to 
the DNA solution, or (d) the double helix is denatured by 
heat? 

6. DNA Topology In the presence of a eukaryotic condensin 
and a type II topoisomerase, the Lk of a relaxed closed-circular 
DNA molecule does not change. However, the DNA becomes 
highly knotted. 




Formation of the knots requires breakage of the DNA, passage 
of a segment of DNA through the break, and religation by the 
topoisomerase. Given that every reaction of the topoisomerase 
would be expected to result in a change in linking number, 
how can Lk remain the same? 

7. Superhelical Density Bacteriophage X infects E. coli by 
integrating its DNA into the bacterial chromosome. The suc- 
cess of this recombination depends on the topology of the 
E coli DNA. When the superhelical density (a) of the E. coli 
DNA is greater than -0.045, the probability of integration is 
<20%; when a is less than -0.06, the probability is >70%. 
Plasmid DNA isolated from an E. coli culture is found to have 
a length of 13,800 bp and an Lk of 1,222. Calculate a for this 
DNA and predict the likelihood that bacteriophage X will be 
able to infect this culture. 

8. Altering Linking Number (a) What is the Lk of a 
5,000 bp circular duplex DNA molecule with a nick in one 
strand? (b) What is the Lk of the molecule in (a) when the 
nick is sealed (relaxed)? (c) How would the Lk of the mol- 
ecule in (b) be affected by the action of a single molecule of 
E. coli topoisomerase I? (d) What is the Lk of the molecule 
in (b) after eight enzymatic turnovers by a single molecule 
of DNA gyrase in the presence of ATP? (e) What is the Lk 
of the molecule in (d) after four enzymatic turnovers by a 
single molecule of bacterial type I topoisomerase? (f) 
What is the Lk of the molecule in (d) after binding of one 
nucleosome? 

9. Chromatin Early evidence that helped researchers define 
nucleosome structure is illustrated by the agarose gel below, in 
which the thick bands represent DNA. It was generated by 
briefly treating chromatin with an enzyme that degrades DNA, 
then removing all protein and subjecting the purified DNA to 
electrophoresis. Numbers at the side of the gel denote the po- 
sition to which a linear DNA of the indicated size would mi- 
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grate. What does this gel tell you about chromatin structure? 
Why are the DNA bands thick and spread out rather than 
sharply defined? 



to compensate. In the experiment shown here, topoisomerases 
were used to make preparations of the same DNA circle with 
different superhelical densities (a). Completely relaxed DNA 
migrated to the position labeled N (for kicked), and highly su- 
percoiled DNA (above the limit where individual topoisomers 
can be distinguished) to the position labeled X. 




10. DNA Structure Explain how the underwinding of a 
B-DNA helix might facilitate or stabilize the formation of Z-DNA. 

11. Maintaining DNA Structure (a) Describe two struc- 
tural features required for a DNA molecule to maintain a neg- 
atively supercoiled state, (b) List three structural changes 
that become more favorable when a DNA molecule is nega- 
tively supercoiled. (c) What enzyme, with the aid of ATP, can 
generate negative superhelicity in DNA? (d) Describe the 
physical mechanism by which this enzyme acts. 

12. Yeast Artificial Chromosomes (YACs) YACs are used 
to clone large pieces of DNA in yeast cells. What three types of 
DNA sequence are required to ensure proper replication and 
propagation of a YAC in a yeast cell? 

13. Nucleoid Structure in Bacteria In bacteria, the tran- 
scription of a subset of genes is affected by DNA topology, with 
expression increasing or (more often) decreasing when the 
DNA is relaxed. When a bacterial chromosome is cleaved at a 
specific site by a restriction enzyme (one that cuts at a long, 
and thus rare, sequence), only nearby genes (within 10,000 bp) 
exhibit either an increase or decrease in expression. The 
transcription of genes elsewhere in the chromosome is unaf- 
fected. Explain (Hint: See Fig. 24-37.) 

14. DNA Topoisomers When DNA is subjected to elec- 
trophoresis in an agarose gel, shorter molecules migrate faster 
than longer ones. Closed-circular DNAs of the same size but 
different linking number also can be separated on an agarose 
gel: topoisomers that are more supercoiled, and thus more con- 
densed, migrate faster through the gel — from top to bottom in 
the gels shown at right. A dye, chloroquine, was added to these 
gels. Chloroquine intercalates between base pairs and stabilizes 
a more underwound DNA structure. When the dye binds to a 
relaxed, closed-circular DNA, the DNA is underwound where 
the dye binds, and unbound regions take on positive supercoils 
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Gel B 

(a) In gel A, why does the a = 0 lane (i.e., DNA prepared 
so that a = 0, on average) have multiple bands? 

(b) In gel B, is the DNA from the a = 0 preparation 
positively or negatively supercoiled in the presence of the 
intercalating dye? 

(c) In both gels, the cr = -0.115 lane has two bands, one 
a highly supercoiled DNA and one relaxed. Propose a reason 
for the presence of relaxed DNA in these lanes (and others). 

(d) The native DNA (leftmost lane in each gel) is the same 
DNA circle isolated from bacterial cells and untreated. What is 
the approximate superhelical density of this native DNA? 
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15. Defining the Functional Elements of Yeast Chromo- 
somes Figure 24-9 shows the major structural elements of a 
chromosome of baker's yeast (Saccharomyces cerevisiae). 
Heiter, Mann, Snyder, and Davis (1985) determined the prop- 
erties of some of these elements. They based their study on 
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the finding that in yeast cells, plasmids (which have genes and 
an origin of replication) act differently from chromosomes 
(which have these elements plus centromeres and telomeres) 
during mitosis. The plasmids are not manipulated by the mi- 
totic apparatus and segregate randomly between daughter 
cells. Without a selectable marker to force the host cells to re- 
tain them (see Fig. 9-4), these plasmids are rapidly lost. In 
contrast, chromosomes, even without a selectable marker, are 
manipulated by the mitotic apparatus and are lost at a very low 
rate (about 10" 5 per cell division). 

Heiter and colleagues set out to determine the important 
components of yeast chromosomes by constructing plasmids 
with various parts of chromosomes and observing whether 
these "synthetic chromosomes" segregated properly during 
mitosis. To measure the rates of different types of failed chro- 
mosome segregation, the researchers needed a rapid assay to 
determine the number of copies of synthetic chromosomes 
present in different cells. This assay took advantage of the fact 
that wild-type yeast colonies are white whereas certain ade- 
nine-requiring (ade") mutants yield red colonies on nutrient 
media. Specifically, ade2~ cells lack functional AIR carboxylase 
(the enzyme of step © in Figure 22-33) and accumulate AIR 
(5-aminoimidazole ribonucleotide) in their cytoplasm. This ex- 
cess AIR is converted to a conspicuous red pigment. The other 
part of the assay involved the gene SUP11, which encodes an 
ochre suppressor (a type of nonsense suppressor; see Box 27-4) 
that suppresses the phenotype of some ade2~ mutants. 

Heiter and coworkers started with a diploid strain of yeast 
homozygous for ade2~\ these cells are red. When the mutant 
cells contain one copy of SUP1 1 , the metabolic defect is partly 
suppressed and the cells are pink. When the cells contain two 
or more copies of SUP11, the defect is completely suppressed 
and the cells are white. 

The researchers inserted one copy of SUP 11 into syn- 
thetic chromosomes containing various elements thought to 
be important in chromosome function, and then observed how 
well these chromosomes were passed from one generation to 
the next. These pink cells were plated on nonselective media, 
and the behavior of the synthetic chromosomes was observed. 
Specifically, Heiter and coworkers looked for colonies in which 
the synthetic chromosomes segregated improperly at the first 
division after plating, giving rise to a colony that is half one 
genotype and half the other. Because yeast cells are non- 
motile, this will be a sectored colony, with one half one color 
and the other half another color. 

(a) One way for the mitotic process to fail is nondisjunc- 
tion: the chromosome replicates but the sister chromatids fail 
to separate, so both copies of the chromosome end up in the 
same daughter cell. Explain how nondisjunction of the syn- 
thetic chromosome would give rise to a colony that is half red 
and half white. 

(b) Another way for the mitotic process to fail is chromo- 
some loss: the chromosome does not enter the daughter nu- 
cleus or is not replicated. Explain how loss of the synthetic 



chromosome would give rise to a colony that is half red and 
half pink. 

By counting the frequency of the different colony types, 
Heiter and colleagues could estimate the frequency of these 
aberrant mitotic events with different types of synthetic chro- 
mosome. First, they explored the requirement for centromeric 
sequences by constructing synthetic chromosomes with dif- 
ferent-sized DNA fragments containing a known centromere. 
Their results are shown below. 

Size of 
centromere- 
Synthetic containing Chromosome Nondisjunction 
chromosome fragment (kbp) loss (%) (%) 
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none 




>50 


2 


0.63 


1.6 


1.1 


3 


1.6 


1.9 


0.4 


4 


3.0 


1.7 


0.35 


5 


6.0 


1.6 


0.35 



(c) Based on these data, what can you conclude about the 
size of the centromere required for normal mitotic segrega- 
tion? Explain your reasoning. 

(d) Interestingly, all the synthetic chromosomes created 
in these experiments were circular and lacked telomeres. Ex- 
plain how they could be replicated more-or-less properly. 

Heiter and colleagues next constructed a series of linear 
synthetic chromosomes that included the functional cen- 
tromeric sequence and telomeres, and measured the total mi- 
totic error rate (% loss + % nondisjunction) as a function of size: 



Synthetic 


Size 


Total error 


chromosome 


(kbp) 


rate (%) 


6 


15 


11.0 


7 


55 


1.5 




95 


0.44 


9 


137 


0.14 



(e) Based on these data, what can you conclude about the 
chromosome size required for normal mitotic segregation? 
Explain your reasoning. 

(f) Normal yeast chromosomes are linear, range from 250 
kbp to 2,000 kbp in length, and have a mitotic error rate of 
about 10~ 5 per cell division. Extrapolating the results from (e), 
do the centromeric and telomeric sequences used in these ex- 
periments explain the mitotic stability of normal yeast chro- 
mosomes, or must other elements be involved? Explain your 
reasoning. (Hint: A plot of log (error rate) vs. length will be 
helpful.) 

Reference 
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I have never known a dull enzyme. 

— Arthur Kornberg, in the essay "For Love of Enzymes/' 1975 
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As the repository of genetic information, DNA occu- 
pies a unique and central place among biological 
macromolecules. The nucleotide sequences of 
DNA encode the primary structures of all cellular RNAs 
and proteins and, through enzymes, indirectly affect the 
synthesis of all other cellular constituents. This passage 
of information from DNA to RNA and protein guides the 
size, shape, and functioning of every living thing. 

DNA is a marvelous device for the stable storage of 
genetic information. The phrase "stable storage," how- 
ever, conveys a static and misleading picture. It fails to 
capture the complexity of processes by which genetic 
information is preserved in an uncorrupted state and 
then transmitted from one generation of cells to the 
next. DNA metabolism comprises both the process that 
gives rise to faithful copies of DNA molecules (replica- 
tion) and the processes that affect the inherent struc- 
ture of the information (repair and recombination). 
Together, these activities are the focus of this chapter. 

The metabolism of DNA is shaped by the require- 
ment for an exquisite degree of accuracy. The chemistry 
of joining one nucleotide to the next in DNA replication 
is elegant and simple, almost deceptively so. As we shall 
see, complexity arises in the form of enzymatic devices 
to ensure that the genetic information is transmitted in- 
tact. Uncorrected errors that arise during DNA synthe- 
sis can have dire consequences, not only because they 
can permanently affect or eliminate the function of a 
gene but also because the change is inheritable. 

The enzymes that synthesize DNA may copy DNA 
molecules that contain millions of bases. They do so 
with extraordinary fidelity and speed, even though the 
DNA substrate is highly compacted and bound with 
other proteins. Formation of phosphodiester bonds to 



link nucleotides in the backbone of a growing DNA 
strand is therefore only one part of an elaborate process 
that requires myriad proteins and enzymes. 

Maintaining the integrity of genetic information lies 
at the heart of DNA repair. As detailed in Chapter 8, 
DNA is susceptible to many types of damaging reac- 
tions. Such reactions are infrequent but significant nev- 
ertheless, because of the very low biological tolerance 
for changes in DNA sequence. DNA is the only macro- 
molecule for which repair systems exist; the number, 
diversity, and complexity of DNA repair mechanisms 
reflect the wide range of insults that can harm DNA. 

Cells can rearrange their genetic information by 
processes collectively called recombination — seemingly 
undermining the principle that the stability and in- 
tegrity of genetic information are paramount. However, 
most DNA rearrangements in fact play constructive 
roles in maintaining genomic integrity, contributing in 
special ways to DNA replication, DNA repair, and chro- 
mosome segregation. 

Special emphasis is given in this chapter to the en- 
zymes of DNA metabolism. They merit careful study not 
only because of their intrinsic biological importance and 
interest but also for their increasing importance in med- 
icine and for their everyday use as reagents in a wide 
range of modern biochemical technologies. Many of 
the seminal discoveries in DNA metabolism have been 
made with Escherichia coli, so its well-understood en- 
zymes are generally used to illustrate the ground rules. A 
quick look at some relevant genes on the E. coli genetic 
map (Fig. 25-1) provides just a hint of the complexity 
of the enzymatic systems involved in DNA metabolism. 

Before taking a closer look at replication, we must 
make a short digression into the use of abbreviations in 
naming bacterial genes and proteins— you will encounter 
many of these in this and later chapters. Similar conven- 
tions exist for naming eukaryotic genes, although the 
exact form of the abbreviations may vary with the 
species and no single convention applies to all eukary- 
otic systems. 
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Mismatch repair protein mutL 
Single-stranded DNA-binding protein ssb 

DNA repair uvrA 
Helicase dnaB 
RNA polymerase frpoB 
subunits \rpoC j 



DNA polymerase I polA 
DNA helicase/mismatch repair uvrD 



holC DNA polymerase III subunit 



Helicase 3' 



( Replication origin ) \oriC 

Replication initiation < dnaN { 
LdnaA 

Recombinational repair recF 
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recombinational repair 



Recombination and 
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holD DNA polymerase III subunit 
dnaC Primosome component 

polB 1 1 lNA polymerase II 

mutT >■ Oxo-dGTP degradation 
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dnaQ \ N T A polymerase III subunit 
dinB NA polymerase IV 
recR Recombinational repair 



holA DNA polymerase III subunit 
phr DNA photolyase 
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DNA gyrase subunit gyrA 

FIGURE 25-1 Map of the £ coli chromosome. The map shows the 
relative positions of genes encoding many of the proteins important in 
DNA metabolism. The number of genes known to be involved provides 
a hint of the complexity of these processes. The numbers 0 to 100 in- 
side the circular chromosome denote a genetic measurement called 
minutes. Each minute corresponds to -40,000 bp along the DNA mol- 



uvrB DNA repair 
holB DNA polymerase III subunit 



~{lmul)} DNA P^erae V 



ogt 0 6 -G alkyltransferase 



Ter (Replication termination) 
xthA AP endonuclease 
holE DNA polymerase III subunit 

' ruvCl 

i ruvA V Recombination and recombinational repair 
LruvBj 



uvrC DNA repair 

sbcB Exonuclease I 
i 

nfo AP endonuclease 

ecule. The three-letter names of genes and other elements generally re- 
flect some aspect of their function. These include mut, mutagenesis; 
dna, DNA replication; pol, DNA polymerase; rpo, RNA polymerase; 
uvr, UV resistance; rec, recombination; dam, DNA adenine methyla- 
tion; lig, DNA //gase;Ter, termination of replication; and ori, or/gin of 
replication {oriC in E. coli, as shown here). 



KEY CONVENTION: Bacterial genes generally are named us- 
ing three lowercase, italicized letters that often reflect a 
gene's apparent function. For example, the dna, uvr, and 
rec genes affect DNA replication, resistance to the damag- 
ing effects of UV radiation, and recombination, respec- 
tively. Where several genes affect the same process, the 
letters A, B, C, and so forth, are added— as in dnaA, dnaB, 
dnaQ, for example— usually reflecting their order of dis- 
covery rather than their order in a reaction sequence. ■ 

The use of abbreviations in naming proteins is less 
straightforward. During genetic investigations, the protein 
product of each gene is usually isolated and characterized. 



Many bacterial genes have been identified and named be- 
fore the roles of their protein products are understood in 
detail. Sometimes the gene product is found to be a previ- 
ously isolated protein, and some renaming occurs. Often, 
however, the product turns out to be an as yet unknown 
protein, with an activity not easily described by a simple 
enzyme name. 

KEY CONVENTION: Bacterial proteins often retain the name 
of their genes. When referring to the protein product of 
an E. coli gene, roman type is used and the first letter is 
capitalized: for example, the dnaA and recA gene prod- 
ucts are the DnaA and RecA proteins, respectively. ■ 
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Long before the structure of DNA was known, scien- 
tists wondered at the ability of organisms to create 
faithful copies of themselves and, later, at the ability of 
cells to produce many identical copies of large, com- 
plex macromolecules. Speculation about these prob- 
lems centered around the concept of a template, a 
structure that would allow molecules to be lined up in 
a specific order and joined to create a macromolecule 
with a unique sequence and function. The 1940s 
brought the revelation that DNA was the genetic mol- 
ecule, but not until James Watson and Francis Crick 
deduced its structure did the way in which DNA 
could act as a template for the replication and trans- 
mission of genetic information become clear: one 
strand is the complement of the other. The strict 
base-pairing rules mean that each strand provides 
the template for a new strand with a predictable 
and complementary sequence (see Figs 8-14, 8-15). 

j| Nucleotides: Building Blocks of Nucleic Acids 

The fundamental properties of the DNA replication 
process and the mechanisms used by the enzymes that 
catalyze it have proved to be essentially identical in all 
species. This mechanistic unity is a major theme as we 
proceed from general properties of the replication 
process, to E % coli replication enzymes, and, finally, to 
replication in eukaryotes. 

DNA Replication Follows a Set of Fundamental Rules 

Early research on bacterial DNA replication and its 
enzymes helped to establish several basic properties 
that have proven applicable to DNA synthesis in every 
organism. 



DNA extracted and centrifuged 
to equilibrium in CsCl 
density gradient 



(a) 



Heavy 
DNA( 15 Nj - 



Original parent 
molecule 



m 



Hybrid DNA 
( 15 N _14 N ) _ 




First-generation 
daughter molecules 




light 

DNA( 14 N) - 

(c> Hybrid DNA— 



Secon d-genera tion 
daughter molecules 

FIGURE 25-2 The Meselson-Stahl experiment, (a) Cells were grown 
for many generations in a medium containing only heavy nitrogen, 
15 N, so that ail the nitrogen in their DNA was 15 N, as shown by a sin- 
gle band (blue) when centrifuged in a CsCl density gradient, (b) Once 
the cells had been transferred to a medium containing only light nitro- 
gen, 14 N, cellular DNA isolated after one generation equilibrated at a 
higher position in the density gradient (purple band), (c) A second cy- 
cle of replication yielded a hybrid DNA band (purple) and another 
band (red), containing only [ 14 N]DNA, confirming semiconservative 
replication. 



DNA Replication Is Semiconservative Each DNA 
strand serves as a template for the synthesis of a new 
strand, producing two new DNA molecules, each with 
one new strand and one old strand. This is semiconser- 
vative replication. 

Watson and Crick proposed the hypothesis of semi- 
conservative replication soon after publication of their 
1953 paper on the structure of DNA, and the hypothesis 
was proved by ingeniously designed experiments car- 
ried out by Matthew Meselson and Franklin Stahl in 
1957. Meselson and Stahl grew E. coli cells for many 
generations in a medium in which the sole nitrogen 
source (NH 4 C1) contained 15 N, the "heavy" isotope of ni- 
trogen, instead of the normal, more abundant "light" iso- 
tope, 14 N. The DNA isolated from these cells had a 
density about 1% greater than that of normal [ 14 N]DNA 
(Fig. 25-2a). Although this is only a small difference, a 
mixture of heavy [ 15 N]DNA and light [ 14 N]DNA can be 
separated by centrifugation to equilibrium in a cesium 
chloride density gradient. 

The E. coli cells grown in the 15 N medium were 
transferred to a fresh medium containing only the 14 N 



isotope, where they were allowed to grow until the cell 
population had just doubled. The DNA isolated from 
these first-generation cells formed a single band in the 
CsCl gradient at a position indicating that the double- 
helical DNA molecules of the daughter cells were hy- 
brids containing one new 14 N strand and one parent 15 N 
strand (Fig. 25-2b). 

This result argued against conservative replication, 
an alternative hypothesis in which one progeny DNA 
molecule would consist of two newly synthesized DNA 
strands and the other would contain the two parent 
strands; this would not yield hybrid DNA molecules in 
the Meselson-Stahl experiment. The semiconservative 
replication hypothesis was further supported in the next 
step of the experiment (Fig. 25-2c). Cells were again al- 
lowed to double in number in the 14 N medium. The iso- 
lated DNA product of this second cycle of replication 
exhibited two bands in the density gradient, one with a 
density equal to that of light DNA and the other with the 
density of the hybrid DNA observed after the first cell 
doubling. 
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Replication Begins at an Origin and Usually Pro- 
ceeds Bidirectionally Following the confirmation of 
a semiconservative mechanism of replication, a host of 
questions arose. Are the parent DNA strands completely 
unwound before each is replicated? Does replication 
begin at random places or at a unique point? After initi- 
ation at any point in the DNA, does replication proceed 
in one direction or both? 

An early indication that replication is a highly coor- 
dinated process in which the parent strands are simulta- 
neously unwound and replicated was provided by John 
Cairns, using autoradiography He madeE" coli DNA ra- 
dioactive by growing cells in a medium containing 
thymidine labeled with tritium ( 3 H). When the DNA was 
carefully isolated, spread, and overlaid with a photo- 
graphic emulsion for several weeks, the radioactive 
thymidine residues generated "tracks" of silver grains in 
the emulsion, producing an image of the DNA molecule. 
These tracks revealed that the intact chromosome of 
E. coli is a single huge circle, 1.7 mm long. Radioactive 
DNA isolated from cells during replication showed an 
extra loop (Fig. 25-3a). Cairns concluded that the loop 
resulted from the formation of two radioactive daughter 
strands, each complementary to a parent strand. One or 
both ends of the loop are dynamic points, termed repli- 
cation forks, where parent DNA is being unwound and 
the separated strands quickly replicated. Cairns's 
results demonstrated that both DNA strands are repli- 
cated simultaneously, and a variation on his experiment 
(Fig. 25-3b) indicated that replication of bacterial chro- 
mosomes is bidirectional: both ends of the loop have 
active replication forks. 

The determination of whether the replication loops 
originate at a unique point in the DNA required land- 
marks along the DNA molecule, These were provided by 
a technique called denaturation mapping, developed 
by Ross Inman and colleagues. Using the 48,502 bp chro- 
mosome of bacteriophage A, Inman showed that DNA 
could be selectively denatured at sequences unusually 
rich in A=T base pairs, generating a reproducible pat- 
tern of single-strand bubbles (see Fig. 8-28). Isolated 
DNA containing replication loops can be partially dena- 
tured in the same way. This allows the position and 
progress of the replication forks to be measured and 
mapped, using the denatured regions as points of refer- 
ence. The technique revealed that in this system the 
replication loops always initiate at a unique point, which 
was termed an origin. It also confirmed the earlier ob- 
servation that replication is usually bidirectional. For 
circular DNA molecules, the two replication forks meet 
at a point on the side of the circle opposite to the origin. 
Specific origins of replication have since been identified 
and characterized in bacteria and lower eukaryotes. 

DNA Synthesis Proceeds in a 5 '^3' Direction and 
Is Semidiscontinuous A new strand of DNA is always 
synthesized in the 5'->3' direction, with the free 3' OH 
as the point at which the DNA is elongated (the 5' and 3' 



(a) 




FIGURE 25-3 Visualization of bidirectional DNA replication, (a) Replica- 
tion of a circular chromosome produces a structure resembling the 
Greek letter theta, 6, as both strands are replicated simultaneously (new 
strands shown in pink), (b) Replication could be either unidirectional or 
bidirectional, and these can be distinguished by autoradiography: when 
tritium ( 3 H) is added for a short period just before replication is stopped, 
label (pink) would be found at one or both replication forks, respec- 
tively. This technique has revealed bidirectional replication in £ coli, 
Bacillus subtilis, and other bacteria. The autoradiogram here shows a 
replication bubble from B. subtilis. The heaviest grain density (arrows) is 
at the two ends, where replication is occurring. The unreplicated part of 
the chromosome, outside the bubble, is unlabeled and thus not visible, 

ends of a DNA strand are denned in Fig. 8-7). Because 
the two DNA strands are antiparallel, the strand serving 
as the template is read from its 3' end toward its 5' end. 

If synthesis always proceeds in the 5'^>3' direction, 
how can both strands be synthesized simultaneously? If 
both strands were synthesized continuously while the 
replication fork moved, one strand would have to un- 
dergo 3'-^5' synthesis. This problem was resolved by 
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specific nucleotide sequences (such as the restriction en- 
donucleases that are so important in biotechnology; see 
Chapter 9, Fig. 9-2). You will encounter many types of 
nucleases in this and subsequent chapters. 
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FIGURE 25-4 Defining DNA strands at the replication fork. A new 

DNA strand (red) is always synthesized in the 5'— »3' direction. The 
template is read in the opposite direction, 3'— »5'. The leading strand is 
continuously synthesized in the direction taken by the replication fork. 
The other strand, the lagging strand, is synthesized discontinuously in 
short pieces (Okazaki fragments) in a direction opposite to that in 
which the replication fork moves. The Okazaki fragments are spliced 
together by DNA ligase. In bacteria, Okazaki fragments are —1,000 to 
2,000 nucleotides long. In eukaryotic cells, they are 150 to 200 nu- 
cleotides long. 

Reiji Okazaki and colleagues in the 1960s. Okazaki found 
that one of the new DNA strands is synthesized in short 
pieces, now called Okazaki fragments. This work 
ultimately led to the conclusion that one strand is syn- 
thesized continuously and the other discontinuously 
(Fig. 25-4). The continuous strand, or leading strand, 
is the one in which 5'-»3' synthesis proceeds in the 
same direction as replication fork movement. The dis- 
continuous strand, or lagging strand, is the one in 
which 5'— »3' synthesis proceeds in the direction oppo- 
site to the direction of fork movement. Okazaki frag- 
ments range in length from a few hundred to a few 
thousand nucleotides, depending on the cell type. As we 
shall see later, leading and lagging strand syntheses are 
tightly coordinated. 

DNA is Degraded by Nucleases 

To explain the enzymology of DNA replication, we first in- 
troduce the enzymes that degrade DNA rather than syn- 
thesize it. These enzymes are known as nucleases, or 
DNases if they are specific for DNA rather than RNA. 
Every cell contains several different nucleases, belonging 
to two broad classes: exonucleases and endonucleases. 
Exonucleases degrade nucleic acids from one end of the 
molecule. Many operate in only the 5'^3' or the 3'— >5' 
direction, removing nucleotides only from the 5' or the 3' 
end, respectively, of one strand of a double-stranded nu- 
cleic acid or of a single-stranded DNA. Endonucleases 
can begin to degrade at specific internal sites in a nucleic 
acid strand or molecule, reducing it to smaller and 
smaller fragments. A few exonucleases and endonucle- 
ases degrade only single-stranded DNA. There are a few 
important classes of endonucleases that cleave only at 



DNA Is Synthesized by DNA Polymerases 

f m The search for an enzyme that 

could synthesize DNA began in 
1955. Work by Arthur Korn- 
berg and colleagues led to the 
purification and characteriza- 
tion of a DNA polymerase from 
E. coli cells, a single-polypep- 
tide enzyme now called DNA 
polymerase I (M T 103,000; 
■ ^g" m m encoded by the polA gene). 

I L ; v ^ Much later, investigators found 

that E. coli contains at least 
™ four other distinct DNA poly- 

Arthur Kornberg, 1 91 8-2007 meraseSj described below. 

Detailed studies of DNA polymerase I revealed fea- 
tures of the DNA synthetic process that are now known 
to be common to all DNA polymerases. The fundamental 
reaction is a phosphoryl group transfer. The nucleophile 
is the 3'-hydroxyl group of the nucleotide at the 3' end 
of the growing strand. Nucleophilic attack occurs at the 
a phosphorus of the incoming deoxynucleoside 5'- 
triphosphate (Fig. 25-5). Inorganic pyrophosphate is 
released in the reaction. The general reaction is 




(dNMP)„ + dNTP 
DNA 



(dNMP)„ +1 + PPi 
Lengthened 
DNA 



(25-1) 



where dNMP and dNTP are deoxynucleoside 5'- 
monophosphate and 5 '-triphosphate, respectively. The 
reaction seems to proceed with only a minimal change in 
free energy, given that one phosphodiester bond is 
formed at the expense of a somewhat less stable phos- 
phate anhydride. However, noncovalent base-stacking 
and base-pairing interactions provide additional stabi- 
lization to the lengthened DNA product relative to the 
free nucleotide. Also, the formation of products is facili- 
tated in the cell by the 19 kJ/mol generated in the subse- 
quent hydrolysis of the pyrophosphate product by the 
enzyme pyrophosphatase (p. 508). 

Early work on DNA polymerase I led to the defini- 
tion of two central requirements for DNA polymeriza- 
tion. First, all DNA polymerases require a template. The 
polymerization reaction is guided by a template DNA 
strand according to the base-pairing rules predicted by 
Watson and Crick: where a guanine is present in the 
template, a cytosine deoxynucleotide is added to the 
new strand, and so on. This was a particularly important 
discovery, not only because it provided a chemical basis 
for accurate semiconservative DNA replication but also 
because it represented the first example of the use of a 
template to guide a biosynthetic reaction. 
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MECHANISM FIGURE 25-5 Elongation of a DNA chain, (a) DNA poly- 
merase I activity requires a single unpaired strand to act as template 
and a primer strand to provide a free hydroxyl group at the 3' end, to 
which a new nucleotide unit is added. Each incoming nucleotide is se- 
lected in part by base pairing to the appropriate nucleotide in the tem- 
plate strand. The reaction product has a new free 3' hydroxyl, allowing 
the addition of another nucleotide, (b) The catalytic mechanism likely 
involves two Mg 2+ ions, coordinated to the phosphate groups of the 




DNA 
polymerase* 



OH H 



incoming nucleotide triphosphate and to three Asp residues, two of 
which are highly conserved in all DNA polymerases. The Mg 2+ ion de- 
picted on the right facilitates attack of the 3'-hydroxyl group of the 
primer on the a phosphate of the nucleotide triphosphate; the other 
Mg 2+ ion facilitates displacement of the pyrophosphate. Both ions sta- 
bilize the structure of the pentacovalent transition state. RNA poly- 
merases use a similar mechanism (see Fig. 26-1 b). £ Nucleotide 
Polymerization by DNA Polymerase 



Second, the polymerases require a primer. A primer is 
a strand segment (complementary to the template) with a 
free 3 '-hydroxyl group to which a nucleotide can be added; 
the free 3' end of the primer is called the primer termi- 
nus. In other words, part of the new strand must already be 
in place: all DNA polymerases can only add nucleotides to a 
preexisting strand. Many primers are oligonucleotides of 
RNA rather than DNA, and specialized enzymes synthesize 
primers when and where they are required. 

After adding a nucleotide to a growing DNA strand, 
a DNA polymerase either dissociates or moves along the 
template and adds another nucleotide. Dissociation and 
reassociation of the polymerase can limit the overall 
polymerization rate— the process is generally faster 
when a polymerase adds more nucleotides without 
dissociating from the template. The average number of 
nucleotides added before a polymerase dissociates de- 
fines its processivity. DNA polymerases vary greatly in 
processivity; some add just a few nucleotides before dis- 
sociating, Others add many thousands, g Nucleotide Poly- 
merization by DNA Polymerase 

Replication Is Very Accurate 

Replication proceeds with an extraordinary degree of 
fidelity. In E. coli, a mistake is made only once for every 



10 9 to 10 10 nucleotides added. For the E. coli chromo- 
some of -4.6 x 10 6 bp, this means that an error occurs 
only once per 1,000 to 10,000 replications. During poly- 
merization, discrimination between correct and incor- 
rect nucleotides relies not just on the hydrogen bonds 
that specify the correct pairing between complementary 
bases but also on the common geometry of the standard 
A=T and G=C base pairs (Fig. 25-6). The active site 
of DNA polymerase I accommodates only base pairs 
with this geometry. An incorrect nucleotide may be able 
to hydrogen-bond with a base in the template, but it 
generally will not fit into the active site. Incorrect bases 
can be rejected before the phosphodiester bond is 
formed. 

The accuracy of the polymerization reaction itself, 
however, is insufficient to account for the high degree of 
fidelity in replication. Careful measurements in vitro 
have shown that DNA polymerases insert one incorrect 
nucleotide for every 10 4 to 10 5 correct ones. These mis- 
takes sometimes occur because a base is briefly in an un- 
usual tautomeric form (see Fig. 8-9), allowing it to 
hydrogen-bond with an incorrect partner. In vivo, the er- 
ror rate is reduced by additional enzymatic mechanisms. 

One mechanism intrinsic to virtually all DNA poly- 
merases is a separate 3 »5' exonuclease activity that 
double-checks each nucleotide after it is added. This 
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FIGURE 25-6 Contribution of base-pair geometry to the fidelity of DNA 
replication, (a) The standard A=T and G^C base pairs have very similar 
geometries, and an active site sized to fit one (blue shading) will gener- 
ally accommodate the other, (b) The geometry of incorrectly paired bases 
can exclude them from the active site, as occurs on DNA polymerase. 

nuclease activity permits the enzyme to remove a newly 
added nucleotide and is highly specific for mismatched 
base pairs (Fig. 25-7). If the polymerase has added 
the wrong nucleotide, translocation of the enzyme to 
the position where the next nucleotide is to be added 
is inhibited. This kinetic pause provides the opportu- 
nity for a correction. The 3'— »5' exonuclease activity 




DNA polymerase I 

DNA polymerase 
active site 

3'->5' (proofreading) 
exonuclease 
active site 



is a rare tautomeric 
form of cytosine (C*) 
that pairs with A and 
is incorporated into 
the growing strand. 



Before the polymerase 
moves on, the cytosine 
undergoes a tautomeric 
shift from C ' to C. The 
new nucleotide is now 
mis paired. 



The mispaired 3' -OH 
end of the growing 
strand blocks further 
elongation. DNA 
polymerase slides back 
to position the 
mispaired base in the 
3'-»5' exonuclease 
active site. 



The mispaired 
rj nucleotide is removed 

Wv 

DNA polymerase slides 
forward and resumes its 
polymerization activity. 




FIGURE 25-7 An example of error correction by the 3'->5' exonu- 
clease activity of DNA polymerase I. Structural analysis has located 
the exonuclease activity ahead of the polymerase activity as the en- 
zyme is oriented in its movement along the DNA. A mismatched base 
(here, a C-A mismatch) impedes translocation of DNA polymerase I to 
the next site. Sliding backward, the enzyme corrects the mistake with 
its 3'->5' exonuclease activity, then resumes its polymerase activity in 
the 5'— >3' direction. 

removes the mispaired nucleotide, and the polymerase 
begins again. This activity, known as proofreading, is 
not simply the reverse of the polymerization reaction 
(Eqn 25-1), because pyrophosphate is not involved. 
The polymerizing and proofreading activities of a DNA 
polymerase can be measured separately. Proofreading 
improves the inherent accuracy of the polymerization 
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reaction 10 2 - to 10 3 -fold. In the monomeric DNA poly- 
merase I, the polymerizing and proofreading activities 
have separate active sites within the same polypeptide. 

When base selection and proofreading are com- 
bined, DNA polymerase leaves behind one net error for 
every 10 6 to 10 s bases added. Yet the measured accu- 
racy of replication in E. coli is higher still. The addi- 
tional accuracy is provided by a separate enzyme system 
that repairs the mismatched base pairs remaining after 
replication. We describe this mismatch repair, along 
with other DNA repair processes, in Section 25.2. 

£ coli Has at Least Five DNA Polymerases 

More than 90% of the DNA polymerase activity ob- 
served in E. coli extracts can be accounted for by DNA 
polymerase I. Soon after the isolation of this enzyme in 
1955, however, evidence began to accumulate that it is 
not suited for replication of the large E, coli chromo- 
some. First, the rate at which it adds nucleotides (600 
nucleotides/min) is too slow (by a factor of 100 or more) 
to account for the rates at which the replication fork 
moves in the bacterial cell. Second, DNA polymerase I 
has a relatively low processivity. Third, genetic studies 
have demonstrated that many genes, and therefore 
many proteins, are involved in replication: DNA poly- 
merase I clearly does not act alone. Fourth, and most 
important, in 1969 John Cairns isolated a bacterial strain 
with an altered gene for DNA polymerase I that pro- 
duced an inactive enzyme. Although this strain was ab- 
normally sensitive to agents that damaged DNA, it was 
nevertheless viable! 

A search for other DNA polymerases led to the dis- 
covery of E. coli DNA polymerase II and DNA poly- 
merase III in the early 1970s. DNA polymerase II is an 
enzyme involved in one type of DNA repair (Section 
25.3). DNA polymerase III is the principal replication en- 
zyme in E. coli. The properties of these three DNA poly- 
merases are compared in Table 25-1. DNA polymerases 




FIGURE 25-8 Large (Klenow) fragment of DNA polymerase I. This 
polymerase is widely distributed in bacteria. The Klenow fragment, 
produced by proteolytic treatment of the polymerase, retains the poly- 
merization and proofreading activities of the enzyme. The Klenow 
fragment shown here is from the thermophilic bacterium Bacillus 
stearothermophilus (PDB ID 3BDP) The active site for addition of 
nucleotides is deep in the crevice at the far end of the bound DNA 
(blue; the dark blue strand is the template). 

IV and V, identified in 1999, are involved in an unusual 
form of DNA repair (Section 25.2). 

DNA polymerase I, then, is not the primary en- 
zyme of replication; instead it performs a host of 
clean-up functions during replication, recombination, 
and repair. The polymerase's special functions are en- 
hanced by its 5 '—>3 f exonuclease activity. This activ- 
ity, distinct from the 3'— >5' proofreading exonuclease 
(Fig. 25-7), is located in a structural domain that can 
be separated from the enzyme by mild protease treat- 
ment. When the 5'— >3' exonuclease domain is re- 
moved, the remaining fragment (M T 68,000), the large 
fragment or Klenow fragment (Fig. 25-8), retains 
the polymerization and proofreading activities. The 
5'— >3' exonuclease activity of intact DNA polymerase 
I can replace a segment of DNA (or RNA) paired to the 
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Comparison of Three DNA Polymerases off* coli 

DNA polymerase 





I 


II 


III 


Structural gene* 


polA 


polB 


polC (dnaE) 


Subunits (number of different types) 


1 


7 


>10 


M r 


103,000 


88,000+ 


791,500 


3'— »5' Exonuclease (proofreading) 


Yes 


Yes 


Yes 


5'— >3' Exonuclease 


Yes 


No 


No 


Polymerization rate (nucleotides/s) 


16-20 


40 


250-1,000 


Processivity (nucleotides added 








before polymerase dissociates) 


3-200 


1,500 


>500,000 



*For enzymes with more than one subunit, the gene listed here encodes the subunit with polymerization activity. Note that 
dnaE is an earlier designation for the gene now referred to as polC. 

Polymerization subunit only. DNA polymerase II shares several subunits with DNA polymerase III, including the 0, y, 8, 8', 
X, and i/f subunits (see Table 25-2). 
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Nick 



template strand, in a process known as nick transla- 
tion (Fig. 25-9). Most other DNA polymerases lack a 
5'— >3' exonuclease activity. 

DNA polymerase III is much more complex than 
DNA polymerase I, having 10 types of subunits (Table 
25-2). Its polymerization and proofreading activities 



FIGURE 25-9 Nick translation. In this process, an RNA or DNA strand 
paired to a DNA template is simultaneously degraded by the 5'->3' 
exonuclease activity of DNA polymerase I and replaced by the poly- 
merase activity of the same enzyme. These activities have a role in 
DNA repair and in the removal of RNA primers during replication 
(both described later). The strand of nucleic acid to be removed (either 
DNA or RNA) is shown in green, the replacement strand in red. DNA 
synthesis begins at a nick (a broken phosphodiester bond, leaving a 
free 3' hydroxyl and a free 5' phosphate). Polymerase I extends the 
nontemplate DNA strand and moves the nick along the DNA— a 
process called nick translation. A nick remains where DNA poly- 
merase I dissociates, and is later sealed by another enzyme. 

reside in its a and s (epsilon) subunits, respectively. 
The 6 subunit associates with a and s to form a core 
polymerase, which can polymerize DNA but with limited 
processivity. Two core polymerases can be linked by an- 
other set of subunits, a clamp-loading complex, or y 
complex, consisting of five subunits of four different 
types, T 2 yd8'. The core polymerases are linked through 
the r (tau) subunits. Two additional subunits, x (chi) 
and if/ (psi), are bound to the clamp-loading complex. 
The entire assembly of 13 protein subunits (nine differ- 
ent types) is called DNA polymerase III* (Fig. 25-10a). 

DNA polymerase III* can polymerize DNA, but with 
a much lower processivity than one would expect for the 
organized replication of an entire chromosome. The 
necessary increase in processivity is provided by the ad- 
dition of the /3 subunits, four of which complete the DNA 
polymerase III holoenzyme. The subunits associate in 
pairs to form donut-shaped structures that encircle the 
DNA and act like clamps (Fig. 25-1 Ob). Each dimer as- 
sociates with a core subassembly of polymerase III* 
(one dimeric clamp per core subassembly) and slides 



Subui 


lits of DMA Polyme 






Number of 








subunits per 








Subunit holoenzyme 


M Y of subunit 


Gene 


Function of subunit 



a 
e 

6 

T 

y 

8 
8* 
X 

P 



2 
2 

2 
2 

1 
1 
1 
1 

1 
4 



129,900 
27,500 

8,600 
71,100 

47,500 
38,700 
36,900 
16,600 
15,200 
40,600 



polC (dnaE') 
dnaQ (jnutU) 

holE 
dnaX 

dnaX* 
holA 
holB 
holC 
holD 
dnaN 



Polymerization activity 

3'— »5' Proofreading 
exonuclease 

Stabilization of s subunit 

Stable template binding; 
core enzyme dimerization 

Clamp loader 

Clamp opener 

Clamp loader 

Interaction with SSB 

Interaction with y and x 

DNA clamp required for 
optimal processivity 



* Core polymerase 



Clamp-loading (y) complex that 
loads 0 submiiLs on lagging strand 
at each Okazaki fragment 



*The y subunit is encoded by a portion of the gene for the t subunit, such that the amino-terminal 66% of the t subunit has the same amino acid sequence as the y subunit. The y 
subunit is generated by a translational frameshifting mechanism (see p. -) that leads to premature translational termination. 
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FIGURE 25-10 DNA polymerase 111. (a) Architecture of bacterial DNA 
polymerase III. Two core domains, composed of subunits a, e, and 0, 
are linked by a five-subunit clamp-loading complex (also known as 
the y complex) with the composition T 2 y88'. The y and r subunits are 
encoded by the same gene. The y subunit is a shortened version of the 
r subunit; r thus contains a domain identical to y along with an addi- 
tional segment that interacts with the core polymerase. The other two 
subunits of DNA polymerase III*, x and if/ (not shown), also bind to the 
clamp-loading complex. Two /3 clamps interact with the two-core sub- 
assembly, each clamp a dimer of the j8 subunit The complex interacts 
with the DnaB helicase through the r subunits. (b) Two (3 subunits of E 
coli polymerase III form a circular clamp that surrounds the DNA. The 
clamp slides along the DNA molecule, increasing the processivity of 
the polymerase III holoenzyme to greater than 500,000 nucleotides by 
preventing its dissociation from the DNA. The end view shows the two 
subunits as gray and light-blue ribbon structures surrounding a 
space-filling model of DNA. in the side view, surface contour models 
of the /3 subunits (gray) surround a stick representation of a DNA dou- 
ble helix (light and dark blue) (derived from PDB ID 2POL). 



along the DNA as replication proceeds. The /3 sliding 
clamp prevents the dissociation of DNA polymerase III 
from DNA, dramatically increasing processivity — to 
greater than 500,000 (Table 25-1). 

DNA Replication Requires Many Enzymes 
and Protein Factors 

Replication in E. coli requires not just a single DNA 
polymerase but 20 or more different enzymes and pro- 
teins, each performing a specific task. The entire com- 
plex has been termed the DNA replicase system or 
replisome. The enzymatic complexity of replication re- 
flects the constraints imposed by the structure of DNA 
and by the requirements for accuracy. The main classes 
of replication enzymes are considered here in terms of 
the problems they overcome. 

Access to the DNA strands that are to act as tem- 
plates requires separation of the two parent strands. This 
is generally accomplished by helicases, enzymes that 
move along the DNA and separate the strands, using 




Side view 
(b) 



chemical energy from ATP. Strand separation creates 
topological stress in the helical DNA structure (see Fig. 
24-12), which is relieved by the action of topoiso- 
merases. The separated strands are stabilized by DNA- 
binding proteins. As noted earlier, before DNA 
polymerases can begin synthesizing DNA, primers must 
be present on the template — generally short segments of 
RNA synthesized by enzymes known as primases. Ulti- 
mately, the RNA primers are removed and replaced by 
DNA; in E. coli, this is one of the many functions of DNA 
polymerase I. After an RNA primer is removed and the 
gap is filled in with DNA, a nick remains in the DNA back- 
bone in the form of a broken phosphodiester bond. These 
nicks are sealed by DNA ligases. All these processes re- 
quire coordination and regulation, an interplay best char- 
acterized in the E. coli system. 
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Replication of the Lcoli Chromosome Proceeds 
in Stages 

The synthesis of a DNA molecule can be divided into 
three stages: initiation, elongation, and termination, dis- 
tinguished both by the reactions taking place and by 
the enzymes required. As you will find here and in the 
next two chapters, synthesis of the major information- 
containing biological polymers — DNAs, RNAs, and 
proteins — can be understood in terms of these same 
three stages, with the stages of each pathway having 
unique characteristics. The events described below re- 
flect information derived primarily from in vitro experi- 
ments using purified E coli proteins, although the 
principles are highly conserved in all replication systems. 

Initiation The E. coli replication origin, oriC, con- 
sists of 245 bp and contains DNA sequence elements 
that are highly conserved among bacterial replication 
origins. The general arrangement of the conserved se- 
quences is illustrated in Figure 25-11. Two types of se- 
quences are of special interest: five repeats of a 9 bp 
sequence (R sites) that serve as binding sites for the key 
initiator protein DnaA, and a region rich in A=T base 
pairs called the DNA unwinding element (DUE). 
There are three additional DnaA-binding sites (I sites), 



and binding sites for the proteins IHF (integration host 
factor) and FIS (factor for inversion stimulation). These 
two proteins were discovered as required components 
of certain recombination reactions described later in 
this chapter, and their names reflect those roles. An- 
other DNA-binding protein, HU (a histonelike bacterial 
protein originally dubbed factor U), also participates but 
does not have a specific binding site. 

At least 10 different enzymes or proteins (summa- 
rized in Table 25-3) participate in the initiation phase 
of replication. They open the DNA helix at the origin 
and establish a prepriming complex for subsequent re- 
actions. The crucial component in the initiation process 
is the DnaA protein, a member of the AAA+ ATPase 
protein family (ATPases associated with diverse cellu- 
lar activities). Many AAA+ ATPases, including DnaA, 
form oligomers and hydrolyze ATP relatively slowly. 
This ATP hydrolysis acts as a switch mediating inter- 
conversion of the protein between two states. In the 
case of DnaA, the ATP-bound form is active and the 
ADP-bound form is inactive. 

Eight DnaA protein molecules, all in the ATP-bound 
state, assemble to form a helical complex encompassing 
the R and I sites in oriC (Fig. 25-12). DnaA has 
a higher affinity for the R sites than I sites, and binds 
R sites equally well in its ATP- or ADP-bound form. The 



Tandem array of 
three 13-bp sequences, 
consensus sequence 
GATCTNTTNTTTT 

r 



Binding sites for DnaA protein, 
five 9-bp sequences, 
consensus sequence 
TT(A/T)TNCACC 



1 

DUE 
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IHF 



R5 



133 FT 



FIGURE 25-11 Arrangement of sequences in the E. coli replication 
origin, or/C Consensus sequences (see p. 102) for key repeated ele- 
ments are shown N represents any of the four nucleotides. The hori- 
zontal arrows indicate the orientations of the nucleotide sequences 
(left-to-right arrow denotes sequence in top strand; right-to-left, bottom 
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strand). FIS and IHF are binding sites for proteins described in the text. 
R sites are bound by DnaA. I sites are additional DnaA-binding sites 
(with different sequences), bound by DnaA only when the protein is 
complexed with ATP. 
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DnaB 



DnaC -dependent 
loading of DnaB 
helicase 



DnaA-dependent 
denaturation of 
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FIGURE 25-12 Model for initiation of replication at the f. coli origin, 
oriC. Eight DnaA protein molecules, each with a bound ATP, bind at 
the R and 1 sites in the origin (see Fig. 25-11). The DNA is wrapped 
around this complex, which forms a right-handed helical structure. The 
A— T-rich DUE region is denatured as a result of the strain imparted by 
the adjacent DnaA binding. Formation of the helical DnaA complex is 



facilitated by the proteins HU, IHF, and FIS, which are not shown here 
because their detailed structural roles have not yet been defined. 
Hexamers of the DnaB protein bind to each strand, with the aid of 
DnaC protein. The DnaB helicase activity further unwinds the DNA in 
preparation for priming and DNA synthesis. 
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I sites, which bind only the ATP-bound DnaA, allow dis- 
crimination between the active and inactive forms of 
DnaA. The tight right-handed wrapping of the DNA 
around this complex introduces an effective positive su- 
percoil (see Chapter 24). The associated strain in the 
nearby DNA leads to denaturation in the A=T-rich DUE 
region. The complex formed at the replication origin 
also includes several DNA-binding proteins — HU, IHF, 
and FIS — that facilitate DNA bending. 

The DnaC protein, another AAA-h ATPase, then 
loads the DnaB protein onto the separated DNA strands 
in the denatured region. A hexamer of DnaC, each sub- 
unit bound to ATP, forms a tight complex with the hexa- 
meric, ring-shaped DnaB helicase. This DnaC-DnaB 
interaction opens the DnaB ring, the process being 
aided by a further interaction between DnaB and DnaA. 
Two of the ring-shaped DnaB hexamers are loaded in 
the DUE, one onto each DNA strand. The ATP bound to 
DnaC is hydrolyzed, releasing the DnaC and leaving the 
DnaB bound to the DNA. 

Loading of the DnaB helicase is the key step in repli- 
cation initiation. As a replicative helicase, DnaB migrates 
along the single-stranded DNA in the 5'— >3' direction, 
unwinding the DNA as it travels. The DnaB helicases 
loaded onto the two DNA strands thus travel in opposite 
directions, creating two potential replication forks. All 
other proteins at the replication fork are linked directly or 
indirectly to DnaB. The DNA polymerase III holoenzyme 
is linked through the r subunits; additional DnaB interac- 
tions are described below. As replication begins and the 
DNA strands are separated at the fork, many molecules of 
single-stranded DNA-binding protein (SSB) bind to and 
stabilize the separated strands, and DNA gyrase (DNA 
topoisomerase II) relieves the topological stress induced 
ahead of the fork by the unwinding reaction. 



Initiation is the only phase of DNA replication that is 
known to be regulated, and it is regulated such that 
replication occurs only once in each cell cycle. The 
mechanism of regulation is not yet entirely understood, 
but genetic and biochemical studies have provided in- 
sights into several separate regulatory mechanisms. 

Once DNA polymerase III has been loaded onto 
the DNA, along with the /3 subunits (signaling comple- 
tion of the initiation phase), the protein Hda binds to 
the jS subunits and interacts with DnaA to stimulate 
hydrolysis of its bound ATP. Hda is yet another AAA+ 
ATPase closely related to DnaA (its name is derived 
from homologous to DnaA). This ATP hydrolysis leads 
to disassembly of the DnaA complex at the origin. 
Slow release of ADP by DnaA and rebinding of ATP cy- 
cles the protein between its inactive (with bound 
ADP) and active (with bound ATP) forms on a time 
scale of 20 to 40 minutes. 

The timing of replication initiation is affected by 
DNA methylation and interactions with the bacterial 
plasma membrane. The oriC DNA is methylated by the 
Dam methylase (Table 25-3), which methylates the N 6 
position of adenine within the palindromic sequence 
(5')GATC. (Dam is not a biochemical expletive; it 
stands for DNA adenine methylation.) The oriC region 
of E. coli is highly enriched in GATC sequences — it has 
1 1 of them in its 245 bp, whereas the average frequency 
of GATC in the E. coli chromosome as a whole is 1 in 
256 bp. 

Immediately after replication, the DNA is hemimethy- 
lated: the parent strands have methylated oriC se- 
quences but the newly synthesized strands do not. The 
hemimethylated oriC sequences are now sequestered 
by interaction with the plasma membrane (the mecha- 
nism is unknown) and by the binding of the protein 
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Protein 




Number of 
subunits 


Function 


DnaA protein 


52,000 


1 


Recognizes ori sequence; opens duplex at specific 








sites in origin 


DnaB protein (helicase) 


300,000 


6* 


Unwinds DNA 


DnaC protein 


174,000 


6* 


Required for DnaB binding at origin 


HU 


19,000 


2 


Histonelike protein; DNA-binding protein; stimulates 








initiation 


FIS 


22,500 


2* 


DNA-binding protein; stimulates initiation 


IHF 


22,000 


2 


DNA-binding protein; stimulates initiation 


Primase (DnaG protein) 


60,000 


1 


Synthesizes RNA primers 


Single-stranded DNA-binding 








protein (SSB) 


75,600 


4* 


Binds single-stranded DNA 


DNA gyrase (DNA topoisomerase II) 


400,000 


4 


Relieves torsional strain generated by DNA unwinding 


Dam methylase 


32,000 


1 


Methylates (5')GATC sequences at oriC 



*Subunits in these cases are identical. 
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SeqA. After a time, oriC is released from the plasma 
membrane, SeqA dissociates, and the DNA must be fully 
methylated by Dam methylase before it can again bind 
DnaA and initiate a new round of replication. 

Elongation The elongation phase of replication in- 
cludes two distinct but related operations: leading 
strand synthesis and lagging strand synthesis. Several 
enzymes at the replication fork are important to the syn- 
thesis of both strands. Parent DNA is first unwound by 
DNA helicases, and the resulting topological stress is re- 
lieved by topoisomerases. Each separated strand is then 
stabilized by SSB. From this point, synthesis of leading 
and lagging strands is sharply different. 

Leading strand synthesis, the more straightforward 
of the two } begins with the synthesis by primase (DnaG 
protein) of a short (10 to 60 nucleotide) RNA primer at 
the replication origin. DnaG interacts with DnaB helicase 
to carry out this reaction, and the primer is synthesized in 
the direction opposite to that in which the DnaB helicase 
is moving. In effect, the DnaB helicase moves along the 
strand that becomes the lagging strand in DNA synthesis; 
however, the first primer laid down in the first DnaG- 
DnaB interaction serves to prime leading strand DNA 
synthesis in the opposite direction. Deoxyribonucleotides 
are added to this primer by a DNA polymerase III com- 
plex linked to the DnaB helicase tethered to the opposite 



DNA strand. Leading strand synthesis then proceeds con- 
tinuously keeping pace with the unwinding of DNA at the 
replication fork. 

Lagging strand synthesis, as we have noted, is ac- 
complished in short Okazaki fragments (Fig. 25-13a). 
First, an RNA primer is synthesized by primase and, as 
in leading strand synthesis, DNA polymerase III binds to 
the RNA primer and adds deoxyribonucleotides (Fig. 
25-13b). On this level, the synthesis of each Okazaki 
fragment seems straightforward, but the reality is quite 
complex. The complexity lies in the coordination of 
leading and lagging strand synthesis. Both strands are 
produced by a single asymmetric DNA polymerase III 
dimer; this is accomplished by looping the DNA of the 
lagging strand as shown in Figure 25-14, bringing to- 
gether the two points of polymerization. 

The synthesis of Okazaki fragments on the lagging 
strand entails some elegant enzymatic choreography. 
DnaB helicase and DnaG primase constitute a functional 
unit within the replication complex, the primosome. 
DNA polymerase III uses one set of its core subunits (the 
core polymerase) to synthesize the leading strand con- 
tinuously, while the other set of core subunits cycles 
from one Okazaki fragment to the next on the looped lag- 
ging strand. DnaB helicase, bound in front of DNA poly- 
merase III, unwinds the DNA at the replication fork (Fig. 
25-1 4a) as it travels along the lagging strand template 
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FIGURE 25-13 Synthesis of Okazaki fragments. 

(a) At intervals, primase synthesizes an RNA 
primer for a new Okazaki fragment. Note that if 
we consider the two template strands as lying 
side by side, lagging strand synthesis formally 
proceeds in the opposite direction from fork 
movement, (b) Each primer is extended by DNA 
polymerase HI. (c) DNA synthesis continues 
until the fragment extends as far as the primer 
of the previously added Okazaki fragment. A 
new primer is synthesized near the replication 
fork to begin the process again. 
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Continuous synthesis on 
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Leading as DNA is unwound fa the 
strand ° DaB 



Clamp-loading complex 
with open j3 sliding clamp 
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A new /J clamp is loaded onto 
the new template primer by 
the clamp loader. 
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Okazaki fragment 
is completed on the 
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FIGURE 25-14 DNA synthesis on the leading and lagging strands. 

Events at the replication fork are coordinated by a single DNA 
polymerase III dimer, in an integrated complex with DnaB heli- 
case. This figure shows the replication process already underway 
(parts (a) through (e) are discussed in the text). The lagging strand 
is looped so that DNA synthesis proceeds steadily on both the 
leading and lagging strand templates at the same time. Red arrows 
indicate the 3' end of the two new strands and the direction of 
DNA synthesis. The heavy black arrows show the direction of 
movement of the parent DNA through the complex. An Okazaki 
fragment is being synthesized on the lagging strand. The subunit 
colors and the functions of the clamp-loading complex are ex- 
plained in Figure 25-1 5, £ DNA Synthesis 




The next 0 clamp is readied 
83 Okazaki fragment 
synthesis is initiated. 



t 



Lagging strand core subunits 
are transferred to the new 
template primer and its 0 
clamp. The old ft clamp is 
discarded. 




in the 5'^3' direction. DnaG primase occasionally asso- 
ciates with DnaB helicase and synthesizes a short RNA 
primer (Fig. 25-14b). A new p sliding clamp is then posi- 
tioned at the primer by the clamp-loading complex of 
DNA polymerase III (Fig. 25-14c). When synthesis of an 
Okazaki fragment has been completed, replication halts, 
and the core subunits of DNA polymerase III dissociate 
from their sliding clamp (and from the completed 
Okazaki fragment) and associate with the new clamp 
(Fig. 25-14d, e). This initiates synthesis of a new 
Okazaki fragment. As noted earlier, the entire complex 
responsible for coordinated DNA synthesis at a replica- 



tion fork is known as the replisome. The proteins acting 
at the replication fork are summarized in Table 25-4. 

The clamp-loading complex of DNA polymerase III, 
consisting of parts of the two r subunits along with the 
y, 6, and 8' subunits, is also an AAA+ ATPase. This com- 
plex binds to ATP and to the new (3 sliding clamp. The 
binding imparts strain on the dimeric clamp, opening up 
the ring at one subunit interface (Fig. 25-15). The 
newly primed lagging strand is slipped into the ring 
through the resulting break. The clamp loader then hy- 
drolyzes ATP, releasing the (3 sliding clamp and allowing 
it to close around the DNA. 
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Proteins of the £ coli Replisome 






llUlllUCI Ul 




Protein 




subunits 


Function 






4 


Binding to single-stranded DNA 


DnaB protein (helicase) 


300,000 


6 


DNA unwinding; primosome constituent 


Primase (DnaG protein) 


60,000 


1 


RNA primer synthesis; primosome constituent 


DNA polymerase III 


791,500 


17 


New strand elongation 


DNA polymerase I 


103,000 


1 


Filling of gaps; excision of primers 


DNA ligase 


74,000 


1 


Ligation 


DNA gyrase (DNA topoisomerase II) 


400,000 


4 


Supercoiling 



Source: Modified from Kornberg,A. (1982) Supplement to DNA Replication , Table S11-2,W. H. Freeman and Company, New York. 



Clamp loader 




FIGURE 25-15 The DNA polymerase III clamp 
loader. The five subunits of the clamp-loading 
complex are the y, 8, and 5' subunits and the 
"5 1 amino-terminal domain of each r subunit (see 
Fig. 25-1 0). The complex binds to three molecules 
of ATP and to a dimeric (3 clamp. This binding 
forces the j8 clamp open at one of its two subunit 
interfaces. Hydrolysis of the bound ATP allows the 
jS clamp to close again around the DNA. 



The replisome promotes rapid DNA synthesis, 
adding —1,000 nucleotides/s to each strand (leading and 
lagging). Once an Okazaki fragment has been com- 
pleted, its RNA primer is removed and replaced with 
DNA by DNA polymerase I, and the remaining nick is 
sealed by DNA ligase (Fig. 25-16). 

DNA ligase catalyzes the formation of a phosphodi- 
ester bond between a 3' hydroxyl at the end of one 
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strand 



3' 
5' 



Nick 



NMPs 
dNTPs - 



DNA polymerase 1 



ATP (or NAD + ) 
AMP + PPi (or NMN) 



DNA ligase 



FIGURE 25-16 Final steps in the synthesis of lagging strand segments. 

RNA primers in the lagging strand are removed by the 5'— »3' exonu- 
clease activity of DNA polymerase I and are replaced with DNA by the 
same enzyme. The remaining nick is sealed by DNA ligase. The role of 
ATP or NAD + is shown in Figure 25-1 7. 



DNA strand and a 5' phosphate at the end of another 
strand. The phosphate must be activated by adenylyla- 
tion. DNA ligases isolated from viruses and eukaryotes 
use ATP for this purpose. DNA ligases from bacteria are 
unusual in that many use NAD + — a cofactor that nor- 
mally functions in hydride transfer reactions (see Fig. 
13-24) — as the source of the AMP activating group 
(Fig. 25-17). DNA ligase is another enzyme of DNA 
metabolism that has become an important reagent in 
recombinant DNA experiments (see Fig. 9-1). 

Termination Eventually, the two replication forks of 
the circular E. coli chromosome meet at a terminus re- 
gion containing multiple copies of a 20 bp sequence 
called Ter (Fig. 25-18). The Ter sequences are 
arranged on the chromosome to create a trap that a 
replication fork can enter but cannot leave. The Ter se- 
quences function as binding sites for the protein Tus 
(terminus utilization substance). The Tus-Ter complex 
can arrest a replication fork from only one direction. 
Only one Tus-Ter complex functions per replication 
cycle — the complex first encountered by either replica- 
tion fork. Given that opposing replication forks gener- 
ally halt when they collide, Ter sequences would not 
seem to be essential, but they may prevent overreplica- 
tion by one fork in the event that the other is delayed or 
halted by an encounter with DNA damage or some 
other obstacle. 

So, when either replication fork encounters a func- 
tional Tus-Ter complex, it halts; the other fork halts 
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DNA ligase 



(T) Adenylylation of 
DNA ligase 




Adenine 



AMP from ATP (R = PP^ 
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PPi (from ATP) 
or 
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FIGURE 25-17 Mechanism of the DNA ligase reaction. In 

each of the three steps, one phosphodiester bond is formed 
at the expense of another. Steps (T) and ® lead to activation 
of the 5' phosphate in the nick. An AMP group is transferred 
first to a Lys residue on the enzyme and then to the 5' phos- 
phate in the nick. In step (f), the 3'-hydroxyl group attacks 
this phosphate and displaces AMP, producing a phosphodi- 
ester bond to seal the nick. In the E. coli DNA ligase reaction, 
AMP is derived from NAD + . The DNA ligases isolated from 
some viral and eukaryotic sources use ATP rather than 
NAD + , and they release pyrophosphate rather than nicoti- 
namide mononucleotide (NMN) in step(T). 



Displacement of AMP seals nick 



3' 
5' 




Clockwise 
fork trap 

FIGURE 25-18 Termination of chromosome replication in f. coli 

TheTer sequences (TerA through TerF) are positioned on the chromo- 
some in two clusters with opposite orientations. 



FIGURE 25-19 Role of topoisomerases in replication termination. 

Replication of the DNA separating opposing replication forks leaves 
the completed chromosomes joined as catenanes, or topologically in- 
terlinked circles. The circles are not covalently linked, but because 
they are interwound and each is covalently closed, they cannot be sep- 
arated — except by the action of topoisomerases. In E. coli, a type II 
topoisomerase known as DNA topoisomerase IV plays the primary role 
in the separation of catenated chromosomes, transiently breaking both 
DNA strands of one chromosome and allowing the other chromosome 
to pass through the break. 



Catenated 
chromosomes 



J 



DNA topoisomerase IV 



Separated 
chromosomes 



25.1 DNA Replication I 991 



when it meets the first (arrested) fork. The final few 
hundred base pairs of DNA between these large protein 
complexes are then replicated (by an as yet unknown 
mechanism), completing two topologically interlinked 
(catenated) circular chromosomes (Fig. 25-19). DNA 
circles linked in this way are known as catenanes. Sep- 
aration of the catenated circles mE. coli requires topoi- 
somerase IV (a type II topoisomerase). The separated 
chromosomes then segregate into daughter cells at cell 
division. The terminal phase of replication of other circu- 
lar chromosomes, including many of the DNA viruses 
that infect eukaryotic cells, is similar. 

Replication in Eukaryotic Cells Is Both 
Similar and More Complex 

The DNA molecules in eukaryotic cells are considerably 
larger than those in bacteria and are organized into com- 
plex nucleoprotein structures (chromatin; p. 962). The 
essential features of DNA replication are the same in 
eukaryotes and bacteria, and many of the protein 
complexes are functionally and structurally conserved. 
However, eukaryotic replication is regulated and coordi- 
nated with the cell cycle, introducing some additional 
complexities. 

Origins of replication have a well-characterized 
structure in some lower eukaryotes, but they are much 
less defined in higher eukaryotes. In vertebrates, a vari- 
ety of A=T-rich sequences may be used for replication 
initiation, and the sites may vary from one cell division 
to the next. Yeast (Saccharomyces cerevisiae) has de- 
fined replication origins called autonomously replicating 
sequences (ARS), or replicators. Yeast replicators 
span —150 bp and contain several essential, conserved 
sequences. About 400 replicators are distributed among 
the 16 chromosomes of the haploid yeast genome. 

Regulation ensures that all cellular DNA is replicated 
once per cell cycle. Much of this regulation involves pro- 
teins called cyclins and the cyclin-dependent kinases 
(CDKs) with which they form complexes (p. 469) . The cy- 
clins are rapidly destroyed by ubiquitin-dependent prote- 
olysis at the end of the M phase (mitosis) , and the absence 
of cyclins allows the establishment of pre-replicative 
complexes (pre-RCs) on replication initiation sites. In 
rapidly growing cells, the pre-RC forms at the end of M 
phase. In slow-growing cells, it does not form until the end 
of Gl. Formation of the pre-RC renders the cell competent 
for replication, an event sometimes called licensing. 

As in bacteria, the key event in the initiation of 
replication in all eukaryotes is the loading of the replica- 
tive helicase, a heterohexameric complex of minichro- 
mosome maintenance (MCM) proteins (MCM2 to 
MCM7). The ring-shaped MCM2-7 helicase, functioning 
much like the bacterial DnaB helicase, is loaded onto 
the DNA by another six-protein complex called ORC 
(origin recognition complex) (Fig. 25-20). ORC 
has five AAA+ ATPase domains among its subunits and 
is functionally analogous to the bacterial DnaA. Two 
other proteins, CDC6 (cell division cycle) and CDTl 




FIGURE 25-20 Assembly of a pre-replicative complex at a eukaryotic 
replication origin. The initiation site (origin) is bound by ORC, CDC6, 
and CDT1 . These proteins, many of them AAA+ ATRases, promote load- 
ing of the replicative helicase, MCM2-7, in a reaction that is analogous 
to the loading of the bacterial DnaB helicase by DnaC protein. Loading 
of the MCM helicase complex onto the DNA forms the pre-replicative 
complex, or pre-RC, and is the key step in the initiation of replication. 

(CDClO-dependent transcript i), are also required to 
load the MCM2-7 complex, and the yeast CDC6 is an- 
other AAA+ ATPase. 

Commitment to replication requires the synthesis 
and activity of S-phase cyclin-CDK complexes (such as 
the cyclin E-CDK2 complex; see Fig. 12-45) and CDC7- 
DBF4. Both types of complexes help to activate replica- 
tion by binding to and phosphorylating several proteins 
in the pre-RC. Other cyclins and CDKs function to in- 
hibit the formation of more pre-RC complexes once 
replication has been initiated. For example, CDK2 binds 
to cyclin A as cyclin E levels decline during S phase, in- 
hibiting CDK2 and preventing the licensing of additional 
pre-RC complexes. 
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The rate of movement of the replication fork in eu- 
karyotes (—50 nucleotides/s) is only one-twentieth that 
observed inE. coli At this rate, replication of an aver- 
age human chromosome proceeding from a single origin 
would take more than 500 hours. Replication of human 
chromosomes in fact proceeds bidirectionally from 
many origins, spaced 30 to 300 kbp apart. Eukaryotic 
chromosomes are almost always much larger than bac- 
terial chromosomes, so multiple origins are probably a 
universal feature of eukaryotic cells. 

Like bacteria, eukaryotes have several types of DNA 
polymerases. Some have been linked to particular func- 
tions, such as the replication of mitochondrial DNA. The 
replication of nuclear chromosomes involves DNA poly- 
merase a, in association with DNA polymerase 8. DNA 
polymerase a is typically a multisubunit enzyme with 
similar structure and properties in all eukaryotic cells. 
One subunit has a primase activity, and the largest sub- 
unit (M T —180,000) contains the polymerization activity. 
However, this polymerase has no proofreading 3'— >5' 
exonuclease activity, making it unsuitable for high- 
fidelity DNA replication. DNA polymerase a is believed 
to function only in the synthesis of short primers (either 
RNA or DNA) for Okazaki fragments on the lagging 
strand. These primers are then extended by the multi- 
subunit DNA polymerase S. This enzyme is associated 
with and stimulated by proliferating cell nuclear antigen 
(PCNA; M T 29,000), a protein found in large amounts in 
the nuclei of proliferating cells. The three-dimensional 
structure of PCNA is remarkably similar to that of the f3 
subunit ofE. coli DNA polymerase III (Fig. 25-10b), al- 
though primary sequence homology is not evident. 
PCNA has a function analogous to that of the p subunit, 
forming a circular clamp that greatly enhances the pro- 
cessivity of the polymerase. DNA polymerase 8 has a 
3 '— »5' proofreading exonuclease activity and seems to 
carry out both leading and lagging strand synthesis in a 
complex comparable to the dimeric bacterial DNA poly- 
merase III. 

Yet another polymerase, DNA polymerase e, re- 
places DNA polymerase 8 in some situations, such as in 
DNA repair. DNA polymerase e may also function at the 
replication fork, perhaps playing a role analogous to that 
of the bacterial DNA polymerase I, removing the primers 
of Okazaki fragments on the lagging strand. 

Two other protein complexes also function in eu- 
karyotic DNA replication. RPA (replication protein A) is 
a eukaryotic single-stranded DNA-binding protein, 
equivalent in function to the E. coli SSB protein. RFC 
(replication factor C) is a clamp loader for PCNA and fa- 
cilitates the assembly of active replication complexes. 
The subunits of the RFC complex have significant se- 
quence similarity to the subunits of the bacterial clamp- 
loading (y) complex. 

The termination of replication on linear eukaryotic 
chromosomes involves the synthesis of special struc- 
tures called telomeres at the ends of each chromo- 
some, as discussed in the next chapter. 



Viral DNA Polymerases Provide Targets for Antiviral Therapy 

rMany DNA viruses encode their own DNA poly- 
merases, and some of these have become targets 
for pharmaceuticals. For example, the DNA polymerase 
of the herpes simplex virus is inhibited by acyclovir, 
a compound developed by Gertrude Elion (p. 894). 
Acyclovir consists of guanine attached to an incomplete 
ribose ring. 




Acyclovi 



It is phosphorylated by a virally encoded thymidine ki- 
nase; acyclovir binds to this viral enzyme with an affinity 
200-fold greater than its binding to the cellular thymi- 
dine kinase. This ensures that phosphorylation occurs 
mainly in virus-infected cells. Cellular kinases convert 
the resulting acyclo-GMP to acyclo-GTP, which is both 
an inhibitor and a substrate of DNA polymerases; acy- 
clo-GTP competitively inhibits the herpes DNA poly- 
merase more strongly than cellular DNA polymerases. 
Because it lacks a 3' hydroxyl, acyclo-GTP also acts as a 
chain terminator when incorporated into DNA. Thus 
viral replication is inhibited at several steps. ■ 

SUMMARY 25.1 DNA Replication 

■ Replication of DNA occurs with very high fidelity and 
at a designated time in the cell cycle. Replication is 
semiconservative, each strand acting as template 
for a new daughter strand. It is carried out in three 
identifiable phases: initiation, elongation, and 
termination. The process starts at a single origin in 
bacteria and usually proceeds bidirectionally. 

■ DNA is synthesized in the 5'— >3' direction by DNA 
polymerases. At the replication fork, the leading 
strand is synthesized continuously in the same 
direction as replication fork movement; the lagging 
strand is synthesized discontinuously as Okazaki 
fragments, which are subsequently ligated. 

■ The fidelity of DNA replication is maintained by 
(1) base selection by the polymerase, (2) a 3'— >5' 
proofreading exonuclease activity that is part of 
most DNA polymerases, and (3) specific repair 
systems for mismatches left behind after 
replication. 

■ Most cells have several DNA polymerases. InE. coli, 
DNA polymerase III is the primary replication enzyme. 
DNA polymerase I is responsible for special functions 
during replication, recombination, and repair. 

■ The separate initiation, elongation, and termination 
phases of DNA replication involve an array of 
enzymes and protein factors, many belonging to 
the AAA 4- ATPase family. 



25.2 DNA Repair [993] 



■ The replication proteins in bacteria are organized 
into replication factories, in which template DNA is 
spooled through two replisomes tethered to the 
bacterial plasma membrane. 

25.2 DNA Repair 

Most cells have only one or two sets of genomic DNA. 
Damaged proteins and RNA molecules can be quickly re- 
placed by using information encoded in the DNA, but DNA 
molecules themselves are irreplaceable. Maintaining the 
integrity of the information in DNA is a cellular imperative, 
supported by an elaborate set of DNA repair systems. 
DNA can become damaged by a variety of processes, some 
spontaneous, others catalyzed by environmental agents 
(Chapter 8). Replication itself can very occasionally dam- 
age the information content in DNA when errors intro- 
duce mismatched base pairs (such as G paired with T). 

The chemistry of DNA damage is diverse and com- 
plex. The cellular response to this damage includes a 
wide range of enzymatic systems that catalyze some of 
the most interesting chemical transformations in DNA 
metabolism. We first examine the effects of alterations in 
DNA sequence and then consider specific repair systems. 

Mutations Are Linked to Cancer 

0The best way to illustrate the importance of DNA 
repair is to consider the effects of unrepaired DNA 
damage (a lesion). The most serious outcome is a change 
in the base sequence of the DNA, which, if replicated and 
transmitted to future generations of cells, becomes perma- 
nent. A permanent change in the nucleotide sequence of 
DNA is called a mutation. Mutations can involve the re- 
placement of one base pair with another (substitution mu- 
tation) or the addition or deletion of one or more base pairs 
(insertion or deletion mutations). If the mutation affects 
nonessential DNA or if it has a negligible effect on the func- 
tion of a gene, it is known as a silent mutation. Rarely, a 
mutation confers some biological advantage. Most nonsi- 
lent mutations, however, are neutral or deleterious. 

In mammals there is a strong correlation between 
the accumulation of mutations and cancer. A simple test 
developed by Bruce Ames measures the potential of a 
given chemical compound to promote certain easily 
detected mutations in a specialized bacterial strain (Fig. 
25-21). Few of the chemicals that we encounter in daily 
life score as mutagens in this test. However, of the com- 
pounds known to be carcinogenic from extensive animal 
trials, more than 90% are also found to be mutagenic in 
the Ames test. Because of this strong correlation be- 
tween mutagenesis and carcinogenesis, the Ames test 
for bacterial mutagens is widely used as a rapid and inex- 
pensive screen for potential human carcinogens. 

The genome of a typical mammalian cell accumu- 
lates many thousands of lesions during a 24-hour period. 
However, as a result of DNA repair, fewer than 1 in 1 ,000 
become a mutation. DNA is a relatively stable molecule, 




FIGURE 25-21 Ames test for carcinogens, based on their mutagenicity. 

A strain of Salmonella typhimurium having a mutation that inactivates 
an enzyme of the histidine biosynthetic pathway is plated on a 
histidine-free medium. Few cells grow, (a) The few small colonies of 
5. typhimurium that do grow on a histidine-free medium carry sponta- 
neous back-mutations that permit the histidine biosynthetic pathway to 
operate. Three identical nutrient plates (b), (c), and (d) have been 
inoculated with an equal number of cells. Each plate then receives 
a disk of filter paper containing progressively lower concentrations of a 
mutagen. The mutagen greatly increases the rate of back-mutation and 
hence the number of colonies. The clear areas around the filter paper 
indicate where the concentration of mutagen is so high that it is lethal 
to the cells. As the mutagen diffuses away from the filter paper, it is 
diluted to sublethal concentrations that promote back-mutation. 
Mutagens can be compared on the basis of their effect on mutation 
rate. Because many compounds undergo a variety of chemical trans- 
formations after entering cells, compounds are sometimes tested 
for mutagenicity after first incubating them with a liver extract. Some 
substances have been found to be mutagenic only after this treatment. 

but in the absence of repair systems, the cumulative 
effect of many infrequent but damaging reactions would 
make life impossible. ■ 

All Cells Have Multiple DNA Repair Systems 

The number and diversity of repair systems reflect both the 
importance of DNA repair to cell survival and the diverse 
sources of DNA damage (Table 25-5). Some common types 
of lesions, such as pyriirridine dimers (see Fig. 8-31), can be 
repaired by several distinct systems. Many DNA repair 
processes also seem to be extraordinarily inefficient ener- 
getically — an exception to the pattern observed in the vast 
majority of metabolic pathways, where every ATP is gener- 
ally accounted for and used optimally. When the integrity of 
the genetic information is at stake, the amount of chemical 
energy invested in a repair process seems almost irrelevant. 

DNA repair is possible largely because the DNA mol- 
ecule consists of two complementary strands. DNA dam- 
age in one strand can be removed and accurately 
replaced by using the undamaged complementary strand 




DNA Metabolism 



TABLE 25-5 



Types of DNA Repair Systems in £ co 

Enzymes/proteins Type of damage 



Mismatch repair 

Dam methylase 

MutH, MutL, MutS proteins 

DNA helicase II 

SSB 

DNA polymerase III 
Exonuclease I 
Exonuclease VII 
RecJ nuclease 
Exonuclease X 
DNA ligase 

Base-excision repair 

DNA glycosylases 
AP endonucleases 
DNA polymerase I 
DNA ligase 

Nucleotide-excision repair 

ABC excinuclease 
DNA polymerase I 
DNA ligase 

Direct repair 

DNA photolyases 

0 6 -Methylguanine-DNA 
methyltransferase 

AlkB protein 



> Misrnai el n- i s 



Abnormal bases (uracil, 
hypoxanthine, xanthine); 
alkylated bases; in some 
other organisms, 
pyrinudine dimers 



DNA lesions that cause 
large structural changes 
(e.g., pyrimidine dimers) 

Pyrimidine dimers 
0 6 -Methylguanine 

1-Methylguanine, 
3-methylcytosine 



as a template. We consider here the principal types of re- 
pair systems, beginning with those that repair the rare 
nucleotide mismatches that are left behind by replication. 

Mismatch Repair Correction of the rare mismatches 
left after replication in E. coli improves the overall fidelity 
of replication by an additional factor of 10 2 to 10 3 . The 
mismatches are nearly always corrected to reflect the in- 
formation in the old (template) strand, so the repair sys- 
tem must somehow discriminate between the template 
and the newly synthesized strand. The cell accomplishes 
this by tagging the template DNA with methyl groups to 
distinguish it from newly synthesized strands. The mis- 
match repair system ofE. coli includes at least 12 protein 
components (Table 25-5) that function either in strand 
dLscrimination or in the repair process itself. 

The strand discrimination mechanism has not been 
worked out for most bacteria or eukaryotes, but is well 
understood for E. coli and some closely related bacter- 
ial species. In these bacteria, strand discrimination is 
based on the action of Dam methylase, which, as you will 
recall, methylates DNA at the N 6 position of all adenines 
within (5')GATC sequences. Immediately after passage 
of the replication fork, there is a short period (a few sec- 
onds or minutes) during which the template strand is 
methylated but the newly synthesized strand is not 
(Fig. 25-22). The transient unmethylated state of 
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FIGURE 25-22 Methylation and mismatch repair. Methylation of DNA 
strands can serve to distinguish parent (template) strands from newly 
synthesized strands in E. coli DNA, a function that is critical to 
mismatch repair (see Fig. 25-23). The methylation occurs at the N 6 of 
adenines in (5')GATC sequences. This sequence is a palindrome (see 
Fig. 8-1 8), present in opposite orientations on the two strands. 
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GATC sequences in the newly synthesized strand per- 
mits the new strand to be distinguished from the tem- 
plate strand. Replication mismatches in the vicinity of a 
hemimethylated GATC sequence are then repaired ac- 
cording to the information in the methylated parent 
(template) strand. Tests in vitro show that if both 
strands are methylated at a GATC sequence, few mis- 
matches are repaired; if neither strand is methylated, 
repair occurs but does not favor either strand. The cell's 
methyl-directed mismatch repair system efficiently re- 
pairs mismatches up to 1,000 bp from a hemimethylated 
GATC sequence. 

How is the mismatch correction process directed 
by relatively distant GATC sequences? A mechanism is 
illustrated in Figure 25-23. MutL protein forms a 
complex with MutS protein, and the complex binds to 
all mismatched base pairs (except C-C). MutH protein 
binds to MutL and to GATC sequences encountered by 
the MutL-MutS complex. DNA on both sides of the mis- 
match is threaded through the MutL-MutS complex, 
creating a DNA loop; simultaneous movement of both 
legs of the loop through the complex is equivalent to 
the complex moving in both directions at once along 
the DNA. MutH has a site-specific endonuclease activ- 
ity that is inactive until the complex encounters a 
hemimethylated GATC sequence. At this site, MutH 
catalyzes cleavage of the unmethylated strand on the 
5' side of the G in GATC, which marks the strand for re- 
pair. Further steps in the pathway depend on where 
the mismatch is located relative to this cleavage site 
(Fig. 25-24). 

When the mismatch is on the 5' side of the cleav- 
age site (Fig. 25-24, right side), the unmethylated 
strand is unwound and degraded in the 3'— >5' direc- 
tion from the cleavage site through the mismatch, and 
this segment is replaced with new DNA. This process 
requires the combined action of DNA helicase II, SSB, 
exonuclease I or exonuclease X (both of which de- 
grade strands of DNA in the 3'->5' direction), DNA 
polymerase III, and DNA ligase. The pathway for re- 
pair of mismatches on the 3' side of the cleavage site 
is similar (Fig. 25-24, left), except that the exonucle- 
ase is either exonuclease VII (which degrades single- 
stranded DNA in the 5'— >3' or 3'— >5' direction) or 
RecJ nuclease (which degrades single-stranded DNA 
in the 5'— »3' direction). 

Mismatch repair is a particularly expensive process 
for E. coli in terms of energy expended. The mismatch 
may be 1,000 bp or more from the GATC sequence. The 
degradation and replacement of a strand segment of this 
length require an enormous investment in activated de- 
oxynucleotide precursors to repair a single mismatched 
base. This again underscores the importance to the cell 
of genomic integrity. 

All eukaryotic cells have several proteins struc- 
turally and functionally analogous to the bacterial MutS 
and MutL (but not MutH) proteins. Alterations in hu- 
man genes encoding proteins of this type produce some 
of the most common inherited cancer-susceptibility 
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FIGURE 25-23 A model for the early steps of methyl-directed mis- 
match repair. The proteins involved in this process in £. coli have 
been purified (see Table 25-5). Recognition of the sequence 
(5')GATC and of the mismatch are specialized functions of the MutH 
and MutS proteins, respectively. The MutL protein forms a complex 
with MutS at the mismatch. DNA is threaded through this complex 
such that the complex moves simultaneously in both directions along 
the DNA until it encounters a MutH protein bound at a hemimethy- 
lated GATC sequence, MutH cleaves the unmethylated strand on the 
5' side of the G in this sequence. A complex consisting of DNA he- 
licase II and one of several exonucleases then degrades the un- 
methylated DNA strand from that point toward the mismatch (see 
Fig. 25-24). 
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FIGURE 25-24 Completing methyl-directed mismatch repair. The 

combined action of DNA helicase II, SSB, and one of four different 
exonucleases removes a segment of the new strand between the MutH 
cleavage site and a point just beyond the mismatch. The exonuclease 
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that is used depends on the location of the cleavage site relative to the 
mismatch, as shown by the alternative pathways here. The resulting 
gap is filled in (dashed line) by DNA polymerase III, and the nick is 
sealed by DNA ligase (not shown). 



syndromes (see Box 25-1, p. 1003), further demonstrat- 
ing the value to the organism of DNA repair systems. 
The main MutS homologs in most eukaryotes, from 
yeast to humans, are MSH2 (MutS ^omolog 2), MSH3, 
and MSH6. Heterodimers of MSH2 and MSH6 generally 
bind to single base-pair mismatches, and bind less well 
to slightly longer mispaired loops. In many organisms 
the longer mismatches (2 to 6 bp) may be bound in- 
stead by a heterodimer of MSH2 and MSH3, or are 
bound by both types of heterodimers in tandem. Ho- 
mologs of MutL, predominantly a heterodimer of MLH1 
(AfutL ^omolog i) and PMS1 (post-meiotic segrega- 
tion), bind to and stabilize the MSH complexes. Many 
details of the subsequent events in eukaryotic mis- 
match repair remain to be worked out. In particular, we 
do not know the mechanism by which newly synthe- 
sized DNA strands are identified, although research has 
revealed that this strand identification does not involve 
GATC sequences. 

Base-Excision Repair Every cell has a class of en- 
zymes called DNA glycosylases that recognize particu- 
larly common DNA lesions (such as the products of 
cytosine and adenine deamination; see Fig. 8-30a) and 
remove the affected base by cleaving the A^-glycosyl 
bond. This cleavage creates an apurinic or apyrimidinic 



site in the DNA, commonly referred to as an AP site or 
abasic site. Each DNA glycosylase is generally specific 
for one type of lesion. 

Uracil DNA glycosylases, for example, found in 
most cells, specifically remove from DNA the uracil 
that results from spontaneous deamination of cytosine. 
Mutant cells that lack this enzyme have a high rate of 
G=C to A=T mutations. This glycosylase does not re- 
move uracil residues from RNA or thymine residues 
from DNA. The capacity to distinguish thymine from 
uracil, the product of cytosine deamination — neces- 
sary for the selective repair of the latter — may be one 
reason why DNA evolved to contain thymine instead of 
uracil (p. 289). 

Most bacteria have just one type of uracil DNA glyco- 
sylase, whereas humans have at least four types, with dif- 
ferent specificities — an indicator of the importance of 
uracil removal from DNA. The most abundant human 
uracil glycosylase, UNG, is associated with the human 
replisome, where it eliminates the occasional U residue 
inserted in place of a T during replication. The deamina- 
tion of C residues is 100-fold faster in single-stranded 
DNA than in double-stranded DNA, and humans have the 
enzyme hSMUGl, which removes any U residues that oc- 
cur in single-stranded DNA during replication or tran- 
scription. Two other human DNA glycosylases, TDG and 
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MBD4, remove either U or T residues paired with G, gen- 
erated by deamination of cytosine or 5-methylcytosine, 
respectively. 

Other DNA glycosylases recognize and remove a va- 
riety of damaged bases, including formamidopyrimidine 
and 8-hydroxyguanine (both arising from purine oxida- 
tion), hypoxanthine (arising from adenine deamina- 
tion), and alkylated bases such as 3-methyladenine and 
7-methylguanine. Glycosylases that recognize other le- 
sions, including pyrimidine dimers, have also been iden- 
tified in some classes of organisms. Remember that AP 
sites also arise from the slow, spontaneous hydrolysis of 
the Af-glycosyl bonds in DNA (see Fig. 8-30b). 

Once an AP site has been formed by a DNA glyco- 
sylase, another type of enzyme must repair it. The re- 
pair is not made by simply inserting a new base and 
re-forming the A^-glycosyl bond. Instead, the deoxyri- 
bose 5 '-phosphate left behind is removed and replaced 
with a new nucleotide. This process begins with one of 
the AP endonucleases, enzymes that cut the DNA 
strand containing the AP site. The position of the inci- 
sion relative to the AP site (5' or 3' to the site) varies 
with the type of AP endonuclease. A segment of DNA 
including the AP site is then removed, DNA polymerase 
I replaces the DNA, and DNA ligase seals the remaining 
nick (Fig. 25-25). In eukaryotes, nucleotide replace- 
ment is carried out by specialized polymerases, as 
described below. 

Nucleotide-Excision Repair DNA lesions that cause 
large distortions in the helical structure of DNA generally 
are repaired by the nucleotide-excision system, a repair 
pathway critical to the survival of all free-living organ- 
isms. In nucleotide-excision repair (Fig. 25-26), a 
multisubunit enzyme (excinuclease) hydrolyzes two 
phosphodiester bonds, one on either side of the distor- 
tion caused by the lesion. In E. coli and other bacteria, 
the enzyme system hydrolyzes the fifth phosphodiester 
bond on the 3' side and the eighth phosphodiester bond 
on the 5' side to generate a fragment of 12 to 13 nu- 
cleotides (depending on whether the lesion involves one 
or two bases). In humans and other eukaryotes, the en- 
zyme system hydrolyzes the sixth phosphodiester bond 
on the 3' side and the twenty-second phosphodiester 
bond on the 5' side, producing a fragment of 27 to 29 
nucleotides. Following the dual incision, the excised 
oligonucleotides are released from the duplex and the 
resulting gap is filled — by DNA polymerase I in E. coli 
and DNA polymerase e in humans. DNA ligase seals 
the nick. 

In E. coli, the key enzymatic complex is the ABC 
excinuclease, which has three subunits, UvrA (M r 
104,000), UvrB (M T 78,000), and UvrC (M T 68,000). The 
term "excinuclease" is used to describe the unique ca- 
pacity of this enzyme complex to catalyze two specific 
endonucleolytic cleavages, distinguishing this activity 
from that of standard endonucleases. A complex of the 
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FIGURE 25-25 DNA repair by the base-excision repair pathway. 

0 A DNA glycosylase recognizes a damaged base and cleaves be- 
tween the base and deoxyribose in the backbone. @ An AP endonu- 
clease cleaves the phosphodiester backbone near the AP site. (3) DNA 
polymerase I initiates repair synthesis from the free 3' hydroxyl at the 
nick, removing (with its 5'->3' exonuclease activity) and replacing a 
portion of the damaged strand. ©The nick remaining after DNA poly- 
merase I has dissociated is sealed by DNA ligase. 



UvrA and UvrB proteins (A 2 B) scans the DNA and binds 
to the site of a lesion. The UvrA dimer then dissociates, 
leaving a tight UvrB-DNA complex. UvrC protein then 
binds to UvrB, and UvrB makes an incision at the fifth 
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FIGURE 25-26 Nucleotide-excision repair in E. coli and humans. 

The general pathway of nucleotide-excision repair is similar in all 
organisms. © An excinuclease binds to DNA at the site of a bulky 
lesion and cleaves the damaged DNA strand on either side of the 
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lesion. @ The DNA segment— of 13 nucleotides (13 mer) or 29 nu- 
cleotides (29 mer) — is removed with the aid of a helicase. ©The 
gap is filled in by DNA polymerase, and ® the remaining nick is 
sealed with DNA ligase. 



phosphodiester bond on the 3' side of the lesion. This 
is followed by a UvrC-mediated incision at the eighth 
phosphodiester bond on the 5' side. The resulting 12 
to 13 nucleotide fragment is removed by UvrD heli- 
case. The short gap thus created is filled in by DNA 
polymerase I and DNA ligase. This pathway (Fig. 
25-26, left) is a primary repair route for many types 
of lesions, including cyclobutane pyrimidine dimers, 
6-4 photoproducts (see Fig. 8-31), and several other 
types of base adducts including benzo[a]pyrene-gua- 
nine, which is formed in DNA by exposure to ciga- 
rette smoke. The nucleolytic activity of the ABC 
excinuclease is novel in the sense that two cuts are 
made in the DNA. 

The mechanism of eukaryotic excinucleases is quite 
similar to that of the bacterial enzyme, although 16 
polypeptides with no similarity to the E. coli excinucle- 
ase subunits are required for the dual incision. As de- 
scribed in Chapter 26, some of the nucleotide-excision 
repair and base-excision repair in eukaryotes is closely 
tied to transcription. Genetic deficiencies in nucleotide- 



excision repair in humans give rise to a variety of serious 
diseases (see Box 25-1). 

Direct Repair Several types of damage are re- 
paired without removing a base or nucleotide. The best- 
characterized example is direct photoreactivation of 
cyclobutane pyrimidine dimers, a reaction promoted 
by DNA photolyases. Pyrimidine dimers result from a 
UV-induced reaction, and photolyases use energy 
derived from absorbed light to reverse the damage 
(Fig. 25-27). Photolyases generally contain two cof ac- 
tors that serve as light-absorbing agents, or chro- 
mophores. One of the chromophores is always FADH". 
In E. coli and yeast, the other chromophore is a folate. 
The reaction mechanism entails the generation of free 
radicals. DNA photolyases are not present in the cells of 
placental mammals (which include humans). 

Additional examples can be seen in the repair of 
nucleotides with alkylation damage. The modified 
nucleotide 0 6 -methylguanine forms in the presence of 
alkylating agents and is a common and highly mutagenic 
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MECHANISM FIGURE 25-27 Repair of pyrimidine dimers with pho- 
tolyase. Energy derived from absorbed light is used to reverse the pho- 
toreaction that caused the lesion. The two chromophores in E, coli 
photolyase (M r 54,000), ^N 10 -methenyltetrahydrofolylpolygluta- 
mate (MTHFpolyGlu) and FADH~, perform complementary functions. 



Monomelic pyrimidines 
in repaired DNA 

MTHFpolyGlu functions as a photoantenna to absorb blue-light photons. 
The excitation energy passes to FADH", and the excited flavin 
(*FADH") donates an electron to the pyrimidine dimer, leading to the 
rearrangement as shown. 



lesion (p. 292). It tends to pair with thymine rather than 
cytosine during replication, and therefore causes G=C 
to A=T mutations (Fig. 25-28). Direct repair of O 6 - 
methylguanine is carried out by 0 6 -methylguanine-DNA 
methyltransferase, a protein that catalyzes transfer of 
the methyl group of 0 6 -methylguanine to one of its own 
Cys residues. This methyltransferase is not strictly an 
enzyme, because a single methyl transfer event perma- 
nently methylates the protein, making it inactive in this 
pathway. The consumption of an entire protein mole- 
cule to correct a single damaged base is another vivid il- 
lustration of the priority given to maintaining the 
integrity of cellular DNA. 
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FIGURE 25-28 Example of how DNA damage results in mutations. 

(a) The methylation product 0 6 -methylguanine pairs with thymine 
rather than cytosine. (b) If not repaired, this leads to a G=C to A=T 
mutation after replication. 

A very different but equally direct mechanism is 
used to repair 1-methyladenine and 3-methylcytosine. 
The amino groups of A and C residues are sometimes 
methylated when the DNA is single-stranded, and the 
methylation directly affects proper base pairing. In E. 
coli, oxidative demethylation of these alkylated nu- 
cleotides is mediated by the AlkB protein, a member of 
the a-ketoglutarate-Fe 2+ -dependent dioxygenase su- 
perfamily (Fig. 25-29). (See Box 4-3 for a description 
of another member of this enzyme family.) 

FIGURE 25-29 Direct repair of alkylated bases by AlkB. The AlkB 
protein is an a-ketoglutarate-Fe 2+ -dependent dioxygenase. It catalyzes 
the oxidative demethylation of 1-methyladenine and 3-methylcytosine 
residues. 
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The interaction of Replication Forks with DNA Damage 
Can Lead to Error-Prone Transiesion DNA Synthesis 

The repair pathways considered to this point generally 
work only for lesions in double-stranded DNA, the undam- 
aged strand providing the correct genetic information to 
restore the damaged strand to its original state. However, 
in certain types of lesions, such as double-strand breaks, 
double-strand cross-links, or lesions in a single-stranded 
DNA, the complementary strand is itself damaged or is ab- 
sent. Double-strand breaks and lesions in single-stranded 
DNA most often arise when a replication fork encounters 
an unrepaired DNA lesion (Fig. 25-30). Such lesions and 
DNA cross-links can also result from ionizing radiation and 
oxidative reactions. 

At a stalled bacterial replication fork, there are two 
avenues for repair. In the absence of a second strand, the 
information required for accurate repair must come from 
a separate, homologous chromosome. The repair system 
thus involves homologous genetic recombination. This 
recombinational DNA repair is considered in detail in 
Section 25.3. Under some conditions, a second repair 
pathway, error-prone translesion DNA synthesis 
(often abbreviated TLS), becomes available. When this 
pathway is active, DNA repair becomes significantly less 
accurate and a high mutation rate can result. In bacteria, 
error-prone translesion DNA synthesis is part of a cellu- 
lar stress response to extensive DNA damage known, ap- 
propriately enough, as the SOS response. Some SOS 
proteins, such as the UvrA and UvrB proteins already de- 
scribed (Table 25-6), are normally present in the cell but 
are induced to higher levels as part of the SOS response. 
Additional SOS proteins participate in the pathway for 
error-prone repair; these include the UmuC and UmuD 
proteins ("Umu" from immutable; lack of the umu gene 
function eliminates error-prone repair). The UmuD pro- 
tein is cleaved in an SOS-regulated process to a shorter 
form called UmuD', which forms a complex with UmuC 



to create a specialized DNA polymerase, DNA poly- 
merase V, that can replicate past many of the DNA le- 
sions that would normally block replication. Proper base 
pairing is often impossible at the site of such a lesion, so 
this translesion replication is error-prone. 

Given the emphasis on the importance of genomic in- 
tegrity throughout this chapter, the existence of a system 
that increases the rate of mutation may seem incongru- 
ous. However, we can think of this system as a despera- 
tion strategy. The umuC and umuD genes are fully 
induced only late in the SOS response, and they are not 
activated for translesion synthesis initiated by UmuD 
cleavage unless the levels of DNA damage are particularly 
high and all replication forks are blocked. The mutations 
resulting from DNA polymerase V-mediated replication 
kill some cells and create deleterious mutations in others, 
but this is the biological price a species pays to overcome 
an otherwise insurmountable barrier to replication, as it 
permits at least a few mutant daughter cells to survive. 

In addition to DNA polymerase V, translesion repli- 
cation requires the RecA protein. RecA filaments bound 
to single-stranded DNA at one chromosomal location 
can activate DNA polymerase V complexes bound at dis- 
tant sites on the chromosome. This has been described 
as acting "in trans," a phenomenon aided by looping of 
the chromosome that brings distant sites adjacent to 
each other. Yet another DNA polymerase, DNA poly- 
merase IV, is also induced during the SOS response. 
Replication by DNA polymerase IV, a product of the 
dinB gene, is also highly error-prone. The bacterial DNA 
polymerases IV and V are part of a family of TLS poly- 
merases found in all organisms. These enzymes lack a 
proofreading exonuclease activity, and the fidelity of 
base selection during replication can be reduced by a 
factor of 10 2 , lowering overall replication fidelity to one 
error in —1,000 nucleotides. 

Mammals have many low-fidelity DNA polymerases 
of the TLS polymerase family. However, the presence of 



Unrepaired 
lesion 



Unrepaired 
break 



y 



i 



1 



Single-stranded 
DNA 



i 



Double-strand 
break 



/ 



Recombinational 
DNA repair or 
error-prone repair 



Recombinational 
DNA repair 



FIGURE 25-30 DNA damage and its effect on DNA 
replication. If the replication fork encounters an un- 
repaired lesion or strand break, replication generally 
halts and the fork may collapse. A lesion is left be- 
hind in an unreplicated, single-stranded segment of 
the DNA (left); a strand break becomes a double- 
strand break (right). In each case the damage to one 
strand cannot be repaired by mechanisms described 
earlier in this chapter, because the complementary 
strand required to direct accurate repair is damaged 
or absent. There are two possible avenues for repair: 
recombinational DNA repair (described in Fig. 
25-39) or, when lesions are unusually numerous, 
error-prone repair. The latter mechanism involves a 
novel DNA polymerase (DNA polymerase V, en- 
coded by the umuC and umuD genes) that can repli- 
cate, albeit inaccurately, over many types of lesions. 
The repair mechanism is "error-prone" because mu- 
tations often result 
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TABLE 25-6 



Gene name 



Induced as Part of the SOS Response in £ coli 

Protein encoded and/or role in DNA repair 



Genes of known function 

polB ( dinA ) 

uvrA 
uvrB J 
umuC j 
umuD j 
sulA 
recA 
dinB 
ssb 
himA 

Genes involved in DNA metabolism, 
but role in DNA repair unknown 

uvrD 

recN 

Genes of unknown function 

dinD 
dinF 



Encodes polymerization subunit of DNA polymerase II, required for replication 
restart in recombinational DNA repair 

Encode ABC excinuclease subunits UvrA and UvrB 



Encode DNA polymerase V 

Encodes protein that inhibits cell division, possibly to allow time for DNA repair 
Encodes RecA protein, required for error-prone repair and recombinational repair 
Encodes DNA polymerase IV 

Encodes single-stranded DNA-binding protein (SSB) 

Encodes subunit of integration host factor (IHF), involved in site-specific 
recombination, replication, transposition, regulation of gene expression 



Encodes DNA helicase II (DNA-unwinding protein) 
Required for recombinational repair 



Note: Some of these genes and their functions are further discussed in Chapter 28. 

these enzymes does not necessarily translate into an un- 
acceptable mutational burden, because most of these en- 
zymes also have specialized functions in DNA repair. 
DNA polymerase rj (eta), for example, found in all eu- 
karyotes, promotes translesion synthesis primarily 
across cyclobutane T— T dimers. Few mutations result 
in this case, because the enzyme preferentially inserts 
two A residues across from the linked T residues. Sev- 
eral other low-fidelity polymerases, including DNA poly- 
merases /3, l (iota), and A, have specialized roles in 
eukaryotic base-excision repair. Each of these enzymes 
has a 5'-deoxyribose phosphate lyase activity in addi- 
tion to its polymerase activity. After base removal by a 
glycosylase and backbone cleavage by an AP endonucle- 
ase, these polymerases remove the abasic site (a 5'-de- 
oxyribose phosphate) and fill in the very short gap. The 
frequency of mutation due to DNA polymerase 77 activ- 
ity is minimized by the very short lengths (often one nu- 
cleotide) of DNA synthesized. 

What emerges from research into cellular DNA repair sys- 
tems is a picture of a DNA metabolism that maintains ge- 
nomic integrity with multiple and often redundant systems. 
In the human genome, more than 130 genes encode pro- 
teins dedicated to the repair of DNA. In many cases, the loss 
of function of one of these proteins results in genomic insta- 
bility and an increased occurrence of oncogenesis (Box 
25-1). These repair systems are often integrated with the 
DNA replication systems and are complemented by recom- 
bination systems, which we turn to next. 



SUMMARY 25.2 DNA Repair 

■ Cells have many systems for DNA repair. 
Mismatch repair in E, coli is directed by transient 
nonmethylation of (5')GATC sequences on the 
newly synthesized strand. 

■ Base-excision repair systems recognize and repair 
damage caused by environmental agents (such as 
radiation and alkylating agents) and spontaneous 
reactions of nucleotides. Some repair systems 
recognize and excise only damaged or incorrect 
bases, leaving an AP (abasic) site in the DNA. This 
is repaired by excision and replacement of the DNA 
segment containing the AP site. 

■ Nucleotide-excision repair systems recognize and 
remove a variety of bulky lesions and pyrimidine 
dimers. They excise a segment of the DNA strand 
including the lesion, leaving a gap that is filled in by 
DNA polymerase and ligase activities. 

■ Some DNA damage is repaired by direct reversal of 
the reaction causing the damage: pyrimidine dimers 
are directly converted to monomeric pyrimidines 
by a photolyase, and the methyl group of O 6 - 
methylguanine is removed by a methyltransferase. 

■ In bacteria, error-prone translesion DNA synthesis, 
involving TLS DNA polymerases, occurs in 
response to very heavy DNA damage. In eukaryotes, 
similar polymerases have specialized roles in DNA 
repair that minimize the introduction of mutations. 



25.2 DNA Repair [1003] 



MEDICINE 



DNA Repair and Cancer 



BOX 2 




Human cancers develop when genes that regulate nor- 
mal cell division (oncogenes and tumor suppressor 
genes; Chapter 12) fail to function, are activated at the 
wrong time, or are altered. As a consequence, cells may 
grow out of control and form a tumor. The genes con- 
trolling cell division can be damaged by spontaneous 
mutation or overridden by the invasion of a tumor virus 
(Chapter 26) . Not surprisingly, alterations in DNA repair 
genes that result in an increased rate of mutation can 
greatly increase an individual's susceptibility to cancer. 
Defects in the genes encoding the proteins involved in 
nucleotide-excision repair, mismatch repair, recombina- 
tional repair, and error-prone translesion DNA synthesis 
have all been linked to human cancers. Clearly, DNA 
repair can be a matter of life and death. 

Nucleotide-excision repair requires a larger num- 
ber of proteins in humans than in bacteria, although 
the overall pathways are very similar. Genetic defects 
that inactivate nucleotide-excision repair have been 
associated with several genetic diseases, the best- 
studied of which is xeroderma pigmentosum (XP). 
Because nucleotide-excision repair is the sole repair 
pathway for pyrimidine dimers in humans, people 
with XP are extremely sensitive to light and readily 
develop sunlight-induced skin cancers. Most people 
with XP also have neurological abnormalities, pre- 
sumably because of their inability to repair certain le- 
sions caused by the high rate of oxidative metabolism 
in neurons. Defects in the genes encoding any of at 
least seven different protein components of the nu- 
cleotide-excision repair system can result in XP, giv- 
ing rise to seven different genetic groups denoted 
XPA to XPG. Several of these proteins (notably those 
defective in XPB, XPD, and XPG) also play roles in 
transcription-coupled base-excision repair of oxida- 
tive lesions, described in Chapter 26. 

Most microorganisms have redundant pathways for 
the repair of cyclobutane pyrimidine dimers — making 
use of DNA photolyases and sometimes base-excision 
repair as alternatives to nucleotide-excision repair — but 
humans and other placental mammals do not. This lack 
of a back-up for nucleotide-excision repair for removing 
pyrimidine dimers has led to speculation that early 
mammalian evolution involved small, furry, nocturnal 
animals with little need to repair UV damage. However, 
mammals do have a pathway for the translesion bypass 
of cyclobutane pyrimidine dimers, which involves DNA 



polymerase 17. This enzyme preferentially inserts two A 
residues opposite a T-T pyrimidine dimer, minimizing 
mutations. People with a genetic condition in which 
DNA polymerase 17 function is missing exhibit an XP-like 
illness known as XP-variant or XP-V. Clinical manifesta- 
tions of XP-V are similar to those of the classic XP dis- 
eases, although mutation levels are higher in XP-V when 
cells are exposed to UV light. Apparently, the nu- 
cleotide-excision repair system works in concert with 
DNA polymerase 77 in normal human cells, repairing 
and/or bypassing pyrimidine dimers as needed to keep 
cell growth and DNA replication going. Exposure to UV 
light introduces a heavy load of pyrimidine dimers, and 
some must be bypassed by translesion synthesis to keep 
replication on track. When one system is missing, it is 
partly compensated for by the other. A loss of DNA poly- 
merase 7] activity leads to stalled replication forks and 
bypass of UV lesions by different, more mutagenic, 
translesion synthesis (TLS) polymerases. As when other 
DNA repair systems are absent, the resulting increase in 
mutations often leads to cancer. 

One of the most common inherited cancer-suscepti- 
bility syndromes is hereditary nonpolyposis colon can- 
cer (HNPCC). This syndrome has been traced to defects 
in mismatch repair. Human and other eukaryotic cells 
have several proteins analogous to the bacterial MutL 
and MutS proteins (see Fig. 25-23). Defects in at least 
five different mismatch repair genes can give rise to 
HNPCC. The most prevalent are defects in the hMLHl 
(human MutL homolog 1) and UMSH2 (human MutS 
homolog 2) genes. In individuals with HNPCC, cancer 
generally develops at an early age, with colon cancers 
being most common. 

Most human breast cancer occurs in women with no 
known predisposition. However, about 10% of cases are 
associated with inherited defects in two genes, BRCA1 
and BRCA2. Human BRCA1 and BRCA2 are large 
proteins (1,834 and 3,418 amino acid residues, respec- 
tively) that interact with a wide range of other proteins 
involved in transcription, chromosome maintenance, 
DNA repair, and control of the cell cycle. BRCA2 has 
been implicated in the recombinational DNA repair of 
double-strand breaks. However, the precise molecular 
function of BRCA1 and BRCA2 in these various cellular 
processes is not yet clear. Women with defects in either 
the BRCA1 or BRCA2 gene have a greater than 80% 
chance of developing breast cancer. 



25.3 DNA Recombination 

The rearrangement of genetic information within 
and among DNA molecules encompasses a variety 
of processes, collectively placed under the heading 
of genetic recombination. The practical applications 
of DNA rearrangements in altering the genomes of 



increasing numbers of organisms are now being explored 
(Chapter 9). 

Genetic recombination events fall into at least three 
general classes. Homologous genetic recombination 

(also called general recombination) involves genetic 
exchanges between any two DNA molecules (or seg- 
ments of the same molecule) that share an extended 
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region of nearly identical sequence. The actual sequence 
of bases is irrelevant, as long as it is similar in the two 
DNAs. In site-specific recombination the exchanges 
occur only at a particular DNA sequence. DNA trans- 
position is distinct from both other classes in that it 
usually involves a short segment 
of DNA with the remarkable ca- 
pacity to move from one location 
in a chromosome to another. 
These "jumping genes" were 
first observed in maize in the 
1940s by Barbara McClintock. 
There is in addition a wide range 
of unusual genetic rearrange- 
ments for which no mechanism 
or purpose has yet been pro- 
posed. Here we focus on the 
three general classes. 
The functions of genetic recombination systems are 
as varied as their mechanisms. They include roles in 
specialized DNA repair systems, specialized activities in 
DNA replication, regulation of expression of certain 
genes, facilitation of proper chromosome segregation 
during eukaryotic cell division, maintenance of genetic 
diversity, and implementation of programmed genetic 
rearrangements during embryonic development. In 
most cases, genetic recombination is closely integrated 
with other processes in DNA metabolism, and this be- 
comes a theme of our discussion. 

Homologous Genetic Recombination 
Has Several Functions 

In bacteria, homologous genetic recombination is prima- 
rily a DNA repair process and in this context (as noted 
in Section 25.2) is referred to as recombinational 
DNA repair. It is usually directed at the reconstruction 
of replication forks stalled at the site of DNA damage. 
Homologous genetic recombination can also occur dur- 
ing conjugation (mating), when chromosomal DNA is 
transferred from one bacterial cell (donor) to another 



(recipient). Recombination during conjugation, although 
rare in wild bacterial populations, contributes to genetic 
diversity 

In eukaryotes, homologous genetic recombination 
can have several roles in replication and cell division, 
including the repair of stalled replication forks. Recom- 
bination occurs with the highest frequency during 
meiosis, the process by which diploid germ-line cells 
with two sets of chromosomes divide to produce haploid 
gametes (sperm cells or ova) in animals (haploid spores 
in plants) — each gamete having only one member of 
each chromosome pair (Fig. 25-31). Meiosis begins 
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FIGURE 25-31 Meiosis in animal germ-line cells. The chromosomes of a hy- 
pothetical diploid germ-line cell (six chromosomes; three homologous 
pairs) replicate and are held together at their centromeres. Each replicated 
double-stranded DNA molecule is called a chromatid (sister chromatid). In 
prophase I, just before the first meiotic division, the three homologous sets 
of chromatids align to form tetrads, held together by covalent links at homol- 
ogous junctions (chiasmata). Crossovers occur within the chiasmata (see 
Fig. 25-32). These transient associations between homologs ensure that the 
two tethered chromosomes segregate properly in the next step, when they 
migrate toward opposite poles of the dividing cell in the first meiotic divi- 
sion. The products of this division are two daughter cells each with three 
pairs of chromatids. The pairs now line up across the equator of the cell in 
preparation for separation of the chromatids (now called chromosomes). 
The second meiotic division produces four haploid daughter cells that can 
serve as gametes. Each has three chromosomes, half the number of the 
diploid germ-line cell. The chromosomes have resorted and recombined. 
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with replication of the DNA in the germ-line cell so that 
each DNA molecule is present in four copies. The cell 
then goes through two rounds of cell division without an 
intervening round of DNA replication. This reduces the 
DNA content to the haploid level in each gamete. 

After the DNA is replicated during prophase of the 
first meiotic division, the resulting sister chromatids 
remain associated at their centromeres. At this stage, 
each set of four homologous chromosomes (tetrad) 
exists as two pairs of chromatids. Genetic information 
is now exchanged between the closely associated ho- 
mologous chromatids by homologous genetic recombi- 
nation, a process involving the breakage and rejoining 
of DNA (Fig. 25-32). This exchange, also referred to 
as crossing over, can be observed with the light micro- 
scope. Crossing over links the two pairs of sister chro- 
matids together at points called chiasmata (singular, 
chiasma). 

Crossing over effectively links together all four ho- 
mologous chromatids, a linkage that is essential to the 
proper segregation of chromosomes in the subsequent 
meiotic cell divisions. Crossing over is not an entirely 
random process, and "hot spots" have been identified on 
many eukaryotic chromosomes. However, the assump- 
tion that crossing over can occur with equal probability 
at almost any point along the length of two homologous 
chromosomes remains a reasonable approximation in 
many cases, and it is this assumption that permits the 
genetic mapping of genes. The frequency of homologous 

Centromere 




(a) 



FIGURE 25-32 Crossing over, (a) Crossing over often produces an ex- 
change of genetic material, (b) The homologous chromosomes of a 
grasshopper are shown during prophase I of meiosis. Many points of 



recombination in any region separating two points on 
a chromosome is roughly proportional to the distance 
between the points, and this allows determination of the 
relative positions of and distances between different 
genes. 

Homologous recombination thus serves at least 
three identifiable functions: (1) it contributes to the 
repair of several types of DNA damage; (2) it provides, 
in eukaryotic cells, a transient physical link between 
chromatids that promotes the orderly segregation of 
chromosomes at the first meiotic cell division; and (3) it 
enhances genetic diversity in a population. 

Recombination during Meiosis Is Initiated 
with Double-Strand Breaks 

A likely pathway for homologous recombination during 
meiosis is outlined in Figure 25-33a. The model has 
four key features. First, homologous chromosomes are 
aligned. Second, a double-strand break in a DNA 
molecule is enlarged by an exonuclease, leaving a single- 
strand extension with a free 3'-hydroxyl group at the 
broken end (step CD). Third, the exposed 3' ends in- 
vade the intact duplex DNA of the homolog, and this is 
followed by branch migration (Fig. 25-34) and/or by 
replication to create a pair of crossover structures, called 
Holliday intermediates (Fig. 25-33a, steps (2) to ®). 
Fourth, cleavage of the two crossovers creates either of 
two pairs of complete recombinant products (step (§)). 




joining (chiasmata) are evident between the two homologous pairs of 
chromatids. These chiasmata are the physical manifestation of prior 
homologous recombination (crossing-over) events, 
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(!) A double-strand break in one of two 
homologs is converted to a double- 
strand gap by the action of exonucleases. 
Strands with 3' ends are degraded 
less than those with 5' ends, 
producing 3' single-strand extensions. 



I 



> An exposed 3' end pairs with its 
complement in the intact homolog. 
The other strand of the duplex 
is displaced. 



© The invading 3' end is extended by 

DNA polymerase plus branch migration, 
eventually generating a DNA molecule 
with two crossovers called Holliday 
intermediates. 



J 



(4) Further DNA replication replaces the 
DNA missing from the site of the 
original double-strand break. 




Cleavage of the Holliday intermediates 
by specialized nucleases generates 
either of the two recombination products. 
In product set 2, the DNA on either 
side of the region undergoing repair 
is recombined. 



FIGURE 25-33 Recombination during meiosis. (a) Model of double- 
strand break repair for homologous genetic recombination. The two 
homologous chromosomes (one shown in red, the other blue) involved 
in this recombination event have similar sequences. Each of the two 
genes shown has different alleles on the two chromosomes. The steps 
are described in the text, (b) A Holliday intermediate formed between 
two bacterial plasmids in vivo, as seen with the electron microscope. 
The intermediates are named for Robin Holliday, who first proposed 
their existence in 1964 



In this double-strand break repair model for re- 
combination, the 3' ends are used to initiate the genetic 
exchange. Once paired with the complementary strand 
on the intact homolog, a region of hybrid DNA is created 
that contains complementary strands from two different 
parent DNAs (the product of step (2) in Fig. 25-33a). 
Each of the 3' ends can then act as a primer for DNA 
replication. The structures thus formed, Holliday in- 
termediates (Fig. 25-33b), are a feature of homolo- 
gous genetic recombination pathways in all organisms. 

Homologous recombination can vary in many details 
from one species to another, but most of the steps out- 
lined above are generally present in some form. There 
are two ways to cleave, or "resolve," the Holliday inter- 
mediate so that the two products carry genes in the 
same linear order as in the substrates — the original, un- 
recombined chromosomes (step (5) of Fig. 25-33a). If 
cleaved one way, the DNA flanking the region containing 
the hybrid DNA is not recombined; if cleaved the other 
way, the flanking DNA is recombined. Both outcomes 
are observed in vivo in eukaiyotes and bacteria. 

The homologous recombination illustrated in Fig- 
ure 25-33 is a very elaborate process with subtle molec- 
ular consequences for the generation of genetic 
diversity. To understand how this process contributes 
to diversity, we should keep in mind that the two ho- 
mologous chromosomes that undergo recombination 
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FIGURE 25-34 Branch migration. When a template strand pairs with 
two different complementary strands, a branch is formed at the point 
where the three complementary strands meet. The branch "migrates" 
when base pairing to one of the two complementary strands is broken 
and replaced with base pairing to the other complementary strand. In 
the absence of an enzyme to direct it, this process can move the 
branch spontaneously in either direction. Spontaneous branch migra- 
tion is blocked wherever one of the otherwise complementary strands 
has a sequence nonidentical to the other strand. 
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are not necessarily identical The linear array of genes 
may be the same, but the base sequences in some of the 
genes may differ slightly (in different alleles). In a hu- 
man, for example, one chromosome may contain the al- 
lele for hemoglobin A (normal hemoglobin) while the 
other contains the allele for hemoglobin S (the sickle- 
cell mutation). The difference may consist of no more 
than one base pair among millions. Homologous recom- 
bination does not change the linear array of genes, but 
it can determine which alleles become linked on a sin- 
gle chromosome, and are thereby passed to the next 
generation together. 

Recombination Requires a Host of Enzymes 
and Other Proteins 

Enzymes that promote various steps of homologous re- 
combination have been isolated from both bacteria and 
eukaryotes. InE. coli, the recB, recC, and recD genes 
encode the heterotrimeric RecBCD enzyme, which has 
both helicase and nuclease activities. The RecA protein 
promotes all the central steps in the homologous recom- 
bination process: the pairing of two DNAs, formation of 
Holliday intermediates, and branch migration (as de- 
scribed below). The RuvA and RuvB proteins (repair of 
UV damage) form a complex that binds to Holliday in- 
termediates, displaces RecA protein, and promotes 
branch migration at higher rates than does RecA. Nucle- 
ases that specifically cleave Holliday intermediates, of- 
ten called resolvases, have been isolated from bacteria 
and yeast. The RuvC protein is one of at least two such 
nucleases inE, coli. 

The RecBCD enzyme binds to linear DNA at a free 
(broken) end and moves inward along the double helix, 
unwinding and degrading the DNA in a reaction cou- 
pled to ATP hydrolysis (Fig, 25-35). The RecB and 
RecD sub units are helicase motors, with RecB moving 
3'— »5' along one strand and RecD moving 5'— »3' along 
the other strand. The activity of the enzyme is altered 
when it interacts with a sequence referred to as chi, 
(5')GCTGGTGG, which binds tightly to a site on the 
RecC subunit. From that point, degradation of the 
strand with a 3' terminus is greatly reduced, but degra- 
dation of the 5 '-terminal strand is increased. This 
process creates a single-stranded DNA with a 3' end, 
which is used during subsequent steps in recombination 
(Fig. 25-33). The 1,009 chi sequences scattered 
throughout the E. coli genome enhance the frequency 
of recombination about 5- to 10-fold within 1,000 bp of 
the chi site. The enhancement declines as the distance 
from the site increases. Sequences that enhance recom- 
bination frequency have also been identified in several 
other organisms. 

RecA is unusual among the proteins of DNA metab- 
olism in that its active form is an ordered, helical filament 
of up to several thousand subunits that assemble cooper- 
atively on DNA (Fig. 25-36). This filament normally 
forms on single-stranded DNA, such as that produced by 
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FIGURE 25-35 Helicase and nuclease activities of the RecBCD 
enzyme. Entering at a double-stranded end, RecBCD unwinds and 
degrades the DNA. A nuclease domain in the RecB subunit is posi- 
tioned so that it can alternately degrade one DNA strand then the 
other The RecB subunit travels in the 3'->5' direction on the strand 
with the free 3' end, using its ATP-dependent helicase activity. The 
RecD subunit similarly moves along the opposite strand in the 
5'->3' direction, Chi sites are recognized and bound at a site on 
the RecC subunit. The chi-containing strand is looped out of the 
enzyme, and only the strand with the free 5' end is now accessible 
to the RecB nuclease domain. The RecBCD enzyme is believed to 
help initiate homologous genetic recombination in E coli. It is also 
involved in the repair of double-strand breaks at collapsed replica- 
tion forks 



the RecBCD enzyme. The filament will also form on a du- 
plex DNA with a single-strand gap; in this case, the first 
RecA monomers bind to the single-stranded DNA in the 
gap, after which the assembled filament rapidly envelops 
the neighboring duplex. Several other proteins, including 
RecX, DinI, RecF, RecO, and RecR, regulate the assem- 
bly and disassembly of RecA filaments. 

A useful model to illustrate the recombination ac- 
tivities of the RecA filament is the in vitro DNA strand 
exchange reaction (Fig. 25-37). A single strand of 
DNA is first bound by RecA to establish the nucleopro- 
tein filament. The RecA filament then takes up a ho- 
mologous duplex DNA and aligns it with the bound 
single strand. ATP binding (but not hydrolysis) is re- 
quired for the formation of an active RecA filament 
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that can align two DNA molecules. Strands are then 
exchanged between the two DNAs to create hybrid 
DNA. The exchange occurs at a rate of 6 bp/s and pro- 
gresses in the 5'-»3' direction relative to the single- 
stranded DNA within the RecA filament. This reaction 
can involve either three or four strands (Fig. 25-37); 
in the latter case, a Holliday intermediate forms during 
the process. 



FIGURE 25-36 RecA protein, (a) Nucleoprotein filament of RecA 
protein on single-stranded DNA, as seen with the electron microscope. 
The striations indicate the right-handed helical structure of the fila- 
ment (b) Surface contour model of a 24-subunit RecA filament The 
filament has 6 subunits per turn. One subunit is colored red to provide 
perspective (derived from PDB ID 2REB). (c) After a rate-limiting 
nucleation step, RecA filaments are extended in the 5'->3' direction 
on single-stranded DNA. Disassembly proceeds, also in the 5'->3' 
direction, from the end opposite to that where extension occurs, 
(d) Filament assembly is assisted by the RecF, RecO, and RecR proteins 
(RecFOR). The RecX protein inhibits RecA filament extension. The Dinl 
protein stabilizes RecA filaments, preventing disassembly. 
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RecA protein binds to single-stranded or gapped 
DNA. The complementary strand of the linear DNA 
pairs with a circular single strand. The other 
linear strand is displaced (left) or pairs with its 
complement in the circular duplex to yield a 
Holliday intermediate (right). 
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Continued branch migration yields a circular duplex 
with a nick and either a displaced linear strand (left) 
or a partially single-stranded linear duplex (right). 

FIGURE 25-37 RecA-promoted DNA strand exchange in vitro. 

Strand exchange involves the separation of one strand of a duplex DNA 
from its complement and transfer of the strand to an alternative comple- 
mentary strand to form a new duplex (heteroduplex) DNA. The transfer 
forms a branched intermediate. Formation of the final product depends 
on branch migration, which is facilitated by RecA. The reaction can in- 
volve three strands (left) or a reciprocal exchange between two homolo- 
gous duplexes — four strands in all (right). When four strands are involved, 
a Holliday intermediate results. RecA promotes the branch-migration 
phases of these reactions, using energy derived from ATP hydrolysis. 
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As the duplex DNA is incorporated within the RecA 
filament and aligned with the bound single-stranded 
DNA over regions of hundreds of base pairs, one strand 
of the duplex switches pairing partners (Fig. 25-38, 
step (2)). Because DNA is a helical structure, continued 
strand exchange requires an ordered rotation of the 
two aligned DNAs. This brings about a spooling action 
(steps (3) and (4)) that shifts the branch point along the 
helix. ATP hydrolysis is coupled to the late stages of 
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FIGURE 25-38 Model for RecA-mediated DNA strand exchange. A 

three-strand reaction is shown. The balls representing RecA protein 
are undersized relative to the thickness of DNA to clarify the fate of 
the DNA strands. © RecA forms a filament on the single-stranded 
DNA. (5) A homologous duplex incorporates into this complex. (5) As 
spooling shifts the three-stranded region from left to right, one of the 
strands in the duplex is transferred to the single strand originally bound 
in the filament. The other strand of the duplex is displaced, and a new 
duplex forms within the filament. As rotation continues (© and ©), 
the displaced strand separates entirely. In this model/ hydrolysis of ATP 
by RecA rotates the two DNA molecules relative to each other and thus 
directs the strand exchange from left to right as shown. 



DNA strand exchange, in which the hybrid DNA created 
in the initial pairing reaction is extended. The coupling 
mechanism is not yet understood. 

Once a Holliday intermediate has formed, a host of 
enzymes— topoisomerases, the RuvAB branch migration 
protein, a resolvase, other nucleases, DNA polymerase I or 
III, and DNA ligase — are required to complete recombina- 
tion. The RuvC protein (M r 20,000) ofE. coli cleaves Hol- 
liday intermediates to generate full-length, unbranched 
chromosome products. 

All Aspects of DNA Metabolism Come Together 
to Repair Stalled Replication Forks 

Like all cells, bacteria sustain high levels of DNA dam- 
age even under normal growth conditions. Most DNA 
lesions are repaired rapidly by base-excision repair, 
nucleotide-excision repair, and the other pathways 
described earlier. Nevertheless, almost every bacter- 
ial replication fork encounters an unrepaired DNA le- 
sion or break at some point in its journey from the 
replication origin to the terminus (Fig. 25-30). For 
many types of lesions, DNA polymerase III cannot 
proceed and the encounter tends to leave the lesion 
in a single-strand gap. An encounter with a DNA 
strand break creates a double-strand break. Both sit- 
uations require recombinational DNA repair (Fig. 
25-39). Under normal growth conditions, stalled 
replication forks are reactivated by an elaborate re- 
pair pathway encompassing recombinational DNA re- 
pair, the restart of replication, and the repair of any 
lesions left behind. All aspects of DNA metabolism 
come together in this process. 

After a replication fork has been halted, it can be 
restored by at least two major paths, both of which re- 
quire the RecA protein. The repair pathway for lesion- 
containing DNA gaps also requires the RecF, RecO, 
and RecR proteins. Repair of double-strand breaks re- 
quires the RecBCD enzyme (Fig. 25-39). Additional 
recombination steps are followed by origin-inde- 
pendent restart of replication, in which the repli- 
cation fork reassembles with the aid of a complex of 
seven proteins (PriA, B, and C, and DnaB, C, G, and 
T). This complex, originally discovered as a compo- 
nent required for the replication of <£X174 DNA in 
vitro, is now termed the replication restart primo- 
some. Restart of the replication fork also requires 
DNA polymerase II, in a role not yet defined; this poly- 
merase II activity gives way to DNA polymerase III for 
the extensive replication generally required to com- 
plete the chromosome. In at least some cases, replica- 
tion restart can occur downstream of a blocking DNA 
lesion before the lesion is repaired. 

The repair of stalled replication forks entails coordi- 
nated transitions between replication and recombina- 
tion. The recombination steps function to fill the DNA 
gap or rejoin the broken DNA branch to recreate the 
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FIGURE 25-39 Models for recombinational DNA 
repair of stalled replication forks. The replication 
fork collapses on encountering a DNA lesion (left) 
or strand break (right), Recombination enzymes 
promote the DNA strand transfers needed to re- 
pair the branched DNA structure at the replication 
fork. A lesion in a single-strand gap is repaired in 
a reaction requiring the RecF, RecO, and RecR 
proteins. Double-strand breaks are repaired in a 
pathway requiring the RecBCD enzyme. Both 
pathways require RecA. Recombination interme- 
diates are processed by additional enzymes (e.g., 
RuvA, RuvB, and RuvC, which process Holliday 
intermediates). Lesions in double-stranded DNA 
are repaired by nucleotide-excision repair or 
other pathways. The replication fork re-forms with 
the aid of enzymes catalyzing origin-independent 
replication restart, and chromosomal replication 
is completed. The overall process requires an 
elaborate coordination of all aspects of bacterial 
DNA metabolism. 
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branched DNA structure at the replication fork. Lesions 
left behind in what is now duplex DNA are repaired by 
pathways such as base-excision or nucleotide-excision 
repair. Thus a wide range of enzymes encompassing 
every aspect of DNA metabolism ultimately take part in 
the repair of a stalled replication fork. This type of repair 
process is a primary function of the homologous recom- 
bination system of every cell, and defects in recombina- 
tional DNA repair play an important role in human 
disease (Box 25-1). 

Site-Specific Recombination Results in 
Precise DNA Rearrangements 

Homologous genetic recombination, the type we have 
discussed so far, can involve any two homologous 



sequences. The second general type of recombina- 
tion, site-specific recombination, is a very different 
type of process: recombination is limited to specific 
sequences. Recombination reactions of this type oc- 
cur in virtually every cell, filling specialized roles 
that vary greatly from one species to another. Exam- 
ples include regulation of the expression of certain 
genes and promotion of programmed DNA re- 
arrangements in embryonic development or in the 
replication cycles of some viral and plasmid DNAs. 
Each site-specific recombination system consists of 
an enzyme called a recombinase and a short (20 to 
200 bp), unique DNA sequence where the recombi- 
nase acts (the recombination site). One or more aux- 
iliary proteins may regulate the timing or outcome of 
the reaction. 
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There are two general classes of site-specific recom- 
bination systems, which rely on either Tyr or Ser 
residues in the active site. In vitro studies of many site- 
specific recombination systems in the tyrosine class 
have elucidated some general principles, including the 
fundamental reaction pathway (Fig. 25-40a). Several 
of these enzymes have been crystallized, revealing 
structural details of the reaction. A separate recombi- 
nase recognizes and binds to each of two recombination 
sites on two different DNA molecules or within the same 
DNA. One DNA strand in each site is cleaved at a spe- 
cific point within the site, and the recombinase becomes 
covalently linked to the DNA at the cleavage site 
through a phosphotyrosine bond (step ®). The tran- 
sient protein-DNA linkage preserves the phosphodiester 
bond that is lost in cleaving the DNA, so high-energy co- 
factors such as ATP are unnecessary in subsequent steps. 
The cleaved DNA strands are rejoined to new partners to 
form a Holliday intermediate, with new phosphodiester 
bonds created at the expense of the protein-DNA linkage 
(step (§)). To complete the reaction, the process must 
be repeated at a second point within each of the two 
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FIGURE 25-40 A site-specific recombination reaction, (a) The reac- 
tion shown here is for a common class of site-specific recombinases 
called integrase-class recombinases (named after bacteriophage A inte- 
grase, the first recombinase characterized). These enzymes use Tyr 
residues as nucleophiles at the active site. The reaction is carried out 
within a tetramer of identical subunits. Recombinase subunits bind to 
a specific sequence, the recombination site. 0 One strand in each 
DNA is cleaved at particular points in the sequence. The nucleophile 
is the — OH group of an active-site Tyr residue, and the product is a co- 
valent phosphotyrosine link between protein and DNA. ® The 
cleaved strands join to new partners, producing a Holliday intermedi- 
ate Steps (5) and ® complete the reaction by a process similar to the 
first two steps. The original sequence of the recombination site is 
regenerated after recombining the DNA flanking the site. These steps 
occur within a complex of multiple recombinase subunits that some- 
times includes other proteins not shown here, (b) Surface contour 
model of a four-subunit integrase-class recombinase called the Cre 
recombinase, bound to a Holliday intermediate (shown with light blue 
and dark blue helix strands). The protein has been rendered transpar- 
ent so that the bound DNA is visible (derived from PDB ID 3CRX). 
Another group of recombinases, called the resolvase/invertase family, 
use a Ser residue as nucleophile at the active site. 



recombination sites (steps (3) and ®). In the systems 
that employ an active-site Ser residue, both strands of 
each recombination site are cut concurrently and re- 
joined to new partners without the Holliday intermedi- 
ate. In both types of system, the exchange is always 
reciprocal and precise, regenerating the recombination 
sites when the reaction is complete. We can view a 
recombinase as a site-specific endonuclease and ligase 
in one package. 

The sequences of the recombination sites recognized 
by site-specific recombinases are partially asymmetric 
(nonpalindromic), and the two recombining sites align in 
the same orientation during the recombinase reaction. 
The outcome depends on the location and orientation of 
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(a) 

FIGURE 25-41 Effects of site-specific recombination. The outcome of 
site-specific recombination depends on the location and orientation of 
the recombination sites (red and green) in a double-stranded DNA 
molecule. Orientation here (shown by arrowheads) refers to the order 
of nucleotides in the recombination site, not the 5'->3' direction. 

the recombination sites (Fig. 25-41). If the two sites are 
on the same DNA molecule, the reaction either inverts or 
deletes the intervening DNA, determined by whether the 
recombination sites have the opposite or the same orien- 
tation, respectively. If the sites are on different DNAs, the 
recombination is intermolecular; if one or both DNAs are 
circular, the result is an insertion. Some recombinase sys- 
tems are highly specific for one of these reaction types and 
act only on sites with particular orientations. 

The first site-specific recombination system studied 
in vitro was that encoded by bacteriophage A. When A 
phage DNA enters anE. coli cell, a complex series of reg- 
ulatory events commits the DNA to one of two fates. The 
A DNA replicates and produces more bacteriophages 
(destroying the host cell), or it integrates into the host 



Deletion and insertion 




(b) 

(a) Recombination sites with opposite orientation in the same DNA 
molecule. The result is an inversion, (b) Recombination sites with the 
same orientation, either on one DNA molecule, producing a deletion, 
or on two DNA molecules, producing an insertion. 



chromosome and (as prophage) replicates passively 
along with the chromosome for many cell generations. In- 
tegration is accomplished by a phage-encoded, tyrosine- 
class recombinase (A integrase) that acts at recombination 
sites on the phage and bacterial DNAs — at attachment 
sites attP and attB, respectively (Fig. 25-42). The role of 
site-specific recombination in regulating gene expression is 
considered in Chapter 28. 

Complete Chromosome Replication Can Require 
Site-Specific Recombination 

Recombinational DNA repair of a circular bacterial 
chromosome, while essential, sometimes generates 
deleterious byproducts. The resolution of a Holliday 
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The recombinase is the A integrase (or INT protein). Integration and 
excision use different attachment sites and different auxiliary proteins. 
Excision uses the proteins XIS, encoded by the bacteriophage, and FIS, 
encoded by the bacterium. Both reactions require the protein iHF 
(integration host factor), encoded by the bacterium. 



FIGURE 25-42 Integration and excision of bacteriophage A DNA at 
the chromosomal target site. The attachment site on the A phage DNA 
{attP) shares only 15 bp of complete homology with the bacterial site 
{attB) in the region of the crossover. The reaction generates two new 
attachment sites (attR and attL) flanking the integrated phage DNA. 
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FIGURE 25-43 DNA deletion to undo a deleterious effect of recombi- 
national DNA repair. The resolution of a Holliday intermediate during 
recombi national DNA repair (if cut at the points indicated by red arrows) 
can generate a contiguous dimeric chromosome. A specialized site-spe- 
cific recombi nase in E coli, XerCD, converts the dimer to monomers, al- 
lowing chromosome segregation and cell division to proceed. 



intermediate at a replication fork by a nuclease such as 
RuvC, followed by completion of replication, can give 
rise to one of two products: the usual two monomeric 
chromosomes or a contiguous dimeric chromosome 
(Fig. 25-43). In the latter case, the covalently linked 
chromosomes cannot be segregated to daughter cells at 
cell division and the dividing cells become "stuck." A 
specialized site-specific recombination system mE. coli, 
the XerCD system, converts the dimeric chromosomes 
to monomeric chromosomes so that cell division can 
proceed. The reaction is a site-specific deletion reaction 
(Fig. 25-41b). This is another example of the close coor- 
dination between DNA recombination processes and 
other aspects of DNA metabolism. 

Transposable Genetic Elements Move from 
One Location to Another 

We now consider the third general type of recombina- 
tion system: recombination that allows the movement of 
transposable elements, or transposons. These seg- 
ments of DNA, found in virtually all cells, move, or 



"jump," from one place on a chromosome (the donor 
site) to another on the same or a different chromosome 
(the target site). DNA sequence homology is not usually 
required for this movement, called transposition; the 
new location is determined more or less randomly. In- 
sertion of a transposon in an essential gene could kill the 
cell, so transposition is tightly regulated and usually 
very infrequent. Transposons are perhaps the simplest 
of molecular parasites, adapted to replicate passively 
within the chromosomes of host cells. In some cases 
they carry genes that are useful to the host cell, and thus 
exist in a kind of symbiosis with the host. 

Bacteria have two classes of transposons. Inser- 
tion sequences (simple transposons) contain 
only the sequences required for transposition and the 
genes for the proteins (transposases) that promote the 
process. Complex transposons contain one or more 
genes in addition to those needed for transposition. 
These extra genes might, for example, confer resistance 
to antibiotics and thus enhance the survival chances of 
the host cell. The spread of antibiotic-resistance ele- 
ments among disease-causing bacterial populations that 
is rendering some antibiotics ineffectual (p. 949) is 
mediated in part by transposition. ■ 

Bacterial transposons vary in structure, but most have 
short repeated sequences at each end that serve as binding 
sites for the transposase. When transposition occurs, a 
short sequence at the target site (5 to 10 bp) is duplicated 
to form an additional short repeated sequence that flanks 
each end of the inserted transposon (Fig. 25^4). These 
duplicated segments result from the cutting mechanism 
used to insert a transposon into the DNA at a new location. 
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FIGURE 25-44 Duplication of the DNA sequence at a target site when 
a transposon is inserted. The sequences that are duplicated following 
transposon insertion are shown in red. These sequences are generally 
only a few base pairs long, so their size relative to that of a typical 
transposon is greatly exaggerated in this drawing. 
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There are two general pathways for transposition in 
bacteria. In direct (or simple) transposition (Fig. 25-45, 
left), cuts on each side of the transposon excise it, and 
the transposon moves to a new location. This leaves a 
double-strand break in the donor DNA that must be re- 
paired. At the target site, a staggered cut is made (as in 
Fig. 25-44), the transposon is inserted into the break, 
and DNA replication fills in the gaps to duplicate the 
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target site sequence. In replicative transposition (Fig. 
25-45, right), the entire transposon is replicated, leav- 
ing a copy behind at the donor location. A cointe- 
grate is an intermediate in this process, consisting of 
the donor region covalently linked to DNA at the tar- 
get site. Two complete copies of the transposon are 
present in the cointegrate, both having the same rela- 
tive orientation in the DNA. In some well-character- 
ized transposons, the cointegrate intermediate is 
converted to products by site-specific recombination, 
in which specialized recombinases promote the re- 
quired deletion reaction. 

Eukaryotes also have transposons, structurally sim- 
ilar to bacterial transposons, and some use similar trans- 
position mechanisms. In other cases, however, the 
mechanism of transposition seems to involve an RNA 
intermediate. Evolution of these transposons is inter- 
twined with the evolution of certain classes of RNA 
viruses. Both are described in the next chapter. 

immunoglobulin Genes Assemble by 
Recombination 

Some DNA rearrangements are a programmed part of 
development in eukaryotic organisms. An important 
example is the generation of complete immunoglobulin 
genes from separate gene segments in vertebrate 
genomes. A human (like other mammals) is capable of 
producing millions of different immunoglobulins (anti- 
bodies) with distinct binding specificities, even though 
the human genome contains only -29,000 genes. Re- 
combination allows an organism to produce an extraor- 
dinary diversity of antibodies from a limited DNA-coding 
capacity. Studies of the recombination mechanism re- 
veal a close relationship to DNA transposition and sug- 
gest that this system for generating antibody diversity 
may have evolved from an ancient cellular invasion of 
transposons. 

We can use the human genes that encode proteins of 
the immunoglobulin G (IgG) class to illustrate how anti- 
body diversity is generated. Immunoglobulins consist 



FIGURE 25-45 Two general pathways for transposition: direct (sim- 
ple) and replicative. 0 The DNA is first cleaved on each side of the 
transposon, at the sites indicated by arrows. @ The liberated 3'- 
hydroxyl groups at the ends of the transposon act as nucleophiles in a 
direct attack on phosphodiester bonds in the target DNA. The target 
phosphodiester bonds are staggered (not directly across from each 
other) in the two DNA strands. ©The transposon is now linked to the 
target DNA. In direct transposition (left), replication fills in gaps at 
each end to complete the process. In replicative transposition (right), 
the entire transposon is replicated to create a cointegrate intermedi- 
ate. 0The cointegrate is often resolved later, with the aid of a separate 
site-specific recombination system. The cleaved host DNA left behind 
after direct transposition is either repaired by DNA end-joining or de- 
graded (not shown). The latter outcome can be lethal to an organism. 
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of two heavy and two light polypeptide chains (see Fig. 
5-21). Each chain has two regions, a variable region, with 
a sequence that differs greatly from one immunoglobulin 
to another, and a region that is virtually constant within a 
class of immunoglobulins. There are also two distinct 
families of light chains, kappa and lambda, which differ 
somewhat in the sequences of their constant regions. For 
all three types of polypeptide chain (heavy chain, and 
kappa and lambda light chains), diversity in the variable 
regions is generated by a similar mechanism. The genes 
for these polypeptides are divided into segments, and the 
genome contains clusters with multiple versions of each 
segment. The joining of one version of each gene segment 
creates a complete gene. 

Figure 25-46 depicts the organization of the DNA 
encoding the kappa light chains of human IgG and 
shows how a mature kappa light chain is generated. In 
undifferentiated cells, the coding information for this 
polypeptide chain is separated into three segments. The 
V (variable) segment encodes the first 95 amino acid 
residues of the variable region, the J (joining) segment 
encodes the remaining 12 residues of the variable 
region, and the C segment encodes the constant region. 



The genome contains —300 different V segments, 4 dif- 
ferent J segments, and 1 C segment. 

As a stem cell in the bone marrow differentiates to 
form a mature B lymphocyte, one V segment and one J 
segment are brought together by a specialized recombi- 
nation system (Fig. 25-46). During this programmed 
DNA deletion, the intervening DNA is discarded. There 
are about 300 X 4 = 1,200 possible V-J combinations. 
The recombination process is not as precise as the site- 
specific recombination described earlier, so additional 
variation occurs in the sequence at the V-J junction. 
This increases the overall variation by a factor of at least 
2.5, so the cells can generate about 2.5 X 1,200 = 3,000 
different V-J combinations. The final joining of the V-J 
combination to the C region is accomplished by an RNA- 
splicing reaction after transcription, a process described 
in Chapter 26. 

The recombination mechanism for joining the V and 
J segments is illustrated in Figure 25-47. Just beyond 
each V segment and just before each J segment lie recom- 
bination signal sequences (RSS). These are bound by 
proteins called RAG1 and RAG2 (products of the recom- 
bination activating gene). The RAG proteins catalyze the 
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FIGURE 25-46 Recombination of the V and J 
gene segments of the human IgG kappa light 
chain. This process is designed to generate anti- 
body diversity. At the top is shown the arrange- 
ment of IgG-coding sequences in a stem cell of 
the bone marrow. Recombination deletes the 
DNA between a particular V segment and a J 
segment. After transcription, the transcript is 
processed by RNA splicing, as described in 
Chapter 26; translation produces the light-chain 
polypeptide. The light chain can combine with 
any of 5,000 possible heavy chains to produce 
an antibody molecule. 
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FIGURE 25-47 Mechanism of immunoglobulin gene rearrangement. 

The RAG1 and RAG2 proteins bind to the recombination signal se- 
quences (RSS) and cleave one DNA strand between the RSS and theV (or 
J) segments to be joined. The liberated 3' hydroxyl then acts as a 
nucieophile, attacking a phosphodi ester bond in the other strand to cre- 
ate a double-strand break. The resulting hairpin bends on theV and J seg- 
ments are cleaved, and the ends are covalently linked by a complex of 
proteins specialized for end-joining repair of double-strand breaks. The 
steps in the generation of the double-strand break catalyzed by RAG1 
and RAG2 are chemically related to steps in transposition reactions, 

formation of a double-strand break between the signal 
sequences and the V (or J) segments to be joined. The V 
and J segments are then joined with the aid of a second 
complex of proteins. 

The genes for the heavy chains and the lambda light 
chains form by similar processes. Heavy chains have more 
gene segments than light chains, with more than 5,000 
possible combinations. Because any heavy chain can 
combine with any light chain to generate an immunoglob- 
ulin, each human has at least 3,000 X 5,000 = 1.5 X 10 7 
possible IgGs. And additional diversity is generated by 
high mutation rates (of unknown mechanism) in the V 
sequences during B-lymphocyte differentiation. Each 
mature B lymphocyte produces only one type of antibody, 
but the range of antibodies produced by the B lympho- 
cytes of an individual organism is clearly enormous. 

Did the immune system evolve in part from ancient 
transposons? The mechanism for generation of the 
double-strand breaks by RAG1 and RAG2 does mirror 
several reaction steps in transposition (Fig. 25-47). In 
addition, the deleted DNA, with its terminal RSS, has a 



sequence structure found in most transposons. In the test 
tube, RAG1 and RAG2 can associate with this deleted 
DNA and insert it, transposonlike, into other DNA mole- 
cules (probably a rare reaction in B lymphocytes). Al- 
though we cannot know for certain, the properties of the 
immunoglobiilin gene rearrangement system suggest an 
intriguing origin in which the distinction between host 
and parasite has become blurred by evolution. 



SUMMARY 25,3 DNA Recombination 

■ DNA sequences are rearranged in recombination 
reactions, usually in processes tightly coordinated 
with DNA replication or repair. 

■ Homologous genetic recombination can take place 
between any two DNA molecules that share 
sequence homology. In meiosis (in eukaryotes), 
this type of recombination helps to ensure accurate 
chromosomal segregation and create genetic 
diversity In both bacteria and eukaryotes it 
serves in the repair of stalled replication forks. 

A Holliday intermediate forms during homologous 
recombination. 

■ Site-specific recombination occurs only at specific 
target sequences, and this process can also involve 
a Holliday intermediate. Recombinases cleave the 
DNA at specific points and ligate the strands to 
new partners. This type of recombination is found 
in virtually all cells, and its many functions include 
DNA integration and regulation of gene expression. 

■ In virtually all cells, transposons use recombination 
to move within or between chromosomes. In 
vertebrates, a programmed recombination reaction 
related to transposition joins immunoglobulin gene 
segments to form immunoglobulin genes during 
B-lymphocyte differentiation. 
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Problems 

1. Conclusions from the Meselson-Stahl Experiment 

The Meselson-Stahl experiment (see Fig. 25-2) proved that 
DNA undergoes semiconservative replication in E. coli In the 
"dispersive" model of DNA replication, the parent DNA 
strands are cleaved into pieces of random size, then joined 
with pieces of newly replicated DNA to yield daughter du- 
plexes. Explain how the results of Meselson and Stahl's exper- 
iment ruled out such a model 

2. Heavy Isotope Analysis of DNA Replication A culture 
of E coli growing in a medium containing 15 NH 4 C1 is switched 
to a medium containing 14 NH 4 C1 for three generations (an 
eightfold increase in population) What is the molar ratio of hy- 
brid DNA ( 15 N- 14 N) to light DNA ( 14 N- 14 N) at this point? 

3. Replication of the E. coli Chromosome The E coli 

chromosome contains 4,639,221 bp. 

(a) How many turns of the double helix must be unwound 
during replication of the E. coli chromosome? 

(b) From the data in this chapter, how long would it take 
to replicate the E. coli chromosome at 37 °C if two replication 
forks proceeded from the origin? Assume replication occurs at 
a rate of 1 ,000 bp/s. Under some conditions E. coli cells can di- 
vide every 20 min. How might this be possible? 

(c) In the replication of the E. coli chromosome, about 
how many Okazaki fragments would be formed? What factors 
guarantee that the numerous Okazaki fragments are assem- 
bled in the correct order in the new DNA? 

4. Base Composition of DNAs Made from Single- 
Stranded Templates Predict the base composition of the to- 
tal DNA synthesized by DNA polymerase on templates 
provided by an equimolar mixture of the two complementary 
strands of bacteriophage 0X174 DNA (a circular DNA mole- 
cule). The base composition of one strand is A, 24.7%; G, 
24,1%; C, 18.5%; and T, 32.7%. What assumption is necessary 
to answer this problem? 

5. DNA Replication Kornberg and his colleagues incubated 
soluble extracts of E. coli with a mixture of dATP, dTTP, dGTP, 
and dCTP, all labeled with 32 P in the a-phosphate group. After 
a time, the incubation mixture was treated with trichloroacetic 
acid, which precipitates the DNA but not the nucleotide 
precursors. The precipitate was collected, and the extent of 
precursor incorporation into DNA was determined from the 
amount of radioactivity present in the precipitate. 



(a) If any one of the four nucleotide precursors were omit- 
ted from the incubation mixture, would radioactivity be found 
in the precipitate? Explain. 

(b) Would 32 P be incorporated into the DNA if only dTTP 
were labeled? Explain. 

(c) Would radioactivity be found in the precipitate if 32 P 
labeled the or y phosphate rather than the a phosphate of 
the deoxyribonucleotides? Explain. 

6. The Chemistry of DNA Replication All DNA poly 
merases synthesize new DNA strands in the 5'— »3' direction. 
In some respects, replication of the antiparallel strands of du- 
plex DNA would be simpler if there were also a second type of 
polymerase, one that synthesized DNA in the 3'-»5' direction. 
The two types of polymerase could, in principle, coordinate 
DNA synthesis without the complicated mechanics required 
for lagging strand replication. However, no such 3'^5'- 
synthesizing enzyme has been found. Suggest two possible 
mechanisms for 3'^5' DNA synthesis. Pyrophosphate should 
be one product of both proposed reactions. Could one or both 
mechanisms be supported in a cell? Why or why not? (Hint: 
You may suggest the use of DNA precursors not actually pres- 
ent in extant cells.) 

7. Leading and Lagging Strands Prepare a table that lists 
the names and compares the functions of the precursors, en- 
zymes, and other proteins needed to make the leading strand 
versus the lagging strand during DNA replication inE. coli. 

8. Function of DNA Ligase Some E. coli mutants contain 
defective DNA ligase. When these mutants are exposed to 3 H- 
labeled thymine and the DNA produced is sedimented on an 
alkaline sucrose density gradient, two radioactive bands ap- 
pear. One corresponds to a high molecular weight fraction, the 
other to a low molecular weight fraction. Explain. 

9. Fidelity of Replication of DNA What factors promote 
the fidelity of replication during synthesis of the leading strand 
of DNA? Would you expect the lagging strand to be made with 
the same fidelity? Give reasons for your answers. 

10. Importance of DNA Topoisomerases in DNA Repli- 
cation DNA unwinding, such as that occurring in replication, 
affects the superhelical density of DNA. In the absence of 
topoisomerases, the DNA would become overwound ahead of 
a replication fork as the DNA is unwound behind it. A bacter- 
ial replication fork will stall when the superhelical density (<x) 
of the DNA ahead of the fork reaches +0.14 (see Chapter 24). 

Bidirectional replication is initiated at the origin of a 6,000 
bp plasmid in vitro, in the absence of topoisomerases. The 
plasmid initially has a a of -0.06. How many base pairs will be 
unwound and replicated by each replication fork before the 
forks stall? Assume that each fork travels at the same rate and 
that each includes all components necessary for elongation ex- 
cept topoisomerase. 

11. The Ames Test In a nutrient medium that lacks histi- 
dine, a thin layer of agar containing ~10 9 Salmonella ty- 
phimurium histidine auxotrophs (mutant cells that require 
histidine to survive) produces -13 colonies over a two-day 
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incubation period at 37 °C (see Fig. 25-21). How do these 
colonies arise in the absence of histidine? The experiment is 
repeated in the presence of 0.4 fig of 2-aminoanthracene. The 
number of colonies produced over two days exceeds 10,000, 
What does this indicate about 2-aminoanthracene? What can 
you surmise about its carcinogenicity? 

12. DNA Repair Mechanisms Vertebrate and plant cells 
often methylate cytosine in DNA to form 5-methylcytosine 
(see Fig. 8-5a). In these same cells, a specialized repair sys- 
tem recognizes G-T mismatches and repairs them to G=C 
base pairs. How might this repair system be advantageous to 
the cell? (Explain in terms of the presence of 5-methylcytosine 
in the DNA.) 

13. DNA Repair in People with Xeroderma Pig- 
mentosum The condition known as xeroderma pig- 
mentosum (XP) arises from mutations in at least seven 
different human genes (see Box 25-1), The deficiencies are 
generally in genes encoding enzymes involved in some part of 
the pathway for human nucleotide-excision repair. The vari- 
ous types of XP are denoted A through G (XPA, XPB, etc ), 
with a few additional variants lumped under the label XP-V. 

Cultures of fibroblasts from healthy individuals and from 
patients with XPG are irradiated with ultraviolet light The DNA 
is isolated and denatured, and the resulting single-stranded 
DNA is characterized by analytical ultracentrifugation. 

(a) Samples from the normal fibroblasts show a significant 
reduction in the average molecular weight of the single- 
stranded DNA after irradiation, but samples from the XPG fi- 
broblasts show no such reduction. Why might this be? 

(b) If you assume that a nucleotide-excision repair system 
is operative in fibroblasts, which step might be defective in the 
cells from the patients with XPG? Explain. 

14. Holliday Intermediates How does the formation of 
Holliday intermediates in homologous genetic recombination 
differ from their formation in site-specific recombination? 

15. A Connection between Replication and Site-Specific 
Recombination Most wild strains of Saccharomyces cere- 
visiae have multiple copies of the circular plasmid 2fx (named 
for its contour length of about 2 /xm), which has —6,300 bp of 
DNA. For its replication the plasmid uses the host replication 
system, under the same strict control as the host cell chromo- 
somes, replicating only once per cell cycle. Replication of the 
plasmid is bidirectional, with both replication forks initiating at 
a single, well-defined origin. However, one replication cycle of 
a 2/x plasmid can result in more than two copies of the plas- 
mid, allowing amplification of the plasmid copy number (num- 
ber of plasmid copies per cell) whenever plasmid segregation 
at cell division leaves one daughter cell with fewer than the 
normal complement of plasmid copies. Amplification requires 
a site-specific recombination system encoded by the plasmid, 
which serves to invert one part of the plasmid relative to the 
other. Explain how a site-specific inversion event could result 
in amplification of the plasmid copy number. (Hint: Consider 
the situation when replication forks have duplicated one re- 
combination site but not the other.) 
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16. Mutagenesis in Escherichia coli Many mutagenic 
compounds act by alkylating the bases in DNA. The alkylating 
agent R7000 (7-methoxy-2-nitronaptho[2,l-6]furan) is an 
extremely potent mutagen. 




NO, 



R7000 



In vivo, R7000 is activated by the enzyme nitroreductase, and 
this more reactive form covalently attaches to DNA— primarily, 
but not exclusively, to G=C base pairs. 

In a 1996 study, Quillardet, Touati, and Hofnung explored 
the mechanisms by which R7000 causes mutations inE. coli. 
They compared the genotoxic activity of R7000 in two strains 
of E coli: the wild-type (uvr + ) and mutants lacking uvrA ac- 
tivity (uvr~; see Table 25-6). They first measured rates of mu- 
tagenesis. Rifampicin is an inhibitor of RNA polymerase (see 
Chapter 26). In its presence, cells will not grow unless certain 
mutations occur in the gene encoding RNA polymerase; the 
appearance of rifampicin-resistant colonies thus provides a 
useful measure of mutagenesis rates. 

The effects of different concentrations of R7000 were de- 
termined, with the results shown in the graph below. 




0.4 0.6 
R7000 (jLtg/mL) 

(a) Why are some mutants produced even when no R7000 
is present? 

Quillardet and colleagues also measured the survival rate 
of bacteria treated with different concentrations of R7000. 
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(b) Explain how treatment with R7000 is lethal to cells. 

(c) Explain the differences in the mutagenesis curves and 
in the survival curves for the two types of bacteria, uvr + and 
uvr~, as shown in the graphs. 

The researchers then went on to measure the amount of 
R7000 covalently attached to the DNA in uvr + and uvr" 
E. coli. They incubated bacteria with [ 3 H]R7000 for 10 or 
70 minutes, extracted the DNA, and measured its 3 H content 
in counts per minute (cpm) per /xg of DNA. 



3 H in DNA (cpm/jig) 



Time (min) uvr 4 



uvr 



10 
70 



76 
69 



159 
228 



(d) Explain why the amount of 3 H drops over time in the 
uvr + strain and rises over time in the uvr" strain. 

Quillardet and colleagues then examined the particular 
DNA sequence changes caused by R7000 in the uvr + and uvr" 
bacteria. For this, they used six different strains of E. coli, 
each with a different point mutation in the lacZ gene, which 
encodes 0-galactosidase (this enzyme catalyzes the same re- 
action as lactase; see Fig, 14-10). Cells with any of these mu- 
tations have a nonfunctional jS-galactosidase and are unable to 
metabolize lactose (i.e., a Lac" phenotype). Each type of point 
mutation required a specific reverse mutation to restore lacZ 
gene function and Lac + phenotype. By plating cells on a 
medium containing lactose as the sole carbon source, it was 
possible to select for these reverse-mutated, Lac + cells. And 
by counting the number of Lac + cells following mutagenesis of 
a particular strain the researchers could measure the fre- 
quency of each type of mutation. 

First, they looked at the mutation spectrum in uvr" cells. 
The following table shows the results for the six strains, CC101 
through CC106 (with the point mutation required to produce 
Lac + cells indicated in parentheses). 



Number of Lac + cells (average ± SD) 

CC101 CC102 CC103 CC104 CC105 CC106 

(A=T (G^C (G=C (G=C (A=T (A=T 
R7000 to to to to to to 

(Atg/mL) C=G) A=T) C=G) T=A) T=A) G=C) 

0 6 ± 3 11 ± 9 2 ± 1 5 ± 3 2 ± 1 1 ± 1 

0.075 24 ± 19 34 ± 3 8 ± 4 82 ± 23 40 ± 14 4 ± 2 
0.15 24 ± 4 26 ± 2 9 ± 5 180 ± 71 130 ± 50 3 ± 2 

(e) Which types of mutation show significant increases 
above the background rate due to treatment with R7000? Pro- 
vide a plausible explanation for why some have higher fre- 
quencies than others. 

(f) Can all of the mutations you listed in (e) be explained 
as resulting from covalent attachment of R7000 to a G=C base 
pair? Explain your reasoning. 

(g) Figure 2En-28b shows how methylation of guanine residues 
can lead to a G=C to A=T mutation. Using a similar pathway, show 
how a G-R7000 adduct could lead to the G=C to A=T or T=A mu- 
tations shown above. Which base pairs with the G — R7000 adduct? 

The results for the uvr + bacteria are shown in the table 
below. 

Number of Lac + cells (average ± SD) 

CC101 CC102 CC103 CC104 CC105 CC106 

(A=T (G=C (G=C (G=C (A=T (A=T 
R7000 to to to to to to 

(fig/mL) C=G) A=T) C=G) T— A) T=A) G=C) 

0 2 + 2 10 + 9 3 ± 3 4 ± 2 6 + 1 0.5 ±1 

1 7 + 6 21 + 9 8 ± 3 23 ± 15 13 ± 1 1 ± 1 
5 4 ± 3 15 ± 7 22 ± 2 68 ± 25 67 ± 14 1 + 1 

(h) Do these results show that all mutation types are repaired 
with equal fidelity? Provide a plausible explanation for your answer 

Reference 

Quillardet, R, Touati, E., & Hofnung, M. (1996) Influence of the 
z^r-dependent nucleotide excision repair on DNA adducts formation 
and mutagenic spectrum of a potent genotoxic agent: 7-methoxy-2- 
nitronaptho[2,l-6]furan (R7000) Mutat Res 358,113-122. 



The RNA of the ceil is partly in the nucleus, partly in particles in the 
cytoplasm and partly as the "soluble" RNA of the eel! sap; many 
workers have shown that all these three fractions turn over differently. 
It is very important to realize in any discussion of the role of RNA in 
I he cell thai it is very in homogeneous metabolicaJly and probably of 
more than one type. 

— Francis H. C. Crick, article in Symposia of the Society for 

Experimental Biology, 1958 
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26.1 DNA-Dependent Synthesis of RNA 1 022 

26.2 RNA Processing 1033 

26.3 RNA-Dependent Synthesis of RNA and DNA 1050 

Expression of the information in a gene generally in- 
volves production of an RNA molecule transcribed 
from a DNA template. Strands of RNA and DNA may 
seem quite similar at first glance, differing only in that 
RNA has a hydroxyl group at the 2' position of the al- 
dopentose, and uracil instead of thymine. However, un- 
like DNA, most RNAs carry out their functions as single 
strands, strands that fold back on themselves and have 
the potential for much greater structural diversity than 
DNA (Chapter 8) . RNA is thus suited to a variety of cel- 
lular functions. 

RNA is the only macromolecule known to have a 
role both in the storage and transmission of information 
and in catalysis, which has led to much speculation 
about its possible role as an essential chemical interme- 
diate in the development of life on this planet. The dis- 
covery of catalytic RNAs, or ribozymes, has changed the 
very definition of an enzyme, extending it beyond the 
domain of proteins. Proteins nevertheless remain essen- 
tial to RNA and its functions. In the modern cell, all 
nucleic acids, including RNAs, are complexed with pro- 
teins. Some of these complexes are quite elaborate, and 
RNA can assume both structural and catalytic roles 
within complicated biochemical machines. 

All RNA molecules except the RNA genomes of cer- 
tain viruses are derived from information permanently 
stored in DNA. During transcription, an enzyme sys- 
tem converts the genetic information in a segment of 
double-stranded DNA into an RNA strand with a base 



sequence complementary to one of the DNA strands. 
Three major kinds of RNA are produced. Messenger 
RNAs (mRNAs) encode the amino acid sequence of 
one or more polypeptides specified by a gene or set of 
genes. Transfer RNAs (tRNAs) read the information 
encoded in the mRNA and transfer the appropriate 
amino acid to a growing polypeptide chain during pro- 
tein synthesis. Ribosomal RNAs (rRNAs) are con- 
stituents of ribosomes, the intricate cellular machines 
that synthesize proteins. Many additional specialized 
RNAs have regulatory or catalytic functions or are pre- 
cursors to the three main classes of RNA. These special- 
function RNAs are no longer thought of as minor species 
in the catalog of cellular RNAs. In vertebrates, RNAs 
that do not fit into one of the classical categories 
(mRNA, tRNA, rRNA) seem to vastly outnumber those 
that do. 

During replication the entire chromosome is usually 
copied, but transcription is more selective. Only partic- 
ular genes or groups of genes are transcribed at any one 
time, and some portions of the DNA genome are never 
transcribed. The cell restricts the expression of genetic 
information to the formation of gene products needed at 
any particular moment. Specific regulatory sequences 
mark the beginning and end of the DNA segments to be 
transcribed and designate which strand in duplex DNA 
is to be used as the template. The transcript itself may 
interact with other RNA molecules as part of the overall 
regulatory program. The regulation of transcription is 
described in detail in Chapter 28. 

The sum of all the RNA molecules produced in a cell 
under a given set of conditions is called the cellular 
transcriptome. Given the relatively small fraction of 
the human genome devoted to protein-encoding genes, 
we might have expected that only a small part of the hu- 
man genome is transcribed. This is not the case. Modern 
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microarray analysis of transcription patterns has re- 
vealed that much of the genome of humans and other 
mammals is transcribed into RNA. The products are pre- 
dominantly not mRNAs, tRNAs, or rRNAs, but rather 
special-function RNAs, a host of which are being discov- 
ered. Many of these seem to be involved in regulation of 
gene expression; however, the rapid pace of discovery 
has forced the realization that we do not yet know what 
many of these RNAs do. 

In this chapter we examine the synthesis of RNA on 
a DNA template and the postsynthetic processing and 
turnover of RNA molecules. In doing so we encounter 
many of the specialized functions of RNA, including cat- 
alytic functions. Interestingly, the substrates for RNA 
enzymes are often other RNA molecules. We also de- 
scribe systems in which RNA is the template and DNA 
the product, rather than vice versa. The information 
pathways thus come full circle, and reveal that template- 
dependent nucleic acid synthesis has standard rules 
regardless of the nature of template or product (RNA 
or DNA). This examination of the biological intercon- 
version of DNA and RNA as information carriers leads 
to a discussion of the evolutionary origin of biological 
information. 

26.1 DNA-Dependent Synthesis of RNA 

Our discussion of RNA synthesis begins with a compari- 
son between transcription and DNA replication (Chap- 
ter 25). Transcription resembles replication in its 
fundamental chemical mechanism, its polarity (direc- 
tion of synthesis), and its use of a template. And like 
replication, transcription has initiation, elongation, and 
termination phases — though in the literature on tran- 
scription, initiation is further divided into discrete 
phases of DNA binding and initiation of RNA synthesis. 
Transcription differs from replication in that it does not 
require a primer and, generally, involves only limited 
segments of a DNA molecule. Additionally, within tran- 
scribed segments only one DNA strand serves as a tem- 
plate for a particular RNA molecule. 

RNA Is Synthesized by RNA Polymerases 

The discovery of DNA polymerase and its dependence 
on a DNA template spurred a search for an enzyme 
that synthesizes RNA complementary to a DNA strand. 
By 1960, four research groups had independently de- 
tected an enzyme in cellular extracts that could form 
an RNA polymer from ribonucleoside 5 '-triphosphates. 
Subsequent work on the purified Escherichia coli 
RNA polymerase helped to define the fundamental 
properties of transcription (Fig. 26-1). DNA-de- 
pendent RNA polymerase requires, in addition to a 
DNA template, all four ribonucleoside 5 '-triphosphates 
(ATP, GTP, UTP, and CTP) as precursors of the nu- 
cleotide units of RNA, as well as Mg 2+ . The protein also 



binds one Zn 2+ . The chemistry and mechanism of RNA 
synthesis closely resemble those used by DNA poly- 
merases (see Fig. 25-5). RNA polymerase elongates an 
RNA strand by adding ribonucleotide units to the 3'- 
hydroxyl end, building RNA in the 5'— >3' direction. 
The 3'-hydroxyl group acts as a nucleophile, attacking 
the a phosphate of the incoming ribonucleoside 
triphosphate (Fig. 26-lb) and releasing pyrophos- 
phate. The overall reaction is 

(NMP)„ + NTP > (NMP) n + 1 + PPi 

RNA Lengthened RNA 

RNA polymerase requires DNA for activity and is most 
active when bound to a double-stranded DNA. As noted 
above, only one of the two DNA strands serves as a tem- 
plate. The template DNA strand is copied in the 3'— >5' 
direction (antiparallel to the new RNA strand), just as 
in DNA replication. Each nucleotide in the newly 
formed RNA is selected by Watson-Crick base-pairing 
interactions; U residues are inserted in the RNA to pair 
with A residues in the DNA template, G residues are 
inserted to pair with C residues, and so on. Base-pair 
geometry (see Fig. 25-6) may also play a role in base 
selection. 

Unlike DNA polymerase, RNA polymerase does not 
require a primer to initiate synthesis. Initiation occurs 
when RNA polymerase binds at specific DNA sequences 
called promoters (described below). The 5 '-triphosphate 
group of the first residue in a nascent (newly formed) 
RNA molecule is not cleaved to release PP i; but instead 
remains intact throughout the transcription process. 
During the elongation phase of transcription, the grow- 
ing end of the new RNA strand base-pairs temporarily 
with the DNA template to form a short hybrid RNA-DNA 
double helix, estimated to be 8 bp long (Fig. 26-la). The 
RNA in this hybrid duplex "peels off shortly after its 
formation, and the DNA duplex reforms. 

To enable RNA polymerase to synthesize an RNA 
strand complementary to one of the DNA strands, the 
DNA duplex must unwind over a short distance, form- 
ing a transcription "bubble." During transcription, the 
E. coli RNA polymerase generally keeps about 17 bp 
unwound. The 8 bp RNA-DNA hybrid occurs in this 
unwound region. Elongation of a transcript by E. coli 
RNA polymerase proceeds at a rate of 50 to 90 nu- 
cleotides/s. Because DNA is a helix, movement of a 
transcription bubble requires considerable strand ro- 
tation of the nucleic acid molecules. DNA strand rota- 
tion is restricted in most DNAs by DNA-binding 
proteins and other structural barriers. As a result, a 
moving RNA polymerase generates waves of positive 
supercoils ahead of the transcription bubble and neg- 
ative supercoils behind (Fig. 26-1 c). This has been ob- 
served both in vitro and in vivo (in bacteria). In the 
cell, the topological problems caused by transcription 
are relieved through the action of topoisomerases 
(Chapter 24). 
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MECHANISM FIGURE 26-1 Transcription by RNA polymerase in E. coll 

For synthesis of an RNA strand complementary to one of two DNA 
strands in a double helix, the DNA is transiently unwound, (a) About 1 7 
bp are unwound at any given time. RNA polymerase and the transcrip- 
tion bubble move from left to right along the DNA as shown, facilitating 
RNA synthesis. The DNA is unwound ahead and rewound behind as 
RNA is transcribed. Red arrows show the direction in which the DNA 
must rotate to permit this process. As the DNA is rewound, the RNA-DNA 
hybrid is displaced and the RNA strand extruded The RNA polymerase is 
in close contact with the DNA ahead of the transcription bubble, as well 
as with the separated DNA strands and the RNA within and immediately 
behind the bubble. A channel in the protein funnels new nucleoside 
triphosphates (NTPs) to the polymerase active site. The polymerase foot- 
print encompasses about 35 bp of DNA during elongation. 
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(b) Catalytic mechanism of RNA synthesis by RNA polymerase. 
Note that this is essentially the same mechanism used by DNA poly- 
merases (see Fig 25-5b). The reaction involves two Mg 2+ ions, coor- 
dinated to the phosphate groups of the incoming NTP and to three Asp 
residues (Asp 460 , Asp 462 , and Asp 464 in the jS' subunit of the E. coli 
RNA polymerase), which are highly conserved in the RNA poly- 
merases of all species. One Mg 2+ ion facilitates attack by the 3'-hy- 
droxyl group on the a phosphate of the NTP; the other Mg 2+ ion 
facilitates displacement of the pyrophosphate; and both metal ions 
stabilize the pentacovalent transition state. 

(c) Changes in the supercoiling of DNA brought about by tran- 
scription. Movement of an RNA polymerase along DNA tends to cre- 
ate positive supercoils (overwound DNA) ahead of the transcription 
bubble and negative supercoils (underwound DNA) behind it. In a cell, 
topoisomerases rapidly eliminate the positive supercoils and regulate 
the level of negative supercoiling (Chapter 24). 
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FIGURE 26-2 Template and nontemplate (coding) DNA strands. 

The two complementary strands of DNA are defined by their 
function in transcription. The RNA transcript is synthesized on the 
template strand and is identical in sequence (with U in place of T) 
to the nontemplate strand, or coding strand. 



(5') CGCTATAGCGTTT(3') DNA nontemplate (coding) strand 
(3') GCGAT ATCGC AAA(5') DNA template strand 

(5 r ) CGCUAUAGCGUUUO') RNA transcript 
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FIGURE 26-3 Organization of coding information in the adenovirus 
genome. The genetic information of the adenovirus genome (a conve- 
niently simple example) is encoded by a double-stranded DNA mole- 
cule of 36,000 bp, both strands of which encode proteins. The 
information for most proteins is encoded by (that is, identical to) the top 
strand— by convention, the strand oriented 5' to 3' from left to right. The 
bottom strand acts as template for these transcripts. However, a few 



proteins are encoded by the bottom strand, which is transcribed in the 
opposite direction (and uses the top strand as template). Synthesis of 
mRNAs in adenovirus is actually much more complex than shown 
here. Many of the mRNAs shown for the upper strand are initially syn- 
thesized as a single, long transcript (25,000 nucleotides), which is then 
extensively processed to produce the separate mRNAs. Adenovirus 
causes upper respiratory tract infections in some vertebrates. 



KEY CONVENTION: The two complementary DNA strands 
have different roles in transcription. The strand that 
serves as template for RNA synthesis is called the tem- 
plate strand. The DNA strand complementary to the 
template, the nontemplate strand, or coding strand, 
is identical in base sequence to the RNA transcribed 
from the gene, with U in the RNA in place of T in the 
DNA (Fig. 26-2). The coding strand for a particular 
gene may be located in either strand of a given chromo- 
some (as shown in Fig. 26-3 for a virus). By conven- 
tion, the regulatory sequences that control transcription 
(described later in this chapter) are designated by the 
sequences in the coding strand. ■ 

The DNA-dependent RNA polymerase ofE. coli is a 
large, complex enzyme with five core subunits (a 2 /3j3'o>; 
M T 390,000) and a sixth subunit, one of a group desig- 
nated cr, with variants designated by size (molecular 
weight). The a subunit binds transiently to the core and 
directs the enzyme to specific binding sites on the DNA 
(described below). These six subunits constitute the 
RNA polymerase holoenzyme (Fig. 26-4). The RNA 
polymerase holoenzyme of E. coli thus exists in several 
forms, depending on the type of a subunit. The most 
common subunit is a 70 (M r 70,000), and the upcoming 
discussion focuses on the corresponding RNA poly- 
merase holoenzyme. 

RNA polymerases lack a separate proofreading 3 >5' 
exonuclease active site (such as that of many DNA poly- 
merases), and the error rate for transcription is higher 
than that for chromosomal DNA replication — approxi- 
mately one error for every 10 4 to 10 5 ribonucleotides in- 
corporated into RNA. Because many copies of an RNA are 
generally produced from a single gene and all RNAs are 



eventually degraded and replaced, a mistake in an RNA 
molecule is of less consequence to the cell than a mistake 
in the permanent information stored in DNA. Many RNA 
polymerases, including bacterial RNA polymerase and the 
eukaryotic RNA polymerase II (discussed below), do 
pause when a mispaired base is added during transcrip- 
tion, and they can remove mismatched nucleotides from 
the 3' end of a transcript by direct reversal of the poly- 
merase reaction. But we do not yet know whether this ac- 
tivity is a true proofreading function and to what extent it 
may contribute to the fidelity of transcription. 




FIGURE 26-4 Structure of the RNA polymerase holoenzyme of the 
bacterium Thermus aquaticus. (Derived from PDB ID 1 1W7) The over- 
all structure of this enzyme is very similar to that of the E. coli RNA 
polymerase; no DNA or RNA is shown here. The /3 subunit is in gray, 
the jS' subunit is white; the two a subunits are different shades of red; 
the (o subunit is yellow; the a subunit is orange. The image on the left 
is oriented as in Figure 26-6. When the structure is rotated 1 80° about 
the y axis (right) the small o> subunit is visible. 



26.1 DNA-Dependent Synthesis of RNA 1025 



RNA Synthesis Begins at Promoters 

Initiation of RNA synthesis at random points in a DNA 
molecule would be an extraordinarily wasteful process. In- 
stead, an RNA polymerase binds to specific sequences in 
the DNA called promoters, which direct the transcrip- 
tion of adjacent segments of DNA (genes). The sequences 
where RNA polymerases bind can be quite variable, and 
much research has focused on identifying the particular 
sequences that are critical to promoter function. 

In E. coli, RNA polymerase binding occurs within a 
region stretching from about 70 bp before the transcrip- 
tion start site to about 30 bp beyond it. By convention, 
the DNA base pairs that correspond to the beginning of 
an RNA molecule are given positive numbers, and those 
preceding the RNA start site are given negative num- 
bers. The promoter region thus extends between posi- 
tions — 70 and +30. Analyses and comparisons of the 
most common class of bacterial promoters (those recog- 
nized by an RNA polymerase holoenzyme containing 
<t 70 ) have revealed similarities in two short sequences 
centered about positions -10 and -35 (Fig. 26-5). 
These sequences are important interaction sites for the 
a 70 subunit. Although the sequences are not identical 
for all bacterial promoters in this class, certain nu- 
cleotides that are particularly common at each position 
form a consensus sequence (recall the E. coli oriC 
consensus sequence; see Fig. 25-11). The consensus se- 
quence at the -10 region is (5')TATAAT(3'); the con- 
sensus sequence at the -35 region is (5')TTGACA(3'). 
A third AT-rich recognition element, called the UP 



(upstream promoter) element, occurs between posi- 
tions -40 and -60 in the promoters of certain highly 
expressed genes. The UP element is bound by the a sub- 
unit of RNA polymerase. The efficiency with which an 
RNA polymerase containing a 70 binds to a promoter and 
initiates transcription is determined in large measure by 
these sequences, the spacing between them, and their 
distance from the transcription start site. 

Many independent lines of evidence attest to the func- 
tional importance of the sequences in the -35 and - 10 re- 
gions. Mutations that affect the function of a given 
promoter often involve a base pair in these regions. Varia- 
tions in the consensus sequence also affect the efficiency 
of RNA polymerase binding and transcription initiation. A 
change in only one base pair can decrease the rate of bind- 
ing by several orders of magnitude. The promoter se- 
quence thus establishes a basal level of expression that can 
vary greatly from one E. coli gene to the next. A method 
that provides information about the interaction between 
RNA polymerase and promoters is illustrated in Box 26-1. 

The pathway of transcription initiation and the fate 
of the a subunit are becoming much better defined 
(Fig. 26-(>a). The pathway consists of two major parts, 
binding and initiation, each with multiple steps. First, the 
polymerase, directed by its bound a factor, binds to the 
promoter. A closed complex (in which the bound DNA is 
intact) and an open complex (in which the bound DNA 
is intact and partially unwound near the -10 sequence) 
form in succession. Second, transcription is initiated 
within the complex, leading to a conformational change 
that converts the complex to the elongation form, 
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FIGURE 26-5 Typical E. coli promoters recognized by an RNA poly- 
merase holoenzyme containing a 70 . Sequences of the nontemplate 
strand are shown, read in the 5'— >3' direction, as is the convention for 
representations of this kind The sequences vary from one promoter to the 
next, but comparisons of many promoters reveal similarities, particularly 
in the —10 and -35 regions. The sequence element UP, not present in all 
E. coli promoters, is shown in the P1 promoter for the highly expressed 



rRNA gene rrnB. UP elements, generally occurring in the region between 
-40 and -60, strongly stimulate transcription at the promoters that con- 
tain them. The UP element in the rrnB P1 promoter encompasses the re- 
gion between -38 and -59. The consensus sequence for E, coli 
promoters recognized by a 70 is shown second from the top. Spacer re- 
gions contain slightly variable numbers of nucleotides (N). Only the first 
nucleotide coding the RNA transcript (at position + 1 ) is shown. 
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BOX 26- 



METHODS 



RNA Polymerase Leaves Its Footprint on a Promoter 



Footprinting, a technique derived from principles 
used in DNA sequencing, identifies the DNA sequences 
bound by a particular protein. Researchers isolate a 
DNA fragment thought to contain sequences recognized 
by a DNA-binding protein and radiolabel one end of one 
strand (Fig. 1). They then use chemical or enzymatic 
reagents to introduce random breaks in the DNA frag- 
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ment (averaging about one per molecule). Separation of 
the labeled cleavage products (broken fragments of var- 
ious lengths) by high-resolution electrophoresis pro- 
duces a ladder of radioactive bands. In a separate tube, 
the cleavage procedure is repeated on copies of the 
same DNA fragment in the presence of the DNA-binding 
protein. The researchers then subject the two sets of 
cleavage products to electrophoresis and 
compare them side by side. A gap ("foot- 
print") in the series of radioactive bands 
derived from the DNA-protein sample, at- 
tributable to protection of the DNA by the 
bound protein, identifies the sequences 
that the protein binds. 

The precise location of the protein- 
binding site can be determined by directly 
sequencing (see Fig. 8-34) copies of the 
same DNA fragment and including the se- 
quencing lanes (not shown here) on the 
same gel with the footprint. Figure 2 shows 
footprinting results for the binding of RNA 
polymerase to a DNA fragment containing 
a promoter. The polymerase covers 60 to 
80 bp; protection by the bound enzyme 
includes the -10 and -35 regions. 
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- + C 



Separate fragments by polyacrylamide gel electrophoresis 
and visualize radiolabeled bands on x-ray film. 
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FIGURE 1 Footprint analysis of the RNA polymerase-binding site on a 
DNA fragment. Separate experiments are carried out in the presence 
(+) and absence (-) of the polymerase. 




Regions bound by 
RNA polymerase 



-10 
-20 

-30 
-40 



~t--50 



FIGURE 2 Footprinting results of RNA polymerase 
binding to the lac promoter (see Fig. 26-5). In this 
experiment, the 5' end of the nontemplate strand 
was radioactively labeled. Lane C is a control in 
which the labeled DNA fragments were cleaved 
with a chemical reagent that produces a more 
uniform banding pattern . 
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followed by movement of the transcription complex 
away from the promoter (promoter clearance). Any of 
these steps can be affected by the specific makeup of 
the promoter sequences. The a subunit dissociates sto- 



chastically (at random) as the polymerase enters the 
elongation phase of transcription. A model for RNA poly- 
merase engaged in elongation is shown in Figure 26-6b. 
The protein NusA (M T 54,430) binds to the elongating 
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FIGURE 26-6 Transcription initiation and elongation by E. coli RNA 
polymerase, (a) Initiation of transcription requires several steps generally 
divided into two phases, binding and initiation. In the binding phase, the 
initial interaction of the RNA polymerase with the promoter leads to for- 
mation of a closed complex, in which the promoter DNA is stably bound 
but not unwound. A 12 to 15 bp region of DNA — from within the -10 
region to position +2 or +3 — is then unwound to form an open com- 
plex. Additional intermediates (not shown) have been detected in the 
pathways leading to the closed and open complexes, along with several 
changes in protein conformation. The initiation phase encompasses tran- 
scription initiation and promoter clearance. Once elongation com- 
mences, the <t subunit is released and the polymerase leaves the 
promoter and becomes committed to elongation of the RNA. 






(b) The RNA core polymerase of E. coli in the elongation phase. 
Subunit coloring matches Figure 26^: the j8 and jS' subunits are light 
gray and white; the a subunits, shades of red. The a> subunit is not vis- 
ible in this view. The a subunit is absent, having dissociated after the 
initiation steps. The top panel shows the entire complex, with DNA 
and RNA. The active site for transcription is in a cleft between the (3 
and p' subunits. In the middle panel, the (3 subunit has been removed, 
exposing the active site and the DNA-RNA hybrid region. The active 
site is marked in part by a Mg 2+ ion (red). In the bottom panel, all the 
protein has been removed to reveal the circuitous path taken by the 
DNA and RNA through the complex. 
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RNA polymerase, competitively with the a subunit. 
Once transcription is complete, NusA dissociates from 
the enzyme, the RNA polymerase dissociates from the 
DNA, and a a factor (<x 70 or another ) can now bind to 
the enzyme to initiate transcription, in a cycle some- 
times called the <r cycle (Fig. 26-7). 

E coli has other classes of promoters, bound by 
RNA polymerase holoenzymes with different a subunits 
(Table 26-1), An example is the promoters of the heat 
shock genes. The products of this set of genes are made 
at higher levels when the cell has received an insult, 
such as a sudden increase in temperature. RNA poly- 
merase binds to the promoters of these genes only when 
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a 70 is replaced with the a 32 (M r 32,000) subunit, which 
is specific for the heat shock promoters (see Fig. 28-3). 
By using different a subunits the cell can coordinate the 
expression of sets of genes, permitting major changes in 
cell physiology. Which sets of genes are expressed is de- 
termined by the availability of the various a subunits, 
which is determined by several factors: regulated rates of 
synthesis and degradation, postsynthetic modifications 
that switch individual a subunits between active and 
inactive forms, and a specialized class of anti-cr proteins, 
each type binding to and sequestering a particular a sub- 
unit (rendering it unavailable for transcription initiation) . 

Transcription Is Regulated at Several Levels 

Requirements for any gene product vary with cellular 
conditions or developmental stage, and transcription of 
each gene is carefully regulated to form gene products 
only in the proportions needed. Regulation can occur at 
any step in transcription, including elongation and ter- 
mination. However, much of the regulation is directed at 
the polymerase binding and transcription initiation 
steps outlined in Figure 26-6a. Differences in promoter 
sequences are just one of several levels of control. 

The binding of proteins to sequences both near to 
and distant from the promoter can also affect levels of 
gene expression. Protein binding can activate transcrip- 
tion by facilitating either RNA polymerase binding or 
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FIGURE 26-7 The a- cycle. RNA polymerase, guided by a bound a sub- 
unit, binds to DNA at a promoter sequence. Once RNA synthesis is ini- 
tiated, the a subunit dissociates stochastically and is replaced by 
NusA, When RNA polymerase reaches a terminator sequence, RNA 



synthesis halts, NusA dissociates from the polymerase, and the RNA 
polymerase dissociates from the DNA. The free polymerase can, in 
principle, bind any cr subunit. The type bound determines the promoter 
to which the RNA polymerase will bind in the next round of synthesis. 
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The Seven a Su bunits of Escherichia coli 
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Source: Adapted from Maeda, H., Fujita, N., & Ishihama, A. (2000) Nucleic. Acids Res. 28, 3500. 

Note: a factors are widely distributed in bacteria; the number varies from a single cr factor in Mycoplasma genitalium to 63 distinct cr factors in Streptomyces coelicoior. 

* Approximate number of each a subunit per cell and the fraction of RNA polymerase holoenzyme complexed with each a subunit during exponential growth. The numbers change 
as growth conditions change. The fraction of RNA polymerase complexed with each cr subunit reflects both the amount of the particular subunit and its affinity for the enzyme. 



steps further along in the initiation process, or it can 
repress transcription by blocking the activity of the 
polymerase. In E r coli, one protein that activates tran- 
scription is the cAMP receptor protein (CRP), which 
increases the transcription of genes coding for enzymes 
that metabolize sugars other than glucose when cells are 
grown in the absence of glucose. Repressors are pro- 
teins that block the synthesis of RNA at specific genes. 
In the case of the Lac repressor (Chapter 28), transcrip- 
tion of the genes for the enzymes of lactose metabolism 
is blocked when lactose is unavailable. 

Transcription is the first step in the complicated and 
energy-intensive pathway of protein synthesis, so much 
of the regulation of protein levels in both bacterial and 
eukaryotic cells is directed at transcription, particularly 
its early stages. In Chapter 28 we describe many mecha- 
nisms by which this regulation is accomplished. 

Specific Sequences Signal Termination of RNA Synthesis 

RNA synthesis is processive (that is, the RNA poly- 
merase has high processivity; p. 980) — necessarily so, 
because if an RNA polymerase released an RNA tran- 
script prematurely, it could not resume synthesis of the 
same RNA but instead would have to start again. How- 
ever, an encounter with certain DNA sequences results 
in a pause in RNA synthesis, and at some of these se- 
quences transcription is terminated. The process of ter- 
mination is not yet well understood in eukaryotes, so 
our focus is again on bacteria. E, coli has at least two 
classes of termination signals: one class relies on a pro- 
tein factor called p (rho) and the other is p-independent. 

Most p-independent terminators have two distin- 
guishing features. The first is a region that produces an 
RNA transcript with self-complementary sequences, 
permitting the formation of a hairpin structure (see 
Fig. 8- 19a) centered 15 to 20 nucleotides before the 
projected end of the RNA strand. The second feature is 
a highly conserved string of three A residues in the 
template strand that are transcribed into U residues 
near the 3' end of the hairpin. When a polymerase 



arrives at a termination site with this structure, it 
pauses (Fig. 26-8). Formation of the hairpin struc- 
ture in the RNA disrupts several A=U base pairs in the 
RNA-DNA hybrid segment and may disrupt important 
interactions between RNA and the RNA polymerase, 
facilitating dissociation of the transcript. 



5' 




FIGURE 26-8 Model for p-independent termination of transcription 
in f. coli. RNA polymerase pauses at a variety of DNA sequences, 
some of which are terminators. One of two outcomes is then possible: 
the polymerase bypasses the site and continues on its way, or the com- 
plex undergoes a conformational change (isomerization). In the latter 
case, intramolecular pairing of complementary sequences in the newly 
formed RNA transcript may form a hairpin that disrupts the RNA-DNA 
hybrid and/or the interactions between RNA and polymerase, resulting 
in isomerization. An A=U hybrid region at the 3' end of the new 
transcript is relatively unstable, and the RNA dissociates completely, 
leading to termination and dissociation of the RNA molecule. This is 
the usual outcome at terminators. At other pause sites, the complex 
may escape after the isomerization step to continue RNA synthesis. 
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The p-dependent terminators lack the sequence of 
repeated A residues in the template strand but usually 
include a CA-rich sequence called a rut (rho -utilization) 
element. The p protein associates with the RNA at spe- 
cific binding sites and migrates in the 5'->3' direction 
until it reaches the transcription complex that is paused 
at a termination site. Here it contributes to release of the 
RNA transcript. The p protein has an ATP-dependent 
RNA-DNA helicase activity that promotes translocation 
of the protein along the RNA, and ATP is hydrolyzed by 
p protein during the termination process. The detailed 
mechanism by which the protein promotes the release 
of the RNA transcript is not known, 

Eukaryotic Cells Have Three Kinds of 
Nuclear RNA Polymerases 

The transcriptional machinery in the nucleus of a 
eukaryotic cell is much more complex than that in bacte- 
ria. Eukaryotes have three RNA polymerases, designated 
I, II, and III, which are distinct complexes but have certain 
subunits in common. Each polymerase has a specific 
function and is recruited to a specific promoter sequence. 

RNA polymerase I (Pol I) is responsible for the 
synthesis of only one type of RNA, a transcript called pre- 
ribosomal RNA (or pre-rRNA), which contains the precur- 
sor for the 18S, 5.8S, and 28S rRNAs (see Fig. 26-25). 
Pol I promoters vary greatly in sequence from one 
species to another. The principal function of RNA poly- 
merase II (Pol II) is synthesis of mRNAs and some spe- 
cialized RNAs. This enzyme can recognize thousands of 
promoters that vary greatly in sequence. Many Pol II 
promoters have a few sequence features in common, 
including a TATA box (eukaryotic consensus sequence 
TATAAA) near base pair -30 and an Inr sequence (ini- 
tiator) near the RNA start site at +1 (Fig. 26-9). 

RNA polymerase III (Pol III) makes tRNAs, the 5S 
rRNA, and some other small specialized RNAs. The pro- 
moters recognized by Pol III are well characterized. In- 
terestingly, some of the sequences required for the 
regulated initiation of transcription by Pol III are lo- 
cated within the gene itself, whereas others are in more 



conventional locations upstream of the RNA start site 
(Chapter 28). 

RNA Polymerase li Requires Many Other Protein 
Factors for Its Activity 

RNA polymerase II is central to eukaryotic gene expres- 
sion and has been studied extensively. Although this 
polymerase is strikingly more complex than its bacterial 
counterpart, the complexity masks a remarkable con- 
servation of structure, function, and mechanism. Pol II 
isolated from yeast is a huge enzyme with 12 subunits. 
The largest subunit (RBP1) exhibits a high degree of 
homology to the fi' subunit of bacterial RNA poly- 
merase. Another subunit (RBP2) is structurally similar 
to the bacterial /3 subunit, and two others (RBP3 and 
RBP1 1) show some structural homology to the two bac- 
terial a subunits. Pol II must function with genomes that 
are more complex and with DNA molecules more elabo- 
rately packaged than in bacteria. The need for protein- 
protein contacts with the numerous other protein 
factors required to navigate this labyrinth accounts in 
large measure for the added complexity of the eukary- 
otic polymerase. 

The largest subunit of Pol II also has an unusual fea- 
ture, a long carboxyl-terminal tail consisting of many re- 
peats of a consensus heptad amino acid sequence 
-YSPTSPS-. There are 27 repeats in the yeast enzyme 
(18 exactly matching the consensus) and 52 (21 exact) in 
the mouse and human enzymes. This carboxyl-terminal 
domain (CTD) is separated from the main body of the en- 
zyme by an unstructured linker sequence. The CTD has 
many important roles in Pol II function, as outlined below 

RNA polymerase II requires an array of other pro- 
teins, called transcription factors, in order to form the 
active transcription complex. The general transcrip- 
tion factors required at every Pol II promoter (factors 
usually designated TFII with an additional identifier) are 
highly conserved in all eukaryotes (Table 26-2). The 
process of transcription by Pol II can be described in 
terms of several phases — assembly, initiation, elonga- 
tion, termination — each associated with characteristic 
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FIGURE 26-9 Common sequences in promoters recognized by eu- 
karyotic RNA polymerase II. The TATA box is the major assembly point 
for the proteins of the preinitiation complexes of Pol II. The DNA is 
unwound at the initiator sequence (Inr), and the transcription start site 
is usually within or very near this sequence. In the Inr consensus 
sequence shown here, N represents any nucleotide; Y, a pyrimidine 
nucleotide. Many additional sequences serve as binding sites for a wide 
variety of proteins that affect the activity of Pol II. These sequences are 
important in regulating Pol 11 promoters and vary greatly in type and 



Inr 



number, and in general the eukaryotic promoter is much more com- 
plex than suggested here (see Fig. 15-23). Many of the sequences are 
located within a few hundred base pairs of the TATA box on the 5' side; 
others may be thousands of base pairs away. The sequence elements 
summarized here are more variable among the Pol II promoters of eu- 
karyotes than among the E. coli promoters (see Fig. 26-5). Many Pol 11 
promoters lack a TATA box or a consensus Inr element or both. Addi- 
tional sequences around the TATA box and downstream (to the right as 
drawn) of Inr may be recognized by one or more transcription factors. 
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T i6— 0 Transcription at RNA polymerase II promoters, (a) The 

sequential assembly of TBP (often with TFIIA), TFIIB, TFIIF plus Pol II, 
TFIIE, and TFIIH results in a closed complex. Within the complex, the 
DNA is unwound at the Inr region by the helicase activity of TFIIH and 
perhaps of TFIIE, creating an open complex. The carboxyl-terminal 
domain of the largest Pol II subunit is phosphorylated by TFIIH, and 
the polymerase then escapes the promoter and begins transcription, 
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Elongation is accompanied by the release of many transcription factors 
and is also enhanced by elongation factors (see Table 26-2). After termi- 
nation, Pol II is released, dephosphorylated, and recycled, (b) Human 
TBP (gray) bound to DNA (blue and white) (PDB ID 1TGH). (c) A cut- 
away view of transcription elongation promoted by the Pol II core 
enzyme. The drawing is based on the structure featured on the cover of 
this book. 



proteins ( pig, 26-10). The step-by-step pathway de- 
scribed below leads to active transcription in vitro. In the 
cell, many of the proteins may be present in larger, pre- 
assembled complexes, simplifying the pathways for as- 
sembly on promoters. As you read about this process, 
consult Figure 26-10 and Table 26-2 to help keep track 
of the many participants. 

Assembly of RNA Polymerase and Transcription 
Factors at a Promoter The formation of a closed 
complex begins when the TATA-binding protein (TBP) 



binds to the TATA box (Fig. 26-10b). TBP is bound in 
turn by the transcription factor TFIIB, which also binds 
to DNA on either side of TBP. TFIIA binding, although 
not always essential, can stabilize the TFIIB-TBP com- 
plex on the DNA and can be important at nonconsensus 
promoters where TBP binding is relatively weak. The 
TFIIB-TBP complex is next bound by another complex 
consisting of TFIIF and Pol II. TFIIF helps target Pol II to 
its promoters, both by interacting with TFIIB and by re- 
ducing the binding of the polymerase to nonspecific sites 
on the DNA. Finally, TFIIE and TFIIH bind to create the 
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^^^^^3| Proteins Required for Initiation of Transcription at the RNA Polymerase II (Pol II) Promoters of Eukaryotes 
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protein 


subunits 


Subunit(s) M T 


Function(s) 


Initiation 








Pol II 


12 


10,000-220,000 


Catalyzes RNA synthesis 


TBP (TATA-binding protein) 


1 


38,000 


Specifically recognizes the TATA box 


TFIIA 


3 


12,000, 19,000, 35,000 


Stabilizes binding of TFIIB and TBP to the promoter 


TFIIB 


1 


35,000 


Binds to TBP; recruits Pol II-TFIIF complex 


TFIIE 


2 


34,000, 57,000 


Recruits TFIIH; has ATPase and helicase activities 


TFIIF 


2 


30,000, 74,000 


Binds tightly to Pol II; binds to TFIIB and prevents 
binding of Pol II to nonspecific DNA sequences 


TFIIH 


12 


35,000-89,000 


Unwinds DNA at promoter (helicase activity); 
phosphorylates Pol II (within the CTD); 
recruits nucleotide-excision repair proteins 


Elongation* 








ELL T 


1 


80,000 




pTEFb 


2 


43,000, 124,000 


Phosphorylates Pol II (within the CTD) 


SII (TFIIS) 


1 


38,000 




Elongin (SIII) 


3 


15,000, 18,000, 110,000 





*The function of all elongation factors is to suppress the pausing or arrest of transcription by the Pol II-TFIIF complex. 

t Name derived from eleven-nineteen /ysine-rich /eukemia.The gene for ELL is the site of chromosomal recombination events frequently associated with acute myeloid leukemia. 



closed complex. TFIIH has DNA helicase activity that 
promotes the unwinding of DNA near the RNA start site 
(a process requiring the hydrolysis of ATP), thereby cre- 
ating an open complex. Counting all the subunits of the 
various essential factors (excluding TFIIA), this minimal 
active assembly has more than 30 polypeptides. Struc- 
tural studies by Roger Romberg and his collaborators 
has provided a more detailed look at the core structure 
of RNA polymerase II during elongation (Fig. 26-10c). 

RNA Strand Initiation and Promoter Clearance 

TFIIH has an additional function during the initiation 
phase. A kinase activity in one of its subunits phospho- 
rylates Pol II at many places in the CTD (Fig. 26-10a). 
Several other protein kinases, including CDK9 (cyclin- 
dependent kinase 9), which is part of the complex 
pTEFb (positive franscription elongation/actor 6), also 
phosphorylate the CTD, primarily on the Ser residues of 
the CTD repeat sequence. This causes a conformational 
change in the overall complex, initiating transcription. 
Phosphorylation of the CTD is also important during the 
subsequent elongation phase, with the phosphorylation 
state of the CTD changing as transcription proceeds. 
The changes affect the interactions between the tran- 
scription complex and other proteins and enzymes, such 
that different sets of proteins are bound at initiation 
than at later stages. Some of these proteins are involved 
in processing the transcript (as described below). 

During synthesis of the initial 60 to 70 nucleotides 
of RNA, first TFIIE and then TFIIH is released, and Pol 
II enters the elongation phase of transcription. 



Elongation, Termination, and Release TFIIF re- 
mains associated with Pol II throughout elongation. Dur- 
ing this stage, the activity of the polymerase is greatly 
enhanced by proteins called elongation factors (Table 
26-2). The elongation factors, some bound to the phos- 
phorylated CTD, suppress pausing during transcription 
and also coordinate interactions between protein com- 
plexes involved in the posttranscriptional processing 
of mRNAs. Once the RNA transcript is completed, 
transcription is terminated. Pol II is dephosphorylated 
and recycled, ready to initiate another transcript 
(Fig. 26-1 0a). 

Regulation of RNA Polymerase II Activity Regu- 
lation of transcription at Pol II promoters is quite elabo- 
rate. It involves the interaction of a wide variety of other 
proteins with the preinitiation complex. Some of these 
regulatory proteins interact with transcription factors, 
others with Pol II itself. The regulation of transcription 
is described in more detail in Chapter 28. 

Diverse Functions of TFIIH In eukaryotes, the re- 
pair of damaged DNA (see Table 25-5) is more efficient 
within genes that are actively being transcribed than for 
other damaged DNA, and the template strand is re- 
paired somewhat more efficiently than the nontemplate 
strand, These remarkable observations are explained by 
the alternative roles of the TFIIH subunits. Not only 
does TFIIH participate in formation of the closed 
complex during assembly of a transcription complex 
(as described above), but some of its subunits are also 



26.2 RNA Processing 




essential components of the separate nucleotide-exci- 
sion repair complex (see Fig. 25-26). 

When Pol II transcription halts at the site of a 
DNA lesion, TFIIH can interact with the lesion 
and recruit the entire nucleotide-excision repair com- 
plex. Genetic loss of certain TFIIH subunits can produce 
human diseases. Some examples are xeroderma pig- 
mentosum (see Box 25-1) and Cockayne's syndrome, 
which is characterized by arrested growth, photosensi- 
tivity, and neurological disorders. ■ 

DNA-Dependent RNA Polymerase Undergoes 
Selective Inhibition 

The elongation of RNA strands by RNA polymerase in both 
bacteria and eukaryotes is inhibited by the antibiotic acti- 
nomycin D (Fig. 26-1 1). The planar portion of this mol- 
ecule inserts (intercalates) into the double-helical DNA 
between successive G=C base pairs, deforming the DNA. 
This prevents movement of the polymerase along the tem- 
plate. Because actinomycin D inhibits RNA elongation in 
intact cells as well as in cell extracts, it is used to identify 
cell processes that depend on RNA synthesis. Acridine in- 
hibits RNA synthesis in a similar fashion (Fig. 26-11). 

Rifampicin inhibits bacterial RNA synthesis by 
binding to the /3 subunit of bacterial RNA polymerases, 
preventing the promoter clearance step of transcription 
(Fig. 26-6). It is sometimes used as an antibiotic. 

The mushroom Amanita phalloides has evolved a 
very effective defense mechanism against predators. It 
produces a-amanitin, which disrupts mRNA formation 
in animal cells by blocking Pol II and, at higher concen- 
trations, Pol III. Neither Pol I nor bacterial RNA poly- 
merase is sensitive to cn-amanitin — nor is the RNA 
polymerase II of A phalloides itself! 
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DNA-Dependent 
Synthesis of RNA 



■ Transcription is catalyzed by DNA-dependent 
RNA polymerases, which use ribonucleoside 5'- 
triphosphates to synthesize RNA complementary to 
the template strand of duplex DNA. Transcription 
occurs in several phases: binding of RNA polymerase 
to a DNA site called a promoter, initiation of 
transcript synthesis, elongation, and termination. 

■ Bacterial RNA polymerase requires a special 
subunit to recognize the promoter. As the first 
committed step in transcription, binding of RNA 
polymerase to the promoter and initiation of 
transcription are closely regulated. Transcription 
stops at sequences called terminators. 

■ Eukaryotic cells have three types of RNA 
polymerases. Binding of RNA polymerase II to its 
promoters requires an array of proteins called 
transcription factors. Elongation factors participate 
in the elongation phase of transcription. The largest 
subunit of Pol II has a long carboxyl-terminal 
domain, which is phosphorylated during the 
initiation and elongation phases. 

26.2 RNA Processing 

Many of the RNA molecules in bacteria and virtually all 
RNA molecules in eukaryotes are processed to some de- 
gree after synthesis. Some of the most interesting mo- 
lecular events in RNA metabolism occur during this 
postsynthetic processing. Intriguingly, several of the en- 
zymes that catalyze these reactions consist of RNA 
rather than protein. The discovery of these catalytic 
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FIGURE 26-11 Actinomycin D and acridine, 
inhibitors of DNA transcription, (a) The 

shaded portion of actinomycin D is planar and 
intercalates between two successive G=C 
base pairs in duplex DNA. The two cyclic pep- 
tide moieties of actinomycin D bind to the 
minor groove of the double helix. Sarcosine 
(Sar) is N-methylglycine; meVal is methylva- 
line. Acridine also acts by intercalation in 
DNA. (b) A complex of actinomycin D with 
DNA (PDB ID 1 DSC). The DNA backbone is 
shown in blue, the bases are white, the inter- 
calated part of actinomycin (shaded in (a)) is 
orange, and the remainder of the actinomycin 
is red. The DNA is bent as a result of the 
actinomycin binding. 
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RNAs, or ribozymes, has brought a revolution in think- 
ing about RNA function and about the origin of life. 

A newly synthesized RNA molecule is called a pri- 
mary transcript. Perhaps the most extensive process- 
ing of primary transcripts occurs in eukaryotic mRNAs 
and in tRNAs of both bacteria and eukaryotes. Special- 
function RNAs are also processed. 

The primary transcript for a eukaryotic mRNA typ- 
ically contains sequences encompassing one gene, al- 
though the sequences encoding the polypeptide may 
not be contiguous. Noncoding tracts that break up the 
coding region of the transcript are called introns, and 
the coding segments are called exons (see the discus- 
sion of introns and exons in DNA in Chapter 24). In a 
process called splicing, the introns are removed from 
the primary transcript and the exons are joined to form 
a continuous sequence that specifies a functional 
polypeptide. Eukaryotic mRNAs are also modified at 
each end. A modified residue called a 5' cap is added at 
the 5' end. The 3' end is cleaved, and 80 to 250 A 
residues are added to create a poly (A) "tail." The some- 
times elaborate protein complexes that carry out each 
of these three mRNA-processing reactions do not oper- 
ate independently. They seem to be organized in associ- 
ation with each other and with the phosphorylated CTD 
of Pol II; each complex affects the function of the others. 
Proteins involved in mRNA transport to the cytoplasm 
are also associated with the mRNA in the nucleus, 
and the processing of the transcript is coupled to its 
transport. In effect, a eukaryotic mRNA, as it is synthe- 
sized, is ensconced in an elaborate complex involving 
dozens of proteins. The composition of the complex 
changes as the primary transcript is processed, trans- 
ported to the cytoplasm, and delivered to the ribosome 
for translation. The associated proteins modulate all 
aspects of the function and fate of the mRNA. These 
processes are outlined in Figure 20-12 and described 
in more detail below. 



The primary transcripts of bacterial and eukaryotic 
tRNAs are processed by the removal of sequences from 
each end (cleavage) and in a few cases by the removal of 
introns (splicing). Many bases and sugars in tRNAs are 
also modified; mature tRNAs are replete with unusual 
bases not found in other nucleic acids (see Fig. 26-23). 
Many of the special-function RNAs also undergo elabo- 
rate processing, often involving the removal of segments 
from one or both ends. 

The ultimate fate of any RNA is its complete and 
regulated degradation. The rate of turnover of RNAs 
plays a critical role in determining their steady-state lev- 
els and the rate at which cells can shut down expression 
of a gene whose product is no longer needed. During the 
development of multicellular organisms, for example, 
certain proteins must be expressed at one stage only, 
and the mRNA encoding such a protein must be made 
and destroyed at the appropriate times. 

Eukaryotic mRNAs Are Capped at the 5 ' End 

Most eukaryotic mRNAs have a 5' cap, a residue of 7- 
methylguanosine linked to the 5 '-terminal residue of the 
mRNA through an unusual 5',5'-triphosphate linkage 
(Fig. 26-13). The 5' cap helps protect mRNA from ri- 
bonucleases. It also binds to a specific cap-binding com- 
plex of proteins and participates in binding of the mRNA 
to the ribosome to initiate translation (Chapter 27). 

The 5' cap is formed by condensation of a mole- 
cule of GTP with the triphosphate at the 5' end of the 
transcript. The guanine is subsequently methylated at 
N-7, and additional methyl groups are often added at 
the 2' hydroxyls of the first and second nucleotides 
adjacent to the cap (Fig. 26-13a). The methyl groups 
are derived from S-adenosylmethionine. All these re- 
actions occur very early in transcription, after the first 
20 to 30 nucleotides of the transcript have been 
added. All three of the capping enzymes, and through 



FIGURE 26-12 Formation of the primary transcript 
and its processing during maturation of mRNA in a 
eukaryotic cell. The 5' cap (red) is added before syn- 
thesis of the primary transcript is complete. A noncod- 
ing sequence (intron) following the last exon is shown 
in orange. Splicing can occur either before or after the 
cleavage and polyadenylation steps. All the processes 
shown here take place in the nucleus. 
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FIGURE 26-13 The 5' cap of mRNA. (a) 7-Methylguanosine (m 7 G) is generally found only in RNAs from vertebrate cells, (b) Generation of 

joined to the 5' end of almost all eukaryotic mRNAs in an unusual the 5' cap involves four to five separate steps (adoHcy is 5-adenosylho- 

5',5'-triphosphate linkage. Methyl groups (pink) are often found at the mocysteine), (c) Synthesis of the cap is carried out by enzymes 

T position of the first and second nucleotides. RNAs in yeast cells lack tethered to the CTD of Pol II. The cap remains tethered to the CTD 

the 2 '-methyl groups. The 2 '-methyl group on the second nucleotide is through an association with the cap-binding complex (CBC). 



them the 5' end of the transcript itself, are associated 
with the RNA polymerase II CTD until the cap is syn- 
thesized. The capped 5' end is then released from the 
capping enzymes and bound by the cap-binding com- 
plex (Fig. 26-13c). 

Both Introns and Exons Are Transcribed from 
DNA into RNA 

In bacteria, a polypeptide chain is generally encoded by 
a DNA sequence that is colinear with the amino acid se- 
quence, continuing along the DNA template without in- 
terruption until the information needed to specify the 
polypeptide is complete. However, the notion that all 
genes are continuous was disproved in 1977 when 
Phillip Sharp and Richard Roberts independently dis- 
covered that many genes for polypeptides in eukaryotes 
are interrupted by noncoding sequences (introns). 



The vast majority of genes in vertebrates contain in- 
trons; among the few exceptions are those that encode 
histones. The occurrence of introns in other eukaryotes 
varies. Many genes in the yeast Saccharomyces cere- 
visiae lack introns, although in some other yeast 
species introns are more common. Introns are also 
found in a few bacterial and archaeal genes. Introns in 
DNA are transcribed along with the rest of the gene by 
RNA polymerases. The introns in the primary RNA tran- 
script are then spliced, and the exons are joined to form 
a mature, functional RNA. In eukaryotic mRNAs, most 
exons are less than 1,000 nucleotides long, with many in 
the 100 to 200 nucleotide size range, encoding stretches 
of 30 to 60 amino acids within a longer polypeptide. In- 
trons vary in size from 50 to 20,000 nucleotides. Genes 
of higher eukaryotes, including humans, typically have 
much more DNA devoted to introns than to exons. Many 
genes have introns; some genes have dozens of them. 
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FIGURE 26-14 Transesterification re- 
action. Shown here is the first step in 
the two-step splicing of group I in- 
trons. In this example, the 3' OH of a 
guanosine molecule acts as nucle- 
ophile, attacking the phosphodiester 
linkage between U and A residues at 
an exon-intron junction of an mRNA 
molecule (see Fig. 26-15). 
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RNA Catalyzes the Splicing of Introns 

There are four classes of introns. The first two, the 
group I and group II introns, differ in the details of their 
splicing mechanisms but share one surprising character- 
istic: they are self-splicing — no protein enzymes are in- 
volved. Group I introns are found in some nuclear, 
mitochondrial, and chloroplast genes that code for 
rRNAs, mRNAs, and tRNAs. Group II introns are gener- 
ally found in the primary transcripts of mitochondrial or 
chloroplast mRNAs in fungi, algae, and plants. Group I 
and group II introns are also found among the rare exam- 
ples of introns in bacteria. Neither class requires a high- 
energy cofactor (such as ATP) for splicing. The splicing 
mechanisms in both groups involve two transesterifica- 
tion reaction steps (Fig. 26-14), in which a ribose 2'- or 



Intron 




3'-hydroxyl group makes a nucleophilic attack on a phos- 
phorus and a new phosphodiester bond is formed at the 
expense of the old, maintaining the balance of energy. 
These reactions are very similar to the DNA breaking and 
rejoining reactions promoted by topoisomerases (see Fig. 
24-21) and site-specific recombinases (see Fig. 25-40). 

The group I splicing reaction requires a guanine nu- 
cleoside or nucleotide cofactor, but the cofactor is not 
used as a source of energy; instead, the 3'-hydroxyl 
group of guanosine is used as a nucleophile in the first 
step of the splicing pathway. The guanosine 3'-hydroxyl 
group forms a normal 3 ', 5 '-phosphodiester bond with 
the 5' end of the intron (Fig. 26-15). The 3' hydroxyl 
of the exon that is displaced in this step then acts as a 
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FIGURE 26-15 Splicing mechanism of group 
I introns. The nucleophile in the first step may 
be guanosine, GMP, GDP, or GTP. The spliced intron 
is eventually degraded 
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The 2' OH of a specific adenosine 
in the intron acts as a nucleophile, 
attacking the 5 ' splice site to form 
a lariat structure. 
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The 3' OH of the 5' exon acts 
as a nucleophile, completing 
the reaction. 
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FIGURE 26-16 Splicing mechanism of 
group II introns. The chemistry is similar 
to that of group I intron splicing, except 
for the identity of the nucleophile in the 
first step and formation of a lariatlike in- 
termediate, in which one branch is a 
2 ',5 '-phosphodiester bond. 



nucleophile in a similar reaction at the 3' end of the 
intron. The result is precise excision of the intron and 
ligation of the exons. 

In group II introns the reaction pattern is similar ex- 
cept for the nucleophile in the first step, which in this 
case is the 2'-hydroxyl group of an A residue within the 
intron (Fig. 26-16). A branched lariat structure is 
formed as an intermediate. 

Self-splicing of introns was first revealed in 1982 in 
studies of the splicing mechanism of the group I rRNA in- 
tron from the ciliated protozoan Tetrahymena ther- 
mophila, conducted by Thomas 
Cech and colleagues. These 
workers transcribed isolated 
Tetrahymena DNA (including 
the intron) in vitro using purified 
bacterial RNA polymerase. The 
resulting RNA spliced itself 
accurately without any protein 
enzymes from Tetrahymena. 
The discovery that RNAs could 
have catalytic functions was a 
milestone in our understanding 
of biological systems. 




Thomas Cech 



Most introns are not self-splicing, and these types 
are not designated with a group number. The third and 
largest class of introns includes those found in nuclear 
mRNA primary transcripts. These are called spliceoso- 
mal introns, because their removal occurs within and 
is catalyzed by a large protein complex called a 
spliceosome. Within the spliceosome, the introns un- 
dergo splicing by the same lariat-forming mechanism as 
the group II introns. The spliceosome is made up of spe- 
cialized RNA-protein complexes, small nuclear riborai- 
cleoproteins (snRNPs, often pronounced "snurps"). 
Each snRNP contains one of a class of eukaryotic RNAs, 
100 to 200 nucleotides long, known as small nuclear 
RNAs (snRNAs). Five snRNAs (Ul, U2, U4, U5, and 
U6) involved in splicing reactions are generally found in 
abundance in eukaryotic nuclei. The RNAs and proteins 
in snRNPs are highly conserved in eukaryotes from 
yeasts to humans. 9 mRNA Splicing 

Spliceosomal introns generally have the dinu- 
cleotide sequence GU at the 5' end and AG at the 3' end, 
and these sequences mark the sites where splicing oc- 
curs. The Ul snRNA contains a sequence complemen- 
tary to sequences near the 5' splice site of nuclear 
mRNA introns (Fig. 26-17a), and the Ul snRNP binds 
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FIGURE 26-17 Splicing mechanism in mRNA primary transcripts, (a) RNA 

pairing interactions in the formation of spliceosome complexes. The U1 snRNA 
has a sequence near its 5' end that is complementary to the splice site at the 5' 
end of the intron. Base pairing of U1 to this region of the primary transcript 
helps define the 5' splice site during spliceosome assembly (if/ is pseudouri- 
dine; see Fig. 26-23). U2 is paired to the intron at a position encompassing the 
A residue (shaded pink) that becomes the nucleophile during the splicing reac- 
tion. Base pairing of U2 snRNA causes a bulge that displaces and helps to 
activate the adenylate, whose 2' OH will form the lariat structure through a 
2',5'-phosphodiester bond. 

(b) Assembly of spliceosomes. The Ul and U2 snRNPs bind, then the re- 
maining snRNPs (the U4-U6 complex and U5) bind to form an inactive 
spliceosome. Internal rearrangements convert this species to an active spliceo- 
some in which U1 and U4 have been expelled and U6 is paired with both the 
5' splice site and U2. This is followed by the catalytic steps, which parallel 
those of the splicing of group II introns (see Fig. 26-1 6). 

(c) Coordination of splicing and transcription provides an attractive 
mechanism for bringing the two splice sites together. See the text for details. 
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to this region in the primary transcript. Addition of the 
U2, U4, U5, and U6 snRNPs leads to formation of the 
spliceosome (Fig. 26-1 7b). The snRNPs together con- 
tribute five RNAs and about 50 proteins to the core 
spliceosome, a supramolecular assembly nearly as com- 
plex as the ribosome (described in Chapter 27). Perhaps 
50 additional proteins are associated with the spliceo- 
some at different stages in the splicing process, with 
some of these proteins having multiple functions: in splic- 
ing, mRNA transport to the cytoplasm, translation, and 
eventual mRNA degradation. ATP is required for assem- 
bly of the spliceosome, but the RNA cleavage-ligation re- 
actions do not seem to require ATP. Some mRNA introns 
are spliced by a less common type of spliceosome, in 
which the Ul and U2 snRNPs are replaced by the Ul 1 and 
U12 snRNPs. Whereas Ul- and U2 -containing spliceo- 
somes remove introns with (5')GU and AG(3') terminal 
sequences, as shown in Figure 26-17, the Ull- and U12- 
containing spliceosomes remove a rare class of introns 
that have (5')AU and AC(3') terminal sequences to mark 
the intronic splice sites. The spliceosomes used in nuclear 
RNA splicing may have evolved from more ancient group 
II introns, with the snRNPs replacing the catalytic do- 
mains of their self-splicing ancestors. 

Some components of the splicing apparatus seem 
to be tethered to the CTD of RNA polymerase II, sug- 
gesting an interesting model for the splicing reaction 
(Fig. 26-1 7c). As the first splice junction is synthesized, 
it is bound by a tethered spliceosome. The second splice 
junction is then captured by this complex as it passes, 
facilitating the juxtaposition of the intron ends and the 
subsequent splicing process. After splicing, the intron 
remains in the nucleus and is eventually degraded. 

The fourth class of introns, found in certain tRNAs, 
is distinguished from the group I and II introns in that 
the splicing reaction requires ATP and an endonuclease. 
The splicing endonuclease cleaves the phosphodiester 
bonds at both ends of the intron, and the two exons are 
joined by a mechanism similar to the DNA ligase reac- 
tion (see Fig. 25-17). 

Although spliceosomal introns seem to be limited to 
eukaryotes, the other intron classes are not. Genes with 
group I and II introns have now been found in both bac- 
teria and bacterial viruses. Bacteriophage T4, for exam- 
ple, has several protein-encoding genes with group I 
introns. Introns may be more common in archaea than in 
bacteria. 

Eukaryotic mRNAs Have a Distinctive 3 ' End Structure 

At their 3' end, most eukaryotic mRNAs have a string of 
80 to 250 A residues, making up the poly(A) tail. This 
tail serves as a binding site for one or more specific pro- 
teins. The poly (A) tail and its associated proteins prob- 
ably help protect mRNA from enzymatic destruction. 
Many bacterial mRNAs also acquire poly(A) tails, but 
these tails stimulate decay of mRNA rather than pro- 
tecting it from degradation. 



The poly(A) tail is added in a multistep process. 
The transcript is extended beyond the site where the 
poly(A) tail is to be added, then is cleaved at the 
poly(A) addition site by an endonuclease component of 
a large enzyme complex, again associated with the CTD 
of RNA polymerase II (Fig. 26-18). The mRNA site 
where cleavage occurs is marked by two sequence ele- 
ments: the highly conserved sequence (5')AAUAAA(3'), 
10 to 30 nucleotides on the 5' side (upstream) of the 
cleavage site, and a less well-defined sequence rich in G 
and U residues, 20 to 40 nucleotides downstream of the 
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FIGURE 26-18 Addition of the poly(A) tail to the primary RNA tran- 
script of eukaryotes. Pol II synthesizes RNA beyond the segment of 
the transcript containing the cleavage signal sequences, including the 
highly conserved upstream sequence (5')AAUAAA. ©The cleavage 
signal sequence is bound by an enzyme complex that includes an 
endonuclease, a polyadenylate polymerase, and several other multi- 
subunit proteins involved in sequence recognition, stimulation of 
cleavage, and regulation of the length of the poly(A) tail. ©The RNA 
is cleaved by the endonuclease at a point 10 to 30 nucleotides 3' to 
(downstream of) the sequence AAUAAA. ©The polyadenylate poly- 
merase synthesizes a poly(A) tail 80 to 250 nucleotides long, begin- 
ning at the cleavage site. 
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FIGURE 26-19 Overview of the processing of a eukaryotic mRNA. 

The ovalbumin gene, shown here, has introns A to G and exons 1 to 7 
and L (L encodes a signal peptide sequence that targets the protein for 
export from the cell; see Fig. 27-38). About three-quarters of the RNA 



is removed during processing. Pol II extends the primary transcript well 
beyond the cleavage and polyadenylation site {"extra RNA") before 
terminating transcription. Termination signals for Pol II have not yet 
been defined. 



cleavage site. Cleavage generates the free 3'-hydroxyl 
group that defines the end of the mRNA, to which A 
residues are immediately added by polyadenylate 
polymerase, which catalyzes the reaction 

RNA + nATP ► RNA-(AMP) n + nPP, 

where n = 80 to 250. This enzyme does not require a 
template but does require the cleaved mRNA as a primer. 

The overall processing of a typical eukaryotic mRNA is 
summarized in Figure 26-19. In some cases the polypep- 
tide-coding region of the mRNA is also modified by RNA 
"editing" (see Section 27.1 for details). This editing in- 
cludes processes that add or delete bases in the coding re- 
gions of primary transcripts or that change the sequence 
(by, for example, enzymatic deamination of a C residue to 
create a U residue). A particularly dramatic example oc- 
curs in tiypanosomes, which are parasitic protozoa: large 
regions of an mRNA are synthesized without any uridyl- 
ate, and the U residues are inserted later by RNA editing. 

A Gene Can Give Rise to Multiple Products by Differential 
RNA Processing 

One of the paradoxes of modern genomics is that the ap- 
parent complexity of organisms does not correlate with 
the number of protein-encoding genes or even with the 
amount of genomic DNA (p. 323). However, the tradi- 
tional focus on protein-encoding genes ignores the com- 
plexity of an organism's transcriptome. As the functions 
of RNA are better appreciated, new genomic complexi- 
ties become apparent. 



Some eukaryotic mRNA transcripts produce only 
one mature mRNA and one corresponding polypeptide, 
but others can be processed in more than one way to 
produce different mRNAs and thus different polypep- 
tides. The primary transcript contains molecular signals 
for all the alternative processing pathways, and the 
pathway favored in a given cell is determined by pro- 
cessing factors, RNA-binding proteins that promote one 
particular path. 

Complex transcripts can have either more than one 
site for cleavage and polyadenylation or alternative splic- 
ing patterns, or both. If there are two or more sites for 
cleavage and polyadenylation, use of the one closest to 
the 5' end will remove more of the primary transcript se- 
quence (Fig. 26-20a), This mechanism, called poly(A) 
site choice, generates diversity in the variable domains 
of immunoglobulin heavy chains (see Fig. 25-46) . Alter- 
native splicing patterns (Fig. 26-20b) produce, from a 
common primary transcript, three different forms of the 
myosin heavy chain at different stages of fruit fly devel- 
opment. Both mechanisms come into play when a single 
RNA transcript is processed differently to produce two 
different hormones: the calcium-regulating hormone 
calcitonin in rat thyroid and calcitonin-gene-related 
peptide (CGRP) in rat brain (Fig. 20-21). There 
are many additional patterns of alternative splicing 
(Fig. 26-22). Many, perhaps most, of the genes in 
mammalian genomes are subject to alternative splicing, 
substantially increasing the number of proteins encoded 
by the genes. The same processes play a much smaller 
role in lower eukaryotes, with only a few genes subject 
to alternative splicing in yeast. 
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FIGURE 26-20 Two mechanisms for the alternative processing of 
complex transcripts in eukaryotes. (a) Alternative cleavage and 
polyadenylation patterns. Two poly(A) sites, At and A 2 , are shown. 
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(b) Alternative splicing patterns. Two different 3' splice sites are 
shown. In both mechanisms, different mature mRNAs are produced 
from the same primary transcript. 
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FIGURE 26-21 Alternative processing of the calcitonin gene tran- 
script in rats. The primary transcript has two poly(A) sites; one pre- 
dominates in the brain, the other in the thyroid. In the brain, splicing 
eliminates the calcitonin exon (exon 4); in the thyroid, this exon is 
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retained. The resulting peptides are processed further to yield the final 
hormone products: calcitonin-gene-related peptide (CGRP) in the 
brain and calcitonin in the thyroid. 
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FIGURE 26-22 Summary of splicing patterns. Exons are shown in 
shades of green, and introns/untranslated regions as yellow lines. Posi- 
tions where polyadenosine is to be added are marked with asterisks. 
Exons joined in a particular splicing scheme are linked with black 
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lead to the top and bottom spliced mRNA products, respectively. In the 
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Ribosomal RNAs and tRNAs Also Undergo 
Processing 

Posttranscriptional processing is not limited to mRNA. 
Ribosomal RNAs of bacterial, archaeal, and eukaryotic 
cells are made from longer precursors called preribo- 
somal RNAs, or pre-rRNAs. Transfer RNAs are simi- 
larly derived from longer precursors. These RNAs may 
also contain a variety of modified nucleosides; some ex- 
amples are shown in Figure 26-23. 

Ribosomal RNAs In bacteria, 16S, 23S, and 5S rRNAs 
(and some tRNAs, although most tRNAs are encoded 
elsewhere) arise from a single 30S RNA precursor of 
about 6,500 nucleotides. RNA at both ends of the 30S 
precursor and segments between the rRNAs are re- 
moved during processing (Fig. 20-24). The 16S and 
23S rRNAs contain modified nucleosides. InE. coli, the 
11 modifications in the 16S rRNA include a pseudouri- 
dine and 10 nucleosides methylated on the base or the 
2'-hydroxyl group, or both. The 23S rRNA has 10 
pseudouridines, 1 dihydrouridine, and 12 methylated 
nucleosides. In bacteria, each modification is generally 
catalyzed by a distinct enzyme. Methylation reactions 
use S-adenosylmethionine as cofactor. No cofactor is re- 
quired for pseudouridine formation. 

The genome of E. coli encodes seven pre-rRNA 
molecules. All of these genes have essentially identical 
rRNA-coding regions, but they differ in the segments 
between these regions. The segment between the 16S 
and 23S rRNA genes generally encodes one or two 
tRNAs, with different tRNAs produced from different 
pre-rRNA transcripts. Coding sequences for tRNAs are 
also found on the 3' side of the 5S rRNA in some precur- 
sor transcripts. 
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FIGURE 26-23 Some modified bases of rRNAs and tRNAs, produced 
in posttranscriptional reactions. The standard symbols are shown in 
parentheses. Note the unusual ribose attachment point in pseudouri- 
dine. This is just a small sampling of the 96 modified nucleosides 



known to occur in different RNA species, with 81 different types 
known in tRNAs and 30 observed to date in rRNAs. A complete listing 
of these modified bases can be found in the RNA modification 
database (http://library.med.utah.edu/RNAmods/). 



26.2 RNA Processing [I043J 



Pre-rRNA 
transcript 
(30S) 



16S 



Intermediates 



17S 



tRNA 

(4S) 



23S 



® 

12 I 



: 



methylation and pseudouridine or 
dihydrouridine conversion 

1 3 



® 



cleavage 



QJ E 

tRNA 



=3 



25S 

1 1 1 1 



Mature RNAs 



16S rRNA 



tRNA 



23S rRNA 



5S 



5S rRNA 



FIGURE 26-24 Processing of pre-rRNA tran- 
scripts in bacteria. © Before cleavage, the 30S 
RNA precursor is methylated at specific bases 
(red tick marks), and some uridine residues are 
converted to pseudouridine (blue tick marks) or 
dihydrouridine (black tick mark) residues. The 
methylation reactions are of multiple types, some 
occurring on bases and some on 2'-hydroxyl 
groups. @ Cleavage liberates precursors of rRNAs 
and tRNA(s). Cleavage at the points labeled 1, 2, 
and 3 is carried out by the enzymes RNase III, 
RNase P, and RNase E, respectively. As discussed 
later in the text, RNase P is a ribozyme. ©The 
final 1 6S, 23S, and 5S rRNA products result from 
the action of a variety of specific nucleases. The 
seven copies of the gene for pre-rRNA in the £. 
coli chromosome differ in the number, location, 
and identity of tRNAs included in the primary 
transcript. Some copies of the gene have addi- 
tional tRNA gene segments between the 1 6S and 
23S rRNA segments and at the far 3' end of the 
primary transcript. 



The situation in eukaryotes is more complicated. A 
45S pre-rRNA transcript is synthesized by RNA poly- 
merase I and processed in the nucleolus to form the 18S, 
28S, and 5.8S rRNAs characteristic of eukaryotic ribo- 
somes (Fig. 26-25). As in bacteria, the processing 
includes cleavage reactions mediated by endo- or exori- 
bonucleases and nucleoside modification reactions. Some 
pre-rRNAs also include introns that must be spliced. The 
entire process is initiated in the nucleolus, in large com- 
plexes that assemble on the rRNA precursor as it is syn- 
thesized by Pol I. There is a tight coupling between rRNA 
transcription, rRNA maturation, and ribosome assembly in 
the nucleolus. Each complex includes the ribonucleases 
that cleave the rRNA precursor, the enzymes that modify 
particular bases, large numbers of small nucleolar RNAs, 
or snoRNAs, that guide nucleoside modification and 
some cleavage reactions, and ribosomal proteins. In yeast, 
the entire process involves the pre-rRNA, more than 170 
nonribosomal proteins, snoRNAs for each nucleoside 
modification (about 70 in all, since some snoRNAs guide 



FIGURE 26-25 Processing of pre-rRNA transcripts in vertebrates. 

During transcription, the 45S primary transcript is incorporated into a 
nucleolar 90S preribosomal complex, in which rRNA processing and 
ribosome assembly are tightly coupled. ® The 45S precursor is methy- 
lated at more than 1 00 of its 1 4,000 nucleotides, either on the bases or 
on the 2'-OH groups, some uridines are converted to pseudouridine, 
and a few other modifications occur. (2) A series of enzymatic cleav- 
ages of the 45S precursor produces the 18S, 5.8S, and 28S rRNAs, and 
the ribosomal subunits gradually take shape with the assembling ribo- 
somal proteins. The cleavage reactions and all of the modifications re- 
quire small nucleolar RNAs (snoRNAs) found in protein complexes 
(snoRNPs) in the nucleolus that are reminiscent of spliceosomes. The 
5S rRNA is produced separately. 
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two types of modification), and the 78 ribosomal proteins. 
Humans have an even greater number of modified nucle- 
osides, about 200, and a greater number of associated 
snoRNAs. The composition of the complexes may change 
as the ribosomes are assembled, and many of the inter- 
mediate complexes may rival the ribosome itself, and the 
snRNPs, in complexity. The 5S rRNA of most eukaryotes 
is made as a completely separate transcript by a different 
polymerase (Pol III). 

The most common nucleoside modifications in 
eukaryotic rRNAs are, again, conversion of uridine to 
pseudouridine and adoMet-dependent nucleoside methy- 
lation (often at 2'-hydroxyl groups). These reactions rely 
on snoRNA-protein complexes, or snoRNPs, each consist- 
ing of a snoRNA and four or five proteins, which include 
the enzyme that carries out the modification. There are 
two classes of snoRNPs, both defined by key conserved se- 
quence elements referred to as lettered boxes. The box 
H/ACA snoRNPs are involved in pseudouridylylation, and 
box C/D snoRNPs function in 2'-0-methylations. Unlike 
the situation in bacteria, the same enzyme may participate 
in modifications at many sites, guided by the snoRNAs. 

The snoRNAs are 60 to 300 nucleotides long. Many 
are encoded within the introns of other genes and co- 
transcribed with those genes. Each snoRNA includes a 
10 to 21 nucleotide sequence that is perfectly comple- 
mentary to some site on an rRNA. The conserved se- 
quence elements in the remainder of the snoRNA fold 
into structures that are bound by the snoRNP proteins 
(Fig. 26-26). 



Transfer RNAs Most cells have 40 to 50 distinct 
tRNAs, and eukaryotic cells have multiple copies of 
many of the tRNA genes. Transfer RNAs are derived 
from longer RNA precursors by enzymatic removal of 
nucleotides from the 5' and 3' ends (Fig. 26-27). In eu- 
karyotes, introns are present in a few tRNA transcripts 
and must be excised. Where two or more different 
tRNAs are contained in a single primary transcript, they 
are separated by enzymatic cleavage. The endonuclease 
RNase P, found in all organisms, removes RNA at the 5' 
end of tRNAs. This enzyme contains both protein and 
RNA. The RNA component is essential for activity, and 
in bacterial cells it can carry out its processing function 
with precision even without the protein component. 
RNase P is therefore another example of a catalytic 
RNA, as described in more detail below. The 3' end of 
tRNAs is processed by one or more nucleases, including 
the exonuclease RNase D. 

Transfer RNA precursors may undergo further post- 
transcriptional processing. The 3'-terminal trinu- 
cleotide CCA(3') to which an amino acid is attached 
during protein synthesis (Chapter 27) is absent from 
some bacterial and all eukaryotic tRNA precursors and 
is added during processing (Fig. 26-27). This addition is 
carried out by tRNA nucleotidyltransferase, an unusual 
enzyme that binds the three ribonucleoside triphos- 
phate precursors in separate active sites and catalyzes 
formation of the phosphodiester bonds to produce the 
CCA(3') sequence. The creation of this defined se- 
quence of nucleotides is therefore not dependent on a 
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FIGURE 26-26 The function of snoRNAs in guiding rRNA modifica- 
tion, (a) RNA pairing with box C/D snoRNAs to guide methylation 
reactions. The methylation sites in the target rRNA (dark green) are in 
the regions paired with the C/D snoRNA. The highly conserved C and 
D (and C and D') box sequences are binding sites for proteins that 
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make up the larger snoRNP. (b) RNA pairing with box H/ACA snoRNAs 
to guide pseudouridylylations. The pseudouridine conversion sites in 
the target rRNA (green segments) are again in the regions paired with 
the snoRNA, and the conserved H/ACA box sequences are protein- 
binding sites. 
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FIGURE 26-27 Processing of tRNAs in bacteria and eukaryotes. The 

yeast tRNA Tyr (the tRNA specific for tyrosine binding; see Chapter 27) 
is used to illustrate the important steps. The nucleotide sequences 
shown in yellow are removed from the primary transcript. The ends are 
processed first the 5' end before the 3' end. CCA is then added to the 
3' end, a necessary step in processing eukaryotic tRNAs and those bac- 
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terial tRNAs that lack this sequence in the primary transcript. While the 
ends are being processed, specific bases in the rest of the transcript 
are modified (see Fig. 26-23). For the eukaryotic tRNA shown here, the 
final step is splicing of the 14 nucleotide intron, Introns are found in 
some eukaryotic tRNAs but not in bacterial tRNAs. 



DNA or RNA template— the template is the binding site 
of the enzyme. 

The final type of tRNA processing is the modifica- 
tion of some bases by methylation, deamination, or re- 
duction (Fig. 26-23). In the case of pseudouridine, the 
base (uracil) is removed and reattached to the sugar 
through C-5. Some of these modified bases occur at 
characteristic positions in all tRNAs (Fig. 26-27). 

Special-Function RNAs Undergo Several 
Types of Processing 

The number of known classes of special-function RNAs 
is expanding rapidly, as is the variety of functions known 
to be associated with them. Many of these RNAs 
undergo processing. 

The snRNAs and snoRNAs not only facilitate 
RNA processing reactions but are themselves syn- 
thesized as larger precursors, and then processed. 
Many snoRNAs are encoded within the introns of 
other genes. As the introns are spliced from the pre- 
mRNA, the snoRNP proteins bind to the snoRNA se- 
quences and ribonucleases remove the extra RNA at 
the 5' and 3' ends. The snRNAs destined for spliceo- 
somes are synthesized as pre-snRNAs by RNA poly- 
merase II, and ribonucleases remove the extra RNA 
at each end. Particular nucleosides in snRNAs are 
also subject to 11 types of modification, with 2'-0- 
methylation and conversion of uridine to pseudouri- 
dine predominating. 

Micro RNAs (miRNAs) are a special class of RNAs 
involved in gene regulation. They are noncoding RNAs, 



about 22 nucleotides long, complementary in sequence 
to particular regions of mRNAs. They regulate mRNA 
function by cleaving the mRNA or suppressing its trans- 
lation. The miRNAs are found in multicellular eukary- 
otes ranging from worms and fruit flies to plants and 
mammals. Up to 1% of the human genome may encode 
miRNAs, and miRNAs may target up to one-third of 
human mRNAs. Their function in gene regulation is 
described in Chapter 28. 

The miRNAs are synthesized from much larger pre- 
cursors, in several steps (Fig. 26-28). The primary 
transcripts for miRNAs (pri-miRNAs) vary greatly in 
size; some are encoded in the introns of other genes and 
are coexpressed with these host genes. Their roles in 
gene regulation also are detailed in Chapter 28. 

RNA Enzymes Are the Catalysts of Some 
Events in RNA Metabolism 

The study of posttranscriptional processing of RNA mol- 
ecules led to one of the most exciting discoveries in 
modern biochemistry— the existence of RNA enzymes. 
The best-characterized ribozymes are the self-splicing 
group I introns, RNase P, and the hammerhead ribozyme 
(discussed below). Most of the activities of these ri- 
bozymes are based on two fundamental reactions: trans- 
esterification (Fig. 26-14) and phosphodiester bond 
hydrolysis (cleavage). The substrate for ribozymes is 
often an RNA molecule, and it may even be part of the 
ribozyme itself. When its substrate is RNA, the RNA cat- 
alyst can make use of base-pairing interactions to align 
the substrate for the reaction. 
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FIGURE 26-28 Synthesis and processing of miRNAs. The primary 
transcript of miRNAs is larger RNA of variable length, a pri-miRNA. 
Much of its processing is mediated by two endoribonucleases in the 
RNase III family, Drosha and Dicer. First, in the nucleus, the pri- 
miRNA is reduced to a 70 to 80 nucleotide precursor miRNA (pre- 
miRNA) by a protein complex including Drosha and another protein, 
DGCR8. The pre-miRNA is then exported to the cytoplasm, where it 
is acted on by Dicer to produce the nearly mature miRNA paired 
with a short RNA complement. The complement is removed by an 
RNA helicase, and the mature miRNA is incorporated into protein 
complexes, such as the RNA-induced silencing complex (RISC), 
which then bind a target mRNA. If the complementarity between 
miRNA and its target is nearly perfect, the target mRNA is cleaved. If 
the complementarity is only partial, the complex blocks translation of 
the target mRNA. 



Ribozymes vary greatly in size. A self-splicing group 
I intron may have more than 400 nucleotides. The ham- 
merhead ribozyme consists of two RNA strands with 
only 41 nucleotides in all (Fig. 26-29). As with protein 
enzymes, the three-dimensional structure of ribozymes 
is important for function. Ribozymes are inactivated by 
heating above their melting temperature or by addition 
of denaturing agents or complementary oligonu- 
cleotides, which disrupt normal base-pairing patterns. 
Ribozymes can also be inactivated if essential nu- 
ll A 




FIGURE 26-29 Hammerhead ribozyme. Certain viruslike elements, 
or virusoids, have small RNA genomes and usually require another 
virus to assist in their replication and/or packaging. Some virusoid 
RNAs include small segments that promote site-specific RNA cleav- 
age reactions associated with replication. These segments are called 
hammerhead ribozymes, because their secondary structures are 
shaped like the head of a hammer. Hammerhead ribozymes have 
been defined and studied separately from the much larger viral 
RNAs. (a) The minimal sequences required for catalysis by the ri- 
bozyme. The boxed nucleotides are highly conserved and are re- 
quired for catalytic function. The arrow indicates the site of 
self-cleavage, (b) Three-dimensional structure (PDB ID 1MME; see 
Fig. 8-25b for a space-filling view). The strands are colored as in (a). 
The hammerhead ribozyme is a metalloenzyme; Mg 2+ ions are re- 
quired for activity. The phosphodiester bond at the site of self-cleav- 
age is indicated by an arrow. f| Hammerhead Ribozyme 
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cleotides are changed. The secondary structure of a 
self-splicing group I intron from the 26S rRNA precursor 

of Tetrahymena is shown in detail in Figure 26-30. 

Enzymatic Properties of Group I Introns Self- 
splicing group I introns share several properties with en- 
zymes besides accelerating the reaction rate, including 
their kinetic behavior and their specificity. Binding of 
the guanosine cofactor (Fig. 26-14) to the Tetrahy- 
mena group I rRNA intron (Fig. 26-30) is saturable 
(K m ~ 30 /xm) and can be competitively inhibited by 3'- 
deoxyguanosine. The intron is very precise in its exci- 
sion reaction, largely due to a segment called the 
internal guide sequence that can base-pair with exon 
sequences near the 5' splice site (Fig. 26-30). This pair- 
ing promotes the alignment of specific bonds to be 
cleaved and rejoined. 

Because the intron itself is chemically altered during 
the splicing reaction — its ends are cleaved — it may seem 
to lack one key enzymatic property: the ability to catalyze 
multiple reactions. Closer inspection has shown that after 



excision, the 414 nucleotide intron from Tetrahymena 
rRNA can, in vitro, act as a true enzyme (but in vivo it is 
quickly degraded). A series of intramolecular cyclization 
and cleavage reactions in the excised intron leads to the 
loss of 19 nucleotides from its 5' end. The remaining 395 
nucleotide, linear RNA— referred to as L-19 IVS— pro- 
motes nucleotidyl transfer reactions in which some 
oligonucleotides are lengthened at the expense of others 
(Fig. 26-31). The best substrates are oligonucleotides, 
such as a synthetic (C) 5 oligomer, that can base-pair with 
the same guanylate-rich internal guide sequence that 
held the 5' exon in place for self-splicing. 

The enzymatic activity of the L-19 IVS ribozyme re- 
sults from a cycle of transesterihcation reactions mech- 
anistically similar to self-splicing. Each ribozyme 
molecule can process about 100 substrate molecules per 
hour and is not altered in the reaction; therefore the in- 
tron acts as a catalyst. It follows Michaelis-Menten kinet- 
ics, is specific for RNA oligonucleotide substrates, and 
can be competitively inhibited. The k cat /K m (specificity 
constant) is 10 3 m" 1 s~\ lower than that of many 
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FIGURE 26-30 Secondary structure of the self-splicing rRNA 
intron of Tetrahymena. Intron sequences are shaded yellow, 
exon sequences green. Each thick yellow line represents a 
bond between neighboring nucleotides in a continuous se- 
quence (a device necessitated by showing this complex mol- 
ecule in two dimensions; similarly, an oversize blue line 
between a C and G residue indicates normal base pairing); all 
nucleotides are shown. The catalytic core of the self-splicing 
activity is shaded. Some base-paired regions are labeled (P1, 
P3, P2.1, P5a, and so forth) according to an established con- 
vention for this RNA molecule. The P1 region, which con- 
tains the internal guide sequence (boxed), is the location of 
the 5' splice site (red arrow). Part of the internal guide se- 
quence pairs with the end of the 3' exon, bringing the 5' and 
3' splice sites (red and blue arrows) into close proximity. The 
three-dimensional structure of a large segment of this intron 
is illustrated in Figure 8-2 5c. 
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FIGURE 26-31 In vitro catalytic activity of L-1 9 IVS. (a) L-19 IVS is 
generated by the autocatalytic removal of 19 nucleotides from the 5' 
end of the spliced Tetrahymena intron. The cleavage site is indicated 
by the arrow in the internal guide sequence (boxed). The G residue 
(shaded pink) added in the first step of the splicing reaction (see Fig. 
26-15) is part of the removed sequence. A portion of the internal 
guide sequence remains at the 5' end of L-19 IVS. (b) L-19 IVS 
lengthens some RNA oligonucleotides at the expense of others in a 
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cycle of transesterification reactions (steps ® through @). The 3' 
OH of the G residue at the 3' end of L-19 IVS plays a key role in this 
cycle (note that this is not the G residue added in the splicing reac- 
tion). {CU is one of the ribozyme's better substrates because it can 
base-pair with the guide sequence remaining in the intron. Although 
this catalytic activity is probably irrelevant to the cell, it has impor- 
tant implications for current hypotheses on evolution, discussed at 
the end of this chapter. 



enzymes, but the ribozyme accelerates hydrolysis by a 
factor of 10 10 relative to the uncatalyzed reaction. It 
makes use of substrate orientation, covalent catalysis, 
and metal-ion catalysis— strategies used by protein 
enzymes. 

Characteristics of Other Ribozymes E. coli RNase 
P has both an RNA component (the Ml RNA, with 377 
nucleotides) and a protein component (M r 17,500). In 
1983 Sidney Altman and Norman Pace and their 
coworkers discovered that under some conditions, the 
Ml RNA alone is capable of catalysis, cleaving tRNA pre- 
cursors at the correct position. The protein component 
apparently serves to stabilize the RNA or facilitate its 
function in vivo. The RNase P ribozyme recognizes the 
three-dimensional shape of its pre-tRNA substrate, 
along with the CCA sequence, and thus can cleave the 
5' leaders from diverse tRNAs (Fig. 26-27). 

The known catalytic repertoire of ribozymes contin- 
ues to expand. Some virusoids, small RNAs associated 
with plant RNA viruses, include a structure that pro- 



motes a self-cleavage reaction; the hammerhead ri- 
bozyme illustrated in Figure 26-29 is in this class, cat- 
alyzing the hydrolysis of an internal phosphodiester 
bond. The splicing reaction that occurs in a spliceosome 
seems to rely on a catalytic center formed by the U2, U5, 
and U6 snRNAs (Fig. 26-17). And perhaps most impor- 
tant, an RNA component of ribosomes catalyzes the syn- 
thesis of proteins (Chapter 27). 

Exploring catalytic RNAs has provided new insights 
into catalytic function in general and has important impli- 
cations for our understanding of the origin and evolution 
of life on this planet, a topic discussed in Section 26.3. 

Cellular mRNAs Are Degraded at Different Rates 

The expression of genes is regulated at many levels. A 
crucial factor governing a gene's expression is the cel- 
lular concentration of its associated mRNA. The con- 
centration of any molecule depends on two factors: its 
rate of synthesis and its rate of degradation. When syn- 
thesis and degradation of an mRNA are balanced, the 
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concentration of the mRNA remains in a steady state. 
A change in either rate will lead to net accumulation or 
depletion of the mRNA. Degradative pathways ensure 
that mRNAs do not build up in the cell and direct the 
synthesis of unnecessary proteins. 

The rates of degradation vary greatly for mRNAs 
from different eukaryotic genes. For a gene product 
that is needed only briefly, the half-life of its mRNA may 
be only minutes or even seconds. Gene products 
needed constantly by the cell may have mRNAs that are 
stable over many cell generations. The average half-life 
of the mRNAs of a vertebrate cell is about 3 hours, with 
the pool of each type of mRNA turning over about 
10 times per cell generation. The half-life of bacterial 
mRNAs is much shorter — only about 1.5 min — perhaps 
because of regulatory requirements. 

Messenger RNA is degraded by ribonucleases pres- 
ent in all cells. In E. coli, the process begins with one or 
several cuts by an endoribonuclease, followed by 3'— »5' 
degradation by exoribonucleases. In lower eukaryotes, 
the major pathway involves first shortening the poly(A) 
tail, then decapping the 5' end and degrading the mRNA 
in the 5'— »3' direction. A 3'— >5' degradative pathway 
also exists and may be the major path in higher eukary- 
otes. All eukaryotes have a complex of up to 10 con- 
served 3'— »5' exoribonucleases, called the exosome, 
which is involved in the processing of the 3' end of 
rRNAs, tRNAs, and some special-function RNAs (in- 
cluding snRNAs and snoRNAs), as well as the degrada- 
tion of mRNAs. 

A hairpin structure in bacterial mRNAs with a p- 
independent terminator (Fig. 26-8) confers stability 
against degradation. Similar hairpin structures can 
make some parts of a primary transcript more stable, 
leading to nonuniform degradation of transcripts. In eu- 
karyotic cells, both the 3' poly(A) tail and the 5' cap are 
important to the stability of many mRNAs. 0 Life Cycle of 
an mRNA 

Polynucleotide Phosphorylase Makes Random 
RNA-like Polymers 

In 1955, Marianne Grunberg-Manago and Severo Ochoa 
discovered the bacterial enzyme polynucleotide phos- 
phorylase, which in vitro catalyzes the reaction 

(NMP)„ + NDP (NMP) n + 1 + Pi 

Lengthened 
polynucleotide 

Polynucleotide phosphorylase was the first nucleic 
acid-synthesizing enzyme discovered (Arthur Romberg's 
discovery of DNA polymerase followed soon thereafter). 
The reaction catalyzed by polynucleotide phosphorylase 
differs fundamentally from the polymerase activities dis- 
cussed so far in that it is not template-dependent. The 
enzyme uses the 5'-diphosphates of ribonucleosides as 
substrates and cannot act on the homologous 5'-triphos- 
phates or on deoxyribonucleoside 5 '-diphosphates. The 
RNA polymer formed by polynucleotide phosphorylase 
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contains the usual 3',5'-phosphodiester linkages, which 
can be hydrolyzed by ribonuclease. The reaction is read- 
ily reversible and can be pushed in the direction of 
breakdown of the polyribonucleotide by increasing the 
phosphate concentration. The probable function of this 
enzyme in the cell is the degradation of mRNAs to nucle- 
oside diphosphates. 

Because the polynucleotide phosphorylase reac- 
tion does not use a template, the polymer it forms does 
not have a specific base sequence. The reaction pro- 
ceeds equally well with any or all of the four nucleoside 
diphosphates, and the base composition of the result- 
ing polymer reflects nothing more than the relative 
concentrations of the 5 '-diphosphate substrates in the 
medium. 

Polynucleotide phosphorylase can be used in the 
laboratory to prepare RNA polymers with many differ- 
ent base sequences and frequencies. Synthetic RNA 
polymers of this sort were critical for deducing the ge- 
netic code for the amino acids (Chapter 27). 

SUMMARY 26.2 RNA Processing 

■ Eukaryotic mRNAs are modified by addition of a 
7-methylguanosine residue at the 5' end and by 
cleavage and polyadenylation at the 3' end to form 
a long poly(A) tail. 

■ Many primary mRNA transcripts contain introns 
(noncoding regions), which are removed by 
splicing. Excision of the group I introns found in 
some rRNAs requires a guanosine cofactor. Some 
group I and group II introns are capable of self- 
splicing; no protein enzymes are required. Nuclear 
mRNA precursors have a third (the largest) class 
of introns, which are spliced with the aid of RNA- 
protein complexes called snRNPs, assembled into 
spliceosomes. A fourth class of introns, found in 
some tRNAs, consists of the only introns known 
to be spliced by protein enzymes. 

■ The function of many eukaryotic mRNAs is 
regulated by complementary microRNAs 
(miRNAs). The miRNAs are themselves derived 
from larger precursors through a series of 
processing reactions. 
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■ Ribosomal RNAs and transfer RNAs are derived 
from longer precursor RNAs, trimmed by 
nucleases. Some bases are modified enzymatically 
during the maturation process. Some nucleoside 
modifications are guided by snoRNAs, within 
protein complexes called snoRNPs, 

■ The self-splicing introns and the RNA component 
of RNase P (which cleaves the 5' end of tRNA 
precursors) are two examples of ribozymes. These 
biological catalysts have the properties of true 
enzymes. They generally promote hydrolytic 
cleavage and transesterification, using RNA as 
substrate. Combinations of these reactions can 

be promoted by the excised group I intron of 
Tetrahymena rRNA, resulting in a type of RNA 
polymerization reaction. 

■ Polynucleotide phosphorylase reversibly forms 
RNA-like polymers from ribonucleoside 

5 '-diphosphates, adding or removing ribonucleotides 
at the 3'-hydroxyl end of the polymer. The enzyme 
degrades RNA in vivo. 

26.3 RNA-Dependent Synthesis of 
RNA and DN A 

In our discussion of DNA and RNA synthesis up to this 
point, the role of the template strand has been reserved 
for DNA. However, some enzymes use an RNA template 
for nucleic acid synthesis. With the very important ex- 
ception of viruses with an RNA genome, these enzymes 
play only a modest role in information pathways. RNA 
viruses are the source of most RNA-dependent 
polymerases characterized so far. 

The existence of RNA replication requires an elabo- 
ration of the central dogma (Fig. 26-32; contrast this 
with the diagram on p. 945). The enzymes involved in 
RNA replication have profound implications for investi- 
gations into the nature of self-replicating molecules that 
may have existed in prebiotic times. 
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Reverse Transcriptase Produces DNA from Viral RNA 

Certain RNA viruses that infect animal cells carry within 
the viral particle an RNA-dependent DNA polymerase 
called reverse transcriptase. On infection, the single- 
stranded RNA viral genome (-10,000 nucleotides) and 
the enzyme enter the host cell. The reverse transcrip- 
tase first catalyzes the synthesis of a DNA strand com- 
plementary to the viral RNA (Fig. 26-33), then 
degrades the RNA strand of the viral RNA-DNA hybrid 
and replaces it with DNA. The resulting duplex DNA 
often becomes incorporated into the genome of the 
eukaryotic host cell. These integrated (and dormant) vi- 
ral genes can be activated and transcribed, and the gene 
products— viral proteins and the viral RNA genome 
itself— packaged as new viruses. The RNA viruses that 
contain reverse transcriptases are known as retro- 
viruses (retro is the Latin prefix for "backward"). 

The existence of reverse transcriptases in RNA viruses 
was predicted by Howard Temin in 1962, and the enzymes 
were ultimately detected by Temin and, independently, by 
David Baltimore in 1970. Their discovery aroused much at- 
tention as dogma-shaking proof that genetic information 
can flow "backward" from RNA to DNA. 



RNA genome 




Protein 



FIGURE 26-32 Extension of the central dogma to include RNA- 
dependent synthesis of RNA and DNA. 
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FIGURE 26-33 Retroviral infection of a mammalian cell and integra- 
tion of the retrovirus into the host chromosome. Viral particles enter- 
ing the host cell carry viral reverse transcriptase and a cellular tRNA 
(picked up from a former host cell) already base-paired to the viral 
RNA The tRNA facilitates immediate conversion of viral RNA to 
double-stranded DNA by the action of reverse transcriptase, as de- 
scribed in the text. Once converted to double-stranded DNA, the 
DNA enters the nucleus and is integrated into the host genome. The 
integration is catalyzed by a viral ly encoded integrase. Integration of 
viral DNA into host DNA is mechanistically similar to the insertion of 
transposons in bacterial chromosomes (see Fig, 25-45). For example, 
a few base pairs of host DNA become duplicated at the site of integra- 
tion, forming short repeats of 4 to 6 bp at each end of the inserted 
retroviral DNA (not shown). 
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Retroviruses typically have three genes: gag (de- 
rived from the historical designation group associated 
antigen), pol, and env (Fig. 26-34). The transcript 
that contains gag and pol is translated into a long 
"polyprotein," a single large polypeptide that is 
cleaved into six proteins with distinct functions. The 
proteins derived from the gag gene make up the inte- 
rior core of the viral particle. The pol gene encodes 
the protease that cleaves the long polypeptide, an in- 
tegrase that inserts the viral DNA into the host chro- 
mosomes, and reverse transcriptase. Many reverse 
transcriptases have two sub units, a and /3. The pol 
gene specifies the /3 subunit (M r 90,000), and the a 
subunit (M T 65,000) is simply a proteolytic fragment of 
the subunit. The env gene encodes the proteins of 
the viral envelope. At each end of the linear RNA 
genome are long terminal repeat (LTR) sequences of a 
few hundred nucleotides. Transcribed into the duplex 
DNA, these sequences facilitate integration of the viral 
chromosome into the host DNA and contain promot- 
ers for viral gene expression. 

Reverse transcriptases catalyze three different 
reactions: (1) RNA-dependent DNA synthesis, (2) 
RNA degradation, and (3) DNA-dependent DNA syn- 
thesis. Like many DNA and RNA polymerases, reverse 
transcriptases contain Zn 2+ . Each transcriptase is 
most active with the RNA of its own virus, but each 
can be used experimentally to make DNA comple- 
mentary to a variety of RNAs. The DNA and RNA syn- 
thesis and RNA degradation activities use separate 
active sites on the protein. For DNA synthesis to be- 
gin, the reverse transcriptase requires a primer, a cel- 
lular tRNA obtained during an earlier infection and 
carried in the viral particle. This tRNA is base-paired 
at its 3' end with a complementary sequence in the vi- 
ral RNA. The new DNA strand is synthesized in the 
5'— »3' direction, as in all RNA and DNA polymerase 
reactions. Reverse transcriptases, like RNA poly- 
merases, do not have 3'— >5' proofreading exonucle- 
ases. They generally have error rates of about 1 per 
20,000 nucleotides added. An error rate this high is 
extremely unusual in DNA replication and seems to 
be a feature of most enzymes that replicate the 
genomes of RNA viruses. A consequence is a higher 
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FIGURE 26-34 Structure and gene products of an integrated retroviral 
genome. The long terminal repeats (LTRs) have sequences needed for 
the regulation and initiation of transcription. The sequence denoted ijj 
is required for packaging of retroviral RNAs into mature viral particles. 
Transcription of the retroviral DNA produces a primary transcript en- 
compassing the gag, pol, and env genes. Translation (Chapter 27) pro- 
duces a polyprotein, a single long polypeptide derived from the gag 
and pol genes, which is cleaved into six distinct proteins. Splicing of 
the primary transcript yields an mRNA derived largely from the env 
gene, which is also translated into a polyprotein, then cleaved to 
generate viral envelope proteins. 



mutation rate and faster rate of viral evolution, which 
is a factor in the frequent appearance of new strains 
of disease-causing retroviruses. 

Reverse transcriptases have become important 
reagents in the study of DNA-RNA relationships and in 
DNA cloning techniques. They make possible the syn- 
thesis of DNA complementary to an mRNA template, 
and synthetic DNA prepared in this manner, called com- 
plementary DNA (cDNA), can be used to clone cellu- 
lar genes (see Fig. 9-14). 

Some Retroviruses Cause Cancer and AIDS 

Retroviruses have featured prominently in recent 
advances in the molecular understanding of can- 
cer. Most retroviruses do not kill their host cells but re- 
main integrated in the cellular DNA, replicating when 
the cell divides. Some retroviruses, classified as RNA tu- 
mor viruses, contain an oncogene that can cause the cell 
to grow abnormally. The first retrovirus of this type to be 
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FIGURE 26-35 Rous sarcoma virus genome. The src gene encodes a 
tyrosine kinase, one of a class of enzymes that function in systems 
affecting cell division, cell-cell interactions, and intercellular commu- 
nication (Chapter 12). The same gene is found in chicken DNA (the 
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usual host for this virus) and in the genomes of many other eukaryotes, 
including humans. When associated with the Rous sarcoma virus, this 
oncogene is often expressed at abnormally high levels, contributing to 
unregulated cell division and cancer. 
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FIGURE 26-36 The genome of HIV, the virus that causes AIDS. In ad- 
dition to the typical retroviral genes, HIV contains several small genes 
with a variety of functions (not identified here, and not all known). 
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Some of these genes overlap (p. 1072). Alternative splicing mechanisms 
produce many different proteins from this small (9.7 X 1 0 3 nucleotides) 
genome. 



studied was the Rous sarcoma virus (also called avian 
sarcoma virus; Fig. 26-35), named for F. Peyton Rous, 
who studied chicken tumors now known to be caused by 
this virus. Since the initial discovery of oncogenes by 
Harold Varmus and Michael Bishop, many dozens of 
such genes have been found in retroviruses. 

The human immunodeficiency virus (HIV), which 
causes acquired immune deficiency syndrome (AIDS), 
is a retrovirus. Identified in 1983, HIV has an RNA 
genome with standard retroviral genes along with sev- 
eral other unusual genes (Fig. 26-36). Unlike many 
other retroviruses, HIV kills many of the cells it infects 
(principally T lymphocytes) rather than causing tumor 
formation. This gradually leads to suppression of the 
immune system in the host organism. The reverse 
transcriptase of HIV is even more error prone than 
other known reverse transcriptases — 10 times more 
so — resulting in high mutation rates in this virus. One or 
more errors are generally made every time the viral 
genome is replicated, so any two viral RNA molecules 
are likely to differ. 

Many modern vaccines for viral infections consist of 
one or more coat proteins of the virus, produced by 
methods described in Chapter 9. These proteins are not 
infectious on their own but stimulate the immune sys- 
tem to recognize and resist subsequent viral invasions 
(Chapter 5). Because of the high error rate of the HIV 
reverse transcriptase, the env gene in this virus (along 
with the rest of the genome) undergoes very rapid mu- 
tation, complicating the development of an effective 
vaccine. However, repeated cycles of cell invasion and 
replication are needed to propagate an HIV infection, so 
inhibition of viral enzymes offers the most effective 
therapy currently available. The HIV protease is tar- 
geted by a class of drugs called protease inhibitors (see 



Box 6-3). Reverse transcriptase is the target of some 
additional drugs widely used to treat HIV-infected indi- 
viduals (Box 26-2). ■ 

Many Transposons, Retroviruses, and Introns May Have a 
Common Evolutionary Origin 

Some well-characterized eukaryotic DNA transposons 
from sources as diverse as yeast and fruit flies have a 
structure very similar to that of retroviruses; these 
are sometimes called retrotransposons (Fig. 26-37). 
Retrotransposons encode an enzyme homologous to the 
retroviral reverse transcriptase, and their coding re- 
gions are flanked by LTR sequences. They transpose 
from one position to another in the cellular genome by 
means of an RNA intermediate, using reverse transcrip- 
tase to make a DNA copy of the RNA, followed by inte- 
gration of the DNA at a new site. Most transposons in 
eukaryotes use this mechanism for transposition, distin- 
guishing them from bacterial transposons, which move 
as DNA directly from one chromosomal location to an- 
other (see Fig. 25-45). 
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FIGURE 26-37 Eukaryotic transposons. The Ty element of the yeast 
Saccharomyces and the copia element of the fruit fly Drosophila serve 
as examples of eukaryotic transposons, which often have a structure 
similar to retroviruses but lack the env gene. The 8 sequences of the 
Ty element are functionally equivalent to retroviral LTRs. In the copia 
element, //Viand RT are homologous to the integrase and reverse tran- 
scriptase segments, respectively, of the pol gene. 
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BOX 26-2 \\{ . Fighting AIDS with Inhibitors of HIV Reverse Transcriptase 



Research into the chemistry of template-dependent nu- 
cleic acid biosynthesis, combined with modern tech- 
niques of molecular biology, has elucidated the life cycle 
and structure of the human immunodeficiency virus, the 
retrovirus that causes AIDS. A few years after the isola- 
tion of HIV, this research resulted in the development of 
drugs capable of prolonging the lives of people infected 
by HIV. 

The first drug to be approved for clinical use was 
AZT, a structural analog of deoxythymidine. AZT was 
first synthesized in 1964 by Jerome P. Horwitz. It failed 
as an anticancer drug (the purpose for which it was 
made) , but in 1985 it was found to be a useful treatment 
for AIDS. AZT is taken up by T lymphocytes, immune 
system cells that are particularly vulnerable to HIV 
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infection, and converted to AZT triphosphate. (AZT 
triphosphate taken directly would be ineffective, be- 
cause it cannot cross the plasma membrane.) HFV's re- 
verse transcriptase has a higher affinity for AZT 
triphosphate than for dTTP, and binding of AZT triphos- 
phate to this enzyme competitively inhibits dTTP bind- 
ing. When AZT is added to the 3' end of the growing 
DNA strand, lack of a 3' hydroxyl means that the DNA 
strand is terminated prematurely and viral DNA synthe- 
sis grinds to a halt. 

AZT triphosphate is not as toxic to the T lympho- 
cytes themselves, because cellular DNA polymerases 
have a lower affinity for this compound than for dTTP. 
At concentrations of 1 to 5 /jm, AZT affects HIV reverse 
transcription but not most cellular DNA replication. Un- 
fortunately, AZT seems to be toxic to the bone marrow 
cells that are the progenitors of erythrocytes, and many 
individuals taking AZT develop anemia. AZT can in- 
crease the survival time of people with advanced AIDS 
by about a year, and it delays the onset of AIDS in those 
who are still in the early stages of HIV infection. Some 
other AIDS drugs, such as dideoxyinosine (DDI), have a 
similar mechanism of action. Newer drugs target and in- 
activate the HIV protease. Because of the high error rate 
of HIV reverse transcriptase and the resulting rapid evo- 
lution of HIV, the most effective treatments of HIV infec- 
tion use a combination of drugs directed at both the 
protease and the reverse transcriptase. 



Retrotransposons lack an env gene and so cannot 
form viral particles. They can be thought of as defective 
viruses, trapped in cells. Comparisons between retro- 
viruses and eukaryotic transposons suggest that reverse 
transcriptase is an ancient enzyme that predates the 
evolution of multicellular organisms. 

Interestingly, many group I and group II introns are 
also mobile genetic elements. In addition to their self- 
splicing activities, they encode DNA endonucleases that 
promote their movement. During genetic exchanges be- 
tween cells of the same species, or when DNA is intro- 
duced into a cell by parasites or by other means, these 
endonucleases promote insertion of the intron into an 
identical site in another DNA copy of a homologous gene 
that does not contain the intron, in a process termed 
homing (Fig. 26-38). Whereas group I intron homing 
is DNA-based, group II intron homing occurs through an 
RNA intermediate. The endonucleases of the group II in- 
trons have associated reverse transcriptase activity. The 
proteins can form complexes with the intron RNAs 
themselves, after the introns are spliced from the pri- 
mary transcripts. Because the homing process involves 



insertion of the RNA intron into DNA and reverse tran- 
scription of the intron, the movement of these introns 
has been called retrohoming. Over time, every copy of a 
particular gene in a population may acquire the intron. 
Much more rarely, the intron may insert itself into a new 
location in an unrelated gene. If this event does not kill 
the host cell, it can lead to the evolution and distribution 
of an intron in a new location. The structures and 
mechanisms used by mobile introns support the idea 
that at least some introns originated as molecular para- 
sites whose evolutionary past can be traced to retro- 
viruses and transposons. 

Telomerase Is a Specialized Reverse Transcriptase 

Telomeres, the structures at the ends of linear eukary- 
otic chromosomes (see Fig. 24-9), generally consist of 
many tandem copies of a short oligonucleotide se- 
quence. This sequence usually has the form T x G y in one 
strand and C y A x in the complementary strand, where x 
and y are typically in the range of 1 to 4 (p. 953) . Telom- 
eres vary in length from a few dozen base pairs in some 
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FIGURE 26-38 Introns that move: homing and retrohoming. Certain 
introns include a gene (shown in red) for enzymes that promote hom- 
ing (certain group I introns) or retrohoming (certain group II introns). 
(a) The gene in the spliced intron is bound by a ribosome and trans- 
lated Group I homing introns specify a site-specific endonuclease, 
called a homing endonuclease. Group II retrohoming introns specify a 
protein with both endonuclease and reverse transcriptase activities. 

(b) Homing. Allele a of a gene X containing a group I homing in- 
tron is present in a cell containing allele b of the same gene, which 
lacks the intron. The homing endonuclease produced by a cleaves b at 
the position corresponding to the intron in a, and double-strand break 
repair {recombination with allele a; see Fig. 25-33) then creates a new 
copy of the intron in b. (c) Retrohoming. Allele a of gene Y contains a 
retrohoming group II intron; allele b lacks the intron. The spliced intron 
inserts itself into the coding strand of b in a reaction that is the reverse 
of the splicing that excised the intron from the primary transcript (see 
Fig. 26-16), except that here the insertion is into DNA rather than 
RNA. The noncoding DNA strand of b is then cleaved by the intron- 
encoded endonuclease/reverse transcriptase. This same enzyme uses 
the inserted RNA as a template to synthesize a complementary DNA 
strand. The RNA is then degraded by cellular ribonucleases and re- 
placed with DNA. 
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ciliated protozoans to tens of thousands of base pairs in 
mammals. The TG strand is longer than its complement, 
leaving a region of single-stranded DNA of up to a few 
hundred nucleotides at the 3' end. 

The ends of a linear chromosome are not readily 
replicated by cellular DNA polymerases. DNA replica- 
tion requires a template and primer, and beyond the end 
of a linear DNA molecule no template is available for the 
pairing of an RNA primer. Without a special mechanism 
for replicating the ends, chromosomes would be short- 
ened somewhat in each cell generation. The enzyme 
telomerase solves this problem by adding telomeres to 
chromosome ends. 

Although the existence of this enzyme may not be 
surprising, the mechanism by which it acts is remark- 
able and unprecedented. Telomerase, like some other 
enzymes described in this chapter, contains both RNA 
and protein components. The RNA component is about 
150 nucleotides long and contains about 1.5 copies of 
the appropriate G y K x telomere repeat. This region of the 
RNA acts as a template for synthesis of the T x G y strand 
of the telomere. Telomerase thereby acts as a cellular 
reverse transcriptase that provides the active site for 
RNA-dependent DNA synthesis. Unlike retroviral re- 
verse transcriptases, telomerase copies only a small seg- 
ment of RNA that it carries within itself. Telomere 
synthesis requires the 3' end of a chromosome as primer 
and proceeds in the usual 5'-»3' direction. Having syn- 
thesized one copy of the repeat, the enzyme repositions 
to resume extension of the telomere (Fig. 26-39a). 

After extension of the T x G y strand by telomerase, 
the complementary G y K x strand is synthesized by cellu- 
lar DNA polymerases, starting with an RNA primer (see 
Fig. 25-13). The single-stranded region is protected by 
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FIGURE 26-39 The TG strand and T loop of telomeres. The internal 
template RNA of telomerase binds to and base-pairs with the TG 
primer (T X G K ) of DNA. ® Telomerase adds more T and G residues 
to the TG primer, then (2) repositions the internal template RNA to 
allow (3) the addition of more T and G residues. The complementary 
strand is synthesized by cellular DNA polymerases (not shown), (b) Pro- 
posed structure of T loops in telomeres. The single-stranded tail synthe- 
sized by telomerase is folded back and paired with its complement in 
the duplex portion of the telomere. The telomere is bound by several 
telomere-binding proteins, including TRF1 andTRF2 (telomere repeat 
binding /actors), (c) Electron micrograph of a T loop at the end of a 
chromosome isolated from a mouse hepatocyte. The bar at the bottom 
of the micrograph represents a length of 5,000 bp. 
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specific binding proteins in many lower eukaryotes, es- 
pecially those species with telomeres of less than a few 
hundred base pairs. In higher eukaryotes (including 
mammals) with telomeres many thousands of base pairs 
long, the single-stranded end is sequestered in a special- 
ized structure called a T loop (Fig. 26-39b). The single- 
stranded end is folded back and paired with its 
complement in the double-stranded portion of the 
telomere. The formation of a T loop involves invasion of 
the 3' end of the telomere's single strand into the duplex 
DNA, perhaps by a mechanism similar to the initiation of 
homologous genetic recombination (see Fig. 25-33). In 
mammals, the looped DNA is bound by two proteins, 
TRF1 and TRF2, with the latter protein involved in for- 
mation of the T loop. T loops protect the 3' ends of chro- 
mosomes, making them inaccessible to nucleases and 
the enzymes that repair double-strand breaks. 



In protozoans (such as Tetrahymend) , loss of 
telomerase activity results in a gradual shortening of 
telomeres with each cell division, ultimately leading 
to the death of the cell line. A similar link between 
telomere length and cell senescence (cessation of 
cell division) has been observed in humans. In germ- 
line cells, which contain telomerase activity, telom- 
ere lengths are maintained; in somatic cells, which 
lack telomerase, they are not. There is a linear, in- 
verse relationship between the length of telomeres 
in cultured fibroblasts and the age of the individual 
from whom the fibroblasts were taken: telomeres in 
human somatic cells gradually shorten as an individ- 
ual ages. If the telomerase reverse transcriptase is 
introduced into human somatic cells in vitro, telom- 
erase activity is restored and the cellular life span in- 
creases markedly. 
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Is the gradual shortening of telomeres a key to the 
aging process? Is our natural life span determined by 
the length of the telomeres we are born with? Further 
research in this area should yield some fascinating 
insights. 

Some Viral RNAs Are Replicated by RNA-Dependent 
RNA Polymerase 

Some E, coli bacteriophages, including f2, MS2, R17, 
and Q/3, as well as some eukaryotic viruses (including 
influenza and Sindbis viruses, the latter associated with 
a form of encephalitis) have RNA genomes. The single- 
stranded RNA chromosomes of these viruses, which also 
function as mRNAs for the synthesis of viral proteins, 
are replicated in the host cell by an RNA-dependent 
RNA polymerase (RNA replicase). All RNA viruses— 
with the exception of retroviruses — must encode a 
protein with RNA-dependent RNA polymerase activity 
because the host cells do not possess this enzyme. 

The RNA replicase of most RNA bacteriophages has 
a molecular weight of -210,000 and consists of four sub- 
units. One subunit (M r 65,000) is the product of the 
replicase gene encoded by the viral RNA and has the ac- 
tive site for replication. The other three subunits are host 
proteins normally involved in host-cell protein synthesis: 
the E. coli elongation factors Tu (M r 45,000) and Ts 
(Mr 34,000) (which ferry amino acyl-tRNAs to the 
ribosomes) and the protein SI (an integral part of the 
30S ribosomal subunit). These three host proteins may 
help the RNA replicase locate and bind to the 3' ends of 
the viral RNAs. 

The RNA replicase isolated from Q/3-infected E. coli 
cells catalyzes the formation of an RNA complementary 
to the viral RNA, in a reaction equivalent to that cat- 
alyzed by DNA-dependent RNA polymerases. New RNA 
strand synthesis proceeds in the 5'-»3' direction by a 
chemical mechanism identical to that used in all other 
nucleic acid synthetic reactions that require a template. 
RNA replicase requires RNA as its template and will not 
function with DNA. It lacks a separate proofreading en- 
donuclease activity and has an error rate similar to that 
of RNA polymerase. Unlike the DNA and RNA poly- 
merases, RNA replicases are specific for the RNA of 
their own virus; the RNAs of the host cell are generally 
not replicated. This explains how RNA viruses are pref- 
erentially replicated in the host cell, which contains 
many other types of RNA. 

RNA Synthesis Offers Important Clues to 
Biochemical Evolution 

The extraordinary complexity and order that distinguish 
living from inanimate systems are key manifestations of 
fundamental life processes. Maintaining the living state 
requires that selected chemical transformations occur 
very rapidly — especially those that use environmental 
energy sources and synthesize elaborate or specialized 
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cellular macromolecules. Life 
depends on powerful and selec- 
tive catalysts — enzymes — and 
on informational systems capa- 
ble of both securely storing the 
blueprint for these enzymes 
and accurately reproducing the 
blue- print for generation after 
generation. Chromosomes en- 
code the blueprint not for the 
cell but for the enzymes that 
construct and maintain the cell. 
The parallel demands for infor- 
mation and catalysis present a 
classic conundrum: what came 
first, the information needed to 
specify structure or the en- 
zymes needed to maintain and 
transmit the information? 

The unveiling of the struc- 
tural and functional complexity 
of RNA led Carl Woese, Francis 
Crick, and Leslie Orgel to pro- 
pose in the 1960s that this 
macromolecule might serve as 
both information carrier and cat- 
alyst, The discovery of catalytic 
RNAs took this proposal from 
conjecture to hypothesis and 
has led to widespread specula- 
tion that an "RNA world" might 
have been important in the tran- 
sition from prebiotic chemistry 
to life (see Fig. 1-34) The par- 
ent of all life on this planet, in the 
sense that it could reproduce it- 
self across the generations from 
the origin of life to the present, 
might have been a self-replicating RNA or a polymer with 
equivalent chemical characteristics. 

How might a self-replicating polymer come to be? 
How might it maintain itself in an environment where 
the precursors for polymer synthesis are scarce? How 
could evolution progress from such a polymer to the 
modern DNA-protein world? These difficult questions 
can be addressed by careful experimentation, providing 
clues about how life on Earth began and evolved. 

The probable origin of purine and pyrimidine bases 
is suggested by experiments designed to test hypotheses 
about prebiotic chemistry (pp. 30-31). Beginning with 
simple molecules thought to be present in the early at- 
mosphere (CH 4 , NH 3 , H 2 0, H 2 ), electrical discharges 
such as lightning generate, first, more reactive molecules 
such as HCN and aldehydes, then an array of amino acids 
and organic acids (see Fig. 1-33). When molecules such 
as HCN become abundant, purine and pyrimidine bases 
are synthesized in detectable amounts. Remarkably, a 
concentrated solution of ammonium cyanide, refluxed 
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for a few days, generates adenine in yields of up to 0.5% 
(Fig. 26-40). Adenine may well have been the first and 
most abundant nucleotide constituent to appear on 
Earth. Intriguingly, most enzyme cofactors contain 
adenosine as part of their structure, although it plays no 
direct role in the cofactor function (see Fig. 8-38). This 
may suggest an evolutionary relationship, based on the 
simple synthesis of adenine from cyanide. 

The RNA world hypothesis requires a nucleotide 
polymer to reproduce itself. Can a ribozyme bring about 
its own synthesis in a template-directed manner? The 
self-splicing rRNA intron of Tetrahymena (Fig. 26-30) 
catalyzes the reversible attack of a guanosine residue 
on the 5' splice junction (Fig. 26-41). If the 5' splice 
site and the internal guide sequence are removed from 
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FIGURE 26-40 Possible prebiotic synthesis of adenine from ammo- 
nium cyanide. Adenine is derived from five molecules of cyanide, 
denoted by shading. 

the intron, the rest of the intron can bind RNA strands 
paired with short oligonucleotides. Part of the remain- 
ing intact intron effectively acts as a template for the 
alignment and ligation of the short oligonucleotides. 
The reaction is in essence a reversal of the attack of 
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FIGURE 26-41 RNA-dependent synthesis of an RNA polymer from 
oligonucleotide precursors, (a) The first step in the removal of the self- 
splicing group I intron of the rRNA precursor of Tetrahymena is re- 
versible attack of a guanosine residue on the 5' splice site. Only P1, 
the region of the ribozyme that includes the internal guide sequence 
(boxed) and the 5' splice site, is shown in detail; the rest of the ri- 
bozyme is represented as a green blob. The complete secondary struc- 
ture of the ribozyme is shown in Figure 26-30. (b) If P1 is removed 
(shown as the darker green "hole"), the ribozyme retains both its three- 



dimensional shape and its catalytic capacity. A new RNA molecule 
added in vitro can bind to the ribozyme in the same manner as the in- 
ternal guide sequence of PI in (a). This provides a template for further 
RNA polymerization reactions when oligonucleotides complementary 
to the added RNA base-pair with it. The ribozyme can link these 
oligonucleotides in a process equivalent to the reversal of the reaction 
in (a). Although only one such reaction is shown in (b), repeated bind- 
ing and catalysis can result in the RNA-dependent synthesis of long 
RNA polymers. 
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The SELEX Method for Generating RNA Polymers with 
New Functions 



SELEX (systematic evolution of Jigands by exponential 
enrichment) is used to generate aptamers, oligonu- 
cleotides selected to tightly bind a specific molecular 
target. The process is generally automated to allow 
rapid identification of one or more aptamers with the de- 
sired binding specificity. 

Figure 1 illustrates how SELEX is used to select an 
RNA species that binds tightly to ATP. In step ®, a random 
mixture of RNA polymers is subjected to "unnatural selec- 
tion" by passing it through a resin to which ATP is attached. 
The practical limit for the complexity of an RNA mixture in 
SELEX is about 10 15 different sequences, which allows for 
the complete randomization of 25 nucleotides (4 25 = 10 15 ). 
For longer RNAs, the RNA pool used to initiate the search 
does not include all possible sequences. ® RNA polymers 
that pass through the column are discarded; ® those that 
bind to ATP are washed from the column with salt solution 
and collected. @The collected RNA polymers are amplified 
by reverse transcriptase to make many DNA complements 
to the selected RNAs; then an RNA polymerase makes 
many RNA complements of the resulting DNA molecules. 
©This new pool of RNA is subjected to the same selection 
procedure, and the cycle is repeated a dozen or more times. 
At the end, only a few aptamers — in this case, RNA se- 
quences with considerable affinity for ATP — remain. 
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FIGURE 1 The SELEX procedure. 
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guanosine on the 5' splice junction, but the result is 
the synthesis of long RNA polymers from short ones, 
with the sequence of the product defined by an RNA 
template. 

A self-replicating polymer would quickly use up 
available supplies of precursors provided by the rela- 
tively slow processes of prebiotic chemistry. Thus, from 
an early stage in evolution, metabolic pathways would 
be required to generate precursors efficiently, with the 
synthesis of precursors presumably catalyzed by ri- 
bozymes. The extant ribozymes found in nature have a 
limited repertoire of catalytic functions, and of the ri- 
bozymes that may once have existed, no trace is left. To 
explore the RNA world hypothesis more deeply, we 
need to know whether RNA has the potential to catalyze 
the many different reactions needed in a primitive sys- 
tem of metabolic pathways. 

The search for RNAs with new catalytic functions 
has been aided by the development of a method that 
rapidly searches pools of random polymers of RNA and 
extracts those with particular activities: SELEX is 
nothing less than accelerated evolution in a test tube 
(Box 26-3). It has been used to generate RNA molecules 
that bind to amino acids, organic dyes, nucleotides, 
cyanocobalamin, and other molecules. Researchers 
have isolated ribozymes that catalyze ester and amide 



bond formation, S N 2 reactions, metallation of (addition 
of metal ions to) porphyrins, and carbon-carbon bond 
formation. The evolution of enzymatic cofactors with 
nucleotide "handles" that facilitate their binding to 
ribozymes might have further expanded the repertoire 
of chemical processes available to primitive metabolic 
systems. 

As we shall see in the next chapter, some natural 
RNA molecules catalyze the formation of peptide bonds, 
offering an idea of how the RNA world might have been 
transformed by the greater catalytic potential of pro- 
teins. The synthesis of proteins would have been a ma- 
jor event in the evolution of the RNA world, but would 
also have hastened its demise. The information-carrying 
role of RNA may have passed to DNA because DNA is 
chemically more stable. RNA replicase and reverse tran- 
scriptase may be modern versions of enzymes that once 
played important roles in making the transition to the 
modern DNA-based system. 

Molecular parasites may also have originated in an 
RNA world. With the appearance of the first inefficient 
self-replicators, transposition could have been a po- 
tentially important alternative to replication as a strat- 
egy for successful reproduction and survival. Early 
parasitic RNAs would simply hop into a self-replicat- 
ing molecule via catalyzed transesterification, then 
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FIGURE 2 RNA aptamer that binds ATP. The shaded nucleotides are 
those required for the binding activity. 



plex pool of sequences makes this a promising approach 
for the generation of new therapies. For example, one 
could select an RNA that binds tightly to a receptor pro- 
tein prominent in the plasma membrane of cells in a par- 
ticular cancerous tumor. Blocking the activity of the 
receptor, or targeting a toxin to the tumor cells by at- 
taching it to the aptamer, would kill the cells. SELEX 
also has been used to select DNA aptamers that detect 
anthrax spores. Many other promising applications are 
under development. ■ 



Critical sequence features of an RNA aptamer that 
binds ATP are shown in Figure 2; molecules with this 
general structure bind ATP (and other adenosine nu- 
cleotides) within < 50 fjM. Figure 3 presents the three- 
dimensional structure of a 36 nucleotide RNA aptamer 
(shown as a complex with AMP) generated by SELEX. 
This RNA has the backbone structure shown in Figure 2. 

I In addition to its use in exploring the potential 
HH functionality of RNA, SELEX has an important 
practical side in identifying short RNAs with pharma- 
ceutical uses. Finding an aptamer that binds specifically 
to every potential therapeutic target may be impossible, 
but the capacity of SELEX to rapidly select and amplify 
a specific oligonucleotide sequence from a highly com- 




FIGURE 3 (Derived from PDB ID 1 RAW) RNA aptamer bound to AMP. 
The bases of the conserved nucleotides (forming the binding pocket) 
are white; the bound AMP is red. 



passively undergo replication. Natural selection would 
have driven transposition to become site-specific, tar- 
geting sequences that did not interfere with the cat- 
alytic activities of the host RNA. Replicators and RNA 
transposons could have existed in a primitive symbi- 
otic relationship, each contributing to the evolution of 
the other. Modern introns, retroviruses, and trans- 
posons may all be vestiges of a "piggy-back" strategy 
pursued by early parasitic RNAs. These elements con- 
tinue to make major contributions to the evolution of 
their hosts. 

Although the RNA world remains a hypothesis, with 
many gaps yet to be explained, experimental evidence 
supports a growing list of its key elements. Further ex- 
perimentation should increase our understanding. Im- 
portant clues to the puzzle will be found in the workings 
of fundamental chemistry, in living cells, and perhaps on 
other planets. Meanwhile, the extant RNA universe con- 
tinues to expand (Box 26^1). 

SUMMARY 26.3 RNA-Dependent 

Synthesis of RNA and 
DNA 

■ RNA-dependent DNA polymerases, also called 
reverse transcriptases, were first discovered in 



retroviruses, which must convert their RNA 
genomes into double-stranded DNA as part of their 
life cycle. These enzymes transcribe the viral RNA 
into DNA, a process that can be used experimentally 
to form complementary DNA. 

a Many eukaryotic transposons are related to 

retroviruses, and their mechanism of transposition 
includes an RNA intermediate. 

■ Telomerase, the enzyme that synthesizes the 
telomere ends of linear chromosomes, is a 
specialized reverse transcriptase that contains 
an internal RNA template. 

■ RNA-dependent RNA polymerases, such as the 
replicases of RNA bacteriophages, are template- 
specific for the viral RNA. 

i The existence of catalytic RNAs and pathways 
for the interconversion of RNA and DNA 
has led to speculation that an important stage 
in evolution was the appearance of an RNA 
(or an equivalent polymer) that could 
catalyze its own replication. The biochemical 
potential of RNAs can be explored by 
SELEX, a method for rapidly selecting RNA 
sequences with particular binding or catalytic 
properties. 



BOX 26-4 An Expanding RNA Universe Filled with TUF RNAs 



Current estimates for the number of genes in the human 
genome, and in many other genomes, are mentioned in 
multiple places throughout this text. The estimates pre- 
sume that scientists know a gene when they see it, 
based on our current understanding of DNA, RNA, and 
proteins. Is the presumption correct? 

As noted in Chapter 9, less than 2% of the human 
genome seems to encode proteins. Even when introns 
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are factored in, one might expect only a tiny fraction of 
the genome to be transcribed into RNA, mostly mRNA 
to encode those proteins. The remainder of the genome 
has sometimes been referred to as junk DNA. The "junk" 
moniker simply reflects our ignorance, which is slowly 
giving way to the realization that most of the genome is 
fully functional. 

In an effort to better map the boundaries of the hu- 
man transcriptome, researchers have invented new 
tools to determine with higher accuracy which genomic 
sequences are transcribed into RNA. The answers are 
surprising. Much more of our genome is transcribed into 
RNA than anyone supposed. Much of this RNA seems 
not to encode proteins. Much of it lacks some of the 
structures (for example, the 3' poly(A) tail) that char- 
acterize mRNA. So what is this RNA doing? 

Most of the methods for looking into this matter fall 
into two broad categories: cDNA cloning and microar- 
rays. The creation of a cDNA library to study the genes 
transcribed in a particular eukaryotic genome is de- 
scribed in Chapter 9 (see Fig. 9-14). However, the clas- 
sical methods for generating cDNA often lead to the 
cloning of only part of the sequence of a given tran- 
script. Because reverse transcriptase may stall at re- 
gions of secondary structure in mRNA, or may simply 
dissociate, often 20% or less of the clones in a cDNA 
library are full-length DNAs. This makes it difficult to 
use the library to map transcription start sites (TSSs) 
and to study the part of a gene that encodes the amino- 
terminal sequence of a protein. One of the many 
approaches developed to overcome this problem is illus- 
trated in Figure 1. Such refinements in technology have 
resulted in the creation of cDNA libraries in which more 
than 95% of the clones are full-length, providing an 
enriched source of information about cellular RNAs. 
However, cDNAs are generally created from RNA tran- 
scripts that have poly(A) tails. The use of microarrays, 
coupled to methods of cDNA preparation that do not 
rely on poly(A) tails (Fig. 2), has revealed that much of 
the RNA in eukaryotic cells lacks the common end 
structures. 

A complete picture has not yet emerged, but some 
conclusions are already clear. If one excludes the repeti- 
tive sequences (transposons, for example) that can 



FIGURE 1 A strategy for cloning full-length cDNAs. A pool of mRNAs 
is isolated from a tissue sample. In some cases, mRNAs bound by a 
particular protein may be targeted by immunoprecipitating the protein 
and isolating the associated mRNAs. Biotin is covalently attached to 
the 5' ends of the mRNAs, making use of unique features of the 5' cap. 
A poly(dT) primer is used to prime reverse transcription of the mRNAs. 
Rnase I degrades RNA that is not part of a DNA-RNA hybrid and thus 
destroys the incomplete cDNA-RNA pairs. The full-length cDNA-RNA 
hybrids are collected with streptavidin beads (which bind biotin), con- 
verted to duplex DNA, and cloned. 




make up half of a mammalian genome, at least 40% — and 
perhaps the vast majority— of the remaining genomic 
DNA is transcribed into RNA. There seem to be more 
RNAs lacking poly(A) tails than RNAs with them. Much 
of this RNA is not transported to the cytoplasm, but re- 
mains exclusively in the nucleus. Many segments of the 
genome are transcribed on both strands; one transcript 
is the complement of the other, a relationship referred to 
as antisense. Many of the antisense RNAs 
may be involved in regulation of the RNAs 
with which they pair. Many RNAs are pro- 
duced in only one or a few tissues, and new 
transcripts are discovered every time a new 
tissue source is analyzed. Thus ; the com- 
plete transcriptome has not yet been de- 
fined for any organism. Most important, 
many of the novel RNAs are transcribed 
from genomic segments, such as those illus- 
trated in Figure 9-20, that share synteny in 
more than one organism. This evolutionary 
conservation strongly suggests that these 
RNAs have an important function. 

Some of the novel RNAs are snoRNAs, 
snRNAs, or miRNAs, types of RNA recog- 
nized only in the past two decades. New TSS 
sequences are being discovered. New 
classes of RNA molecules are being defined. 
New patterns of alternative splicing are be- 
ing elucidated. And all of these findings are 
challenging our definitions of a gene. In 
the mouse and human genomes, even the 
familiar protein-encoding mRNA transcripts 
may be much more numerous than initially 



FIGURE 2 Defining the transcriptome with microar- 
rays. (a) Tiled microarrays are synthesized, represent- 
ing the nonrepetitive parts of the genome. In a tiled 
array, the successive oligonucleotides in the individual 
spots overlap in sequence, so that each nucleotide 
(such as the T shown in blue) is represented multiple 
times, (b) A tissue sample is fractionated to separate 
nuclear and cytoplasmic samples, and RNA is isolated 
from each. The RNA containing poly(A) tails is sepa- 
rated from RNA lacking the tails (by passing the RNA 
over a column with bound poly(dT)). RNA with a 
poIy(A) tail is converted to cDNA using methods de- 
scribed in Figure 1. RNA lacking a poly(A) tail is con- 
verted to cDNA by using primers with randomized 
sequences. The resulting DNA fragments do not corre- 
spond in length precisely to the RNAs from which they 
are derived, but the overall DNA pool includes most of 
the sequences present in the original RNAs. The cDNA 
samples are then labeled and used to probe the mi- 
croarrays. The signals from the microarrays define the 
sequences that were transcribed into RNA. 



thought, and the number of known protein-encoding 
genes may soon increase. However, the function of most 
of the newly discovered transcripts is unknown, and they 
are simply called TUFs (transcripts of unknown func- 
tion). This new RNA universe is a frontier that promises 
further insights into the workings of eukaryotic cells and 
perhaps a new glimpse of our origins in the RNA world of 
the distant past. 
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Key Terms 



Terms in bold are defined in the glossary. 



transcription 1021 
messenger RNA 

(mRNA) 1021 
transfer RNA 

(tRNA) 1021 
ribosomal RNA 

(rRNA) 1021 
transcriptome 1021 
DNA-dependent RNA 

polymerase 1022 
promoter 1025 
consensus 

sequence 1025 
footprinting 1026 
cAMP receptor protein 

(CRP) 1029 
repressor 1029 
transcription factors 1030 
ribozymes 1034 
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primary transcript 1034 
RNA splicing 1034 
5' cap 1034 
spliceosome 1037 
snRNA 1037 
poly(A) tail 1039 
snoRNA 1043 
miRNA 1045 
reverse transcriptase 
1050 

retrovirus 1050 
complementary DNA 

(cDNA) 1051 
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telomerase 1054 
RNA-dependent RNA 
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Problems 

1. RNA Polymerase (a) How long would it take for the E. 
coli RNA polymerase to synthesize the primary transcript for 
theE. coli genes encoding the enzymes for lactose metabolism 
(the 5,300 bp lac operon, considered in Chapter 28)? (b) How 
far along the DNA would the transcription "bubble" formed by 
RNA polymerase move in 10 seconds? 

2. Error Correction by RNA Polymerases DNA poly- 
merases are capable of editing and error correction, whereas 
the capacity for error correction in RNA polymerases seems to 
be quite limited. Given that a single base error in either replica- 
tion or transcription can lead to an error in protein synthesis, 
suggest a possible biological explanation for this difference. 

3. RNA Posttranscriptional Processing Predict the likely 
effects of a mutation in the sequence (5')AAUAAA in a eu- 
karyotic mRNA transcript. 

4. Coding versus Template Strands The RNA genome of 
phage Q/3 is the nontemplate strand, or coding strand, and 
when introduced into the cell it functions as an mRNA, Sup- 
pose the RNA replicase of phage QjS synthesized primarily 
template-strand RNA and uniquely incorporated this, rather 
than nontemplate strands, into the viral particles. What would 
be the fate of the template strands when they entered a new 
cell? What enzyme would have to be included in the viral par- 
ticles for successful invasion of a host cell? 

5. The Chemistry of Nucleic Acid Biosynthesis Describe 
three properties common to the reactions catalyzed by DNA 
polymerase, RNA polymerase, reverse transcriptase, and RNA 
replicase. How is the enzyme polynucleotide phosphorylase 
similar to and different from these four enzymes? 

6. RNA Splicing What is the minimum number of transes- 
terification reactions needed to splice an intron from an mRNA 
transcript? Explain. 

7. RNA Processing If the splicing of mRNA in a vertebrate 
cell is blocked, the rRNA modification reactions are also 
blocked. Suggest a reason for this. 

8. RNA Genomes The RNA viruses have relatively small 
genomes. For example, the single-stranded RNAs of retro- 
viruses have about 10,000 nucleotides and the Q/3 RNA is only 
4,220 nucleotides long. Given the properties of reverse tran- 
scriptase and RNA replicase described in this chapter, can you 
suggest a reason for the small size of these viral genomes? 

9. Screening RNAs by SELEX The practical limit for the 
number of different RNA sequences that can be screened in a 



SELEX experiment is 10 . (a) Suppose you are working with 
oligonucleotides 32 nucleotides long. How many sequences ex- 
ist in a randomized pool containing every sequence possible? 
(b) What percentage of these can be screened in a SELEX ex- 
periment? (c) Suppose you wish to select an RNA molecule that 
catalyzes the hydrolysis of a particular ester. From what you 
know about catalysis, propose a SELEX strategy that might 
allow you to select the appropriate catalyst. 

10. Slow Death The death cap mushroom, Amanita phal- 
loides, contains several dangerous substances, including the 
lethal a-amanitin. This toxin blocks RNA elongation in con- 
sumers of the mushroom by binding to eukaryotic RNA poly- 
merase II with very high affinity; it is deadly in concentrations 
as low as 10~ 8 m. The initial reaction to ingestion of the mush- 
room is gastrointestinal distress (caused by some of the other 
toxins) . These symptoms disappear, but about 48 hours later, 
the mushroom-eater dies, usually from liver dysfunction. 
Speculate on why it takes this long for a-amanitin to kill. 

K9 11- Detection of Rifampicin-Resistant Strains of 
|] Tuberculosis Rifampicin is an important antibiotic 
used to treat tuberculosis and other mycobacterial diseases. 
Some strains of Mycobacterium tuberculosis, the causative 
agent of tuberculosis, are resistant to rifampicin. These 
strains become resistant through mutations that alter the 
rpoB gene, which encodes the /3 sub unit of the RNA poly- 
merase. Rifampicin cannot bind to the mutant RNA poly- 
merase and so is unable to block the initiation of transcription. 
DNA sequences from a large number of rifampicin-resistant 
M. tuberculosis strains have been found to have mutations in 
a specific 69 bp region of rpoB. One well-characterized ri- 
fampicin-resistant strain has a single base pair alteration in 
rpoB that results in a His residue being replaced by an Asp 
residue in the /3 subunit. 

(a) Based on your knowledge of protein chemistry, sug- 
gest a technique that would allow detection of the rifampicin- 
resistant strain containing this particular mutant protein. 

(b) Based on your knowledge of nucleic acid chemistry, 
suggest a technique to identify the mutant form of rpoB. 

Biochemistry on the Internet 

12. The Ribonuclease Gene Human pancreatic ribonucle- 
ase has 128 amino acid residues. 

(a) What is the minimum number of nucleotide pairs 
required to code for this protein? 

(b) The mRNA expressed in human pancreatic cells was 
copied with reverse transcriptase to create a "library" of hu- 
man DNA. The sequence of the mRNA coding for human pan- 
creatic ribonuclease was determined by sequencing the 
complementary DNA (cDNA) from this library that included 
an open reading frame for the protein. Use the Entrez data- 
base system (www.ncbi.nlm.nih.gov/Entrez) to find the pub- 
lished sequence of this mRNA (search the CoreNucleotide 
database for accession number D26129). What is the length of 
this mRNA? 

(c) How can you account for the discrepancy between the 
size you calculated in (a) and the actual length of the mRNA? 
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Data Analysis Problem 

13. A Case of RNA Editing The AMPA (a-amino-3- 
hydroxy-5-methyl-4-isoxazolepropionic acid) receptor is an 
important component of the human nervous system. It is pres- 
ent in several forms, in different neurons, and some of this 
variety results from posttranscriptional modification. This 
problem explores research on the mechanism of this RNA 
editing. 

An initial report by Sommer and coauthors (1991) looked 
at the sequence encoding a key Arg residue in the AMPA re- 
ceptor. The sequence of the cDNA (see Fig, 9-14) for the 
AMPA receptor showed a CGG (Arg; see Fig. 27-7) codon for 
this amino acid. Surprisingly, the genomic DNA showed a CAG 
(Gin) codon at this position. 

(a) Explain how this result is consistent with posttran- 
scriptional modification of the AMPA receptor mRNA. 

Rueter and colleagues (1995) explored this mechanism in 
detail. They first developed an assay to differentiate between 
edited and unedited transcripts, based on the Sanger method 
of DNA sequencing (see Fig 8-33) They modified the tech- 
nique to determine whether the base in question was an A (as 
in CAG) or not. They designed two DNA primers based on the 
genomic DNA sequence of this region of the AMPA gene. 
These primers, and the genomic DNA sequence of the non- 
template strand for the relevant region of the AMPA receptor 
gene, are shown below; the A residue that is edited is in red. 

To detect whether this A was present or had been edited 
to another base, Rueter and coworkers used the following 
procedure: 

1 . Prepared cDNA complementary to the mRNA, using primer 
1, reverse transcriptase, dATP, dGTP, dCTP, and dTTP 

2. Removed the mRNA. 

3. Annealed 32 P-labeled primer 2 to the cDNA, and reacted 
this with DNA polymerase, dGTP, dCTP, dTTP, and ddATP 
(dideoxy ATP; see Fig. 8-33). 

4. Denatured the resulting duplexes and separated them with 
polyacrylamide gel electrophoresis (PAGE; p. 88). 

5. Detected the 32 P-labeled DNA species with autoradiography 

They found that edited mRNA produced a 22 nucleotide 
[ 32 P]DNA, whereas unedited mRNA produced a 19 nucleotide 
[ 32 P]DNA. 

(b) Using the sequences below, explain how the edited 
and unedited mRNAs resulted in these different products. 

Using the same procedure, to measure the fraction of tran- 
scripts edited under different conditions, the researchers 
found that extracts of cultured epithelial cells (a common cell 
line called HeLa) could edit the mRNA at a high level. To de- 
termine the nature of the editing machinery, they pretreated 
an active HeLa cell extract as described in the table and meas- 



ured its ability to edit AMPA mRNA. Proteinase K degrades 
only proteins; micrococcal nuclease, only DNA. 



Sample 


Pretreatment 


% mRNA edited 


1 


None 


18 


2 


Proteinase K 


5 


3 


Heat to 65 °C 


3 


4 


Heat to 85 °C 


3 


5 


Micrococcal nuclease 


17 



(c) Use these data to argue that the editing machinery 
consists of protein. What is a key weakness in this argument? 

To determine the exact nature of the edited base, Rueter 
and colleagues used the following procedure: 

1. Produced mRNA, using [a- 32 P]ATP in the reaction mixture. 

2. Edited the labeled mRNA by incubating with HeLa extract. 

3. Hydrolyzed the edited mRNA to single nucleotide 
monophosphates with nuclease PI. 

4. Separated the nucleotide monophosphates with thin-layer 
chromatography (TLC; see Fig. 10-24). 

5. Identified the resulting 32 P-labeled nucleotide monophos- 
phates with autoradiography. 

In unedited mRNA, they found only [ 32 P]AMP; in edited 
mRNA, they found mostly [ 32 P]AMP with some [ 32 P]IMP (ino- 
sine monophosphate; see Fig. 22-34). 

(d) Why was it necessary to use [a- 32 P]ATP rather than 
[/3- 32 P]ATP or [y- 32 P]ATP in this experiment? 

(e) Why was it necessary to use [a- 32 P]ATP rather than 
[a- 32 P]GTP, [a- 32 P]CTP, or [a- 32 P]UTP? 

(f) How does the result exclude the possibility that the 
entire A nucleotide (sugar, base, and phosphate) was removed 
and replaced by an I nucleotide during the editing process? 

The researchers next edited mRNA that was labeled with 
[2,8- 3 H]ATP, and repeated the above procedure. The only 3 H- 
labeled mononucleotides produced were AMP and IMP. 

(g) How does this result exclude removal of the A base 
(leaving the sugar-phosphate backbone intact) followed by 
replacement with an I base as a mechanism of editing? What, 
then, is the most likely mechanism of editing in this case? 

(h) How does changing an A to an I residue in the mRNA 
explain the Gin to Arg change in protein sequence in the two 
forms of AMPA receptor protein? (Hint: See Fig. 27-8). 
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(5') ...GTCTCTGGTTTTCCTTGGGTGCCTTTATGCAGC AAGGATGCGATATTTCGCCAAG.., 
Primer 1: C GTT C CT AC GCT AT AAAGC GGT T C (5') 

Primer 2: (5') CCTTGGGTGCCTTTA 



Obviously, Harry [Noller]'s finding doesn't speak to how life started, 
and it doesn't explain what came before RNA. But as part of the con- 
tinually growing body of circumstantial evidence that there was a life 
form before us on this planet, from which we emerged — boy, it's very 
strong! 

—Gerald Joyce, quoted in commentary in Science, 1992 
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Proteins are the end products of most information 
pathways. A typical cell requires thousands of dif- 
ferent proteins at any given moment. These must 
be synthesized in response to the cell's current needs, 
transported (targeted) to their appropriate cellular lo- 
cations, and degraded when no longer needed. 

An understanding of protein synthesis, the most 
complex biosynthetic process, has been one of the 
greatest challenges in biochemistry. Eukaryotic protein 
synthesis involves more than 70 different ribosomal pro- 
teins; 20 or more enzymes to activate the amino acid 
precursors; a dozen or more auxiliary enzymes and 
other protein factors for the initiation, elongation, and 
termination of polypeptides; perhaps 100 additional en- 
zymes for the final processing of different proteins; and 
40 or more kinds of transfer and ribosomal RNAs. Over- 
all, almost 300 different macromolecules cooperate to 
synthesize polypeptides. Many of these macromolecules 
are organized into the complex three-dimensional struc- 
ture of the ribosome. 

To appreciate the central importance of protein syn- 
thesis, consider the cellular resources devoted to this 
process. Protein synthesis can account for up to 90% of 
the chemical energy used by a cell for all biosynthetic 
reactions. Every bacterial, archaeal, and eukaryotic cell 
contains from several to thousands of copies of many 
different proteins and RNAs. The 15,000 ribosomes, 
100,000 molecules of protein synthesis-related protein 
factors and enzymes, and 200,000 tRNA molecules in a 
typical bacterial cell can account for more than 35% of 
the cell's dry weight. 

Despite the great complexity of protein synthesis, pro- 
teins are made at exceedingly high rates. A polypeptide 



of 100 residues is synthesized in an Escherichia coli 
cell (at 37 °C) in about 5 seconds. Synthesis of the thou- 
sands of different proteins in a cell is tightly regulated, 
so that just enough copies are made to match the cur- 
rent metabolic circumstances. To maintain the appropri- 
ate mix and concentration of proteins, the targeting and 
degradative processes must keep pace with synthesis. 
Research is gradually uncovering the finely coordinated 
cellular choreography that guides each protein to its 
proper cellular location and selectively degrades it when 
it is no longer required. 

The study of protein synthesis offers another impor- 
tant reward: a look at a world of RNA catalysts that may 
have existed before the dawn of life "as we know it." Re- 
searchers have elucidated the structure of bacterial ribo- 
somes, revealing the workings of cellular protein 
synthesis in beautiful molecular detail. And what did they 
find? Proteins are synthesized by a gigantic RNA enzyme! 

27.1 The Genetic Code 

Three major advances set the stage for our present 
knowledge of protein biosynthesis. First, in the early 
1950s, Paul Zamecnik and his colleagues designed a set 
of experiments to investigate where in the cell proteins 
are synthesized. They injected radioactive amino acids 
into rats and, at different time 
intervals after the injection, re- 
moved the liver, homogenized 
it, fractionated the homogenate 
by centrifugation, and examined 
the subcellular fractions for the 
presence of radioactive protein. 
When hours or days were al- 
lowed to elapse after injection of 
the labeled amino acids, all the 
subcellular fractions contained 
labeled proteins. However, when 
only minutes had elapsed, labeled Pa u I Za mec n i k 
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FIGURE 27-1 Ribosomes and endoplasmic reticulum. Electron micro- 
graph and schematic drawing of a portion of a pancreatic cell, show- 
ing ribosomes attached to the outer (cytosolic) face of the endoplasmic 
reticulum (ER). The ribosomes are the numerous small dots bordering 
the parallel layers of membranes. 



protein appeared only in a fraction containing small ribonu- 
cleoprotein particles. These particles, visible in animal tis- 
sues by electron microscopy, were therefore identified as 
the site of protein synthesis from amino acids, and later 
were named ribosomes (Fig. 27-1). 

The second key advance was made by Mahlon 
Hoagland and Zamecnik, when they found that amino 
acids were "activated" when incubated with ATP and 
the cytosolic fraction of liver cells. The amino acids be- 
came attached to a heat-stable soluble RNA of the type 
that had been discovered and characterized by Robert 
Holley and later called transfer RNA (tRNA), to form 
aminoacyl-tRNAs. The enzymes that catalyze this 
process are the aminoacyl-tRNA synthetases. 

The third advance resulted from Francis Crick's rea- 
soning on how the genetic information encoded in the 4- 
letter language of nucleic acids could be translated into 
the 20-letter language of proteins. A small nucleic acid 
(perhaps RNA) could serve the role of an adaptor, one 
part of the adaptor molecule binding a specific amino acid 
and another part recognizing the nucleotide sequence 
encoding that amino acid in an mRNA (Fig. 27-2). This 
idea was soon verified. The tRNA adaptor "translates" 
the nucleotide sequence of an mRNA into the amino 
acid sequence of a polypeptide. The overall process of 
mRNA-guided protein synthesis is often referred to sim- 
ply as translation. 

These three developments soon led to recognition 
of the major stages of protein synthesis and ultimately 
to the elucidation of the genetic code that specifies each 
amino acid. 
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FIGURE 27-2 Crick's adaptor hypothesis. Today we know that the 
amino acid is covalently bound at the 3' end of a tRNA molecule and 
that a specific nucleotide triplet elsewhere in the tRNA interacts with a 
particular triplet codon in mRNA through hydrogen bonding of com- 
plementary bases. 

The Genetic Code Was Cracked Using Artificial 
mRNA Templates 

By the 1960s it was apparent that at least three nu- 
cleotide residues of DNA are necessary to encode each 
amino acid. The four code letters of DNA (A, T, G, and 
C) in groups of two can yield only 4 2 = 16 different com- 
binations, insufficient to encode 20 amino acids. Groups 
of three, however, yield 4 3 = 64 different combinations. 

Several key properties of the genetic code were es- 
tablished in early genetic studies (Figs 27-3, 27-4). A 
codon is a triplet of nucleotides that codes for a specific 
amino acid. Translation occurs in such a way that these 
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FIGURE 27-3 Overlapping versus nonoverlapping genetic codes. In a 

nonoverlapping code, codons (numbered consecutively) do not share 
nucleotides. In an overlapping code, some nucleotides in the mRNA 
are shared by different codons. In a triplet code with maximum over- 
lap, many nucleotides, such as the third nucleotide from the left (A), 
are shared by three codons. Note that in an overlapping code, the 
triplet sequence of the first codon limits the possible sequences for the 
second codon. A nonoverlapping code provides much more flexibility 
in the triplet sequence of neighboring codons and therefore in the pos- 
sible amino acid sequences designated by the code. The genetic code 
used in all living systems is now known to be nonoverlapping. 
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FIGURE 27-4 The triplet, nonoverlapping code. Evidence for the 
general nature of the genetic code came from many types of ex- 
periments, including genetic experiments on the effects of dele- 
tion and insertion mutations. Inserting or deleting one base pair 
(shown here in the mRNA transcript) alters the sequence of 
triplets in a nonoverlapping code; all amino acids coded by the 
mRNA following the change are affected, Combining insertion 
and deletion mutations affects some amino acids but can eventu- 
ally restore the correct amino acid sequence. Adding or subtract- 
ing three nucleotides (not shown) leaves the remaining triplets 
intact, providing evidence that a codon has three, rather than four 
or five, nucleotides. The triplet codons shaded in gray are those 
transcribed from the original gene; codons shaded in blue are 
new codons resulting from the insertion or deletion mutations. 
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FIGURE 27-5 Reading frames in the genetic code. In a triplet, nonoverlapping code, all mRNAs have three potential 
reading frames, shaded here in different colors. The triplets, and hence the amino acids specified, are different in each 
reading frame, 



nucleotide triplets are read in a successive, nonoverlap- 
ping fashion. A specific first codon in the sequence es- 
tablishes the reading frame, in which a new codon 
begins every three nucleotide residues. There is no 
punctuation between codons for successive amino acid 
residues. The amino acid sequence of a protein is de- 
fined by a linear sequence of contiguous triplets. In prin- 
ciple, any given single-stranded DNA or mRNA 
sequence has three possible reading frames. Each read- 
ing frame gives a different sequence of codons (Fig. 
27-5), but only one is likely to encode a given protein. A 
key question remained: what were the three-letter code 
words for each amino acid? 

In 1961 Marshall Niren- 
berg and Heinrich Matthaei 
reported the first break- 
through. They incubated syn- 
thetic polyuridylate, poly(U), 
with an E. coli extract, GTP, 
ATP, and a mixture of the 20 
amino acids in 20 different 
tubes, each tube containing a 
different radioactively labeled 
amino acid. Because poly(U) 
mRNA is made up of many suc- 
cessive UUU triplets, it should 
promote the synthesis of a 
polypeptide containing only the amino acid encoded by 
the triplet UUU. A radioactive polypeptide was indeed 
formed in only one of the 20 tubes, the one containing 
radioactive phenylalanine. Nirenberg and Matthaei 
therefore concluded that the triplet codon UUU encodes 




Marshall Nirenberg 



phenylalanine. The same approach soon revealed that 
polycytidylate, poly(C), encodes a polypeptide contain- 
ing only proline (polyproline) , and polyadenylate, 
poly (A), encodes polylysine. Polyguanylate did not gen- 
erate any polypeptide in this experiment because it 
spontaneously forms tetraplexes (see Fig. 8-20) that 
cannot be bound by ribosomes. 

The synthetic polynucleotides used in such experi- 
ments were prepared with polynucleotide phosphory- 
lase (p. 1049), which catalyzes the formation of RNA 
polymers starting from ADP, UDP, CDP, and GDP. This 
enzyme, discovered by Severo Ochoa, requires no 
template and makes polymers with a base composition 
that directly reflects the relative concentrations of the 
nucleoside 5' -diphosphate precursors in the medium. 
If polynucleotide phosphorylase is presented with 
UDP only, it makes only poly(U). If it is presented with 
a mixture of five parts ADP and one part CDP, it makes 
a polymer in which about five-sixths of the residues 
are adenylate and one-sixth are cytidylate. This ran- 
dom polymer is likely to have many triplets of the se- 
quence AAA, smaller numbers of AAC, AC A, and CAA 
triplets, relatively few ACC, CCA, and CAC triplets, 
and very few CCC triplets (Table 27-1). Using a vari- 
ety of artificial mRNAs made by polynucleotide phos- 
phorylase from different starting mixtures of ADP, 
GDP, UDP, and CDP, the Nirenberg and Ochoa groups 
soon identified the base compositions of the triplets 
coding for almost all the amino acids. Although these 
experiments revealed the base composition of the 
coding triplets, they usually could not reveal the se- 
quence of the bases. 
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1 TABLE 27-1 


Incorporation of Amino Acids into Polypeptides in Response to Random 
Polymers of RNA 


Amino acid 


Observed 
frequency of 

iiicurpurtt null 

(Lys = 100) 


Tentative assignment 
for nucleotide 
composition oi 

corresponding codon* 


Expected frequency 
of incorporation 
based on 
assignment 
(Lys = 100) 


Asparagine 


24 


A 2 C 


20 


Glutamine 


24 


A 2 C 


20 


Histidine 


6 


AC 2 


4 


Lysine 


100 


AAA 


100 


Proline 


7 


AC 2) CCC 


4.8 


Threonine 


26 


A 2 C, AC 2 


24 



Note: Presented here is a summary of data from one of the early experiments designed to elucidate the genetic code. A synthetic RNA 
containing only A and C residues in a 5:1 ratio directed polypeptide synthesis, and both the identity and the quantity of incorporated 
amino acids were determined. Based on the relative abundance of A and C residues in the synthetic RNA, and assigning the codon AAA 
(the most likely codon) a frequency of 100, there should be three different codons of composition A 2 C, each at a relative frequency of 20; 
three of composition AC 2 , each at a relative frequency of 4.0; and CCC at a relative frequency of 0.8. The CCC assignment was based on 
information derived from prior studies with poly(C). Where two tentative codon assignments are made, both are proposed to code for the 
same amino acid 

These designations of nucleotide composition contain no information on nucleotide sequence (except, of course, AAA and CCC). 



KEY CONVENTION: Much of the following discussion deals 
with tRNAs. The amino acid specified by a tRNA is indi- 
cated by a superscript, such as tRNA Ala , and the amino- 
acylated tRNA by a hyphenated name: alanyl-tRNA^ or 
Ala-tRNA Ala . ■ 

In 1964 Nirenberg and Philip Leder achieved an- 
other experimental breakthrough. Isolated E. coli ribo- 
somes would bind a specific aminoacyl-tRNA in the 
presence of the corresponding synthetic polynucleotide 
messenger. For example, ribosomes incubated with 
poly(U) and phenylalanyl-tRNA Phe (Phe-tRNA Phe ) bind 
both RNAs, but if the ribosomes are incubated with 
poly(U) and some other aminoacyl-tRNA, the aminoacyl- 
tRNA is not bound, because it does not recognize the 
UUU triplets in poly(U) (Table 27-2). Even trinu- 
cleotides could promote specific binding of appropriate 
tRNAs, so these experiments could be carried out with 

^^fflff^P' Trinucleotides That Induce Specific Binding 

BlttHHBHfliH of Aminoacyl-tRNAs to Ribosomes 

Relative increase in 14 C-labeled 
aminoacyl-tRNA bound to ribosome* 

Trinucleotide Phe-tRNA Phc Ly^tRNA 1 * 8 Pro-tRNA^ 0 



UUU 


4.6 


0 


0 


AAA 


0 


7.7 


0 


CCC 


0 


0 


3.1 



Source: Modified from Nirenberg, M. & Leder, R (1964) RNA code words and protein 
synthesis. Science 145, 1399. 

*Each number represents the factor by which the amount of bound 14 C increased when 
the indicated trinucleotide was present, relative to a control with no trinucleotide 



chemically synthesized small oligonucleotides. With this 
technique researchers determined which aminoacyl- 
tRNA bound to 54 of the 64 possible triplet codons. For 
some codons, either no aminoacyl-tRNA or more than 
one would bind. Another method was needed to com- 
plete and confirm the entire genetic code. 



At about this time, a com- 
plementary approach was pro- 
vided by H. Gobind Khorana, 
who developed chemical meth- 
ods to synthesize polyribonu- 
cleotides with defined, repeating 
sequences of two to four bases. 
The polypeptides produced by 
these mRNAs had one or a few 
amino acids in repeating pat- 
terns. These patterns, when 




combined with information from H Gobind Khorana 
the random polymers used by 

Nirenberg and colleagues, permitted unambiguous codon 
assignments. The copolymer (AC) W for example, has alter- 
nating ACA and CAC codons: ACACACACACACACA. The 
polypeptide synthesized on this messenger contained equal 
amounts of threonine and histidine. Given that a histidine 
codon has one A and two Cs (Table 27-1), CAC must code 
for histidine and ACA for threonine. 

Consolidation of the results from many experi- 
ments permitted the assignment of 61 of the 64 possible 
codons. The other three were identified as termination 
codons, in part because they disrupted amino acid 
coding patterns when they occurred in a synthetic 
RNA polymer (Fig. 27-6). Meanings for all the triplet 
codons (tabulated in Fig. 27-7) were established by 
1966 and have been verified in many different ways. 
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Reading frame 1 5' - -fflflMM W^0M fegt&j U A -A-| 118111 A A- 3' 

Reading frame 2 G [U A AjG U A||A G Uj[A A G ■[U A A !iG U~aT| A 

Reading frame 3 --- G U [A^A^ttTA^^U^l A G U| |A A G ;U~A"a!-- 

FIGURE 27-6 Effect of a termination codon in a repeating tetranucleotide. Termination codons (pink) are 
encountered every fourth codon in three different reading frames (shown in different colors), Dipeptides or 
tripeptides are synthesized, depending on where the ribosome initially binds. 



The cracking of the genetic code is regarded as one of 
the most important scientific discoveries of the twenti- 
eth century. 

Codons are the key to the translation of genetic in- 
formation, directing the synthesis of specific proteins. 
The reading frame is set when translation of an mRNA 
molecule begins, and it is maintained as the synthetic 
machinery reads sequentially from one triplet to the 
next. If the initial reading frame is off by one or two 
bases, or if translation somehow skips a nucleotide in 
the mRNA, all the subsequent codons will be out of reg- 
ister; the result is usually a "missense" protein with a 
garbled amino acid sequence. 

Several codons serve special functions (Fig. 27-7). 
The initiation codon AUG is the most common signal 
for the beginning of a polypeptide in all cells, in addition 
to coding for Met residues in internal positions of 

First letter of codon (5' end) 



Second letter 
of codon 



U 



c 



A 



G 



u 




C 




A 






G 


uuu 


Phe 


ucu 


Ser 


UAU 


Tyr 


UGU 


Cys 


uuc 


Phe 


ucc 


Ser 


UAC 


Tyr 


UGC 


Cys 


UUA 


Leu 


UCA 


Ser 


UAA 


Stop 


UGA 


Stop 


UUG 


Leu 


UCG 


Ser 


UAG 


Stop 


UGG 


Trp 


cuu 


Leu 


ecu 


Pro 


CAU 


His 


CGU 


Arg 


cue 


Leu 


ccc 


Pro 


CAC 


His 


CGC 


Arg 


CUA 


Leu 


CCA 


Pro 


CAA 


Gin 


CGA 


Arg 


CUG 


Leu 


CCG 


Pro 


CAG 


Gin 


CGG 


Arg 


AUU 


He 


ACU 


Thr 


AAU 


Asn 


AGU 


Ser 


1 AUC 


lie 


ACC 


Thr 


AAC 


Asn 


AGC 


Ser 


AUA 


He 


ACA 


Thr 


AAA 


Lys 


AGA 


Arg 


AUG 


Met 


ACG 


Thr 


AAG 


Lys 


AGG 


Arg 


GUU 


Val 


GCU 


Ala 


GAU 


Asp 


GGU 


Gly 


GUC 


Val 


GCC 


Ala 


GAC 


Asp 


GGC 


Gly 


GUA 


Val 


GCA 


Ala 


GAA 


Glu 


GGA 


Gly 


GUG 


Val 


GCG 


Ala 


GAG 


Glu 


GGG 


Gly 



FIGURE 27-7 "Dictionary" of amino acid code words in mRNAs. The 

codons are written in the 5'— >3 f direction. The third base of each 
codon (in bold type) plays a lesser role in specifying an amino acid 
than the first two. The three termination codons are shaded in pink, the 
initiation codon AUG in green. All the amino acids except methionine 
and tryptophan have more than one codon. In most cases, codons that 
specify the same amino acid differ only at the third base. 



polypeptides. The termination codons (UAA, UAG, 
and UGA), also called stop codons or nonsense codons, 
normally signal the end of polypeptide synthesis and do 
not code for any known amino acids. Some deviations 
from these rules are discussed in Box 27-1. 

As described in Section 27.2, initiation of protein 
synthesis in the cell is an elaborate process that relies on 
initiation codons and other signals in the mRNA. In retro- 
spect, the experiments of Nirenberg, Khorana, and oth- 
ers to identify codon function should not have worked in 
the absence of initiation codons. Serendipitously, exper- 
imental conditions caused the normal initiation require- 
ments for protein synthesis to be relaxed. Diligence 
combined with chance to produce a breakthrough — a 
common occurrence in the history of biochemistry. 

In a random sequence of nucleotides, 1 in every 20 
codons in each reading frame is, on average, a termina- 
tion codon. In general, a reading frame without a termi- 
nation codon among 50 or more codons is referred to as 
an open reading frame (ORF). Long open reading 
frames usually correspond to genes that encode pro- 
teins. In the analysis of sequence databases, sophisti- 
cated programs are used to search for open reading 
frames in order to find genes among the often huge 
background of nongenic DNA. An uninterrupted gene 
coding for a typical protein with a molecular weight of 
60,000 would require an open reading frame with 500 or 
more codons. 

A striking feature of the genetic code is that an 
amino acid may be specified by more than one codon, so 
the code is described as degenerate. This does not sug- 
gest that the code is flawed: although an amino acid may 
have two or more codons, each codon specifies only one 
amino acid. The degeneracy of the code is not uniform. 
Whereas methionine and tryptophan have single 
codons, for example, three amino acids (Arg, Leu, Ser) 
have six codons, five amino acids have four, isoleucine 
has three, and nine amino acids have two (Table 27-3). 

The genetic code is nearly universal. With the in- 
triguing exception of a few minor variations in mitochon- 
dria, some bacteria, and some single-celled eukaryotes 
(Box 27-1), amino acid codons are identical in all species 
examined so far. Human beings, E. coli, tobacco plants, 
amphibians, and viruses share the same genetic code. 
Thus it would appear that all life forms have a common 
evolutionary ancestor, whose genetic code has been pre- 
served throughout biological evolution. Even the varia- 
tions reinforce this theme. 
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BOX 27-1 Exceptions That Prove the Ruler 

In biochemistry, as in other disciplines, exceptions to 
general rules can be problematic for instructors and 
frustrating for students. At the same time, though, they 
teach us that life is complex and inspire us to search for 
more surprises. Understanding the exceptions can even 
reinforce the original rule in surprising ways. 

One would expect little room for variation in the ge- 
netic code. Even a single amino acid substitution can have 
profoundly deleterious effects on the structure of a pro- 
tein. Nevertheless, variations in the code do occur in some 
organisms, and they are both interesting and instructive. 
The types of variation and their rarity provide powerful ev- 
idence for a common evolutionary origin of all living things. 

To alter the code, changes must occur in the 
gene(s) encoding one or more tRNAs, with the obvious 
target for alteration being the anticodon. Such a change 
would lead to the systematic insertion of an amino acid 
at a codon that, according to the normal code (see 
Fig. 27-7), does not specify that amino acid. The genetic 
code, in effect, is defined by two elements: (1) the anti- 
codons on tRNAs (which determine where an amino acid 
is placed in a growing polypeptide) and (2) the specificity 
of the enzymes— the aminoacyl-tRNA synthetases— that 
charge the tRNAs, which determines the identity of the 
amino acid attached to a given tRNA. 

Most sudden changes in the code would have cata- 
strophic effects on cellular proteins, so code alterations 
are more likely to persist where relatively few proteins 
would be affected — such as in small genomes encoding 
only a few proteins. The biological consequences of a 
code change could also be limited by restricting changes 
to the three termination codons, which do not generally 
occur within genes (see Box 27-4 for exceptions to this 
rule). This pattern is in fact observed. 

Of the very few variations in the genetic code that we 
know of, most occur in mitochondrial DNA (mtDNA), which 



Natural Variations in the Genetic Code 




encodes only 10 to 20 proteins. Mitochondria have their own 
tRNAs, so their code variations do not affect the much larger 
cellular genome. The most common changes in mitochon- 
dria (and the only code changes that have been observed in 
cellular genomes) involve termination codons. These 
changes affect termination in the products of only a subset 
of genes, and sometimes the effects are minor because the 
genes have multiple (redundant) termination codons. 

Vertebrate mtDNAs have genes that encode 13 pro- 
teins, 2 rRNAs, and 22 tRNAs (see Fig. 19-38). The 
small number of codon reassignments, along with an un- 
usual set of wobble rules (p. 1072), makes the 22 tRNAs 
sufficient to decode the protein genes, as opposed to the 
32 tRNAs required for the normal code. In mitochon- 
dria, these changes can be viewed as a kind of genomic 
streamlining, as a smaller genome confers a replication 
advantage on the organelle. Four codon families (in 
which the amino acid is determined entirely by the first 
two nucleotides) are decoded by a single tRNA with a U 
residue in the first (or wobble) position in the anti- 
codon. Either the U pairs somehow with any of the four 
possible bases in the third position of the codon or a 
"two out of three" mechanism is used — that is, no base 
pairing is needed at the third position. Other tRNAs rec- 
ognize codons with either A or G in the third position, 
and yet others recognize U or C, so that virtually all the 
tRNAs recognize either two or four codons. 

In the normal code, only two amino acids are specified 
by single codons: methionine and tryptophan (see Table 
27-3). If all mitochondrial tRNAs recognize two codons, we 
would expect additional Met and Trp codons in mitochon- 
dria. And we find that the single most common code varia- 
tion is the normal termination codon UGA specifying 
tryptophan. The tRNA 1 ^ recognizes and inserts a Trp 
residue at either UGA or the normal Trp codon, UGG. The 
second most common variation is conversion of AUA from 



| Degeneracy of the Genetic Code : 

Number of Number of 



Amino acid 


codons 


Amino acid 


codons 


Met 


1 


Tyr 


2 


Trp 


1 


He 


3 


Asn 


2 


Ala 


4 


Asp 


2 


Gly 


4 


Cys 


2 


Pro 


4 


Gin 


2 


Thr 


4 


Glu 


2 


Val 


4 


His 


2 


Arg 


6 


Lys 


2 


Leu 


6 


Phe 


2 


Ser 


6 



Wobble Allows Some tRNAs to Recognize More than 
One Codon 

When several different codons specify one amino acid, 
the difference between them usually lies at the third 
base position (at the 3' end). For example, alanine is 
coded by the triplets GCU, GCC, GCA, and GCG. The 
codons for most amino acids can be symbolized by XY£ 
or XYq. The first two letters of each codon are the pri- 
mary determinants of specificity, a feature that has some 
interesting consequences. 

Transfer RNAs base-pair with mRNA codons at a 
three-base sequence on the tRNA called the anticodon. 
The first base of the codon in mRNA (read in the 5'^3' 
direction) pairs with the third base of the anticodon 
(Fig. 27-8a). If the anticodon triplet of a tRNA recog- 
nized only one codon triplet through Watson-Crick base 
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Codons* 



AGA 





UGA 


AUA 


AGG 


CUN 


CGG 


Normal code assignment 


Stop 


lie 


Arg 


Leu 


Arg 


Animals 












Vertebrates 


Trp 


Met 


Stop 


+ 


+ 


Drosophila 


Trp 


Met 


Ser 


+ 


+ 


Yeasts 












Saccharomyces cerevisiae 


Trp 


Met 


+ 


Thr 


+ 


Torulopsis glabrata 


Trp 


Met 


+ 


Thr 


9 


Schizosaccharomyces pombe 


Trp 


+ 


+ 


+ 


+ 


Filamentous fungi 


Trp 


+ 


+ 


+ 


+ 


Trypanosomes 


Trp 


+ 


+ 


+ 


+ 


Higher plants 


+ 


+ 


+ 


+ 


Trp 


Chlamydomonas reinhardtii 


? 


+ 


+ 


+ 


9 



*N indicates any nucleotide; + , codon has the same meaning as in the normal code; ?, codon not observed in this mitochondrial genome. 



an lie codon to a Met codon; the normal Met codon is AUG, 
and a single tRNA recognizes both codons. The known cod- 
ing variations in mitochondria are summarized in Table 1. 

Turning to the much rarer changes in the codes for 
cellular (as distinct from mitochondrial) genomes, we find 
that the only known variation in a bacterium is again the 
use of UGA to encode Trp residues, occurring in the sim- 
plest free-living cell, Mycoplasma capricolum. Among 
eukaryotes, the only known extramitochondrial coding 
changes occur in a few species of ciliated protists, in which 
both temtination codons UAA and UAG can specify gluta- 
mine. There are also rare but interesting cases where stop 
codons have been adapted to encode amino acids that are 
not among the standard 20, as detailed in Box 27-3. 



Changes in the code need not be absolute; a codon 
might not always encode the same amino acid. For example, 
in many bacteria— including E. coli—GUG (Val) is some- 
times used as an initiation codon that specifies Met. This oc- 
curs only for those genes in which the GUG is properly 
located relative to particular mRNA sequences that affect 
the initiation of translation (as discussed in Section 27.2). 

These variations tell us that the code is not quite as uni- 
versal as once believed, but that its flexibility is severely 
constrained. The variations are obviously derivatives of 
the normal code, and no example of a completely different 
code has been found. The limited scope of code variants 
strengthens the principle that all life on this planet evolved 
on the basis of a single (slightly flexible) genetic code. 



mRNA 5 




(a) 



Codon 



pairing at all three positions, cells would have a different 
tRNA for each amino acid codon. This is not the case, 
however, because the anticodons in some tRNAs include 
the nucleotide inosinate (designated I), which contains 
the uncommon base hypoxanthine (see Fig. 8-5b). 



3 2 1 3 2 1 3 2 1 

Anticodon (3') G^C-I G-C-I 9~9~l < 5 ') 

Codon (5')C-G-A C-G-U C-G^C (3') 

123 123 123 

(b) 

FIGURE 27-8 Pairing relationship of codon and anticodon. (a) Align- 
ment of the two RNAs is antiparallel. The tRNA is shown in the tradi- 
tional cloverleaf configuration, (b) Three different codon pairing 
relationships are possible when the tRNA anticodon contains inosinate. 
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Inosinate can form hydrogen bonds with three different 
nucleotides (U, C, and A; Fig. 27-8b), although these 
pairings are much weaker than the hydrogen bonds of 
Watson-Crick base pairs (G=C and A=U). In yeast, one 
tRNA Arg has the anticodon (5')ICG, which recognizes 
three arginine codons: (5')CGA, (5')CGU, and (5')CGC. 
The first two bases are identical (CG) and form strong 
Watson-Crick base pairs with the corresponding bases of 
the anticodon, but the third base (A, U, or C) forms 
rather weak hydrogen bonds with the I residue at the 
first position of the anticodon. 

Examination of these and other codon-anticodon 
pairings led Crick to conclude that the third base of 
most codons pairs rather loosely with the corresponding 
base of its anticodon; to use his picturesque word, the 
third base of such codons (and the first base of their cor- 
responding anticodons) "wobbles." Crick proposed a set 
of four relationships called the wobble hypothesis: 

1. The first two bases of an mRNA codon always form 
strong Watson-Crick base pairs with the correspon- 
ding bases of the tRNA anticodon and confer most 
of the coding specificity. 

2. The first base of the anticodon (reading in the 
5'-»3' direction; this pairs with the third base of 
the codon) determines the number of codons rec- 
ognized by the tRNA. When the first base of the an- 
ticodon is C or A, base pairing is specific and only 
one codon is recognized by that tRNA. When the 
first base is U or G, binding is less specific and two 
different codons may be read. When inosine (I) is 
the first (wobble) nucleotide of an anticodon, three 
different codons can be recognized— the maximum 
number for any tRNA. These relationships are sum- 
marized in Table 27-4. 

3. When an amino acid is specified by several different 
codons, the codons that differ in either of the first 
two bases require different tRNAs. 

4. A minimum of 32 tRNAs are required to translate 
all 61 codons (31 to encode the amino acids and 1 
for initiation). 

The wobble (or third) base of the codon contributes 
to specificity, but, because it pairs only loosely with its 
corresponding base in the anticodon, it permits rapid 
dissociation of the tRNA from its codon during protein 
synthesis. If all three bases of a codon engaged in strong 
Watson-Crick pairing with the three bases of the anti- 
codon, tRNAs would dissociate too slowly and this 
would severely limit the rate of protein synthesis. 
Codon-anticodon interactions balance the requirements 
for accuracy and speed. 

The genetic code tells us how protein sequence in- 
formation is stored in nucleic acids and provides some 
clues about how that information is translated into pro- 
tein. We now turn to the molecular mechanisms of the 
translation process. 



TABLE 27-4 



How the Wobble Base of the Anticodon 
Determines the Number of Codons a 
tRNA Can Recognize 

One codon recognized: 

Anticodon (3') X - Y- § (5') (3') X-Y- A (5') 



Codon (5') X'- Y'-G (3') (5') X'- Y'-U (3') 

2 Two codons recognized: 

Anticodon (3') X - Y- X] (5') (3') X - Y- G (5') 

H z 2 a 

Codon (5') X'-Y'-& (3') (5') X'- Y'-g (3') 

3, Three codons recognized: 
Anticodon (3') X - Y- I (5') 



Codon 



(5') X'-Y'-tH3') 



Note: X and Y denote bases complementary to and capable of strong Watson-Crick 
base pairing with X' and Y', respectively. Wobble bases-in the 3' position of codons 
and 5' position of anticodons-are shaded in pink. 

Translational Frameshifting and RNA Editing Affect How 
the Code Is Read 

Once the reading frame has been set during protein syn- 
thesis, codons are translated without overlap or punctu- 
ation until the ribosomal complex encounters a 
termination codon. The other two possible reading 
frames usually contain no useful genetic information, 
but a few genes are structured so that ribosomes "hic- 
cup" at a certain point in the translation of their mRNAs, 
changing the reading frame from that point on. This ap- 
pears to be a mechanism either to allow two or more re- 
lated but distinct proteins to be produced from a single 
transcript or to regulate the synthesis of a protein. 

One of the best-documented examples of transla- 
tional frameshifting occurs during translation of the 
mRNA for the overlapping gag and pol genes of the Rous 
sarcoma virus (see Fig. 26-35). The reading frame for 
pol is offset to the left by one base pair (-1 reading 
frame) relative to the reading frame for gag (Fig. 27-9). 

The product of the pol gene (reverse transcrip- 
tase) is translated as a larger polyprotein, on the same 
mRNA that is used for the gag protein alone (see Fig. 
26-34). The polyprotein, or gag-pol protein, is then 
trimmed to the mature reverse transcriptase by prote- 
olytic digestion. Production of the polyprotein requires 
a translational frameshift in the overlap region to allow 
the ribosome to bypass the UAG termination codon at 
the end of the gag gene (shaded pink in Fig. 27-9). 

Frameshifts occur during about 5% of translations 
of this mRNA, and the gag-pol polyprotein (and ulti- 
mately reverse transcriptase) is synthesized at about 
one-twentieth the frequency of the gag protein, a level 
that suffices for efficient reproduction of the virus. In 
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gag reading frame 
Leu — Gly — Leu — Arg — Leu 

5' - - ,C U A | G -idir g| e t). -c k 



Thr — Asn — Leu Stop 

glllll il& 1 '11 ir '$t-:Wi §]ggagggcca 



— C UAGG GC UC C G C UUGAC AAA UUU [A U Aj| G G G[|A G G||G C C| A 
pal reading frame He — Gly — Arg — Ala 

FIGURE 27-9 Translational frameshifting in a retroviral transcript. The gag-pol overlap region in Rous 
sarcoma virus RNA is shown. 



some retroviruses, another translational frameshift allows 
translation of an even larger polyprotein that includes the 
product of the env gene fused to the gag and pol gene 
products (see Fig. 26-34). A similar mechanism produces 
both the r and y subunits ofE. coli DNA polymerase III 
from a single dnaX gene transcript (see Table 25-2). 

Some mRNAs are edited before translation. RNA 
editing can involve the addition, deletion, or alteration 
of nucleotides in the RNA in a manner that affects the 
meaning of the transcript when it is translated. Addition 
or deletion of nucleotides has been most commonly ob- 
served in RNAs originating from the mitochondrial and 
chloroplast genomes of eukaryotes. The reactions re- 
quire a special class of RNA molecules encoded by these 
same organelles, with sequences complementary to the 
edited mRNAs. These guide RNAs (gRNAs; Fig. 27-10) 
act as templates for the editing process. 



The initial transcripts of the genes that encode cy- 
tochrome oxidase subunit II in some protist mitochon- 
dria provide an example of editing by insertion. These 
transcripts do not correspond precisely to the sequence 
needed at the carboxyl terminus of the protein product. 
A posttranscriptional editing process inserts four U 
residues that shift the translational reading frame of the 
transcript. Figure 27-10 shows the added U residues in 
the small part of the transcript that is affected by editing. 
Note that the base pairing between the initial transcript 
and the guide RNA involves a number of G = U base pairs 
(blue dots), which are common in RNA molecules. 

RNA editing by alteration of nucleotides most com- 
monly involves the enzymatic deamination of adenosine 
or cytidine residues, forming inosine or uridine, respec- 
tively (Fig. 27-11), although other base changes have 
been described. Inosine is interpreted as a G residue 
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FIGURE 27-10 RNA editing of the transcript 
of the cytochrome oxidase subunit II gene 
from Trypanosoma brucei mitochondria, (a) 

Insertion of four U residues (pink) produces a 
revised reading frame, (b) A special class of 
guide RNAs, complementary to the edited 
product act as templates for the editing 
process. Note the presence of two G=U base 
pairs, signified by a blue dot to indicate non- 
Watson-Crick pairing. 
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FIGURE 27-11 Deamination reactions that result in RNA editing, (a) 

The conversion of adenosine nucleotides to inosine nucleotides is 




catalyzed by ADAR enzymes, (b) Cytidine to uridine conversions are 
catalyzed by the APOBEC family of enzymes. 
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Residue number 2,146 2,148 2,150 2,152 2,154 2,156 

Human liver 5 ' - - ^M^^MElUM^MB^B^a^BlC A All-UjMj^^^'A^^Tj 
(apoB-100) _ Gln - Leu - Gln Thr - T yr - Met - He - Gin - Phe - Asp - Gin - Tyr 

Human intestme--4fii|^ 

(apoB-48) - Gln - Leu - Gln - Thr - Tyr - Met - He Stop 

FIGURE 27-12 RNA editing of the transcript of the gene for the apoB-100 component of LDL Deamina- 
tion, which occurs only in the intestine, converts a specific cytidine to uridine, changing a Gln codon to 
a stop codon and producing a truncated protein 



during translation. The adenosine deamination reac- 
tions are carried out by adenosine deaminases that act 
onit!NA (ADAEs). The cytidine deaminations are car- 
ried out by the apoB mRNA editing catalytic peptide 
(APOBEC) family of enzymes, which includes the re- 
lated activation-induced deaminase (AID) enzymes. 
Both groups of deaminase enzymes have a homologous 
zinc-coordinating catalytic domain. 

A well-studied example of RNA editing by deamina- 
tion occurs in the gene for the apolipoprotein B compo- 
nent of low-density lipoprotein in vertebrates. One form 
of apolipoprotein B, apoB-100 (M T 513,000), is synthe- 
sized in the liver; a second form, apoB-48 (M r 250,000), 
is synthesized in the intestine. Both are encoded by an 
mRNA produced from the gene for apoB-100. An 
APOBEC cytidine deaminase found only in the intestine 
binds to the mRNA at the codon for amino acid residue 
2,153 (CAA = Gln) and converts the C to a U, to create 
the termination codon UAA. The apoB-48 produced in 
the intestine from this modified mRNA is simply an ab- 
breviated form (corresponding to the amino-terminal 
half) of apoB-100 (Fig. 27-12). This reaction permits 
tissue-specific synthesis of two different proteins from 
one gene. 

The ADAR-promoted A to I editing is particularly 
common in transcripts derived from the genes of pri- 
mates, and perhaps 90% or more of the editing occurs in 
the short interspersed elements (SINEs) called Alu ele- 
ments (see Fig. 24-8). There are over a million of the 
300 bp Alu elements in human DNA, making up about 
10% of the genome. These are concentrated near 
protein-encoding genes, often appearing in introns and 
untranslated regions at the 3' and 5' ends of transcripts. 
When it is first synthesized (prior to processing), the 
average human mRNA includes 10 to 20 Alu elements. 
The ADAR enzymes bind to and promote A to I editing 
only in duplex regions of RNA. The abundant Alu ele- 
ments offer many opportunities for intramolecular base 
pairing within the transcripts, providing the duplex tar- 
gets required by the ADARs. Some of the editing affects 
the coding sequences of genes. Defects in ADAR func- 
tion have been associated with a variety of human neu- 
rological conditions, including amyotrophic lateral 
sclerosis (ALS), epilepsy, and major depression. 



The genomes of all vertebrates are replete with 
SINEs, but there are many different types of SINES 
present in most of these organisms. The Alu elements 
predominate only in the primates. Careful screening of 
genes and transcripts indicates that A to I editing is 30 
to 40 times more prevalent in humans than in mice, 
largely due to the presence of many Alu elements. 
Large-scale A to I editing and an increased level of alter- 
native splicing (see Fig. 26-22) are two features that set 
primate genomes apart from those of other mammals. It 
is not yet clear whether these reactions are incidental, 
or whether they played key roles in the evolution of 
primates and, ultimately, humans. 

SUMMARY 27.1 The Genetic Code 

■ The particular amino acid sequence of a protein is 
constructed through the translation of information 
encoded in mRNA. This process is carried out by 
ribosomes. 

■ Amino acids are specified by mRNA codons 
consisting of nucleotide triplets. Translation 
requires adaptor molecules, the tRNAs, that 
recognize codons and insert amino acids into 
their appropriate sequential positions in the 
polypeptide. 

■ The base sequences of the codons were deduced 
from experiments using synthetic mRNAs of known 
composition and sequence. 

■ The codon AUG signals initiation of translation. 
The triplets UAA, UAG, and UGA are signals for 
termination. 

■ The genetic code is degenerate: it has multiple 
codons for almost every amino acid. 

■ The standard genetic code is universal in all 
species, with some minor deviations in mitochondria 
and a few single-celled organisms. 

■ The third position in each codon is much less 
specific than the first and second and is said to 
wobble. 

■ Translational frameshifting and RNA editing affect 
how the genetic code is read during translation. 
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27.2 Protein Synthesis 



As we have seen for DNA and RNA (Chapters 25 and 26), 
the synthesis of polymeric biomolecules can be consid- 
ered in terms of initiation, elongation, and termination 
stages. These fundamental processes are typically brack- 
eted by two additional stages: activation of precursors be- 
fore synthesis and postsynthetic processing of the 
completed polymer. Protein synthesis follows the same 
pattern. The activation of amino acids before their incor- 
poration into polypeptides and the posttranslational pro- 
cessing of the completed polypeptide play particularly 
important roles in ensuring both the fidelity of synthesis 
and the proper function of the protein product. The cellu- 
lar components involved in the five stages of protein syn- 
thesis in E. coli and other bacteria are listed in Table 27-5; 
the requirements in eukaryotic cells are quite similar, al- 
though the components are in some cases more numer- 
ous. An initial overview of the stages of protein synthesis 
provides a useful outline for the discussion that follows. 



Protein Biosynthesis Takes Place in 
Five Stages 

Stage 1: Activation of Amino Acids For the syn- 
thesis of a polypeptide with a defined sequence, two 
fundamental chemical requirements must be met: (1) 
the carboxyl group of each amino acid must be acti- 
vated to facilitate formation of a peptide bond, and (2) 
a link must be established between each new amino 
acid and the information in the mRNA that encodes it. 
Both these requirements are met by attaching the 
amino acid to a tRNA in the first stage of protein syn- 
thesis. Attaching the right amino acid to the right tRNA 
is critical. This reaction takes place in the cytosol, not 
on the ribosome. Each of the 20 amino acids is cova- 
lently attached to a specific tRNA at the expense of ATP 
energy, using Mg 2+ -dependent activating enzymes 
known as aminoacyl-tRNA synthetases. When attached 
to their amino acid (aminoacylated) the tRNAs are said 
to be "charged." 



TABLE 27-5 



Stage 



Components Required for the Five Major Stages of Protein 
Synthesis intra// 

Essential components 




1. Activation of amino acids 



2. Initiation 



3. Elongation 



4 . Termination and ribosome 
recycling 



5. Folding and posttranslational 
processing 



20 amino acids 

20 aminoacyl-tRNA synthetases 

32 or more tRNAs 

ATP 

Mg 2+ 

mRNA 

A^Formylmetmonyl-tRNA™ 6 * 

Initiation codon in mRNA (AUG) 

30S ribosomal subunit 

508 ribosomal subunit 

Initiation factors (IF-1, IF-2, IF-3) 

GTP 

Mg 2+ 

Functional 70S ribosome (initiation complex) 
Aminoacyl-tRNAs specified by codons 
Elongation factors (EF-Tu, EF-Ts, EF-G) 
GTP 

Mg 2+ 

Termination codon in mRNA 

Release factors (RF-1, RP-2, RF-3, RRF) 

EF-G 

IF-3 

Specific enzymes, cofactors, and other components 
for removal of initiating residues and signal 
sequences, additional proteolytic processing, 
modification of terminal residues, and attachment 
of acetyl, phosphoryl, methyl, carboxyl, 
carbohydrate, or prosthetic groups 
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TABLE 27-6 


RNA and Protein Components of the £, coli Ribosome 




Subunit 


iNumoer oi lotai numoer rrotein 
different proteins of proteins designations 


Number and 
type of rRNAs 


30S 
60S 


21 21 S1-S21 
33 36 L1-L36* 


1 (16S rRNA) 

2 (5S and 23S rRNAs) 



"The LI to L36 protein designations do not correspond to 36 different proteins. The protein originally designated L7 is in fact a 
modified form of L12, and L8 is a complex of three other proteins. Also, L26 proved to be the same protein as S20 (and not part 
of the SOS subunit). This gives 33 different proteins in the large subunit. There are four copies of the L7/L12 protein, with the three 
extra copies bringing the total protein count to 36. 



Stage 2: Initiation The mRNA bearing the code for 
the polypeptide to be synthesized binds to the smaller of 
two ribosomal subunits and to the initiating aminoacyl- 
tRNA. The large ribosomal subunit then binds to form an 
initiation complex. The initiating aminoacyl-tRNA base- 
pairs with the mRNA codon AUG that signals the begin- 
ning of the polypeptide. This process, which requires 
GTP, is promoted by cytosolic proteins called initiation 
factors. 

Stage 3: Elongation The nascent polypeptide is 
lengthened by covalent attachment of successive amino 
acid units, each carried to the ribosome and correctly 
positioned by its tRNA, which base-pairs to its corre- 
sponding codon in the mRNA. Elongation requires cy- 
tosolic proteins known as elongation factors. The 
binding of each incoming aminoacyl-tRNA and the 
movement of the ribosome along the mRNA are facili- 
tated by the hydrolysis of GTP as each residue is added 
to the growing polypeptide. 

Stage 4: Termination and Ribosome Recycling 

Completion of the polypeptide chain is signaled by a ter- 
mination codon in the mRNA. The new polypeptide is 
released from the ribosome, aided by proteins called re- 
lease factors, and the ribosome is recycled for another 
round of synthesis. 

Stage 5: Folding and Posttranslational Processing 

In order to achieve its biologically active form, the new 
polypeptide must fold into its proper three-dimensional 
conformation. Before or after folding, the new polypep- 
tide may undergo enzymatic processing, including re- 
moval of one or more amino acids (usually from the 
amino terminus); addition of acetyl, phosphoryl, methyl, 
carboxyl, or other groups to certain amino acid residues; 
proteolytic cleavage; and/or attachment of oligosaccha- 
rides or prosthetic groups. 

Before looking at these five stages in detail, we must ex- 
amine two key components in protein biosynthesis: the 
ribosome and tRNAs. 



The Ribosome Is a Complex Supramolecular 
Machine 

Eachi?. coli cell contains 15,000 or more ribosomes, ac- 
counting for almost a quarter of the dry weight of the 
cell. Bacterial ribosomes contain about 65% rRNA and 
35% protein; they have a diameter of about 18 nm and 
are composed of two unequal subunits with sedimenta- 
tion coefficients of 30S and 50S and a combined sedi- 
mentation coefficient of 70S. Both subunits contain 
dozens of ribosomal proteins and at least one large 
rRNA (Table 27-6). 

Following Zamecnik's dis- 
covery that ribosomes are 
the complexes responsible for 
protein synthesis, and following 
elucidation of the genetic code, 
the study of ribosomes acceler- 
ated. In the late 1960s Masayasu 
Nomura and colleagues demon- 
strated that both ribosomal sub- 
units can be broken down into 
their RNA and protein compo- 
nents, then reconstituted in 
vitro. Under appropriate ex- 
perimental conditions, the RNA and protein sponta- 
neously reassemble to form 30S or 50S subunits nearly 
identical in structure and activity to native subunits. 
This breakthrough fueled decades of research into the 
function and structure of ribosomal RNAs and proteins. 
At the same time, increasingly sophisticated structural 
methods revealed more and more details about ribo- 
some structure. 

The dawn of a new millennium brought with it the 
elucidation of the first high-resolution structures of bac- 
terial ribosomal subunits, providing a wealth of sur- 
prises (Fig. 27-13). First, the traditional focus on the 
protein components of ribosomes was shifted. The ribo- 
somal subunits are huge RNA molecules. In the 508 sub- 
unit, the 5S and 23S rRNAs form the structural core. 
The proteins are secondary elements in the complex, 
decorating the surface. Second and most important, 
there is no protein within 18 A of the active site for 




Masayasu Nomura 
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FIGURE 27-13 The bacterial ribosome. Our understanding of ribo- 
some structure has been greatly enhanced by multiple high-resolution 
images of the bacterial ribosome and its subunits, contributed by 
several research groups. A sampling is presented here, (a) The SOS and 
30S bacterial subunits, split apart to visualize the surfaces that inter- 
act in the active ribosome. The structure on the left is the 50S subunit 
(derived from PDB ID 20W8, 1VSA, and 1CIX), with tRNAs (dis- 
played as green backbone structures) bound to sites E, P, and A, de- 
scribed later in the text; the tRNA anticodons are in red. Proteins 
appear as blue wormlike structures representing the peptide back- 
bone; the rRNA as a gray rendering of the surface features. The struc- 
ture on the right is the 30S subunit (derived from PDB ID 20W8). 
Protein backbones are brown wormlike structures and the rRNA is a 
lighter tan surface rendering. The part of the mRNA that interacts with 
the tRNA anticodons is shown in red. The rest of the mRNA (not 



peptide bond formation. The high-resolution structure 
thus confirms what many had suspected for more than a 
decade: the ribosome is a ribozyme. In addition to the in- 
sight they provide into the mechanism of protein syn- 
thesis (as elaborated below), the detailed structures of 




(d) 50S 



shown) winds through grooves or channels on the 30S subunit 
surface, (b) The assembled active bacterial ribosome, viewed down 
into the groove separating the subunits (derived from PDB ID 20W8, 
1VSA, and 1GIX). All components are colored as in (a), (c) A pair of 
ribosome images in the same orientation as in (b), but with all com- 
ponents shown as surface renderings to emphasize the mass of the 
entire structure. In the structure on the right, the tRNAs have been 
omitted to give a better sense of the cleft where protein synthesis 
occurs, (d) The SOS bacterial ribosome subunit (PDB ID 1Q7Y). The 
subunit is again viewed from the side that attaches to the 30S subunit, 
but tilted down slightly compared with its orientation in (a). The active 
site for peptide bond formation (the peptidyl transferase activity), deep 
within a surface groove and far away from any protein, is marked by a 
bound inhibitor, puromycin (red). 



the ribosome and its subunits have stimulated a new 
look at the evolution of life (Box 27-2). 

The bacterial ribosome is complex, with a combined 
molecular weight of -2.7 million. The two irregularly 
shaped ribosomal subunits fit together to form a cleft 
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BOX 27-2 From an RNA World to a Protein World 



Extant ribozymes generally promote one of two types 
of reactions: hydrolytic cleavage of phosphodiester 
bonds or phosphoryl transfers (Chapter 26). In both 
cases, the substrates of the reactions are also RNA mol- 
ecules. The ribosomal RNAs provide an important ex- 
pansion of the catalytic range of known ribozymes. 
Coupled to the laboratory exploration of potential RNA 
catalytic function (see Box 26-3), the idea of an RNA 
world as a precursor to current life forms becomes in- 
creasingly attractive. 

A viable RNA world would require an RNA capable 
of self-replication, a primitive metabolism to generate 
the needed ribonucleotide precursors, and a cell bound- 
ary to aid in concentrating the precursors and seques- 
tering them from the environment. The requirements 



*5 





FIGURE 1 The SOS subunit of a bacterial ribosome (PDB ID 1NKW). 
The protein backbones are shown as blue wormlike structures; the 
rRNA components are transparent. The unstructured extensions of 
many of the ribosomal proteins snake into the rRNA structures, helping 
to stabilize them. 



for catalysis of reactions involving a growing range of 
metabolites and macromolecules could have led to 
larger and more complex RNA catalysts. The many neg- 
atively charged phosphoryl groups in the RNA backbone 
limit the stability of very large RNA molecules. In an 
RNA world, divalent cations or other positively charged 
groups could be incorporated into the structures to aug- 
ment stability. 

Certain peptides could stabilize large RNA mole- 
cules. For example, many ribosomal proteins in modern 
eukaryotic cells have long extensions, lacking second- 
ary structure, that snake into the rRNAs and help stabi- 
lize them (Fig. 1). Ribozyme-catalyzed synthesis of 
peptides could thus initially have evolved as part of a 
general solution to the structural maintenance of large 
RNA molecules. The synthesis of peptides may have 
helped stabilize large ribozymes, but this advance also 
marked the beginning of the end for the RNA world. 
Once peptide synthesis was possible, the greater cat- 
alytic potential of proteins would have set in motion an 
irreversible transition to a protein-dominated metabolic 
system. 

Most enzymatic processes, then, were eventually 
surrendered to the proteins — but not all. In every organ- 
ism, the critical task of synthesizing the proteins re- 
mains, even now, a ribozyme-catalyzed process. There 
appears to be only one good arrangement (or just a very 
few) of nucleotide residues in a ribozyme active site that 
can catalyze peptide synthesis. The rRNA residues that 
seem to be involved in the peptidyl transferase activity 
of ribosomes are highly conserved in the large-subunit 
rRNAs of all species. Using in vitro evolution (SELEX; 
see Box 26-3), investigators have isolated artificial ri- 
bozymes that promote peptide synthesis. Intriguingly, 
most of them include the ribonucleotide octet 
(5')AUAACAGG(3')> a highly conserved sequence 
found at the peptidyl transferase active site in the ribo- 
somes of all cells. There may be just one optimal solution 
to the overall chemical problem of ribozyme-catalyzed 
synthesis of proteins of defined sequence. Evolution 
found this solution once, and no life form has notably 
improved on it. 



through which the mRNA passes as the ribosome moves 
along it during translation (Fig. 2 7-1 3b). The 57 pro- 
teins in bacterial ribosomes vary enormously in size and 
structure. Molecular weights range from about 6,000 to 
75,000. Most of the proteins have globular domains 
arranged on the ribosome surface. Some also have 
snakelike extensions that protrude into the rRNA core 
of the ribosome, stabilizing its structure. The functions 
of some of these proteins have not yet been elucidated 



in detail, although a structural role seems evident for 
many of them. 

The sequences of the rRNAs of many organisms 
are now known. Each of the three single-stranded 
rRNAs of E, coli has a specific three-dimensional con- 
formation featuring extensive intrachain base pairing. 
The predicted secondary structure of the rRNAs 
(Fig. 27-14) has largely been confirmed in the high- 
resolution models, but fails to convey the extensive 
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Bacterial ribosome 
70S 



5S rRNA 

FIGURE 27-14 Bacterial rRNAs. Diagrams of the secondary structure 
off. coli 1 6S and 5S rRNAs. The first (5' end) and final (3' end) ribonu- 
cleotide residues of the 1 6S rRNA are numbered. 

network of tertiary interactions apparent in the com- 
plete structure. 

The ribosomes of eukaryotic cells (other than mito- 
chondrial and chloroplast ribosomes) are larger and 
more complex than bacterial ribosomes (Fig. 27-15 )> 
with a diameter of about 23 nm and a sedimentation 
coefficient of about 80S. They also have two subunits, 
which vary in size among species but on average are 60S 
and 40S. Altogether, eukaryotic ribosomes contain more 
than 80 different proteins. The ribosomes of mitochon- 
dria and chloroplasts are somewhat smaller and simpler 
than bacterial ribosomes. Nevertheless, ribosomal struc- 
ture and function are strikingly similar in all organisms 
and organelles. 

Transfer RNAs Have Characteristic Structural Features 

To understand how tRNAs can serve as adaptors in 
translating the language of nucleic acids into the lan- 
guage of proteins, we must first examine their struc- 
ture in more detail. Transfer RNAs are relatively small 
and consist of a single strand of RNA folded into a 
precise three-dimensional structure (see Fig. 8-25a). 
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FIGURE 27-15 Summary of the composition and mass of ribosomes in 
bacteria and eukaryotes. Ribosomal subunits are identified by their 
S (Svedberg unit) values, sedimentation coefficients that refer to their 
rate of sedimentation in a centrifuge. The S values are not necessarily 
additive when subunits are combined, because rates of sedimentation 
are affected by shape as well as mass. 



The tRNAs in bacteria and in the cytosol of eukary- 
otes have between 73 and 93 nucleotide residues, cor- 
responding to molecular weights of 24,000 to 31,000. 
Mitochondria and chloroplasts contain distinctive, 
somewhat smaller tRNAs. Cells have at least one kind 
of tRNA for each amino acid; at least 32 tRNAs are re- 
quired to recognize all the amino acid codons (some 
recognize more than one codon), but some cells use 
more than 32. 

Yeast alanine tRNA (tRNA Ala ), the first nucleic 
acid to be completely sequenced (Fig. 27-16), con- 
tains 76 nucleotide residues, 10 of which have 
modified bases. Comparisons of tRNAs from various 
species have revealed many common structural features 
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FIGURE 27-16 Nucleotide sequence of yeast tRNA Ala . This structure 
was deduced in 1965 by Robert W. Holley and his colleagues; it is 
shown in the cloverleaf conformation in which intrastrand base pairing 
is maximal. The following symbols are used for the modified nu- 
cleotides (shaded pink): ip, pseudouridine; I, inosine;T, ribothymidine; 
D, 5,6-dihydrouridine; m 1 l, 1-methylinosine; m 1 G, 1 -methylguanosine; 
m 2 G, N 2 -dimethylguanosine (see Fig. 26-23) Blue lines between 
parallel sections indicate Watson-Crick base pairs. In RNAs, guanosine 
is often base-paired with uridine, although the G=U pair is not as 
stable as the Watson-Crick G=C pair (Chapter 8). The anticodon can 
recognize three codons for alanine (GCA, GCU, and GCC). Other 
features of tRNA structure are shown in Figures 27-1 7 and 27-1 8. 

(Fig. 27-17). Eight or more of the nucleotide residues 
have modified bases and sugars, many of which are 
methylated derivatives of the principal bases. Most 
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FIGURE 27-17 General cloverleaf secondary structure of tRNAs. The 

large dots on the backbone represent nucleotide residues; the blue 
lines represent base pairs. Characteristic and/or invariant residues 
common to all tRNAs are shaded in pink. Transfer RNAs vary in length 
from 73 to 93 nucleotides. Extra nucleotides occur in the extra arm or 
in the D arm. At the end of the anticodon arm is the anticodon loop, 
which always contains seven unpaired nucleotides. The D arm contains 
two or three D (5,6-dihydrouridine) residues, depending on the tRNA. In 
some tRNAs, the D arm has only three hydrogen-bonded base pairs. In 
addition to the symbols explained in Figure 27-1 6: Pu, purine nucleotide; 
Py, pyrimidine nucleotide; G*, guanylate o/"2'-0-methylguanylate. 

tRNAs have a guanylate (pG) residue at the 5' end, and 
all have the trinucleotide sequence CCA(3') at the 3' 
end. When drawn in two dimensions, the hydrogen- 
bonding pattern of all tRNAs forms a cloverleaf struc- 
ture with four arms; the longer tRNAs have a short fifth 
arm, or extra arm (Fig. 27-17). In three dimensions, a 
tRNA has the form of a twisted L (Fig. 27-18). 

Amino 
acid arm 



FIGURE 27-18 Three-dimensional struc- 
ture of yeast tRNA phe deduced from 
x-ray diffraction analysis. The shape 
resembles a twisted L. (a) Schematic dia- 
gram with the various arms identified in 
Figure 27-17 shaded in different colors, 
(b) A space-filling model, with the same 
color coding (PDB ID 4TRA), The CCA 
sequence at the 3' end (orange) is the at- 
tachment point for the amino acid. 
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Two of the arms of a tRNA are critical for its adap- 
tor function. The amino acid arm can carry a specific 
amino acid esterified by its carboxyl group to the 2'- or 
3'-hydroxyl group of the A residue at the 3' end of the 
tRNA. The anticodon arm contains the anticodon. The 
other major arms are the D arm, which contains the un- 
usual nucleotide dihydrouridine (D), and the T^rC 
arm, which contains ribothymidine (T), not usually 
present in RNAs, and pseudouridine (#), which has an 
unusual carbon-carbon bond between the base and ri- 
bose (see Fig. 26-23). The D and T^C arms contribute 
important interactions for the overall folding of tRNA 
molecules, and the Tij/G arm interacts with the large- 
subunit rRNA. 

Having looked at the structures of ribosomes and tRNAs, 
we now consider in detail the five stages of protein 
synthesis. 

Stage 1 ; Aminoacyl-tRN A Synthetases Attach the Correct 
Amino Acids to Their tRNAs 

During the first stage of protein synthesis, taking place 
in the cytosol, aminoacyl-tRNA synthetases esterify the 
20 amino acids to their corresponding tRNAs. Each 
enzyme is specific for one amino acid and one or more 
corresponding tRNAs. Most organisms have one 
aminoacyl-tRNA synthetase for each amino acid. For 
amino acids with two or more corresponding tRNAs, the 
same enzyme usually aminoacylates all of them. 

The structures of all the aminoacyl-tRNA syn- 
thetases of E. coli have been determined. Researchers 
have divided them into two classes (Table 27-7) based 
on substantial differences in primary and tertiary struc- 
ture and in reaction mechanism (Fig. 27-19); these 
two classes are the same in all organisms. There is no ev- 
idence for a common ancestor, and the biological, chem- 
ical, or evolutionary reasons for two enzyme classes for 
essentially identical processes remain obscure. 

The reaction catalyzed by an aminoacyl-tRNA syn- 
thetase is 

Amino acid + tRNA 4- ATP Mg2+ > 

aminoacyl-tRNA + AMP + PPi 
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This reaction occurs in two steps in the enzyme's ac- 
tive site. In step ® (Fig. 27-19) an enzyme-bound in- 
termediate, aminoacyl adenylate (aminoacyl-AMP), is 
formed. In the second step the aminoacyl group is trans- 
ferred from enzyme-bound aminoacyl-AMP to its corre- 
sponding specific tRNA. The course of this second step 
depends on the class to which the enzyme belongs, as 
shown by pathways © and (§) in Figure 27-19. The re- 
sulting ester linkage between the amino acid and the 
tRNA (Fig. 27-20) has a highly negative standard free 
energy of hydrolysis (AG' 0 = -29 kJ/mol). The py- 
rophosphate formed in the activation reaction undergoes 
hydrolysis to phosphate by inorganic pyrophosphatase. 
Thus two high-energy phosphate bonds are ultimately 
expended for each amino acid molecule activated, ren- 
dering the overall reaction for amino acid activation es- 
sentially irreversible: 

Amino acid + tRNA + ATP Mg2+ > 

aminoacyl-tRNA + AMP + 2P A 

AG' 0 * -29kJ/mol 

Proofreading by Aminoacyl-tRNA Synthetases 

The aminoacylation of tRNA accomplishes two ends: 
(1) it activates an amino acid for peptide bond forma- 
tion and (2) it ensures appropriate placement of the 
amino acid in a growing polypeptide. The identity of 
the amino acid attached to a tRNA is not checked on 
the ribosome, so attachment of the correct amino 
acid to the tRNA is essential to the fidelity of protein 
synthesis. 

As you will recall from Chapter 6, enzyme speci- 
ficity is limited by the binding energy available from 
enzyme-substrate interactions. Discrimination between 
two similar amino acid substrates has been studied in 
detail in the case of Ile-tRNA synthetase, which distin- 
guishes between valine and isoleucine, amino acids that 
differ by only a single methylene group ( — CH 2 — ): 
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Note: Here, Arg represents arginyl-tRNA synthetase, and so forth. The classification applies 
to all organisms for which tRNA synthetases have been analyzed and is based on protein 
structural distinctions and on the mechanistic distinction outlined in Figure 27-19. 



Ile-tRNA synthetase favors activation of isoleucine (to 
form Ile-AMP) over valine by a factor of 200 — as we 
would expect, given the amount by which a methylene 
group (in He) could enhance substrate binding. Yet va- 
line is erroneously incorporated into proteins in positions 
normally occupied by an He residue at a frequency of only 
about 1 in 3,000. How is this greater than 10-fold increase 
in accuracy brought about? Ile-tRNA synthetase, like 
some other aminoacyl-tRNA synthetases, has a proofread- 
ing function. 
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MECHANISM FIGURE 27-19 Aminoacylation 
of tRNA by aminoacyl-tRNA synthetases. Step 
(T) is formation of an aminoacyl adenylate, 
which remains bound to the active site. In the 
second step the aminoacyl group is trans- 
ferred to the tRNA. The mechanism of this step 
is somewhat different for the two classes of 
aminoacyl-tRNA synthetases (see Table 27-7). 
For class I enzymes, (2a) the aminoacyl group 
is transferred initially to the 2'-hydroxyl group 
of the 3'-terminal A residue, then (3a) to the 
3 '-hydroxy I group by a transesterification re- 
action. For class II enzymes, @) the aminoa- 
cyl group is transferred directly to the 
3'-hydroxyl group of the terminal adenylate. 
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Aminoacyl group is transferred directly to the 
(25) 3' -OH of the 3 '-terminal A residue of tRNA, 
generating the aminoacyl-tRNA product. 
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FIGURE 27-20 General structure of aminoacyl-tRNAs. The aminoacy! 
group is esterified to the 3' position of the terminal A residue. The es- 
ter linkage that both activates the amino acid and joins it to the tRNA 
is shaded pink, 

Recall a general principle from the discussion of 
proofreading by DNA polymerases (p. 982): if available 
binding interactions do not provide sufficient discrimi- 
nation between two substrates, the necessary specificity 
can be achieved by substrate-specific binding in two 
successive steps. The effect of forcing the system 
through two successive filters is multiplicative. In the 
case of Ile-tRNA synthetase, the first filter is the initial 
binding of the amino acid to the enzyme and its activa- 
tion to aminoacyl-AMR The second is the binding of any 
incorrect aminoacyl-AMP products to a separate active 
site on the enzyme; a substrate that binds in this second 
active site is hydrolyzed. The R group of valine is slightly 
smaller than that of isoleucine, so Val-AMP fits the hy- 
drolytic (proofreading) site of the Ile-tRNA synthetase 
but Ile-AMP does not. Thus Val-AMP is hydrolyzed to va- 
line and AMP in the proofreading active site, and tRNA 
bound to the synthetase does not become aminoacyl- 
ated to the wrong amino acid. 

In addition to proofreading after formation of the 
aminoacyl-AMP intermediate, most aminoacyl-tRNA 
synthetases can also hydrolyze the ester linkage be- 
tween amino acids and tRNAs in the aminoacyl-tRNAs. 
This hydrolysis is greatly accelerated for incorrectly 
charged tRNAs, providing yet a third filter to enhance 
the fidelity of the overall process. The few aminoacyl- 
tRNA synthetases that activate amino acids with no 
close structural relatives (Cys-tRNA synthetase, for ex- 
ample) demonstrate little or no proofreading activity; in 
these cases, the active site for aminoacylation can suffi- 
ciently discriminate between the proper substrate and 
any incorrect amino acid. 



The overall error rate of protein synthesis (~1 mis- 
take per 10 4 amino acids incorporated) is not nearly as 
low as that of DNA replication. Because flaws in a pro- 
tein are eliminated when the protein is degraded and are 
not passed on to future generations, they have less bio- 
logical significance. The degree of fidelity in protein syn- 
thesis is sufficient to ensure that most proteins contain 
no mistakes and that the large amount of energy re- 
quired to synthesize a protein is rarely wasted. One de- 
fective protein molecule is usually unimportant when 
many correct copies of the same protein are present. 

Interaction between an Aminoacyl-tRNA Syn- 
thetase and a tRNA: A "Second Genetic Code" 

An individual aminoacyl-tRNA synthetase must be spe- 
cific not only for a single amino acid but for certain tRNAs 
as well. Discriminating among dozens of tRNAs is just as 
important for the overall fidelity of protein biosynthesis 
as is distinguishing among amino acids. The interaction 
between aminoacyl-tRNA synthetases and tRNAs has 
been referred to as the "second genetic code," reflecting 
its critical role in maintaining the accuracy of protein 
synthesis. The "coding" rules appear to be more complex 
than those in the "first" code. 

Figure 27-21 summarizes what we know about the 
nucleotides involved in recognition by some aminoacyl- 
tRNA synthetases. Some nucleotides are conserved in 



3' 




Anticodon 



FIGURE 27-21 Nucleotide positions in tRNAs that are recognized by 
aminoacyl-tRNA synthetases. Some positions (blue dots) are the same 
in all tRNAs and therefore cannot be used to discriminate one from an- 
other. Other positions are known recognition points for one (orange) or 
more (green) aminoacyl-tRNA synthetases. Structural features other 
than sequence are important for recognition by some of the synthetases. 
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FIGURE 27-22 Aminoacyl-tRNA synthetases. Both synthetases are complexed with their cognate 
tRNAs (green stick structures). Bound ATP (red) pinpoints the active site near the end of the aminoa- 
cyl arm. (a) Cln-tRNA synthetase from E. co//, a typical monomeric class I synthetase (PDB ID 
1QRT). (b) Asp-tRNA synthetase from yeast, a typical dimeric class II synthetase (PDB ID 1ASZ). 



all tRNAs and therefore cannot be used for discrimina- 
tion. By observing changes in nucleotides that alter sub- 
strate specificity, researchers have identified nucleotide 
positions that are involved in discrimination by the 
aminoacyl-tRNA synthetases. These nucleotide positions 
seem to be concentrated in the amino acid arm and the 
anticodon arm, including the nucleotides of the anti- 
codon itself, but are also located in other parts of the 
tRNA molecule. Determination of the crystal structures 
of aminoacyl-tRNA synthetases complexed with their 
cognate tRNAs and ATP has added a great deal to our 
understanding of these interactions (Fig. 27-22), 

Ten or more specific nucleotides may be involved in 
recognition of a tRNA by its specific aminoacyl-tRNA 
synthetase. But in a few cases the recognition mecha- 
nism is quite simple. Across a range of organisms from 
bacteria to humans, the primary determinant of tRNA 
recognition by the Ala-tRNA synthetases is a single G = U 
base pair in the amino acid arm of tRNA^ (Fig. 27-23a). 
A short synthetic RNA with as few as 7 bp arranged in 
a simple hairpin minihelix is efficiently aminoacylated 



FIGURE 27-23 Structural elements of tRNA Ala 
that are required for recognition by Ala-tRNA 
synthetase, (a) The tRNA Ala structural elements 
recognized by the Ala-tRNA synthetase are un- 
usually simple. A single G=U base pair (pink) is 
the only element needed for specific binding and 
aminoacylation. (b) A short synthetic RNA mini- 
helix, with the critical G=U base pair but lacking 
most of the remaining tRNA structure. This is 
aminoacylated specifically with alanine almost as 
efficiently as the complete tRNA Ala . 



by the Ala-tRNA synthetase, as long as the RNA contains 
the critical G = U (Fig. 27-23b). This relatively simple 
alanine system may be an evolutionary relic of a period 
when RNA oligonucleotides, ancestors to tRNA, were 
aminoacylated in a primitive system for protein synthesis. 

The interaction of aminoacyl-tRNA synthetases and 
their cognate tRNAs is critical to accurate reading of the 
genetic code. Any expansion of the code to include new 
amino acids would necessarily require a new aminoacyl- 
tRNA synthetase:tRNA pair. A limited expansion of the 
genetic code has been observed in nature; a more exten- 
sive expansion has been accomplished in the laboratory 
(Box 27-3). 
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BOX 27-3 Natural and Unnatural Expansion of the Genetic Code 



As we have seen, the 20 amino acids commonly found in 
proteins offer only limited chemical functionality Living 
systems generally overcome these limitations by using 
enzymatic cofactors or by modifying particular amino 
acids after they have been incorporated into proteins. In 
principle, expansion of the genetic code to introduce 
new amino acids into proteins offers another route to 
new functionality, but it is a very difficult route to ex- 
ploit. Such a change might just as easily result in the in- 
activation of thousands of cellular proteins. 

Expanding the genetic code to include a new amino 
acid requires several cellular changes. A new aminoacyl- 
tRNA synthetase must generally be present, along with 
a cognate tRNA. Both of these components must be 
highly specific, interacting only with each other and the 
new amino acid. Significant concentrations of the new 
amino acid must be present in the cell, which may entail 
the evolution of new metabolic pathways. As outlined in 
Box 27-1, the anticodon on the tRNA would most likely 
pair with a codon that normally specifies termination. 
Making all of this work in a living cell seems unlikely, but 
it has happened both in nature and in the laboratory. 

There are actually 22 rather than 20 amino acids 
specified by the known genetic code. The two extra ones 
are selenocysteine and pyrrolysine, each found in only a 
very few proteins but both offering a glimpse into the in- 
tricacies of code evolution. 
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A few proteins in all cells (such as formate dehydro- 
genase in bacteria and glutathione peroxidase in mam- 
mals) require selenocysteine for their activity. InE. coli 
selenocysteine is introduced into the enzyme formate 
dehydrogenase during translation, in response to an in- 
frame UGA codon. A special type of Ser-tRNA, present at 
lower levels than other Ser-tRNAs, recognizes UGA and 
no other codons. This tRNA is charged with serine by the 
normal serine aminoacyl-tRNA synthetase, and the ser- 
ine is enzymatically converted to selenocysteine by a 
separate enzyme before its use at the ribosome. The 
charged tRNA does not recognize just any UGA codon; 



some contextual signal in the mRNA, still to be identified, 
ensures that this tRNA recognizes only the few UGA 
codons, within certain genes, that specify selenocysteine. 
In effect, UGA doubles as a codon for both termination 
and (very occasionally) selenocysteine. This particular 
code expansion has a dedicated tRNA as described 
above, but it lacks a dedicated cognate aminoacyl-tRNA 
synthetase. The process works for selenocysteine, but 
one might consider it an intermediate step in the evolu- 
tion of a complete new codon definition. 

Pyrrolysine is found in a group of anaerobic archaea 
called methanogens (see Box 22-1). These organisms 
produce methane as a required part of their metabolism, 
and the Methanosarcinaceae group can use methyl- 
amines as substrates for methanogenesis. Producing 
methane from monomethylamine requires the enzyme 
monomethylamine methyltransferase. The gene encod- 
ing this enzyme has an in-frame UAG termination codon. 
The structure of the methyltransferase was elucidated 
in 2002, revealing the presence of the novel amino acid 
pyrrolysine at the position specified by the UAG codon. 
Subsequent experiments demonstrated that — unlike se- 
lenocysteine — pyrrolysine was attached directly to a 
dedicated tRNA by a cognate pyrrolysyl-tRNA syn- 
thetase. These cells produce pyrrolysine via a metabolic 
pathway that remains to be elucidated. The overall sys- 
tem has all the hallmarks of an established codon assign- 
ment, although it only works for UAG codons in this 
particular gene. As in the case of selenocysteine, there 
are probably contextual signals that direct this tRNA to 
the correct UAG codon. 

Can scientists match this evolutionary feat? Modifi- 
cation of proteins with various functional groups can 
provide important insights into the activity and/or struc- 
ture of the proteins. However, protein modification is of- 
ten quite laborious. For example, an investigator who 
wishes to attach a new group to a particular Cys residue 
will have to somehow block the other Cys residues that 
may be present on the same protein. If one could instead 
adapt the genetic code to enable a cell to insert a modi- 
fied amino acid at a particular location in a protein, the 
process could be rendered much more convenient. Peter 
Schultz and coworkers have done just that. 

To develop a new codon assignment, one again 
needs a new aminoacyl-tRNA synthetase and a novel 
cognate tRNA, both adapted to work only with a par- 
ticular new amino acid. Efforts to create such an "un- 
natural" code expansion initially focused on E, coli. 
The codon UAG was chosen as the best target for en- 
coding a new amino acid. UAG is the least used of the 
three termination codons, and strains with tRNAs se- 
lected to recognize UAG (see Box 27-4) do not ex- 
hibit growth defects. To create the new tRNA and 

(continued on next page) 
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1-i Natural and Unnatural Expansion of the Genetic Code {continued from previous page) 




tRNA 



M/tRNATyr gene 



Randomize M/tRNA T > r sequence 
at 1 1 positions, transform cells 
to create library. 



Add plasmid encoding 
engineered barnase gene. 



Library 




I 

<8> 



Cells containing Ai/tRNATyr variants 
aminoacylated by endogenous tRNA 
synthetases die. Survivors have My'tRNA 1 ^ 
variant that are not aminoacylated. 

M/TyrRS gene 



i -lactamase gene 



Grow in medium 
containing ampicillin 



Add plasmid 

encoding 

M/TyrRS and 

engineered 

/^-lactamase 

gene. 




C ® 



Cells live only if they contain M/'tRNA^ 1 " 
variants aminoacylated by M/TyrRS. 



FIGURE 1 Selecting M/'tRNA Tyr variants that function only with the tyro- 
syl tRNA synthetase M/TyrRS. The sequence of the gene encoding Mj- 
tRNA Tyr , on a plasmid, is randomized at 1 1 positions that do not interact 
with M/TyrRS (red dots). The mutagenized plasmids are introduced 
into E. coli cells to create a library of millions of My'tRNA Tyr variants, rep- 
resented by the six cells shown here. The toxic barnase gene, engi- 
neered to include the sequence TAG so that its transcript includes UAG 
codons, is provided on a separate plasmid, providing a negative selec- 
tion. If this gene is expressed, the cells die. It can only be expressed if 
the M/tRNA Tyr variant expressed by that particular cell is aminoacylated 



by endogenous (E. coli) aminoacyl-tRNA synthetases, inserting an 
amino acid instead of stopping translation. Another gene, encoding 
/3-lactamase, and also engineered with TAG sequences to produce UAG 
stop codons, is provided on yet another plasmid that also expresses the 
gene encoding M/TyrRS. This serves as a means of positive selection for 
the remaining /WytRNA Tyr variants. Those variants that are aminoacy- 
lated by M/TyrRS allow expression of the /3-lactamase gene, which al- 
lows cells to grow on ampicillin. Multiple rounds of negative and 
positive selection yield the best M/tRNA Tyr variants that are aminoacyl- 
ated uniquely by M/TyrRS and used efficiently in translation. 
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tRNA synthetase, the genes for a tyrosyl-tRNA and its 
cognate tyrosyl-tRNA synthetase were taken from the 
archaean Methanococcus jannaschii (My'tRNA 1 ^ 
and MfTyrRS). ATj'TyrRS does not bind to the anti- 
codon loop of Af/'tRNA^ 1 ", allowing the anticodon loop 
to be modified to CUA (complementary to UAG) with- 
out affecting the interaction. Because the archaeal 
and bacterial systems are orthologous, the modified 
archaeal components could be transferred to E. coli 
cells without disrupting the intrinsic translation sys- 
tem of the cells. 

First, the gene encoding MjtRNA^ had to be mod- 
ified to generate an ideal product tRNA — one that was 
not recognized by any aminoacyl-tRNA synthetases en- 
dogenous to E. coli, but was aminoacylated by Mj- 
TyrRS. Finding such a variant could be accomplished 
via a series of negative and positive selection cycles de- 
signed to efficiently sift through variants of the tRNA 
gene (Fig. 1). Parts of the MjtRNA^ 1 " sequence were 
randomized, allowing creation of a library of cells that 
each expressed a different version of the tRNA. A gene 
encoding barnase (a ribonuclease toxic to E. coli) was 
engineered so that its mRNA transcript contained sev- 
eral UAG codons, and this gene was also introduced 
into the cells on a plasmid. If the MytRNA 1 ^ variant ex- 
pressed in a particular cell in the library was aminoacyl- 
ated by an endogenous tRNA synthetase it would 
express the barnase gene and that cell would die (a 
negative selection). Surviving cells would contain tRNA 
variants that were not aminoacylated by endogenous 
tRNA synthetases, but could potentially be aminoacyl- 
ated by My'TyrRS. A positive selection (Fig. 1) was then 
set up by engineering the /3-lactamase gene (which 
confers resistance to the antibiotic ampicillin) so that 



its transcript contained several UAG codons and intro- 
ducing this gene into the cells along with the gene en- 
coding M/IVrRS. Those MjtRNA 1 ^ variants that could 
be aminoacylated by MfTyrRS allowed growth on ampi- 
cillin only when M/TyrRS was also expressed in the cell. 
Several rounds of this negative and positive selection 
scheme identified a newMy'tRNA^ variant that was not 
affected by endogenous enzymes, was aminoacylated 
by M?TyrRS, and functioned well in translation. 

Second, the M/'TyrRS had to be modified to recog- 
nize the new amino acid. The gene encoding M/TyrRS 
was now mutagenized to create a large library of 
variants. Variants that would aminoacylate the new 
Mj/tRNA^ 1 " variant with endogenous amino acids were 
eliminated using the barnase gene selection. A second 
positive selection (similar to the ampicillin selection 
above) was carried out so that cells would survive only if 
the MjtWA^ T variant was aminoacylated only in the 
presence of the unnatural amino acid. Several rounds of 
negative and positive selection generated a cognate 
tRNA synthetase-tRNA pair that recognized only the un- 
natural amino acid. These components were then re- 
named to reflect the unnatural amino acid used in the 
selection. 

Using this approach, many£ f . coli strains have been 
constructed, each capable of incorporating one particular 
unnatural amino acid into a protein in response to a 
UAG codon. The same approach has been used to artifi- 
cially expand the genetic code of yeast and even mam- 
malian cells. Over 30 different amino acids (Fig. 2) can 
be introduced site-specifically and efficiently into 
cloned proteins in this way. The result is an increasingly 
useful and flexible toolkit with which to advance the 
study of protein structure and function. 
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FIGURE 2 A sampling of unnatural amino acids that have been added 
to the genetic code. These unnatural amino acids add uniquely reac- 
tive chemical groups such as (a) a ketone/ (b) an azide, (c) a pho- 
tocrosslinker (a functional group designed to form a covalent bond 



(d) 



(f) 



with a nearby group when activated by light), (d) a highly fluorescent 
amino acid, (e) an amino acid with a heavy atom (Br) for use in crys- 
tallography/ and (f) a long-chain cysteine analog that can form ex- 
tended disulfide bonds. 
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Stage 2: A Specific Amino Acid initiates Protein Synthesis 

Protein synthesis begins at the amino-terminal end and 
proceeds by the stepwise addition of amino acids to the 
carboxyl-terminal end of the growing polypeptide, as 
determined by Howard Dintzis in 1961 (Fig. 27-24). 
The AUG initiation codon thus specifies an amino- 
terminal methionine residue. Although methionine has 
only one codon, (S')AUG, all organisms have two tRNAs 
for methionine. One is used exclusively when (5') AUG 
is the initiation codon for protein synthesis. The other is 
used to code for a Met residue in an internal position in 
a polypeptide. 

The distinction between an initiating (5') AUG and an 
internal one is straightforward. In bacteria, the two types 
of tRNA specific for methionine are designated tRNA Met 
and tRNA™ 6 * The amino acid incorporated in response 
to the (S')AUG initiation codon is AT-formylmethionine 
(Met). It arrives at the ribosome as A^-formylmethionyl- 
tRNA Met (fMet-tRNA met ) , which is formed in two succes- 
sive reactions. First, methionine is attached to tRNA met 
by the Met-tRNA synthetase (which in E. coli amino- 
acylates both tRNA™ et and tRNA Met ): 

Methionine + tRNA met + ATP ► 

Met-tRNA™** + AMP + PPi 

Next, a transformylase transfers a formyl group from 
A/^-formyltetrahydrofolate to the amino group of the 
Met residue: 

AT 10 -Formyltetrahydrofolate + Met-tRNA™ 6 * ► 

tetrahydrofolate + fMet-tRNA™** 

The transformylase is more selective than the Met- 
tRNA synthetase; it is specific for Met residues attached 
to tRNA™ 61 , presumably recognizing some unique struc- 
tural feature of that tRNA. By contrast, Met-tRNA Met 
inserts methionine in interior positions in polypeptides. 
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Addition of the AT-formyl group to the amino group 
of methionine by the transformylase prevents fMet from 
entering interior positions in a polypeptide while also al- 
lowing fMet-tRNA™ 61 to be bound at a specific ribosomal 
initiation site that accepts neither Met-tRNA Met nor any 
other aminoacyl-tRNA. 

In eukaryotic cells, all polypeptides synthesized by 
cytosolic ribosomes begin with a Met residue (rather 
than fMet), but, again, the cell uses a specialized initiat- 
ing tRNA that is distinct from the tRNA Met used at 
(S')AUG codons at interior positions in the mRNA. 
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FIGURE 27-24 Proof that polypeptides grow by addition of amino 
acid residues to the carboxyl end: the Dintzis experiment. Reticulo- 
cytes (immature erythrocytes) actively synthesizing hemoglobin were 
incubated with radioactive leucine (selected because it occurs fre- 
quently in both the a- and /3-globin chains). Samples of completed 
a chains were isolated from the reticulocytes at various times afterward, 
and the distribution of radioactivity was determined. The dark red 
zones show the portions of completed a-globin chains containing ra- 
dioactive Leu residues. At 4 min, only a few residues at the carboxyl 
end of a-globin were labeled, because the only complete globin 
chains with incorporated label after 4 min were those that had nearly 
completed synthesis at the time the label was added. With longer incu- 
bation times, successively longer segments of the polypeptide con- 
tained labeled residues, always in a block at the carboxyl end of the 
chain. The unlabeled end of the polypeptide (the amino terminus) was 
thus defined as the initiating end, which means that polypeptides grow 
by successive addition of amino acids to the carboxyl end. 

Polypeptides synthesized by mitochondrial and chloroplast 
ribosomes, however, begin with A^-formylmethionine. This 
strongly supports the view that mitochondria and 
chloroplasts originated from bacterial ancestors that 
were symbiotically incorporated into precursor eukary- 
otic cells at an early stage of evolution (see Fig. 1-36). 

How can the single (5')AUG codon determine 
whether a starting A/'-formylmethionine (or methionine, 
in eukaryotes) or an interior Met residue is ultimately 
inserted? The details of the initiation process provide 
the answer. 

The Three Steps of Initiation The initiation of 

polypeptide synthesis in bacteria requires (1) the 30S ri- 
bosomal subunit, (2) the mRNA coding for the polypep- 
tide to be made, (3) the initiating Met-tRNA™ 6 *, (4) a 
set of three proteins called initiation factors (IF-1, IF-2, 
and IP-3), (5) GTP, (6) the SOS ribosomal subunit, and 
(7) Mg 2+ . Formation of the initiation complex takes 
place in three steps (Fig. 27-25). 

In step (D the 30S ribosomal subunit binds two initi- 
ation factors, IF-1 and IF-3. Factor IF-3 prevents the 
30S and SOS subunits from combining prematurely. The 
mRNA then binds to the 30S subunit. The initiating 
(S')AUG is guided to its correct position by the Shine- 
Dalgarno sequence (named for Australian researchers 
John Shine and Lynn Dalgarno, who identified it) in the 
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FIGURE 27-25 Formation of the initiation complex in bacteria. The com- 
plex forms in three steps (described in the text) at the expense of the hy- 
drolysis of GTP to GDP and Pi. IF-1, IF-2, and IF-3 are initiation factors. P 
designates the peptidyl site, A the aminoacyl site, and E the exit site. Here 
the anticodon of the tRNA is oriented 3' to 5', left to right, as in Figure 
27-8 but opposite to the orientation in Figures 27-21 and 27-23. 



mRNA. This consensus sequence is an initiation signal 
of four to nine purine residues, 8 to 13 bp to the 5' side 
of the initiation codon (Fig. 27-26a). The sequence 
base-pairs with a complementary pyrimidine-rich se- 
quence near the 3' end of the 16S rRNA of the 30S riboso- 
mal subunit (Fig. 27-26b). This mRNA-rRNA interaction 
positions the initiating (5') AUG sequence of the mRNA in 
the precise position on the 30S subunit where it is re- 
quired for initiation of translation. The particular (5') AUG 
where fMet-tRNA^ 161 is to be bound is distinguished from 
other methionine codons by its proximity to the Shine- 
Dalgarno sequence in the mRNA. 

Bacterial ribosomes have three sites that bind tRNAs, 
the aminoacyl (A) site, the peptidyl (P) site, and 
the exit (E) site. The A and P sites bind to aminoacyl- 
tRNAs, whereas the E site binds only to uncharged tRNAs 
that have completed their task on the ribosome. Both 
the 30S and the 60S subunits contribute to the charac- 
teristics of the A and P sites, whereas the E site is 
largely confined to the 60S subunit. The initiating 
(5') AUG is positioned at the P site, the only site to 
which fMet-tRNA™ 61 can bind (Fig. 27-25). The fMet- 
tRNA^^ is the only aminoacyl-tRNA that binds first to 
the P site; during the subsequent elongation stage, all 
other incoming aminoacyl-tRNAs (including the Met- 
tRNA Met that binds to interior AUG codons) bind first to 
the A site and only subsequently to the P and E sites. 
The E site is the site from which the "uncharged" tRNAs 
leave during elongation. Factor IF-1 binds at the A site 
and prevents tRNA binding at this site during initiation. 

In step (D of the initiation process (Fig. 27-25), the 
complex consisting of the 30S ribosomal subunit, IF-3, 
and mRNA is joined by both GTP-bound IF-2 and the ini- 
tiating fMet-tRNA™**. The anticodon of this tRNA now 
pairs correctly with the mRNA's initiation codon. 

In step © this large complex combines with the 508 
ribosomal subunit; simultaneously, the GTP bound to 
IF-2 is hydrolyzed to GDP and P i; which are released 
from the complex. All three initiation factors depart 
from the ribosome at this point. 

Completion of the steps in Figure 27-25 produces a 
functional 70S ribosome called the initiation complex, 
containing the mRNA and the initiating fMet-tRNA^. 
The correct binding of the fMet-tRNA Met to the P site in 
the complete 70S initiation complex is assured by at 
least three points of recognition and attachment: the 
codon-anticodon interaction involving the initiation 
AUG fixed in the P site; interaction between the Shine- 
Dalgarno sequence in the mRNA and the 16S rRNA; and 
binding interactions between the ribosomal P site and 
the fMet-tRNA fMet . The initiation complex is now ready 
for elongation. 

Initiation in Eukaryotic Cells Translation is gener- 
ally similar in eukaryotic and bacterial cells; most of the 
significant differences are in the mechanism of initia- 
tion. Eukaryotic mRNAs are bound to the ribosome as a 
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FIGURE 27-26 Messenger RNA sequences that serve as signals for initi- 
ation of protein synthesis in bacteria, (a) Alignment of the initiating 
AUG (shaded in green) at its correct location on the 30S ribosomal sub- 
unit depends in part on upstream Shine-Dalgarno sequences (pink). Por- 



tions of the mRNA transcripts of five bacterial genes are shown. Note the 
unusual example of the E. coli Lacl protein, which initiates with a GUG 
(Val) codon (see Box 27-1). (b) The Shine-Dalgarno sequence of the 
mRNA pairs with a sequence near the 3' end of the 1 6S rRNA, 



complex with a number of specific binding proteins. 
Several of these tie together the 5' and 3' ends of the 
message. At the 3' end, the mRNA is bound by the 
poly (A) binding protein (PAB). Eukaryotic cells have at 
least nine initiation factors. A complex called eIF4F, 
which includes the proteins eIF4E, eIF4G, and eIF4A, 
binds to the 5' cap (see Fig. 26-13) through eIF4E. The 
protein eIF4G binds to both eIF4E and PAB, effectively 
tying them together (Fig. 27-27). The protein eIF4A 
has an RNA helicase activity. It is the eIF4F complex 
that associates with another factor, eIF3, and with the 
40S ribosomal subunit. The efficiency of translation is 
affected by many properties of the mRNA and proteins 
in this complex, including the length of the 3' poly(A) 



tract (in most cases, longer is better). The end-to-end 
arrangement of the eukaryotic mRNA facilitates transla- 
tional regulation of gene expression, considered in 
Chapter 28. 

The initiating (5')AUG is detected within the 
mRNA not by its proximity to a Shine-Dalgarno-like se- 
quence but by a scanning process: a scan of the mRNA 
from the 5' end until the first AUG is encountered, sig- 
naling the beginning of the reading frame. The eIF4F 
complex is probably involved in this process, perhaps 
using the RNA helicase activity of eIF4A to eliminate 
secondary structure in the 5' untranslated portion of 
the mRNA. Scanning is also facilitated by another pro- 
tein, eIF4B. 



FIGURE 27-27 Protein complexes in the for- 
mation of a eukaryotic initiation complex. 

The 3' and 5' ends of eukaryotic mRNAs are 
linked by a complex of proteins that includes 
several initiation factors and the poly(A) 
binding protein (PAB). The factors elF4E and 
elF4G are part of a larger complex called 
elF4F. This complex binds to the 40S riboso- 
mal subunit. 
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TABLE 27-8 



Factor 



Protein Factors Required for Initiation of Translation in Bacterial and Eukaryotic Cells 
Function 



Bacterial 

IF-1 
IF-2 
IF-3 

Eukaryotic 

eIF2 

eIF2B ; eIF3 
eIF4A 

eIF4B 
eIF4E 
eIF4G 
eIF5 

eIF6 



Prevents premature binding of tRNAs to A site 

Facilitates binding of fMet-tRNA™ 61 to 30S ribosomal subunit 

Binds to 30S subunit; prevents premature association of 50S 
subunit; enhances specificity of P site for fMet-tRNA™ 6 * 

Facilitates binding of initiating Met-tRNA Met to 40S ribosomal subunit 

First factors to bind 40S subunit; facilitate subsequent steps 

RNA helicase activity removes secondary structure in the mRNA to permit binding 
to 40S subunit; part of the eIF4F complex 

Binds to mRNA; facilitates scanning of mRNA to locate the first AUG 

Binds to the 5' cap of mRNA; part of the eIF4F complex 

Binds to eIF4E and to poly (A) binding protein (PAB); part of the eIF4F complex 

Promotes dissociation of several other initiation factors from 40S subunit as a 
prelude to association of 60S subunit to form 80S initiation complex 

Facilitates dissociation of inactive 80S ribosome into 40S and 60S subunits 



The roles of the various bacterial and eukaryotic ini- 
tiation factors in the overall process are summarized in 
Table 27-8. The mechanism by which these proteins act 
is an important area of investigation. 

Stage 3: Peptide Bonds Are Formed in the 
Elongation Stage 

The third stage of protein synthesis is elongation. 
Again, our initial focus is on bacterial cells. Elongation 
requires (1) the initiation complex described above, (2) 
aminoacyl-tRNAs, (3) a set of three soluble cytosolic 
proteins called elongation factors (EF-Tu, EF-Ts, and 
EF-G in bacteria), and (4) GTP. Cells use three steps to 
add each amino acid residue, and the steps are repeated 
as many times as there are residues to be added. 

Elongation Step 1: Binding of an Incoming 
Aminoacyl-tRNA In the first step of the elongation 
cycle (Fig. 27-28), the appropriate incoming aminoacyl- 
tRNA binds to a complex of GTP-bound EF-Tu. The 
resulting arrunoacyl-tRNA-EF-Tu-GTP complex binds 
to the A site of the 70S initiation complex. The GTP is 
hydrolyzed and an EF-Tu-GDP complex is released 
from the 70S ribosome. The EF-Tu-GTP complex is 
regenerated in a process involving EF-Ts and GTP. 

Elongation Step 2: Peptide Bond Formation A 

peptide bond is now formed between the two amino 
acids bound by their tRNAs to the A and P sites on the 
ribosome. This occurs by the transfer of the initiating 
A^-formylmethionyl group from its tRNA to the amino 



group of the second amino acid, now in the A site 
(Fig. 27-29). The a-amino group of the amino acid in 
the A site acts as a nucleophile, displacing the tRNA in 
the P site to form the peptide bond. This reaction 
produces a dipeptidyl-tRNA in the A site, and the now 
"uncharged" (deacylated) tRNA^ 61 remains bound to 
the P site. The tRNAs then shift to a hybrid binding 
state, with elements of each spanning two different sites 
on the ribosome, as shown in Figure 27-29. 

The enzymatic activity that catalyzes peptide 
bond formation has historically been referred to as 
peptidyl transferase and was widely assumed to be 
intrinsic to one or more of the proteins in the large ri- 
bosomal subunit. We now know that this reaction is 
catalyzed by the 23S rRNA (Fig. 27-13d), adding to 
the known catalytic repertoire of ribozymes. This dis- 
covery has interesting implications for the evolution of 
life (see Box 27-2). 

Elongation Step 3: Translocation In the final step 
of the elongation cycle, translocation, the ribosome 
moves one codon toward the 3' end of the mRNA 
(Fig. 27-30a). This movement shifts the anticodon of the 
dipeptidyl-tRNA, which is still attached to the second 
codon of the mRNA, from the A site to the P site, and shifts 
the deacylated tRNA from the P site to the E site, from 
where the tRNA is released into the cytosol. The third 
codon of the mRNA now lies in the A site and the second 
codon in the P site. Movement of the ribosome along the 
mRNA requires EF-G (also known as translocase) and 
the energy provided by hydrolysis of another molecule of 
GTP. A change in the three-dimensional conformation of 
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FIGURE 27-28 First elongation step in bacteria: binding of the second 
aminoacyl-tRNA. The second aminoacyl-tRNA (AA 2 ) enters the A site 
of the ribosome bound to EF-Tu (shown here as Tu), which also con- 
tains GTP. Binding of the second aminoacyl-tRNA to the A site is ac- 
companied by hydrolysis of the GTP to GDP and P, and release of the 
EF-Tu-GDP complex from the ribosome. The bound GDP is released 
when the EF-Tu-GDP complex binds to EF-Ts, and EF-Ts is subse- 
quently released when another molecule of GTP binds to EF-Tu. This 
recycles EF-Tu and makes it available to repeat the cycle. 

the entire ribosome results in its movement along the 
mRNA. Because the structure of EF-G mimics the structure 
of the EF-Tu-tRNA complex (Fig. 27-30b), EF-G can bind 
the A site and presumably displace the peptidyl-tRNA. 
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FIGURE 27-29 Second elongation step in bacteria: formation of the 
first peptide bond. The peptidyl transferase catalyzing this reaction is 
the 23S rRNA ribozyme. The N-formylmethionyl group is transferred to 
the amino group of the second aminoacyl-tRNA in the A site, forming 
a dipeptidyl-tRNA. At this stage, both tRNAs bound to the ribosome 
shift position in the SOS subunit to take up a hybrid binding state. The 
uncharged tRNA shifts so that its 3' and 5' ends are in the E site. Simi- 
larly, the 3' and 5' ends of the peptidyl tRNA shift to the P site. The an- 
ticodons remain in the A and P sites. 

After translocation, the ribosome, with its attached 
dipeptidyl-tRNA and mRNA, is ready for the next elon- 
gation cycle and attachment of a third amino acid 
residue. This process occurs in the same way as addition 
of the second residue (as shown in Figs 27-28, 27-29, 
and 27-30). For each amino acid residue correctly 
added to the growing polypeptide, two GTPs are hy- 
drolyzed to GDP and as the ribosome moves from 
codon to codon along the mRNA toward the 3' end. 
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The polypeptide remains attached to the tRNA of 
the most recent amino acid to be inserted. This associa- 
tion maintains the functional connection between the 
information in the mRNA and its decoded polypeptide 
output. At the same time, the ester linkage between this 
tRNA and the carboxyl terminus of the growing polypep- 
tide activates the terminal carboxyl group for nucle- 
ophilic attack by the incoming amino acid to form a new 
peptide bond (Fig. 27-29). As the existing ester linkage 
between the polypeptide and tRNA is broken during 
peptide bond formation, the linkage between the polypep- 
tide and the information in the mRNA persists, because 
each newly added amino acid is still attached to its tRNA. 

The elongation cycle in eukaryotes is quite similar 
to that in bacteria. Three eukaryotic elongation factors 





FIGURE 27-30 Third elongation step in bacteria: translocation, (a) The 

ribosome moves one codon toward the 3' end of the mRNA, using en- 
ergy provided by hydrolysis of GTP bound to EF-G (translocase). The 
dipeptidyl-tRNA is now entirely in the P site, leaving the A site open 
for the incoming (third) aminoacyl-tRNA. The uncharged tRNA dissoci- 
ates from the E site, and the elongation cycle begins again. (b)The struc- 
ture of EF-G mimics the structure of EF-Tu complexed with tRNA. 
Shown here are (left) EF-Tu complexed with tRNA (green) (PDB ID 
\ B23) and (right) EF-G complexed with GDP (red) (PDB ID 1 DAR). The 
carboxyl-terminal part of EF-G (dark gray) mimics the structure of the 
anticodon loop of tRNA in both shape and charge distribution. 

(eEFla, eEFljSy, and eEF2) have functions analogous 
to those of the bacterial elongation factors (EF-Tu, EF- 
Ts, and EF-G, respectively). Eukaryotic ribosomes do 
not have an E site; uncharged tRNAs are expelled di- 
rectly from the P site. 

Proofreading on the Ribosome The GTPase activity 
of EF-Tu during the first step of elongation in bacterial cells 
(Fig. 27-28) makes an important contribution to the rate 
and fidelity of the overall biosynthetic process. Both the 
EF-Tu-GTP and EF-Tu-GDP complexes exist for a few 
milliseconds before they dissociate. These two intervals 
provide opportunities for the codon-anticodon interactions 
to be proofread. Incorrect aminoacyl-tRNAs normally dis- 
sociate from the A site during one of these periods. If the 
GTP analog guanosine 5'-0-(3-thiotriphosphate) (GTPyS) 
is used in place of GTP, hydrolysis is slowed, improving the 
fidelity (by increasing the proofreading intervals) but re- 
ducing the rate of protein synthesis. 
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BOX 27-4 Induced Variation in the Geneti 

When a mutation produces a termination codon in the 
interior of a gene, translation is prematurely halted and 
the incomplete polypeptide is usually inactive. These 
are called nonsense mutations. The gene can be re- 
stored to normal function if a second mutation either 
(1) converts the misplaced termination codon to a 
codon specifying an amino acid or (2) suppresses the ef- 
fects of the termination codon. Such restorative muta- 
tions are called nonsense suppressors; they generally 
involve mutations in tRNA genes to produce altered 
(suppressor) tRNAs that can recognize the termination 
codon and insert an amino acid at that position. Most 
known suppressor tRNAs have single base substitutions 
in their anticodons. 

Suppressor tRNAs constitute an experimentally in- 
duced variation in the genetic code to allow the reading 
of what are usually termination codons, much like the 
naturally occurring code variations described in Box 
27-1. Nonsense suppression does not completely disrupt 
normal information transfer in a cell, because the cell 
usually has several copies of each tRNA gene; some of 
these duplicate genes are weakly expressed and account 
for only a minor part of the cellular pool of a particular 
tRNA, Suppressor mutations usually involve a "minor" 
tRNA, leaving the major tRNA to read its codon normally. 

For example, E. coli has three identical genes for 
tRNA^ 1 ", each producing a tRNA with the anticodon 



Code: Nonsense Suppressio 




(5')GUA. One of these genes is expressed at relatively 
high levels and thus its product represents the major 
tRNA^ 1 * species; the other two genes are transcribed in 
only small amounts. A change in the anticodon of the 
tRNA product of one of these duplicate tRNA^ genes, 
from (5')GUA to (5')CUA, produces a minor tRNA 1 ^ 
species that will insert tyrosine at UAG stop codons. 
This insertion of tyrosine at UAG is carried out ineffi- 
ciently, but it can produce enough full-length protein 
from a gene with a nonsense mutation to allow the cell 
to survive. The major tRNA^ continues to translate the 
genetic code normally for the majority of proteins. 

The mutation that leads to creation of a suppressor 
tRNA does not always occur in the anticodon. The sup- 
pression of UGA nonsense codons generally involves the 
tRNA 1 ^ that normally recognizes UGG. The alteration 
that allows it to read UGA (and insert Trp residues at 
these positions) is a G to A change at position 24 (in an 
arm of the tRNA somewhat removed from the anti- 
codon); this tRNA can now recognize both UGG and 
UGA. A similar change is found in tRNAs involved in the 
most common naturally occurring variation in the ge- 
netic code (UGA = Trp; see Box 27-1), 

Suppression should lead to many abnormally long 
proteins, but this does not always occur. We understand 
only a few details of the molecular events in translation 
termination and nonsense suppression. 



The process of protein synthesis (including the 
characteristics of codon-anticodon pairing already de- 
scribed) has clearly been optimized through evolution 
to balance the requirements for speed and fidelity. Im- 
proved fidelity might diminish speed, whereas in- 
creases in speed would probably compromise fidelity. 
And, recall that the proofreading mechanism on the ri- 
bosome establishes only that the proper codon-anti- 
codon pairing has taken place, not that the correct 
amino acid is attached to the tRNA. If a tRNA is suc- 
cessfully aminoacylated with the wrong amino acid (as 
can be done experimentally), this incorrect amino 
acid is efficiently incorporated into a protein in re- 
sponse to whatever codon is normally recognized by 
the tRNA. 

Stage 4: Termination of Polypeptide Synthesis Requires a 
Special Signal 

Elongation continues until the ribosome adds the last 
amino acid coded by the mRNA. Termination, the fourth 
stage of polypeptide synthesis, is signaled by the pres- 
ence of one of three termination codons in the mRNA 
(UAA, UAG, UGA), immediately following the final 



coded amino acid. Mutations in a tRNA anticodon that 
allow an amino acid to be inserted at a termination 
codon are generally deleterious to the cell (Box 27-4). 

In bacteria, once a termination codon occupies the 
ribosomal A site, three termination factors, or re- 
lease factors — the proteins RF-1, RF-2, and RF-3 — 
contribute to (1) hydrolysis of the terminal peptidyl- 
tRNA bond; (2) release of the free polypeptide and the 
last tRNA, now uncharged, from the P site; and (3) dis- 
sociation of the 70S ribosome into its 30S and SOS sub- 
units, ready to start a new cycle of polypeptide synthesis 
(Fig. 27-31). RF-1 recognizes the termination codons 
UAG and UAA, and RF-2 recognizes UGA and UAA. Ei- 
ther RF-1 or RF-2 (depending on which codon is pres- 
ent) binds at a termination codon and induces peptidyl 
transferase to transfer the growing polypeptide to a wa- 
ter molecule rather than to another amino acid. The re- 
lease factors have domains thought to mimic the 
structure of tRNA, as shown for the elongation factor 
EF-G in Figure 27-30b. The specific function of RF-3 
has not been firmly established, although it is thought to 
release the ribosomal subunit. In eukaryotes, a single 
release factor, eRF, recognizes all three termination 
codons. 
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Ribosome recycling leads to dissociation of the 
translation components. The release factors dissociate 
from the posttermination complex (with an uncharged 
tRNA in the P site), and are replaced by EF-G and a 
protein called ribosome recycling factor (RRF; M r 
20,300). Hydrolysis of GTP by EF-G leads to dissocia- 
tion of the 50S subunit from the 30S tRNA-mRNA com- 
plex. EF-G and RRF are replaced by IF-3, which 
promotes the dissociation of the tRNA. The mRNA is 
then released. The complex of IF-3 and the 30S subunit 
is then ready to initiate another round of protein syn- 
thesis (Fig. 27-25). 



FIGURE 27-31 Termination of protein synthesis in bacteria. Termi- 
nation occurs in response to a termination codon in the A site. First, 
a release factor, RF (RF-1 or RF-2, depending on which termination 
codon is present), binds to the A site. This leads to hydrolysis of the 
ester linkage between the nascent polypeptide and the tRNA in the 
P site and release of the completed polypeptide. Finally, the mRNA, 
deacylated tRNA, and release factor leave the ribosome, which dis- 
sociates into its 30S and SOS subunits, aided by ribosome recycling 
factor (RRF), IF-3, and energy provided by EF-G-mediated GTP 
hydrolysis. The 30S subunit complex with IF-3 is ready to begin 
another cycle of translation (see Fig. 27-25). 



Energy Cost of Fidelity in Protein Synthesis Syn- 
thesis of a protein true to the information specified in its 
mRNA requires energy. Formation of each aminoacyl- 
tRNA uses two high-energy phosphate groups. An 
additional ATP is consumed each time an incorrectly 
activated amino acid is hydrolyzed by the deacylation 
activity of an aminoacyl-tRNA synthetase, as part of its 
proofreading activity. A GTP is cleaved to GDP and Pj 
during the first elongation step, and another during the 
translocation step. Thus, on average, the energy derived 
from the hydrolysis of more than four NTPs to NDPs is 
required for the formation of each peptide bond of a 
polypeptide. 

This represents an exceedingly large thermody- 
namic "push" in the direction of synthesis: at least 4 X 
30.5 kJ/mol = 122 kJ/mol of phosphodiester bond en- 
ergy to generate a peptide bond, which has a standard 
free energy of hydrolysis of only about -21 kJ/mol. The 
net free-energy change during peptide bond synthesis is 
thus -101 kJ/mol. Proteins are information-containing 
polymers. The biochemical goal is not simply the forma- 
tion of a peptide bond but the formation of a peptide 
bond between two specified amino acids. Each of the 
high-energy phosphate compounds expended in this 
process plays a critical role in maintaining proper align- 
ment between each new codon in the mRNA and its as- 
sociated amino acid at the growing end of the 
polypeptide. This energy permits very high fidelity in 
the biological translation of the genetic message of 
mRNA into the amino acid sequence of proteins. 

Rapid Translation of a Single Message by Poly- 
somes Large clusters of 10 to 100 ribosomes that are 
very active in protein synthesis can be isolated from 
both eukaryotic and bacterial cells. Electron micro- 
graphs show a fiber between adjacent ribosomes in the 
cluster, which is called a polysome (Fig. 27-32). The 
connecting strand is a single molecule of mRNA that is 
being translated simultaneously by many closely spaced 
ribosomes, allowing the highly efficient use of the 
mRNA. 

In bacteria, transcription and translation are tightly 
coupled. Messenger RNAs are synthesized and trans- 
lated in the same 5'— »3' direction. Ribosomes begin 
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FIGURE 27-32 Polysome, (a) Four ribosomes translating a eukaryotic 
mRNA molecule simultaneously, moving from the 5' end to the 3' end 
and synthesizing a polypeptide from the amino terminus to the car- 
boxyl terminus, (b) Electron micrograph and explanatory diagram of a 



polysome from the silk gland of a silkworm larva. The mRNA is being 
translated by many ribosomes simultaneously. The nascent polypep- 
tides become longer as the ribosomes move toward the 3' end of the 
mRNA. The final product of this process is silk fibroin. 



FIGURE 27-33 Coupling of transcription and translation in bacteria. 

The mRNA is translated by ribosomes while it is still being transcribed 
from DNA by RNA polymerase. This is possible because the mRNA in 
bacteria does not have to be transported from a nucleus to the cyto- 
plasm before encountering ribosomes. In this schematic diagram the 
ribosomes are depicted as smaller than the RNA polymerase. In reality 
the ribosomes (M T 2.7 X 10 6 ) are an order of magnitude larger than 
the RNA polymerase {M r 3.9 X 10 5 ). 

translating the 5' end of the mRNA before transcription 
is complete (Fig. 27-33). The situation is quite differ- 
ent in eukaryotic cells, where newly transcribed mRNAs 
must leave the nucleus before they can be translated. 

Bacterial mRNAs generally exist for just a few min- 
utes (p. 1049) before they are degraded by nucleases. In 
order to maintain high rates of protein synthesis, the 
mRNA for a given protein or set of proteins must be made 
continuously and translated with maximum efficiency. 
The short lifetime of mRNAs in bacteria allows a rapid ces- 
sation of synthesis when the protein is no longer needed. 

Stage 5: Newly Synthesized Polypeptide Chains Undergo 
Folding and Processing 

In the final stage of protein synthesis, the nascent 
polypeptide chain is folded and processed into its 
biologically active form. During or after its synthesis, the 
polypeptide progressively assumes its native conforma- 
tion, with the formation of appropriate hydrogen bonds 
and van der Waals, ionic, and hydrophobic interactions. 
In this way the linear, or one-dimensional, genetic 
message in the mRNA is converted into the three- 
dimensional structure of the protein. Some newly made 
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proteins, bacterial, archaeal, and eukaryotic, do not at- 
tain their final biologically active conformation until 
they have been altered by one or more processing reac- 
tions called posttranslational modifications. 

Ammo-Terminal and Carboxyl-Terminal Modifica- 
tions The first residue inserted in all polypeptides is 
A^-formylmethionine (in bacteria) or methionine (in eukary- 
otes). However, the formyl group, the ammo-terminal Met 
residue, and often additional amino-terminal (and, in some 
cases, carboxyl-terminal) residues may be removed enzy- 
matically in formation of the final functional protein. In as 
many as 50% of eukaryotic proteins, the amino group of the 
amino-terminal residue is TV-acetylated after translation. 
Carboxyl-terminal residues are also sometimes modified. 

Loss of Signal Sequences As we shall see in Sec- 
tion 27.3, the 15 to 30 residues at the amino-terminal end 
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FIGURE 27-34 Some modified amino acid residues, (a) Phosphory- 
lated amino acids, (b) A carboxylated amino acid, (c) Some methy- 
lated amino acids. 

of some proteins play a role in directing the protein to its 
ultimate destination in the cell. Such signal sequences 

are eventually removed by specific peptidases. 

Modification of Individual Amino Acids The hy- 

droxyl groups of certain Ser, Thr, and Tyr residues of 
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some proteins are enzymatically phosphorylated by ATP 
(Fig. 27-34a); the phosphate groups add negative 
charges to these polypeptides. The functional signifi- 
cance of this modification varies from one protein to the 
next. For example, the milk protein casein has many 
phosphoserine groups that bind Ca 2+ . Calcium ; phos- 
phate, and amino acids are all valuable to suckling 
young, so casein efficiently provides three essential nu- 
trients. And as we have seen in numerous instances, 
phosphorylation-dephosphorylation cycles regulate the 
activity of many enzymes and regulatory proteins. 

Extra carboxyl groups may be added to Glu residues 
of some proteins. For example, the blood-clotting protein 
prothrombin contains a number of y-carboxyglutamate 
residues (Fig. 27-34b) in its amino-terminal region, intro- 
duced by an enzyme that requires vitamin K. These car- 
boxyl groups bind Ca 2+ , which is required to initiate the 
clotting mechanism. 

Monomethyl- and dimethyllysine residues (Fig. 
27-34c) occur in some muscle proteins and in cy- 
tochrome c, The calmodulin of most species contains one 
trimethyllysine residue at a specific position. In other 
proteins, the carboxyl groups of some Glu residues un- 
dergo methylation, removing their negative charge. 

Attachment of Carbohydrate Side Chains The 

carbohydrate side chains of glycoproteins are attached 
covalently during or after synthesis of the polypeptide. 
In some glycoproteins, the carbohydrate side chain is 
attached enzymatically to Asn residues (AMinked 
oligosaccharides), in others to Ser or Thr residues (O- 
linked oligosaccharides) (see Fig. 7-29). Many proteins 
that function extracellularly, as well as the lubricating 
proteoglycans that coat mucous membranes, contain 
oligosaccharide side chains (see Fig. 7-27). 

Addition of Isoprenyl Groups A number of eukary- 
otic proteins are modified by the addition of groups 
derived from isoprene (isoprenyl groups). A thioether 
bond is formed between the isoprenyl group and a Cys 
residue of the protein (see Fig. 11-14). The isoprenyl 
groups are derived from pyrophosphorylated intermedi- 
ates of the cholesterol biosynthetic pathway (see Fig. 
21-35), such as farnesyl pyrophosphate (Fig. 27-35). 
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Farnesylated Ras protein 

FIGURE 27-35 Farnesylation of a Cys residue. The thioether linkage is 
shown in red. The Ras protein is the product of the ras oncogene. 
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Proteins modified in this way include the Ras proteins, 
products of the ras oncogenes and proto-oncogenes, 
and G proteins (both discussed in Chapter 12), and 
lamins, proteins found in the nuclear matrix. The iso- 
prenyl group helps to anchor the protein in a membrane. 
The transforming (carcinogenic) activity of the ras 
oncogene is lost when isoprenylation of the Ras protein 
is blocked, a finding that has stimulated interest in iden- 
tifying inhibitors of this posttranslational modification 
pathway for use in cancer chemotherapy. 

Addition of Prosthetic Groups Many proteins 
require for their activity covalently bound prosthetic 
groups. Two examples are the biotin molecule of acetyl- 
CoA carboxylase and the heme group of hemoglobin or 
cytochrome c. 

Proteolytic Processing Many proteins are initially 
synthesized as large, inactive precursor polypeptides 
that are proteolytically trimmed to form their smaller, 
active forms. Examples include proinsulin, some viral 
proteins, and proteases such as chymotrypsinogen and 
trypsinogen (see Fig. 6-38). 

Formation of Disulfide Cross-Links After folding 
into their native conformations, some proteins form in- 
trachain or interchain disulfide bridges between Cys 
residues. In eukaryotes, disulfide bonds are common in 
proteins to be exported from cells. The cross-links 
formed in this way help to protect the native conforma- 
tion of the protein molecule from denaturation in the ex- 
tracellular environment, which can differ greatly from 
intracellular conditions and is generally oxidizing. 

Protein Synthesis Is Inhibited by Many 
Antibiotics and Toxins 

Protein synthesis is a central function in cellular physiol- 
ogy and is the primary target of many naturally occurring 
antibiotics and toxins. Except as noted, these antibiotics 
inhibit protein synthesis in bacteria. The differences be- 
tween bacterial and eukaryotic protein synthesis, though 
in some cases subtle, are sufficient that most of the 
compounds discussed below are relatively harmless to 
eukaryotic cells. Natural selection has favored the evolu- 
tion of compounds that exploit minor differences in 
order to affect bacterial systems selectively, such that 
these biochemical weapons are synthesized by some 
microorganisms and are extremely toxic to others. Be- 
cause nearly every step in protein synthesis can be 
specifically inhibited by one antibiotic or another, antibi- 
otics have become valuable tools in the study of protein 
biosynthesis. 

Puromycin, made by the mold Streptomyces al- 
boniger, is one of the best-understood inhibitory an- 
tibiotics. Its structure is very similar to the 3' end of 
an aminoacyl-tRNA, enabling it to bind to the riboso- 
mal A site and participate in peptide bond formation, 



producing peptidyl-puromycin (Fig. 27-36). How- 
ever, because puromycin resembles only the 3' end of 
the tRNA, it does not engage in translocation and dis- 
sociates from the ribosome shortly after it is linked to 
the carboxyl terminus of the peptide. This prema- 
turely terminates polypeptide synthesis. 

Tetracyclines inhibit protein synthesis in bacteria 
by blocking the A site on the ribosome, preventing the 
binding of aminoacyl-tRNAs. Chloramphenicol inhibits 
protein synthesis by bacterial (and mitochondrial and 
chloroplast) ribosomes by blocking peptidyl transfer; it 
does not affect cytosolic protein synthesis in eukaryotes. 
Conversely, cycloheximide blocks the peptidyl trans- 
ferase of 80S eukaryotic ribosomes but not that of 70S 
bacterial (and mitochondrial and chloroplast) ribosomes. 
Streptomycin, a basic trisaccharide, causes misreading 
of the genetic code (in bacteria) at relatively low concen- 
trations and inhibits initiation at higher concentrations. 
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Several other inhibitors of protein synthesis are no- 
table because of their toxicity to humans and other 
mammals. Diphtheria toxin (M T 58,330) catalyzes 
the ADP-ribosylation of a diphthamide (a modified 
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FIGURE 27-36 Disruption of peptide bond formation by puromycin. 

(a) The antibiotic puromycin resembles the aminoacyl end of a charged 
tRNA, and it can bind to the ribosomal A site and participate in peptide 
bond formation (see Fig. 27-1 3d). The product of this reaction, instead 
of being translocated to the P site, dissociates from the ribosome, caus- 
ing premature chain termination, (b) Peptidyl puromycin. 



histidine) residue of eukaryotic elongation factor eEF2, 
thereby inactivating it. Ricin (M r 29,895), an extremely 
toxic protein of the castor bean, inactivates the 60S sub- 
unit of eukaryotic ribosomes by depurinating a specific 
adenosine in 23S rRNA. 



SUMMARY 27.2 Protein Synthesis 

■ Protein synthesis occurs on the ribosomes, which 
consist of protein and rRNA. Bacteria have 70S 
ribosomes, with a large (SOS) and a small (30S) 
subunit. Eukaryotic ribosomes are significantly 
larger (80S) and contain more proteins. 

■ Transfer RNAs have 73 to 93 nucleotide residues, 
some of which have modified bases. Each tRNA has 
an amino acid arm with the terminal sequence 
CCA(3') to which an amino acid is esterified, an 
anticodon arm, a Ti/jG arm, and a D arm; some tRNAs 
have a fifth arm. The anticodon is responsible for 
the specificity of interaction between the aminoacyl- 
tRNA and the complementary mRNA codon. 

■ The growth of polypeptides on ribosomes begins 
with the amino-terminal amino acid and proceeds 
by successive additions of new residues to the 
carboxyl-terminal end. 

■ Protein synthesis occurs in five stages. 

1. Amino acids are activated by specific aminoacyl- 
tRNA synthetases in the cytosol. These enzymes 
catalyze the formation of aminoacyl-tRNAs, with 
simultaneous cleavage of ATP to AMP and PP it 
The fidelity of protein synthesis depends on the 
accuracy of this reaction, and some of these enzymes 
carry out proofreading steps at separate active sites. 

2. In bacteria, the initiating aminoacyl-tRNA in all 
proteins is TV-formylmethionyl-tRNA^ 61 . Initiation 
of protein synthesis involves formation of a 
complex between the 30S ribosomal subunit, mRNA, 
GTP, fMet-tRNA™ 6 *, three initiation factors, and the 
SOS subunit; GTP is hydrolyzed to GDP and Pi. 

3. In the elongation steps, GTP and elongation 
factors are required for binding the incoming 
aminoacyl-tRNA to the A site on the ribosome. In 
the first peptidyl transfer reaction, the fMet residue 
is transferred to the amino group of the incoming 
aminoacyl-tRNA. Movement of the ribosome along 
the mRNA then translocates the dipeptidyl-tRNA 
from the A site to the P site, a process requiring 
hydrolysis of GTP. Deacylated tRNAs dissociate 
from the ribosomal E site. 

4. After many such elongation cycles, synthesis of 
the polypeptide is terminated with the aid of 
release factors. At least four high-energy phosphate 
equivalents (from ATP and GTP) are required to 
generate each peptide bond, an energy investment 
required to guarantee fidelity of translation. 
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5. Polypeptides fold into their active, three- 
dimensional forms. Many proteins are further 
processed by posttranslational modification 
reactions. 

■ Many well-studied antibiotics and toxins inhibit 
some aspect of protein synthesis. 



27.3 Protein Targeting and Degradation 

The eukaryotic cell is made up of many structures, com- 
partments, and organelles, each with specific functions 
that require distinct sets of proteins and enzymes. 
These proteins (with the exception of those produced 
in mitochondria and plastids) are synthesized on 
ribosomes in the cytosol, so how are they directed to 
their final cellular destinations? 

We are now beginning to understand this complex 
and fascinating process. Proteins destined for secretion, 
integration in the plasma membrane, or inclusion in 
lysosomes generally share the first few steps of a 
pathway that begins in the endoplasmic reticulum. Pro- 
teins destined for mitochondria, chloroplasts, or the 
nucleus use three separate mechanisms. And proteins 
destined for the cytosol simply remain where they are 
synthesized. 

The most important element in many of these tar- 
geting pathways is a short sequence of amino acids 
called a signal sequence, whose function was first pos- 




tulated by Giinter Blobel and 
colleagues in 1970. The signal 
sequence directs a protein to 
its appropriate location in the 
cell and, for many proteins, is 
removed during transport or 
after the protein has reached 
its final destination. In proteins 
slated for transport into mito- 
chondria, chloroplasts, or the 
ER, the signal sequence is at 



Gunter Blobel the amino terminus of a newly 



synthesized polypeptide. In many cases, the targeting 
capacity of particular signal sequences has been con- 
firmed by fusing the signal sequence from one protein 
to a second protein and showing that the signal directs 
the second protein to the location where the first pro- 
tein is normally found. The selective degradation of 
proteins no longer needed by the cell also relies largely 
on a set of molecular signals embedded in each pro- 
tein's structure. 

In this concluding section we examine protein tar- 
geting and degradation, emphasizing the underlying sig- 
nals and molecular regulation that are so crucial to 
cellular metabolism. Except where noted, the focus is 
now on eukaryotic cells. 

Posttransiational Modification of Many Eukaryotic 
Proteins Begins in the Endoplasmic Reticulum 

Perhaps the best-characterized targeting system begins 
in the ER. Most lysosomal, membrane, or secreted 
proteins have an amino-terminal signal sequence 
(Fig. 27-37 ) that marks them for translocation into the 
lumen of the ER; hundreds of such signal sequences have 
been determined. The carboxyl terminus of the signal 
sequence is defined by a cleavage site, where protease 
action removes the sequence after the protein is im- 
ported into the ER. Signal sequences vary in length from 
13 to 36 amino acid residues, but all have the following 
features: (1) about 10 to 15 hydrophobic amino acid 
residues; (2) one or more positively charged residues, 
usually near the amino terminus, preceding the hy- 
drophobic sequence; and (3) a short sequence at the 
carboxyl terminus (near the cleavage site) that is rela- 
tively polar, typically having amino acid residues with 
short side chains (especially Ala) at the positions closest 
to the cleavage site. 

As originally demonstrated by George Palade, pro- 
teins with these signal sequences are synthesized on 
ribosomes attached to the ER. The signal sequence itself 
helps to direct the ribosome to the ER, as illustrated by 
steps ® through ® in Figure 27-38. (J) The targeting 



cleavage 

Human influenza s J e 

virus A Met Lys Ala Lys Leu Leu Val Leu Leu Tyr Ala Phe Val Ala Gly Asp Gin - 

Human ^ 
preproinsulin Met Ala Leu Trp Met Arg Leu Leu Pro Leu Leu Ala Leu Leu Ala Leu Trp Gly Pro Asp Pro Ala Ala Ala Phe Val - 
Bovine 

growth , 
hormone Met Met Ala Ala Gly Pro Arg Thr Ser Leu Leu Leu Ala Phe Ala Leu Leu Cys Leu Pro Trp Thr Gin Val Val Gly Ala Phe - 
Bee I 
promellitin Met Lys Phe Leu Val Asn Val Ala Leu Val Phe Met Val Val Tyr He Ser Tyr De Tyr Ala Ala Pro - 

Drosophila glue ^ 

protein Met Lys Leu Leu Val Val Ala Val He Ala Cys Met Leu He Gly Phe Ala Asp Pro Ala Ser Gly Cys Lys - 

FIGURE 27-37 Amino-terminal signal sequences of some eukaryotic the polar and short-side-chain residues immediately preceding (to the 
proteins that direct their translocation into the ER. The hydrophobic left of, as shown here) the cleavage sites (indicated by red arrows), 
core (yellow) is preceded by one or more basic residues (blue). Note 
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FIGURE 27-38 Directing eukaryotic proteins with the appropriate sig- 
nals to the endoplasmic reticulum. This process involves the SRP cycle 
and translocation and cleavage of the nascent polypeptide. The steps 
are described in the text. SRP is a rod-shaped complex containing a 
300 nucleotide RNA (7SL-RNA) and six different proteins (combined 
M r 325,000). One protein subunit of SRP binds directly to the signal 



sequence, inhibiting elongation by sterically blocking the entry of 
aminoacyl-tRNAs and inhibiting peptidyl transferase. Another protein 
subunit binds and hydrolyzes GTP. The SRP receptor is a heterodimer 
of a (M r 69,000) and p (M r 30,000) subunits, both of which bind and 
hydrolyze multiple GTP molecules during this process. 



George Palade 



pathway begins with initiation 
of protein synthesis on free ri- 
bosomes. © The signal se- 
quence appears early in the 
synthetic process, because it is 
at the amino terminus, which 
as we have seen is synthesized 
first, (g) As it emerges from 
the ribosome, the signal se- 
quence — and the ribosome it- 
self — are bound by the large 
signal recognition particle 
(SRP); SRP then binds GTP 
and halts elongation of the polypeptide when it is about 
70 amino acids long and the signal sequence has com- 
pletely emerged from the ribosome. @ The GTP-bound 
SRP now directs the ribosome (still bound to the 
mRNA) and the incomplete polypeptide to GTP-bound 
SRP receptors in the cytosolic face of the ER; the nas- 
cent polypeptide is delivered to a peptide transloca- 
tion complex in the ER, which may interact directly 
with the ribosome. (§) SRP dissociates from the ribo- 
some, accompanied by hydrolysis of GTP in both SRP 
and the SRP receptor. (§) Elongation of the polypeptide 



now resumes, with the ATP-driven translocation com- 
plex feeding the growing polypeptide into the ER lu- 
men until the complete protein has been synthesized. 
© The signal sequence is removed by a signal peptidase 
within the ER lumen; (§) the ribosome dissociates and 
is recycled. 

Glycosylate?) Plays a Key Role in Protein 
Targeting 

In the ER lumen, newly synthesized proteins are fur- 
ther modified in several ways. Following the removal 
of signal sequences, polypeptides are folded, disulfide 
bonds formed, and many proteins glycosylated to 
form glycoproteins. In many glycoproteins the linkage 
to their oligosaccharides is through Asn residues. 
These AMinked oligosaccharides are diverse (Chapter 
7), but the pathways by which they form have a com- 
mon first step. A 14 residue core oligosaccharide is 
built up in a stepwise fashion, then transferred from a 
dolichol phosphate donor molecule to certain Asn 
residues in the protein (Fig. 27-39). The transferase 
is on the lumenal face of the ER and thus cannot cat- 
alyze glycosylation of cytosolic proteins. After transfer, 
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FIGURE 27-39 Synthesis of the core oligosaccharide of glycoproteins. 

The core oligosaccharide is built up by the successive addition of 
monosaccharide units. (T),@The first steps occur on the cytosol ic face 
of the ER. ©Translocation moves the incomplete oligosaccharide across 
the membrane (mechanism not shown), and (?) completion of the core 
oligosaccharide occurs within the lumen of the ER. The precursors that 
contribute additional mannose and glucose residues to the growing 
oligosaccharide in the lumen are dolichol phosphate derivatives. In the 
first step in the construction of the N-linked oligosaccharide moiety of 
a glycoprotein, @, @ the core oligosaccharide is transferred from 
dolichol phosphate to an Asn residue of the protein within the ER lu- 
men. The core oligosaccharide is then further modified in the ER and 
the Golgi complex in pathways that differ for different proteins, The 
five sugar residues shown surrounded by a beige screen (after step ©) 
are retained in the final structure of all N-linked oligosaccharides. 
® The released dolichol pyrophosphate is again translocated so that 
the pyrophosphate is on the cytosol ic face of the ER, then @ a phos- 
phate is hydrolytically removed to regenerate dolichol phosphate. 



the core oligosaccharide is trimmed and elaborated in 
different ways on different proteins, but all AMinked 
oligosaccharides retain a pentasaccharide core derived 
from the original 14 residue oligosaccharide. Several 
antibiotics act by interfering with one or more steps in 
this process and have aided in elucidating the steps of 
protein glycosylation. The best-characterized is tuni- 
camycin, which mimics the structure of UDP-7V- 
acetylglucosamine and blocks the first step of the 
process (Fig. 27-39, step ®). A few proteins are O- 
glycosylated in the ER, but most O-glycosylation oc- 
curs in the Golgi complex or in the cytosol (for proteins 
that do not enter the ER). 
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Suitably modified proteins can now be moved to a 
variety of intracellular destinations. Proteins travel from 
the ER to the Golgi complex in transport vesicles 
(Fig. 27-40). In the Golgi complex, oligosaccharides are 
O-linked to some proteins, and AMinked oligosaccharides 
are further modified. By mechanisms not yet fully un- 
derstood, the Golgi complex also sorts proteins and 
sends them to their final destinations. The processes 
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FIGURE 27-40 Pathway taken by proteins destined for lysosomes, the 
plasma membrane, or secretion. Proteins are moved from the ER to the 
cis side of the Golgi complex in transport vesicles. Sorting occurs pri- 
marily in the trans side of the Golgi complex. 

that segregate proteins targeted for secretion from 
those targeted for the plasma membrane or lysosomes 
must distinguish among these proteins on the basis of 
structural features other than signal sequences, which 
were removed in the ER lumen. 

This sorting process is best understood in the case 
of hydrolases destined for transport to lysosomes. On 
arrival of a hydrolase (a glycoprotein) in the Golgi com- 
plex, an as yet undetermined feature (sometimes called 
a signal patch) of the three-dimensional structure of the 
hydrolase is recognized by a phosphotransferase, which 
phosphorylates certain mannose residues in the 
oligosaccharide (Fig. 27-41). The presence of one or 
more mannose 6-phosphate residues in its TV-linked 
oligosaccharide is the structural signal that targets the 
protein to lysosomes. A receptor protein in the mem- 
brane of the Golgi complex recognizes the mannose 6- 
phosphate signal and binds the hydrolase so marked. 
Vesicles containing these receptor-hydrolase complexes 
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FIGURE 27-41 Phosphorylation of mannose 
residues on lysosome-targeted enzymes. 

N-Acetylglucosamine phosphotransferase rec- 
ognizes some as yet unidentified structural 
feature of hydrolases destined for lysosomes. 
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bud from the trans side of the Golgi complex and make 
their way to sorting vesicles. Here, the receptor-hydrolase 
complex dissociates in a process facilitated by the lower 
pH in the vesicle and by phosphatase-catalyzed removal 
of phosphate groups from the mannose 6-phosphate 
residues. The receptor is then recycled to the Golgi 
complex, and vesicles containing the hydrolases bud 
from the sorting vesicles and move to the lysosomes. 
In cells treated with tunicamycin (Fig. 27-39, step ®), 
hydrolases that should be targeted for lysosomes are 
instead secreted, confirming that the TV-linked oligosac- 
charide plays a key role in targeting these enzymes to 
lysosomes. 

The pathways that target proteins to mitochondria 
and chloroplasts also rely on amino-terminal signal se- 
quences. Although mitochondria and chloroplasts con- 
tain DNA, most of their proteins are encoded by nuclear 
DNA and must be targeted to the appropriate organelle. 
Unlike other targeting pathways, however, the mito- 
chondrial and chloroplast pathways begin only after a 
precursor protein has been completely synthesized and 
released from the ribosome. Precursor proteins des- 
tined for mitochondria or chloroplasts are bound by cy- 
tosolic chaperone proteins and delivered to receptors 
on the exterior surface of the target organelle. Special- 
ized translocation mechanisms then transport the pro- 
tein to its final destination in the organelle, after which 
the signal sequence is removed. 

Signal Sequences for Nuclear Transport Are Not Cleaved 

Molecular communication between the nucleus and the 
cytosol requires the movement of macromolecules 
through nuclear pores. RNA molecules synthesized in 
the nucleus are exported to the cytosol. Ribosomal pro- 
teins synthesized on cytosolic ribosomes are imported 
into the nucleus and assembled into 60S and 40S riboso- 
mal subunits in the nucleolus; completed subunits are 
then exported back to the cytosol. A variety of nuclear 
proteins (RNA and DNA polymerases, histones, topoiso- 
merases, proteins that regulate gene expression, and so 
forth) are synthesized in the cytosol and imported into 
the nucleus. This traffic is modulated by a complex sys- 
tem of molecular signals and transport proteins that is 
gradually being elucidated. 

In most multicellular eukaryotes, the nuclear enve- 
lope breaks down at each cell division, and once division 
is completed and the nuclear envelope reestablished, 
the dispersed nuclear proteins must be reimported. To 
allow this repeated nuclear importation, the signal 
sequence that targets a protein to the nucleus — the 
nuclear localization sequence, NLS — is not removed 
after the protein arrives at its destination. An NLS, un- 
like other signal sequences, may be located almost any- 
where along the primary sequence of the protein. NLSs 
can vary considerably, but many consist of four to eight 
amino acid residues and include several consecutive 
basic (Arg or Lys) residues. 



Nuclear importation is mediated by a number of 
proteins that cycle between the cytosol and the nucleus 
(Fig. 27-42), including importin a and jS and a small 
GTPase known as Ran (i?as-related nuclear protein). A 
heterodimer of importin a and /3 functions as a soluble 
receptor for proteins targeted to the nucleus, with the a 
subunit binding NLS-bearing proteins in the cytosol. 
The complex of the NLS-bearing protein and the im- 
portin docks at a nuclear pore and is translocated 
through the pore by an energy-dependent mechanism. 
In the nucleus, the importin /3 is bound by Ran GTPase, 
releasing importin (3 from the imported protein. Im- 
portin a is bound by Ran and by CAS (cellular apoptosis 
susceptibility protein) and separated from the NLS- 
bearing protein. Importin a and /3, in their complexes 
with Ran and CAS, are then exported from the nucleus. 
Ran hydrolyzes GTP in the cytosol to release the im- 
portins, which are then free to begin another importa- 
tion cycle. Ran itself is also cycled back into the nucleus 
by the binding of Ran-GDP to nuclear transport factor 2 
(NTF2) . Inside the nucleus, the GDP bound to Ran is re- 
placed with GTP through the action of Ran guanosine 
nucleotide exchange factor (RanGEF; see Box 12-2). 

Bacteria Also Use Signal Sequences for Protein Targeting 

Bacteria can target proteins to their inner or outer 
membranes, to the periplasmic space between these 
membranes, or to the extracellular medium. They use 
signal sequences at the amino terminus of the proteins 
(Fig. 27-43), much like those on eukaryotic proteins 
targeted to the ER, mitochondria, and chloroplasts. 

Most proteins exported fromS coli make use of the 
pathway shown in Figure 27-44. Following translation, 
a protein to be exported may fold only slowly, the 
amino-terminal signal sequence impeding the folding. 
The soluble chaperone protein SecB binds to the pro- 
tein's signal sequence or other features of its incom- 
pletely folded structure. The bound protein is then 
delivered to SecA, a protein associated with the inner 
surface of the plasma membrane. SecA acts as both a re- 
ceptor and a translocating ATPase. Released from SecB 
and bound to SecA, the protein is delivered to a translo- 
cation complex in the membrane, made up of SecY, E, 
and G, and is translocated stepwise through the mem- 
brane at the SecYEG complex in lengths of about 20 
amino acid residues. Each step is facilitated by the hy- 
drolysis of ATP, catalyzed by SecA. 

An exported protein is thus pushed through the 
membrane by a SecA protein located on the cytoplasmic 
surface, rather than being pulled through the membrane 
by a protein on the periplasmic surface. This difference 
may simply reflect the need for the translocating ATPase 
to be where the ATP is. The transmembrane electrochem- 
ical potential can also provide energy for translocation 
of the protein, by an as yet unknown mechanism. 

Although most exported bacterial proteins use this 
pathway, some follow an alternative pathway that uses 



27.3 Protein Targeting and Degradation i1105i 




0,2 M m 



Nucleoplasm 



FIGURE 27—42 Targeting of nuclear proteins, (a) (T) A protein with an 
appropriate nuclear localization signal (NLS) is bound by a complex of 
importin a and fi.(2)The resulting complex binds to a nuclear pore, 
and translocates, (5) Inside the nucleus, dissociation of importin j8 is 
promoted by the binding of Ran-GTP. @ Importin a binds to Ran-CTP 
and CAS (cellular apoptosis susceptibility protein), releasing the nuclear 
protein. (5) Importin a and jS and CAS are transported out of the 



nucleus and recycled. They are released in the cytosol when Ran hy- 
drolyzes its bound GTR © Ran-GDP is bound by NTF2, and transported 
back into the nucleus. © RanGEF promotes the exchange of GDP for 
GTP in the nucleus, and Ran-GTP is ready to process another NLS- 
bearing proteinnmportin complex, (b) Scanning electron micrograph of 
the surface of the nuclear envelope, showing numerous nuclear pores. 



Inner membrane proteins 

Phage fd, major 

coat protein Met Lys Lys Ser Leu Val Leu Lys Ala Ser Val 

Phage fd, minor 
coat protein 

Periplasmic proteins 

Alkaline phosphatase Met Lys Gin Ser Thr He Ala Leu Ala 
Leucine-specific 

binding protein Met Lys Ala Asn Ala Lys Thr He lie Ala Gly 
jS-Lactamase of 

pBR322 Met Ser lie Gin His Phe Arg Val Ala Leu He 

Outer membrane proteins 

Lipoprotein Met Lys Ala Thr Lys Leu Val Leu 

LamB Leu Arg Lys Leu Pro Leu Ala Val Ala 

OmpA Met Met He Thr Met Lys Lys Thr Ala He Ala He Ala Val Ala Leu Ala Gly Phe Ala Thr Val Ala Gin Ala^Ala Pro 



cleavage 
site 

Ala Val Ala Thr Leu Val Pro Met Leu Ser Phe Ala* Ala Glu 



Met Lys Lys Leu Leu Phe Ala lie Pro Leu Val Val Pro Phe Tyr Ser His Ser Ala Glu 



Leu Leu Pro Leu Leu Phe Thr Pro Val Thr Lys Ala Arg Thr 



Met He Ala Leu Ala He Ser His Thr Ala Met Ala Asp Asp 



Pro Phe Phe Ala Ala Phe Cys Leu Pro Val Phe Ala His Pro 



Gly Ala Val He Leu Gly Ser Thr Leu Leu Ala Gly Cys Ser 
Val Ala Ala Gly Val Met Ser Ala Gin Ala Met Ala* Val Asp 



FIGURE 27-43 Signal sequences that target proteins to different loca- 
tions in bacteria. Basic amino acids (blue) near the amino terminus 
and hydrophobic core amino acids (yellow) are highlighted. The cleav- 
age sites marking the ends of the signal sequences are indicated by red 



arrows. Note that the inner bacterial cell membrane (see Fig. 1-6) is 
where phage fd coat proteins and DNA are assembled into phage par- 
ticles. OmpA is outer membrane protein A; LamB is a cell surface re- 
ceptor protein for A phage. 
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FIGURE 27-44 Model for protein export in bacteria. 

(T) A newly translated polypeptide binds to the cy- 
tosolic chaperone protein SecB, which ©delivers it to 
SecA, a protein associated with the translocation com- 
plex (SecYEG) in the bacterial cell membrane. 
(5) SecB is released, and SecA inserts itself into the 
membrane, forcing about 20 amino acid residues of 
the protein to be exported through the translocation 
complex. (4) Hydrolysis of an ATP by SecA provides 
the energy for a conformational change that causes 
SecA to withdraw from the membrane, releasing the 
polypeptide. © SecA binds another ATP, and the next 
stretch of 20 amino acid residues is pushed across the 
membrane through the translocation complex. Steps 
(5) and © are repeated until © the entire protein 
has passed through and is released to the periplasm. 
The electrochemical potential across the membrane 
(denoted by + and -) also provides some of the driv- 
ing force required for protein translocation 
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signal recognition and receptor proteins homologous to 
components of the eukaryotic SRP and SRP receptor 
(Fig. 27-38). 

Cells Import Proteins by Receptor-Mediated Endocytosis 

Some proteins are imported into eukaryotic cells from 
the surrounding medium; examples include low-density 
lipoprotein (LDL), the iron-carrying protein transferrin, 
peptide hormones, and circulating proteins destined for 
degradation. There are several importation pathways 
(Fig. 27-45). In one path, proteins bind to receptors in 
invaginations of the membrane called coated pits, 
which concentrate endocytic receptors in preference to 
other cell-surface proteins. The pits are coated on their 
cytosolic side with a lattice of the protein clathrin, 
which forms closed polyhedral structures (Fig. 27-46). 
The clathrin lattice grows as more receptors are occupied 
by target proteins. Eventually, a complete membrane- 
bounded endocytic vesicle is pinched off the plasma 
membrane with the aid of the large GTPase dynamin, 



and enters the cytoplasm. The clathrin is quickly removed 
by uncoating enzymes, and the vesicle fuses with an 
endosome. ATPase activity in the endosomal mem- 
branes reduces the pH therein, facilitating dissociation 
of receptors from their target proteins. In a related path- 
way, caveolin causes invagination of patches of mem- 
brane containing lipid rafts associated with certain 
types of receptors (see Fig. 11-21). These endocytic 
vesicles then fuse with caveolin-containing internal 
structures, caveosomes, where the internalized mole- 
cules are sorted and redirected to other parts of the 
cell and the caveolins are prepared for recycling to 
the membrane surface. There are also clathrin- and 
caveolin-independent pathways; some make use of 
dynamin and others do not. 

The imported proteins and receptors then go their 
separate ways, their fates varying with the cell and pro- 
tein type. Transferrin and its receptor are eventually 
recycled. Some hormones, growth factors, and immune 
complexes, after eliciting the appropriate cellular re- 
sponse, are degraded along with their receptors. LDL is 



FIGURE 27-45 Summary of endocy- 
tosis pathways in eukaryotic cells. 

Pathways dependent on clathrin or 
caveolin make use of the GTRase dy- 
namin to pinch vesicles from the 
plasma membrane. Some pathways 
do not use clathrin or caveolin; 
some of these make use of dynamin 
and some do not. 
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FIGURE 27-46 Clathrin. (a) Three light (L) chains (M r 35,000) and three heavy (H) chains (M r 180,000) 
of the (HL) 3 clathrin unit, organized as a three-legged structure called a triskelion. (b) Triskelions tend 
to assemble into polyhedral lattices, (c) Electron micrograph of a coated pit on the cytosolic face of the 
plasma membrane of a fibroblast. 



degraded after the associated cholesterol has been 
delivered to its destination, but the LDL receptor is 
recycled (see Fig. 21-42). 

Receptor-mediated endocytosis is exploited by 
some toxins and viruses to gain entry to cells. Influenza 
virus, diphtheria toxin, and cholera toxin all enter cells 
in this way. 

Protein Degradation Is Mediated by Specialized 
Systems in All Cells 

Protein degradation prevents the buildup of abnormal 
or unwanted proteins and permits the recycling of 
amino acids. The half-lives of eukaryotic proteins vary 
from 30 seconds to many days. Most proteins turn over 
rapidly relative to the lifetime of a cell, although a few 
(such as hemoglobin) can last for the life of the cell 
(about 110 days for an erythrocyte). Rapidly degraded 
proteins include those that are defective because of in- 
correctly inserted amino acids or because of damage ac- 
cumulated during normal functioning. And enzymes 
that act at key regulatory points in metabolic pathways 
often turn over rapidly. 

Defective proteins and those with characteristically 
short half-lives are generally degraded in both bacterial 



and eukaryotic cells by selective ATP-dependent cy- 
tosolic systems. A second system in vertebrates, operat- 
ing in lysosomes, recycles the amino acids of membrane 
proteins, extracellular proteins, and proteins with char- 
acteristically long half-lives. 

In E coli, many proteins are degraded by an ATP- 
dependent protease called Lon (the name refers to the 
"long form" of proteins, observed only when this pro- 
tease is absent). The protease is activated in the pres- 
ence of defective proteins or those slated for rapid 
turnover; two ATP molecules are hydrolyzed for every 
peptide bond cleaved. The precise role of this ATP 
hydrolysis is not yet clear. Once a protein has been re- 
duced to small inactive peptides, other ATP-independent 
proteases complete the degradation process. 

The ATP-dependent pathway in eukaryotic cells is 
quite different, involving the protein ubiquitin, which, 
as its name suggests, occurs throughout the eukaryotic 
kingdoms. One of the most highly conserved proteins 
known, ubiquitin (76 amino acid residues) is essen- 
tially identical in organisms as different as yeasts and 
humans. Ubiquitin is covalently linked to proteins 
slated for destruction via an ATP-dependent pathway 
involving three separate enzymes (El, E2, and E3 in 
Fig. 27-47). 
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FIGURE 27-47 Three-step pathway by which ubiquitin is attached to 
a protein. Two different enzyme-ubiquitin intermediates are involved. 
The free carboxyl group of ubiquitin's carboxyl-terminal Gly residue is 
ultimately linked through an amide (isopeptide) bond to an e-amino 
group of a Lys residue of the target protein. Additional cycles produce 
polyubiquitin, a covalent polymer of ubiquitin subunits that targets the 
attached protein for destruction in eukaryotes. 



Ubiquitinated proteins are degraded by a large com- 
plex known as the 26S proteasome (Af r 2.5 X 10 6 ) 
(Fig. 27-48). The eukaryotic proteasome consists of 
two copies each of at least 32 different subunits, most of 
which are highly conserved from yeasts to humans. The 
proteasome contains two main types of subcomplexes, a 
barrel-like core particle and regulatory particles on ei- 
ther end of the barrel. The 20S core particle consists of 
four rings; the outer rings are formed from seven a sub- 
units, and the inner rings from seven /3 subunits. Three 
of the seven subunits in each /3 ring have protease activ- 
ities, each with different substrate specificities. The 
stacked rings of the core particle form the barrel-like 
structure within which target proteins are degraded. 
The 19S regulatory particle on each end of the core par- 
ticle contains approximately 18 subunits, including 
some that recognize and bind to ubiquitinated proteins. 
Six of the subunits are AAA+ ATPases (see Chapter 25) 
that probably function in unfolding the ubiquitinated 
proteins and translocating the unfolded polypeptide 
into the core particle for degradation. The 19S particle 
also deubiquitinates the proteins as they are degraded 
in the proteasome. Most cells have additional regulatory 
complexes that can replace the 19S particle. These al- 
ternative regulators do not hydrolyze ATP and do not 
bind to ubiquitin, but they are important for the degra- 
dation of particular cellular proteins. The 26S protea- 
some can be effectively "accessorized," with regulatory 
complexes changing with changing cellular conditions. 



FIGURE 27-48 Three-dimensional structure of 
the eukaryotic proteasome. The 26S proteasome 
is highly conserved in all eukaryotes. The two sub- 
assemblies are the 20S core particle and the 19S 
regulatory particle, (a) (PDB ID 1IRU) The core 
particle consists of four rings arranged to form a 
barrel-like structure. Each of the inner rings has 
seven different (3 subunits (light blue), three of 
which have protease activities (dark blue). The 
outer rings each have seven different a subunits 
(gray), (b) A regulatory particle forms a cap on 
each end of the core particle. The core particle is 
colored as in (a). The base and lid segments of 
each regulatory particle are presented in different 
shades of pink. The regulatory particle unfolds 
ubiquitinated proteins (blue) and translocates 
them into the core particle, as shown. 
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Relationship between Protein Half-Life 
and A mi no-Terminal Amino Acid Residue 



Ammo-terminal residue Half-life* 
Stabilizing 

Met, Gly, Ala, Ser, Thr, Val >20 h 
Destabilizing 

He, Gin -30 min 

Tyr, Glu -10 min 

Pro ~7min 

Leu, Phe, Asp, Lys -3 min 

Arg ~2 min 



Source: Modified from Bachmair.A., Finley, D., & Varshavsky, A. (1986) In vivo half-life 
of a protein is a function of its amino-terminal residue. Science 234, 179-186. 

♦Half-lives were measured in yeast for the jS-galactosidase protein modified so that in 
each experiment it had a different amino-terminal residue. Half-lives may vary for differ- 
ent proteins and in different organisms, but this general pattern appears to hold for all 
organisms. 

Although we do not yet understand all the signals 
that trigger ubiquitination, one simple signal has been 
found. For many proteins, the identity of the first 
residue that remains after removal of the amino-terminal 
Met residue, and any other posttranslational prote- 
olytic processing of the amino-terminal end, has a pro- 
found influence on half-life (Table 27-9). These 
amino-terminal signals have been conserved over bil- 
lions of years of evolution, and are the same in bacterial 
protein degradation systems and in the human ubiquiti- 
nation pathway. More complex signals, such as the de- 
struction box discussed in Chapter 12 (see Fig. 12-46), 
are also being identified. 

Ubiquitin-dependent proteolysis is as important for 
the regulation of cellular processes as for the elimina- 
tion of defective proteins. Many proteins required at 
only one stage of the eukaryotic cell cycle are rapidly 
degraded by the ubiquitin-dependent pathway after 
completing their function. Ubiquitin-dependent de- 
struction of cyclin is critical to cell-cycle regulation (see 
Fig. 12-46). The E2 and E3 components of the ubiquiti- 
nation pathway (Fig. 27^7) are in fact two large fami- 
lies of proteins. Different E2 and E3 enzymes exhibit 
different specificities for target proteins and thus regu- 
late different cellular processes. Some E2 and E3 
enzymes are highly localized in certain cellular compart- 
ments, reflecting a specialized function. 

Not surprisingly, defects in the ubiquitination path- 
way have been implicated in a wide range of dis- 
ease states. An inability to degrade certain proteins that 
activate cell division (the products of oncogenes) can lead 
to tumor formation, whereas a too-rapid degradation of 
proteins that act as tumor suppressors can have the same 
effect. The ineffective or overly rapid degradation of cellu- 
lar proteins also appears to play a role in a range of other 



conditions: renal diseases, asthma, neurodegenerative dis- 
orders such as Alzheimer's and Parkinson's diseases (asso- 
ciated with the formation of characteristic proteinaceous 
structures in neurons), cystic fibrosis (caused in some 
cases by a too-rapid degradation of a chloride ion channel, 
with resultant loss of function; see Box 11-3), Liddle's 
syndrome (in which a sodium channel in the kidney is not 
degraded, leading to excessive Na + absorption and early- 
onset hypertension) — and many other disorders. Drugs 
designed to inhibit proteasome function are being devel- 
oped as potential treatments for some of these conditions. 
In a changing metabolic environment, protein degradation 
is as important to a cell's survival as is protein synthesis, 
and much remains to be learned about these interesting 
pathways. ■ 

SUMMARY 27.3 Protein Targeting and 
Degradation 

■ After synthesis, many proteins are directed to 
particular locations in the cell. One targeting 
mechanism involves a peptide signal sequence, 
generally found at the amino terminus of a newly 
synthesized protein. 

m In eukaryotic cells, one class of signal sequences is 
recognized by the signal recognition particle (SRP), 
which binds the signal sequence as soon as it 
appears on the ribosome and transfers the entire 
ribosome and incomplete polypeptide to the ER. 
Polypeptides with these signal sequences are 
moved into the ER lumen as they are synthesized; 
once in the lumen they may be modified and moved 
to the Golgi complex, then sorted and sent to 
lysosomes, the plasma membrane, or transport 
vesicles. 

■ Proteins targeted to mitochondria and chloroplasts 
in eukaryotic cells, and those destined for export in 
bacteria, also make use of an amino-terminal signal 
sequence. 

■ Proteins targeted to the nucleus have an internal 
signal sequence that, unlike other signal sequences, 
is not cleaved once the protein is successfully 
targeted. 

■ Some eukaryotic cells import proteins by 
receptor-mediated endocytosis. 

■ All cells eventually degrade proteins, using 
specialized proteolytic systems. Defective proteins 
and those slated for rapid turnover are generally 
degraded by an ATP-dependent system. In eukaryotic 
cells, the proteins are first tagged by linkage 

to ubiquitin, a highly conserved protein. 
Ubiquitin-dependent proteolysis is carried out by 
proteasomes, also highly conserved, and is critical 
to the regulation of many cellular processes. 
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Key Terms 

Terms in bold are defined in the glossary. 



aminoacyl-tRNA 1 066 
aminoacyl-tRNA 

synthetases 1066 
translation 1 066 
codon 1066 
reading frame 1067 
initiation codon 1069 
termination codons 1069 
open reading frame 

(ORF) 1069 
anticodon 1070 
wobble 1072 
translational 

frameshifting 1072 
RNA editing 1073 
initiation 1088 
Shine-Dalgarno 

sequence 1088 
aminoacyl (A) site 1089 
peptidyl(P) site 1089 
exit (E) site 1089 
initiation complex 1089 
elongation 1091 
elongation factors 1091 
peptidyl 

transferase 1 09 1 
translocation 1091 



nonsense 

suppressor 1 094 

termination 1094 
release factors 1094 
polysome 1095 
posttranslational 

modification 1096 
puromycin 1098 
tetracyclines 1098 
chloramphenicol 1 098 
cycloheximide 1098 
streptomycin 1098 
diphtheria toxin 1098 
ricin 1099 

signal sequence 1100 
signal recognition particle 

(SRP) 1101 
peptide translocation 

complex 1101 
tunicamycin 1102 
nuclear localization sequence 

(NLS) 1104 
coated pits 1106 
clathrin 1106 
dynamin 1106 
ubiquitin 1107 
proteasome 1108 



Further Reading 

Genetic Code 

Ambrogelly, A., Palioura, S., & Soil, D. (2007) Natural expansion 
of the genetic code. Nat Chem Biol. 3, 29-35 

Blanc, V. & Davidson, N.O. (2003) C-to-U RNA editing: mechanisms 
leading to genetic diversity. J. Biol Chem 278, 1395-1398. 

Crick, F.H.C. (1966) The genetic code: III. Sci Am. 215 (October), 
55-62. 

An insightful overview of the genetic code at a time when the 
code words had just been worked out . 

Hohn, M.J., Park, H.S., O'Donoghue, P., Schnitzbauer, M., & 
Soil, D. (2006) Emergence of the universal genetic code imprinted 
in an RNA record. Proc Natl Acad Sci USA 103, 18,095-18,100. 
Klobutcher, L.A. & Farabaugh, P.J. (2002) Shifty ciliates: 
frequent programmed translational frameshifting in Euplotids. 
Cell 111, 763-766. 

Levanon, K., Eisenberg E., Rechavi G., & Levanon, E.Y. (2005) 
Letter from the editor: adenosine-to-inosine RNA editing in Alu 
repeats in the human genome. EMBO Rep. 6, 831-835. 

Maas, S., Rich, A., & Nishikura, K. (2003) A-to-I RNA editing: 
recent news and residual mysteries, J. Biol. Chem 278, 
1391-1394. 

Neeman, Y., Dahary, D., & Nishikura, K. (2006) Editor meets 
silencer: crosstalk between RNA editing and RNA interference. Nat. 
Rev Mol. Cell Biol 7,919-931 

Nirenberg, M. (2004) Historical review: deciphering the genetic 
code — a personal account. Trends Biochem Sci 29, 46-54. 



Schimmel, P. & Beebe, K. (2004) Molecular biology — genetic code 
seizes pyrrolysine, Nature 431, 257-258, 

Stadtman, T.C. (1996) Selenocysteine. Rev Biochem 65, 

83-100 

Vetsigian, K., Woese, C, & Goldenfeld, N. (2006) Collective 
evolution and the genetic code Proc Natl Acad Sci USA 103, 
10,696-10,701. 

Xie, J.M. & Schultz, P.G. (2006) Innovation: a chemical toolkit for 
proteins — an expanded genetic code. Nat. Rev Mol Cell Biol 7, 
775-782 

Yanofsky, C. (2007) Establishing the triplet nature of the genetic 
code Cell 128, 815-818. 

Yarus, M., Caporaso, J.G., & Knight, R. (2005) Origins of the 
genetic code: the escaped triplet theory. Annu. Rev Biochem, 74, 
179-198 

Protein Synthesis 

Ban, N., Nissen, P., Hansen, J., Moore, P.B., & Steitz, T.A. 

(2000) The complete atomic structure of the large ribosomal subunit 
at 2.4 angstrom resolution, Science 289, 905-920, 

The first high-resolution structure of a major ribosomal subunit 

Bjork, G.R., Ericson, J.U., Gustafsson, C.E.D., Hagervall, T.G., 
Jonsson, Y.H., & Wikstrom, P.M. (1987) Transfer RNA modifica- 
tion. Annu Rev Biochem 56, 263-288. 

Chapeville, F., Lipmann, E, von Ehrenstein, G., Weisblum, B., 
Ray, W.J., Jr., & Benzer, S. (1962) On the role of soluble ribonu- 
cleic acid in coding for amino acids Proc Natl Acad Sci USA 48, 
1086-1092 

Classic experiments providing proof for Crick's adaptor hypothesis 
and showing that amino acids are not checked after they are linked 
to tRNAs. 

Dintzis, H.M. (1961) Assembly of the peptide chains of hemoglobin. 
Proc Natl Acad Sci. USA 47,247-261, 

A classic experiment establishing that proteins are assembled 
beginning at the amino terminus. 

Giege, R., Sissler, M., & Florentz, C. (1998) Universal rules and 
idiosyncratic features in tRNA identity. Nucleic Acid Res. 26, 
5017-5035. 

Gray, N.K. & Wickens, M. (1998) Control of translation initiation in 
animals Annu Rev Cell Dev Biol 14, 399-458. 

Ibba, M. & Soli, D. (2000) Aminoacyl-tRNA synthesis Annu Rev 
Biochem. 69,617-650. 

Kapp, L.D. & Lorsch, J.R. (2004) The molecular mechanics of 
eukaryotic translation Annu Rev Biochem. 73, 657-704. 

Korostelev, A., Trakhanov, S., Laurberg, M., & Noller, H.F. 

(2006) Crystal structure of a 70S ribosome-tRNA complex reveals 
functional interactions and rearrangements, Cell 126, 1065-1077. 

Moore, P.B. & Steitz, T.A. (2003) The structural basis of large 
ribosomal subunit function Annu Rev Biochem 72, 813-850. 

Peske, F., Rodnina, M.V., & Wintermeyer, W. (2005) Sequence of 
steps in ribosome recycling as defined by kinetic analysis. Mol Cell 
18, 403-412. 

Poehlsgaard, J. & Douthwaite, S. (2005) The bacterial ribosome 
as a target for antibiotics. Nat Rev Microbiol 3, 870-881 

Rodnina, M.V. & Wintermeyer, W. (2001) Fidelity of aminoacyl- 
tRNA selection on the ribosome: kinetic and structural mechanisms. 
Annu. Rev. Biochem. 70, 415-435 

Woese, C.R., Olsen, G.J., Ibba, M., & Soli, D. (2000) Aminoacyl- 
tRNA synthetases, the genetic code, and the evolutionary process. 
Microbiol Mol Biol Rev 64,202-236. 

Protein Targeting and Secretion 

DeMartino, G.N. & Gillette, T.G. (2007) Proteasomes: machines 
for all reasons. Cell 129, 659-662. 



Problems 




Hartmann-Petersen, R., Seeger, M., & Gordon C. (2003) 
Transferring substrates to the 26S proteasome. Trends Biochem 
Sci 28,26-31 

Higgins, M.K. & McMahon, H.T. (2002) Snap-shots of clathrin- 
mediated endocytosis Trends Biochem Sci 27, 257-263 

Liu, C.W., Li, X.H., Thompson, D., Wooding, K., Chang, T., 
Tang, Z., Yu, H., Thomas, P.J., & DeMartino, G.N. (2006) ATP 
binding and ATP hydrolysis play distinct roles in the function of 26S 
proteasome Mol. Cell 24, 39-50. 

Luzio, J.P., Pryor, PR., & Bright, N.A. (2007) Lysosomes: fusion 
and function. Nat Rev Mol. Cell Biol. 8,622-632. 

Mayor, S. & Pagano, R.E. (2007) Pathways of clathrin-independent 
endocytosis Nat Rev Mol Cell Biol 8,603-612 

Neupert, W. (1997) Protein import into mitochondria Annu Rev, 
Biochem. 66, 863-917. 

Pickart, CM. & Cohen, R.E. (2004) Proteasomes and their kin: 
proteases in the machine age Nat Rev Mol Cell Biol 5,177-187. 

Royle, S.J. (2006) The cellular functions of clathrin. Cell Mol Life 
Sci 63, 1823-1832 

Schatz, G. & Dobberstein, B. (1996) Common principles of pro- 
tein translocation across membranes. Science 271, 1519-1525 

Schekman, R. (2007) How sterols regulate protein sorting and traf- 
fic Proc Natl Acad Sci USA 104, 6496-6497. 

Smalle, J. & Vierstra, R.D. (2004) The ubiquitin 26S proteasome 
proteolytic pathway. Annu Rev Plant Biol 55,555-590. 

Stewart, M. (2007) Molecular mechanism of the nuclear protein im- 
port cycle Nat Rev Mol Cell Biol 8, 195-208 

Problems 

1. Messenger RNA Translation Predict the amino acid se- 
quences of peptides formed by ribosomes in response to the 
following mRNA sequences, assuming that the reading frame 
begins with the first three bases in each sequence. 

(a) GGUCAGUCGCUCCUGAUU 

(b) UUGGAUGCGCCAUAAUUUGCU 

(c) CAUGAUGCCUGUUGCUAC 

(d) AUGGACGAA 

2. How Many Different mRNA Sequences Can Specify 
One Amino Acid Sequence? Write all the possible mRNA 
sequences that can code for the simple tripeptide segment 
Leu-Met-Tyr. Your answer will give you some idea about the 
number of possible mRNAs that can code for one polypeptide. 

3. Can the Base Sequence of an mRNA Be Predicted 
from the Amino Acid Sequence of Its Polypeptide Prod- 
uct? A given sequence of bases in an mRNA will code for one 
and only one sequence of amino acids in a polypeptide, if the 
reading frame is specified. From a given sequence of amino 
acid residues in a protein such as cytochrome c, can we pre- 
dict the base sequence of the unique mRNA that coded it? 
Give reasons for your answer. 

4. Coding of a Polypeptide by Duplex DNA The template 
strand of a segment of double-helical DNA contains the 
sequence 

(5')CTTAACACCCCTGACTTCGCGCCGTCG(3') 

(a) What is the base sequence of the mRNA that can be 
transcribed from this strand? 



(b) What amino acid sequence could be coded by the 
mRNA in (a), starting from the 5' end? 

(c) If the complementary (nontemplate) strand of this 
DNA were transcribed and translated, would the resulting 
amino acid sequence be the same as in (b)? Explain the biolog- 
ical significance of your answer. 

5. Methionine Has Only One Codon Methionine is one 
of two amino acids with only one codon. How does the sin- 
gle codon for methionine specify both the initiating residue 
and interior Met residues of polypeptides synthesized by E 

coli? 

6. Synthetic mRNAs The genetic code was elucidated with 
polyribonucleotides synthesized either enzymatically or chem- 
ically in the laboratory. Given what we now know about the ge- 
netic code, how would you make a polyribonucleotide that 
could serve as an mRNA coding predominantly for many Phe 
residues and a small number of Leu and Ser residues? What 
other amino acid(s) would be coded for by this polyribonu- 
cleotide, but in smaller amounts? 

7. Energy Cost of Protein Biosynthesis Determine the 
minimum energy cost, in terms of ATP equivalents expended, 
required for the biosynthesis of the /3-globin chain of hemoglo- 
bin (146 residues), starting from a pool including all necessary 
amino acids, ATP, and GTR Compare your answer with the di- 
rect energy cost of the biosynthesis of a linear glycogen chain 
of 146 glucose residues in (al^>4) linkage, starting from a 
pool including glucose, UTP, and ATP (Chapter 15). From your 
data, what is the extra energy cost of making a protein, in 
which all the residues are ordered in a specific sequence, com- 
pared with the cost of making a polysaccharide containing the 
same number of residues but lacking the informational content 
of the protein? 

In addition to the direct energy cost for the synthesis of a 
protein, there are indirect energy costs — those required for 
the cell to make the necessary enzymes for protein synthesis. 
Compare the magnitude of the indirect costs to a eukaryotic 
cell of the biosynthesis of linear (al— >4) glycogen chains and 
the biosynthesis of polypeptides, in terms of the enzymatic 
machinery involved. 

8. Predicting Anticodons from Codons Most amino 
acids have more than one codon and attach to more than one 
tRNA, each with a different anticodon. Write all possible an- 
ticodons for the four codons of glycine: (5')GGU, GGC, GGA, 
and GGG. 

(a) From your answer, which of the positions in the anti- 
codons are primary determinants of their codon specificity in 
the case of glycine? 

(b) Which of these anticodon-codon pairings has/have a 
wobbly base pair? 

(c) In which of the anticodon-codon pairings do all three 
positions exhibit strong Watson-Crick hydrogen bonding? 

9. Effect of Single-Base Changes on Amino Acid Se- 
quence Much important confirmatory evidence on the ge- 
netic code has come from assessing changes in the amino acid 
sequence of mutant proteins after a single base has been 
changed in the gene that encodes the protein. Which of the 
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following amino acid replacements would be consistent with 
the genetic code if the replacements were caused by a single 
base change? Which cannot be the result of a single-base mu- 
tation? Why? 

(a) Phe^Leu (e) Ile^Leu 

(b) Lys->Ala (f) His^Glu 

(c) Ala->Thr (g) Pro^Ser 

(d) Phe^Lys 

10. Basis of the Sickle-Cell Mutation Sickle-cell hemoglo- 
bin has a Val residue at position 6 of the /3-globin chain, instead 
of the Glu residue found in normal hemoglobin A. Can you pre- 
dict what change took place in the DNA codon for glutamate to 
account for replacement of the Glu residue by Val? 

11. Proofreading by Aminoacyl-tRNA Synthetases The 

isoleucyl-tRNA synthetase has a proofreading function that 
ensures the fidelity of the aminoacylation reaction, but the 
histidyl-tRNA synthetase lacks such a proofreading function. 
Explain. 

12. Importance of the "Second Genetic Code" Some 
aminoacyl-tRNA synthetases do not recognize and bind the 
anticodon of their cognate tRNAs but instead use other struc- 
tural features of the tRNAs to impart binding specificity. The 
tRNAs for alanine apparently fall into this category. 

(a) What features of tRNA^ a are recognized by Ala-tRNA 
synthetase? 

(b) Describe the consequences of a C— >G mutation in the 
third position of the anticodon of tRNA^. 

(c) What other kinds of mutations might have similar 
effects? 

(d) Mutations of these types are never found in natural 
populations of organisms. Why? (Hint: Consider what might 
happen both to individual proteins and to the organism as a 
whole J 

13. Maintaining the Fidelity of Protein Synthesis The 

chemical mechanisms used to avoid errors in protein synthesis 
are different from those used during DNA replication. DNA 
polymerases use a 3'^>5' exonuclease proofreading activity to 
remove mispaired nucleotides incorrectly inserted into a 
growing DNA strand. There is no analogous proofreading func- 
tion on ribosomes and, in fact, the identity of an amino acid at- 
tached to an incoming tRNA and added to the growing 
polypeptide is never checked. A proofreading step that hy- 
drolyzed the previously formed peptide bond after an incor- 
rect amino acid had been inserted into a growing polypeptide 
(analogous to the proofreading step of DNA polymerases) 
would be impractical. Why? (Hint: Consider how the link be- 
tween the growing polypeptide and the mRNA is maintained 
during elongation; see Figs 27-29 and 27-30.) 

14. Predicting the Cellular Location of a Protein The 

gene for a eukaryotic polypeptide 300 amino acid residues 
long is altered so that a signal sequence recognized by SRP oc- 
curs at the polypeptide's amino terminus and a nuclear local- 
ization signal (NLS) occurs internally, beginning at residue 
150. Where is the protein likely to be found in the cell? 



15. Requirements for Protein Translocation across a 
Membrane The secreted bacterial protein OmpA has a pre- 
cursor, ProOmpA, which has the amino-terminal signal se- 
quence required for secretion. If purified ProOmpA is 
denatured with 8 M urea and the urea is then removed (such 
as by running the protein solution rapidly through a gel filtra- 
tion column) the protein can be translocated across isolated 
bacterial inner membranes in vitro. However, translocation be- 
comes impossible if ProOmpA is first allowed to incubate for a 
few hours in the absence of urea. Furthermore, the capacity 
for translocation is maintained for an extended period if 
ProOmpA is first incubated in the presence of another bacter- 
ial protein called trigger factor. Describe the probable function 
of this factor. 

16. Protein-Coding Capacity of a Viral DNA The 5,386 bp 
genome of bacteriophage <£>X174 includes genes for 10 pro- 
teins, designated A to K, with sizes given in the table below. 
How much DNA would be required to encode these 10 pro- 
teins? How can you reconcile the size of the <£X174 genome 
with its protein-coding capacity? 
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Data Analysis Problem 

17. Designing Proteins by Using Randomly Generated 
Genes Studies of the amino acid sequence and corresponding 
three-dimensional structure of wild-type or mutant proteins 
have led to significant insights into the principles that govern 
protein folding. An important test of this understanding would 
be to design a protein based on these principles and see 
whether it folds as expected. 

Kamtekar and colleagues (1993) used aspects of the ge- 
netic code to generate random protein sequences with denned 
patterns of hydrophilic and hydrophobic residues Their clever 
approach combined knowledge about protein structure, amino 
acid properties, and the genetic code to explore the factors 
that influence protein structure. 

They set out to generate a set of proteins with the simple 
four-helix bundle structure shown at the top of page 1113 
(right), with a helices (shown as cylinders) connected by 
segments of random coil (pink). Each a helix is amphipathic — 
the R groups on one side of the helix are exclusively hydropho- 
bic (yellow) and those on the other side are exclusively 
hydrophilic (blue). A protein consisting of four of these helices 
separated by short segments of random coil would be 
expected to fold so that the hydrophilic sides of the helices 
face the solvent. 
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An amphipathic a helix Four-helix bundle 



(a) What forces or interactions hold the four a helices to- 
gether in this bundled structure? 

Figure 4-4a shows a segment of a helix consisting of 10 
amino acid residues. With the gray central rod as a divider, 
four of the R groups (purple spheres) extend from the left side 
of the helix and six extend from the right. 

(b) Number the R groups in Figure from top (amino 
terminus; 1) to bottom (carboxyl terminus; 10). Which R 
groups extend from the left side and which from the right? 

(c) Suppose you wanted to design this 10 amino acid seg- 
ment to be an amphipathic helix, with the left side hydrophilic 
and the right side hydrophobic. Give a sequence of 10 amino 
acids that could potentially fold into such a structure. There 
are many possible correct answers here, 

(d) Give one possible double-stranded DNA sequence 
that could encode the amino acid sequence you chose for (c). 
(It is an internal portion of a protein, so you do not need to in- 
clude start or stop codons.) 

Rather than designing proteins with specific sequences, 
Kamtekar and colleagues designed proteins with partially ran- 
dom sequences, with hydrophilic and hydrophobic amino acid 
residues placed in a controlled pattern. They did this by taking 
advantage of some interesting features of the genetic code to 
construct a library of synthetic DNA molecules with partially 
random sequences arranged in a particular pattern. 

To design a DNA sequence that would encode random hy- 
drophobic amino acid sequences, the researchers began with 



the degenerate codon NTN, where N can be A, G, C, or T. They 
filled each N position by including an equimolar mixture of A, 
G, C, and T in the DNA synthesis reaction to generate a mix- 
ture of DNA molecules with different nucleotides at that posi- 
tion (see Fig. 8-35). Similarly, to encode random polar amino 
acid sequences, they began with the degenerate codon NAN 
and used an equimolar mixture of A, G, and C (but in this case, 
no T) to fill the N positions. 

(e) Which amino acids can be encoded by the NTN 
triplet? Are all amino acids in this set hydrophobic? Does the 
set include all the hydrophobic amino acids? 

(f) Which amino acids can be encoded by the NAN 
triplet? Are all of these polar? Does the set include all the po- 
lar amino acids? 

(g) In creating the NAN codons, why was it necessary to 
leave T out of the reaction mixture? 

Kamtekar and coworkers cloned this library of random 
DNA sequences into plasmids, selected 48 that produced the 
correct patterning of hydrophilic and hydrophobic amino 
acids, and expressed these inE. coli. The next challenge was 
to determine whether the proteins folded as expected. It 
would be very time-consuming to express each protein, 
crystallize it, and determine its complete three-dimensional 
structure. Instead, the investigators used the E. coli protein- 
processing machinery to screen out sequences that led to 
highly defective proteins. In this initial screening, they kept 
only those clones that resulted in a band of protein with the 
expected molecular weight on SDS polyacrylamide gel 
electrophoresis (see Fig. 3-18). 

(h) Why would a grossly misfolded protein fail to produce 
a band of the expected molecular weight on electrophoresis? 

Several proteins passed this initial test, and further explo- 
ration showed that they had the expected four-helix structure. 

(i) Why didn't all of the random-sequence proteins 
that passed the initial screening test produce four-helix 
structures? 

Reference 

Kamtekar, S., Schiffer, J.M., Xiong, H., Babik, J.M., & Hecht, 
M.H. (1993) Protein design by binary patterning of polar and non- 
polar amino acids. Science 262, 1680-1685. 



The fundamental problem of chemical physiology and of embryology 
is to understand why tissue cells do not all express, all the time, all the 
potentialities inherent in their genome. 

— Francois Jacob and Jacques Monod, 
article in Journal of Molecular Biology, 7 96 7 
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Of the 4,000 or so genes in the typical bacterial 
genome, or the perhaps 29,000 genes in the human 
genome, only a fraction are expressed in a cell at 
any given time. Some gene products are present in very 
large amounts: the elongation factors required for pro- 
tein synthesis, for example, are among the most abun- 
dant proteins in bacteria, and ribulose 1,5-bisphosphate 
carboxylase/oxygenase (rubisco) of plants and photo- 
synthetic bacteria is, as far as we know, the most abun- 
dant enzyme in the biosphere. Other gene products 
occur in much smaller amounts; for instance, a cell may 
contain only a few molecules of the enzymes that repair 
rare DNA lesions. Requirements for some gene products 
change over time. The need for enzymes in certain 
metabolic pathways may wax and wane as food sources 
change or are depleted. During development of a multi- 
cellular organism, some proteins that influence cellular 
differentiation are present for just a brief time in only a 
few cells. Specialization of cellular function can dramat- 
ically affect the need for various gene products; an ex- 
ample is the uniquely high concentration of a single 
protein — hemoglobin — in erythrocytes. Given the high 
cost of protein synthesis, regulation of gene expression 
is essential to making optimal use of available energy. 

The cellular concentration of a protein is deter- 
mined by a delicate balance of at least seven processes, 
each having several potential points of regulation: 

1. Synthesis of the primary RNA transcript 
(transcription) 

2. Posttranscriptional modification of mRNA 

3. Messenger RNA degradation 



4. Protein synthesis (translation) 

5. Posttranslational modification of proteins 

6. Protein targeting and transport 

7. Protein degradation 

These processes are summarized in Figure 28-1. We 
have examined several of these mechanisms in previous 
chapters. Posttranscriptional modification of mRNA, by 
processes such as alternative splicing patterns (see 
Fig. 26-22) or RNA editing (see Figs 27-10, 27-12), can 
affect which proteins are produced from an mRNA 
transcript and in what amounts. A variety of nucleotide 
sequences in an mRNA can affect the rate of its degrada- 
tion (p. 1048). Many factors affect the rate at which 
an mRNA is translated into a protein, as well as the 
posttranslational modification, targeting, and eventual 
degradation of that protein (Chapter 27). 

Of the regulatory processes illustrated in Figure 28-1 , 
those operating at the level of transcription initiation are 
the best documented and these are a major focus of 
this chapter; other mechanisms are also considered. 
Researchers continue to discover complex and some- 
times surprising regulatory mechanisms, leading to an 
increasing appreciation of the importance of posttran- 
scriptional and translational regulation, especially in 
eukaryotes. For many genes, the regulatory processes are 
elaborate and redundant and can involve a considerable 
investment of chemical energy. 

Control of transcription initiation permits the syn- 
chronized regulation of multiple genes encoding prod- 
ucts with interdependent activities. For example, when 
their DNA is heavily damaged, bacterial cells require a 
coordinated increase in the levels of the many DNA re- 
pair enzymes. And perhaps the most sophisticated form 
of coordination occurs in the complex regulatory cir- 
cuits that guide the development of multicellular eu- 
karyotes, which can involve many types of regulatory 
mechanisms. 
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FIGURE 28-1 Seven processes that affect the steady-state concen- 
tration of a protein. Each process has several potential points of 
regulation. 

We begin by examining the interactions between 
proteins and DNA that are the key to transcriptional 
regulation. We next discuss the specific proteins that 
influence the expression of specific genes, first in bacte- 
rial and then in eukaryotic cells. Information about post- 
transcriptional and translational regulation is included 
in the discussion, where relevant, to provide a more 
complete overview of the rich complexity of regulatory 
mechanisms. 

28.1 Principles of Gene Regulation 

Genes for products that are required at all times, such as 
those for the enzymes of central metabolic pathways, 
are expressed at a more or less constant level in virtually 
every cell of a species or organism. Such genes are often 
referred to as housekeeping genes. Unvarying expres- 
sion of a gene is called constitutive gene expression. 



For other gene products, cellular levels rise and fall 
in response to molecular signals; this is regulated gene 
expression. Gene products that increase in concentra- 
tion under particular molecular circumstances are 
referred to as inducible; the process of increasing their 
expression is induction. The expression of many of the 
genes encoding DNA repair enzymes, for example, is 
induced by a system of regulatory proteins that re- 
sponds to high levels of DNA damage. Conversely, gene 
products that decrease in concentration in response to a 
molecular signal are referred to as repressible, and the 
process is called repression. For example, in bacteria, 
ample supplies of tryptophan lead to repression of 
the genes for the enzymes that catalyze tryptophan 
biosynthesis. 

Transcription is mediated and regulated by protein- 
DNA interactions, especially those involving the protein 
components of RNA polymerase (Chapter 26). We first 
consider how the activity of RNA polymerase is regu- 
lated, and proceed to a general description of the pro- 
teins participating in this regulation. We then examine 
the molecular basis for the recognition of specific DNA 
sequences by DNA-binding proteins. 

RNA Polymerase Binds to DNA at Promoters 

RNA polymerases bind to DNA and initiate transcription 
at promoters (see Fig. 26-5), sites generally found near 
points at which RNA synthesis begins on the DNA tem- 
plate. The regulation of transcription initiation often en- 
tails changes in how RNA polymerase interacts with a 
promoter. 

The nucleotide sequences of promoters vary con- 
siderably, affecting the binding affinity of RNA poly- 
merases and thus the frequency of transcription 
initiation. Some Escherichia coli genes are transcribed 
once per second, others less than once per cell genera- 
tion. Much of this variation is due to differences in pro- 
moter sequence. In the absence of regulatory proteins, 
differences in promoter sequence may affect the fre- 
quency of transcription initiation by a factor of 1,000 or 
more. Most E. coli promoters have a sequence close to a 
consensus (Fig. 28-2). Mutations that result in a shift 
away from the consensus sequence usually decrease 
promoter function; conversely, mutations toward con- 
sensus usually enhance promoter function. 

Although housekeeping genes are expressed consti- 
tutively, the cellular concentrations of the proteins 
they encode vary widely. For these genes, the RNA 
polymerase-promoter interaction strongly influences 
the rate of transcription initiation; differences in promoter 
sequence allow the cell to synthesize the appropriate 
level of each housekeeping gene product. 

The basal rate of transcription initiation at the pro- 
moters of nonhousekeeping genes is also determined 
by the promoter sequence, but expression of these 
genes is further modulated by regulatory proteins. 
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FIGURE 28-2 Consensus sequence for many E. co// promoters. Most base substitutions in the -10 and 

-35 regions have a negative effect on promoter function. Some promoters also include the UP (upstream 
promoter) element (see Fig. 26-5). By convention, DNA sequences are shown as they exist in the nontem- 
plate strand, with the 5' terminus on the left, Nucleotides are numbered from the transcription start site, 
with positive numbers to the right (in the direction of transcription) and negative numbers to the left. N 
indicates any nucleotide. 



Many of these proteins work by enhancing or interfer- 
ing with the interaction between RNA polymerase and 
the promoter. 

The sequences of eukaryotic promoters are more 
variable than their bacterial counterparts (see Fig. 
26-9). The three eukaryotic RNA polymerases usually 
require an array of general transcription factors in order 
to bind to a promoter. Yet, as with bacterial gene expres- 
sion, the basal level of transcription is determined by the 
effect of promoter sequences on the function of RNA 
polymerase and its associated transcription factors. 

Transcription Initiation Is Regulated by Proteins That 
Bind to or near Promoters 

At least three types of proteins regulate transcription 
initiation by RNA polymerase: specificity factors alter 
the specificity of RNA polymerase for a given promoter 
or set of promoters; repressors impede access of RNA 
polymerase to the promoter; and activators enhance 
the RNA polymerase-promoter interaction. 

We introduced bacterial specificity factors in Chap- 
ter 26 (see Fig. 26-5), although we did not refer to them 
by that name. The a subunit of the E. coli RNA poly- 
merase holoenzyme is a specificity factor that mediates 
promoter recognition and binding. Most E. coli promot- 
ers are recognized by a single <x subunit (M r 70,000), 
cr 70 . Under some conditions, some of the <t 70 subunits 
are replaced by one of six other specificity factors. One 
notable case arises when the bacteria are subjected to 
heat stress, leading to the replacement of <x 70 by cr 32 (M T 
32,000). When bound to o- 32 , RNA polymerase is di- 
rected to a specialized set of promoters with a different 



consensus sequence (Fig. 28-3). These promoters 
control the expression of a set of genes that encode pro- 
teins, including some protein chaperones (p. 143), that 
are part of a stress-induced system called the heat shock 
response. Thus, through changes in the binding affinity 
of the polymerase that direct it to different promoters, a 
set of genes involved in related processes is coordinately 
regulated. In eukaryotic cells, some of the general 
transcription factors, in particular the TATA-binding 
protein (TBP; see Fig. 26-9) , may be considered speci- 
ficity factors. 

Repressors bind to specific sites on the DNA. In bac- 
terial cells, such binding sites, called operators, are 
generally near a promoter. RNA polymerase binding, or 
its movement along the DNA after binding, is blocked 
when the repressor is present. Regulation by means of a 
repressor protein that blocks transcription is referred to 
as negative regulation. Repressor binding to DNA is 
regulated by a molecular signal (or effector), usually a 
small molecule or a protein, that binds to the repressor 
and causes a conformational change. The interaction be- 
tween repressor and signal molecule either increases or 
decreases transcription. In some cases, the conforma- 
tional change results in dissociation of a DNA-bound re- 
pressor from the operator (Fig. 28-4a). Transcription 
initiation can then proceed unhindered. In other cases, 
interaction between an inactive repressor and the signal 
molecule causes the repressor to bind to the operator 
(Fig. 28-Ab). In eukaryotic cells, the binding site for a 
repressor may be some distance from the promoter; 
binding has the same effect as in bacterial cells: inhibit- 
ing the assembly or activity of a transcription complex at 
the promoter. 
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FIGURE 28-3 Consensus sequence for promoters that regulate expression of the E. coli heat shock genes. 

This system responds to temperature increases as well as some other environmental stresses, resulting in the 
induction of a set of proteins, Binding of RNA polymerase to heat shock promoters is mediated by a spe- 
cialized cr subunit of the polymerase, a 32 , which replaces a 70 in the RNA polymerase initiation complex. 
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FIGURE 28-4 Common patterns of regulation of transcription initia- 
tion. Two types of negative regulation are illustrated, (a) Repressor 
(pink) binds to the operator in the absence of the molecular signal; the 
external signal causes dissociation of the repressor to permit transcrip- 
tion, (b) Repressor binds in the presence of the signal; the repressor 
dissociates and transcription ensues when the signal is removed. Posi- 
tive regulation is mediated by gene activators, Again, two types are 
shown, (c) Activator (green) binds in the absence of the molecular 




mRNA 



signal and transcription proceeds; when the signal is added, the activa- 
tor dissociates and transcription is inhibited, (d) Activator binds in the 
presence of the signal; it dissociates only when the signal is removed. 
Note that "positive" and "negative" regulation refer to the type of 
regulatory protein involved: the bound protein either facilitates or in- 
hibits transcription. In either case, addition of the molecular signal 
may increase or decrease transcription, depending on its effect on the 
regulatory protein. 



Activators provide a molecular counterpoint to re- 
pressors; they bind to DNA and enhance the activity of 
RNA polymerase at a promoter; this is positive regula- 
tion. Activator-binding sites are often adjacent to pro- 
moters that are bound weakly or not at all by RNA 
polymerase alone, such that little transcription occurs in 
the absence of the activator. Some eukaryotic activators 
bind to DNA sites, called enhancers, that are quite dis- 
tant from the promoter, affecting the rate of transcrip- 
tion at a promoter that may be located thousands of 
base pairs away. Some activators are usually bound to 
DNA, enhancing transcription until dissociation of the 
activator is triggered by the binding of a signal molecule 
(Fig. 28-4c). In other cases the activator binds to 
DNA only after interaction with a signal molecule 
(Fig. 28-4d). Signal molecules can therefore increase or 
decrease transcription, depending on how they affect 



the activator. Positive regulation is particularly common 
in eukaryotes, as we shall see. 

Many Bacterial Genes Are Clustered and 
Regulated in Operons 

Bacteria have a simple general mechanism for coordi- 
nating the regulation of genes encoding products that 
participate in a set of related processes: these genes are 
clustered on the chromosome and are transcribed 
together. Many bacterial mRNAs are polycistronic — 
multiple genes on a single transcript — and the single 
promoter that initiates transcription of the cluster is the 
site of regulation for expression of all the genes in the 
cluster. The gene cluster and promoter, plus additional 
sequences that function together in regulation, are 
called an operon (Fig. 28-5). Operons that include 
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FIGURE 28-5 Representative bacterial operon. Genes A, B, and C are 

transcribed on one polycistronic mRNA. Typical regulatory sequences 
include binding sites for proteins that either activate or repress tran- 
scription from the promoter. 



two to six genes transcribed as a unit are common; some 
operons contain 20 or more genes. 

Many of the principles of bacterial gene expression 
were first defined by studies of lactose metabolism in 
E. coli, which can use lactose as its sole carbon source. 
In I960, Frangois Jacob and Jacques Monod published a 
short paper in the Proceedings of the French Academy 
of Sciences that described how two adjacent genes 
involved in lactose metabolism were coordinately 
regulated by a genetic element located at one end of the 
gene cluster. The genes were those for /3-galactosidase, 
which cleaves lactose to galactose and glucose, and 
galactoside permease (lactose permease, p. 402), which 
transports lactose into the cell (Fig. 28-6). The terms 
"operon" and "operator" were first introduced in this pa- 
per. With the operon model, gene regulation could, for 
the first time, be considered in molecular terms. 

The lac Operon Is Subject to Negative Regulation 

The lactose {lac) operon (Fig. 28-7a) includes the 
genes for /3-galactosidase (Z), galactoside permease 
(F), and thiogalactoside transacetylase (A). The last of 
these enzymes seems to modify toxic galactosides to fa- 
cilitate their removal from the cell. Each of the three 
genes is preceded by a ribosome binding site (not shown 
in Fig. 28-7) that independently directs the translation 
of that gene (Chapter 27). Regulation of the lac operon 
by the lac repressor protein (Lac) follows the pattern 
outlined in Figure 28-4a. 

The study of lac operon mutants has revealed some 
details of the workings of the operon's regulatory sys- 
tem. In the absence of lactose, the lac operon genes are 
repressed. Mutations in the operator or in another gene, 
the / gene, result in constitutive synthesis of the gene 
products. When the / gene is defective, repression can 
be restored by introducing a functional / gene into the 
cell on another DNA molecule, demonstrating that the 
/ gene encodes a diffusible molecule that causes gene 
repression. This molecule proved to be a protein, 
now called the Lac repressor, a tetramer of identical 
monomers. The operator to which it binds most tightly 
(00 abuts the transcription start site (Fig. 28-7a). The 
/ gene is transcribed from its own promoter (P r ) inde- 
pendent of the lac operon genes, The lac operon has 
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FIGURE 28-6 Lactose metabolism in f. coli. Uptake and metabolism 
of lactose require the activities of galactoside (lactose) permease and 
/3-galactosidase. Conversion of lactose to allolactose by transglycosyl- 
ation is a minor reaction also catalyzed by /3-galactosidase. 
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FIGURE 28-7 The lac operon. (a) The lac operon in the repressed state. 
The / gene encodes the Lac repressor. The lac Z, Y f and A genes en- 
code )8-galactosidase, galactoside permease, and thiogalactoside 
transacetylase, respectively. P is the promoter for the lac genes, and P| 
is the promoter for the / gene. is the main operator for the lac 
operon; 0 2 and 0 3 are secondary operator sites of lesser affinity for the 
Lac repressor. (b)The Lac repressor binds to the main operator and 0 2 
or 0 3 , apparently forming a loop in the DNA that might wrap around 
the repressor as shown, (c) Lac repressor bound to DNA (derived from 




(b) 




PDB ID 1 LBG). This shows the protein (gray) bound to short, discon- 
tinuous segments of DNA (blue), (d) Conformational change in the Lac 
repressor caused by binding of the artificial inducer isopropylthiogalac- 
toside, IPTG (derived from PDB ID 1 LBH and 1 LBG). The structure of 
the tetrameric repressor is shown without IPTG bound (transparent im- 
age) and with IPTG bound (overlaid solid image; IPTG not shown). The 
DNA bound when IPTG is absent (transparent structure) is not shown. 
When IPTG is bound and DNA is not bo., id, the repressor's DNA-bind- 
ing domains are too ^' ,i jred to be defined in the crystal structure. 



two secondary binding sites for the Lac represser. Gn^ 
(0 2 ) is centered near position +410, within the gene 
encoding /3-galactosidase (Z)\ the other (0 3 ) is near 
position -90, within the / gene. To repress the operon, 
the Lac repressor seems to bind to both the main operator 
and one of the two secondary sites, with the intervening 
DNA looped out (Fig. 28-7b, c). Either binding arrange- 
ment blocks transcription initiation. 

Despite this elaborate binding complex, repression 
is not absolute. Binding of the Lac repressor reduces the 
rate of transcription initiation by a factor of 10 3 . If the 0 2 
and 0 3 sites are eliminated by deletion or mutation, the 
binding of repressor to O x alone reduces transcription 
by a factor of about 10 2 . Even in the repressed state, 
each cell has a few molecules of jS-galactosidase and 
galactoside permease, presumably synthesized on the 
rare occasions when the repressor transiently dissoci- 



ates from the operators. This basal level of transcription 
is essential to operon regulation. 

When cells are provided with lactose, the lac operon 
is induced. An inducer (signal) molecule binds to a spe- 
cific site on the Lac repressor, causing a conformational 
change (Fig. 28-7d) that results in dissociation of the re- 
pressor from the operator. The inducer in the lac operon 
system is not lactose itself but allolactose, an isomer of 
lactose (Fig. 28-6). After entry into the E. coli cell (via 
the few existing molecules of permease), lactose is con- 
verted to allolactose by one of the few existing /?-galac- 
tosidase molecules. Release of the operator by Lac 
repressor, triggered as the repressor binds to allolactose, 
allows expression of the lac operon genes and leads to a 
10 3 -fold increase in the concentration of /3-galactosidase. 

Several jS-galactosides structurally related to allolac- 
tose are inducers of the lac operon but are not substrates 
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FIGURE 28-8 Groups in DNA available for protein binding. Shown 
here are functional groups on all four base pairs that are displayed in 
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the major and minor grooves of DNA (see Fig. 8-13). Groups that can 
be used for base-pair recognition by proteins are shown in red. 



for /3-galactosidase; others are substrates but not induc- 
ers. One particularly effective and nonmetabolizable in- 
ducer of the lac operon that is often used experimentally 
is isopropylthiogalactoside (IPTG): 



CH 2 OH 




H OH 

Isopropyl-/?-D-thiogalactoside 
(IPTG) 

An inducer that cannot be metabolized allows researchers 
to explore the physiological function of lactose as a carbon 
source for growth, separate from its function in the regu- 
lation of gene expression. 

In addition to the multitude of operons now known 
in bacteria, a few polycistronic operons have been found 
in the cells of lower eukaryotes. In the cells of higher eu- 
karyotes, however, almost all protein-encoding genes 
are transcribed separately. 

The mechanisms by which operons are regulated 
can vary significantly from the simple model presented 
in Figure 28-7. Even the lac operon is more complex 
than indicated here, with an activator also contributing 
to the overall scheme, as we shall see in Section 28.2. 
Before any further discussion of the layers of regulation 
of gene expression, however, we examine the critical 
molecular interactions between DNA-binding proteins 
(such as repressors and activators) and the DNA 
sequences to which they bind. 

Regulatory Proteins Have Discrete DNA-Binding Domains 

Regulatory proteins generally bind to specific DNA se- 
quences. Their affinity for these target sequences is 
roughly 10 4 to 10 6 times higher than their affinity for any 
other DNA sequence. Most regulatory proteins have dis- 
crete DNA-binding domains containing substructures 
that interact closely and specifically with the DNA. 
These binding domains usually include one or more of a 
relatively small group of recognizable and characteristic 
structural motifs. 



To bind specifically to DNA sequences, regulatory 
proteins must recognize surface features on the DNA. 
Most of the chemical groups that differ among the four 
bases and thus permit discrimination between base 
pairs are hydrogen-bond donor and acceptor groups ex- 
posed in the major groove of DNA (Fig. 28-8), and 
most of the protein-DNA contacts that impart specificity 
are hydrogen bonds. A notable exception is the nonpo- 
lar surface near C-5 of pyrimidines, where thymine is 
readily distinguished from cytosine by its protruding 
methyl group. Protein-DNA contacts are also possible in 
the minor groove of the DNA, but the hydrogen-bonding 
patterns here generally do not allow ready discrimina- 
tion between base pairs. 

Within regulatory proteins, the amino acid side 
chains most often hydrogen-bonding to bases in the 
DNA are those of Asn, Gin, Glu, Lys, and Arg residues. Is 
there a simple recognition code in which a particular 
amino acid always pairs with a particular base? The two 
hydrogen bonds that can form between Gin or Asn and 
the N 6 and N-7 positions of adenine cannot form with 
any other base. And an Arg residue can form two hy- 
drogen bonds with N-7 and O 6 of guanine (Fig. 28-9). 
Examination of the structure of many DNA-binding 
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FIGURE 28-9 Two examples of specific amino acid residue-base pair 
interactions that have been observed in DNA-protein binding. 
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FIGURE 28-10 Relationship between the lac operator sequence and the lac promoter. The bases 
shaded beige exhibit twofold (palindromic) symmetry about the axis indicated by the dashed vertical line. 



proteins, however, has shown that a protein can recognize 
each base pair in more than one way, leading to the con- 
clusion that there is no simple amino acid-base code. 
For some proteins, the Gln-adenine interaction can specify 
A=T base pairs, but in others a van der Waals pocket for 
the methyl group of thymine can recognize A=T base 
pairs. Researchers cannot yet examine the structure of 
a DNA-binding protein and infer the DNA sequence to 
which it binds. 

To interact with bases in the major groove of DNA, a 
protein requires a relatively small substructure that can 
stably protrude from the protein surface. The DNA- 
binding domains of regulatory proteins tend to be small 
(60 to 90 amino acid residues), and the structural motifs 
within these domains that are actually in contact with 
the DNA are smaller still. Many small proteins are unsta- 
ble because of their limited capacity to form layers of 
structure to bury hydrophobic groups (p. 114). The 
DNA-binding motifs provide either a very compact sta- 
ble structure or a way of allowing a segment of protein 
to protrude from the protein surface. 

The DNA-binding sites for regulatory proteins are 
often inverted repeats of a short DNA sequence (a palin- 
drome) at which multiple (usually two) subunits of a reg- 
ulatory protein bind cooperatively. The Lac repressor is 
unusual in that it functions as a tetramer, with two 
dimers tethered together at the end distant from the 
DNA-binding sites (Fig. 28-7b). An E. coli cell usually 
contains about 20 tetramers of the Lac repressor. Each of 
the tethered dimers separately binds to a palindromic 
operator sequence, in contact with 17 bp of a 22 bp re- 
gion in the lac operon (Fig. 28-10). And each of the 
tethered dimers can independently bind to an operator 
sequence, with one generally binding to 0 : and the other 
to 0 2 or 0 3 (as in Fig. 28-7b). The symmetry of the Oi 
operator sequence corresponds to the twofold axis of 
symmetry of two paired Lac repressor subunits. The 
tetrameric Lac repressor binds to its operator sequences 
in vivo with an estimated dissociation constant of about 
10~ 10 m. The repressor discriminates between the oper- 
ators and other sequences by a factor of about 10 6 , so 
binding to these few base pairs among the 4.6 million or 
so of the E. coli chromosome is highly specific. 

Several DNA-binding motifs have been described, 
but here we focus on two that play prominent roles in 
the binding of DNA by regulatory proteins: the helix- 



turn-helix and the zinc finger. We also consider a type 
of DNA-binding domain — the homeodomain — found in 
some eukaryotic proteins. 

Helix-Turn-Helix This DNA-binding motif is crucial 
to the interaction of many bacterial regulatory proteins 
with DNA, and similar motifs occur in some eukaryotic 
regulatory proteins. The helix-turn-helix motif com- 
prises about 20 amino acids in two short a-helical 
segments, each seven to nine amino acid residues long, 
separated by a f3 turn (Fig. 28-11). This structure gen- 
erally is not stable by itself; it is simply the reactive por- 
tion of a somewhat larger DNA-binding domain. One of 
the two a-helical segments is called the recognition 
helix, because it usually contains many of the amino 
acids that interact with the DNA in a sequence-specific 
way. This a helix is stacked on other segments of the 
protein structure so that it protrudes from the protein 
surface. When bound to DNA, the recognition helix is 
positioned in or nearly in the major groove. The Lac 
repressor has this DNA-binding motif (Fig. 28-11). 

Zinc Finger In a zinc finger, about 30 amino acid 
residues form an elongated loop held together at the 
base by a single Zn 2+ ion, which is coordinated to four of 
the residues (four Cys, or two Cys and two His). The 
zinc does not itself interact with DNA; rather, the coor- 
dination of zinc with the amino acid residues stabilizes 
this small structural motif. Several hydrophobic side 
chains in the core of the structure also lend stability. 
Figure 28-12 shows the interaction between DNA and 
three zinc fingers of a single polypeptide from the mouse 
regulatory protein Zif268. 

Many eukaryotic DNA-binding proteins contain zinc 
fingers. The interaction of a single zinc finger with DNA 
is typically weak, and many DNA-binding proteins, like 
Zif268, have multiple zinc fingers that substantially en- 
hance binding by interacting simultaneously with the 
DNA. One DNA-binding protein of the frog Xenopus has 
37 zinc fingers. There are few known examples of the 
zinc finger motif in bacterial proteins. 

The precise manner in which proteins with zinc fin- 
gers bind to DNA differs from one protein to the next. 
Some zinc fingers contain the amino acid residues that 
are important in sequence discrimination, whereas oth- 
ers seem to bind DNA nonspecifically (the amino acids 
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FIGURE 28-11 Helix-turn-helix, (a) DNA-binding domain of the Lac repressor 
(PDB ID 1LCC). The helix-turn-helix motif is shown in red and orange; the DNA 
recognition helix is red. (b) Entire Lac repressor (derived from PDB ID 1LBG). The 
DNA-binding domains are gray, and the a helices involved in tetramerization are 
red. The remainder of the protein (shades of green) has the binding sites for allolac- 
tose. The allolactose-binding domains are linked to the DNA-binding domains 
through linker helices (yellow), (c) Surface rendering of the DNA-binding domain of 
the Lac repressor (gray) bound to DNA (blue), (d) The same DNA-binding domain as 
in (c), but separated from the DNA, with the binding interaction surfaces shown. 
Some groups on the protein and DNA that interact through hydrogen-bonding are 
shown in red; some groups that interact through hydrophobic interactions are in or- 
ange. This model shows only a few of the groups involved in sequence recognition. 
The complementary nature of the two surfaces is evident. 



required for specificity are located elsewhere in the 
protein). Zinc fingers can also function as RNA-binding 
motifs — for example, in certain proteins that bind eu- 
karyotic mRNAs and act as translational repressors. We 
discuss this role later (Section 28.3). 




Homeodomain Another type of DNA-binding domain 
has been identified in some proteins that function as 



FIGURE 28-12 Zinc fingers. Three zinc fingers (gray) of the regulatory 
protein Zif268 / complexed with DNA (blue and white) (PDB ID 1A1L). 
Each Zn 2+ (maroon) coordinates with two His and two Cys residues 
(not shown). 
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transcriptional regulators, especially during eukaryotic 
development. This domain of 60 amino acids — called 
the homeodomain, because it was discovered in 
homeotic genes (genes that regulate the development of 
body patterns) — is highly conserved and has now been 
identified in proteins from a wide variety of organisms, 
including humans (Fig. 28-13). The DNA-binding seg- 
ment of the domain is related to the helix-turn-helix mo- 
tif. The DNA sequence that encodes this domain is 
known as the homeobox. 

Regulatory Proteins Also Have Protein-Protein 
Interaction Domains 

Regulatory proteins contain domains not only for DNA 
binding but also for protein-protein interactions — with 
RNA polymerase, other regulatory proteins, or other 
subunits of the same regulatory protein. Examples in- 
clude many eukaryotic transcription factors that func- 
tion as gene activators, which often bind as dimers to 
the DNA, through DNA-binding domains that contain 
zinc fingers. Some structural domains are devoted to the 
interactions required for dimer formation, which is gen- 
erally a prerequisite for DNA binding. Like DNA-binding 
motifs, the structural motifs that mediate protein-protein 
interactions tend to fall within one of a few common 
categories. Two important examples are the leucine 
zipper and the basic helix-loop-helix. Structural 
motifs such as these are the basis for classifying some 
regulatory proteins into structural families. 




FIGURE 28-13 Homeodomain. Shown here is a homeodomain bound 
to DNA; one of the a helices (red), stacked on two others, can be seen 
protruding into the major groove (PDB ID 1B8I). This is only a small 
part of the much larger protein Ultrabithorax (Ubx), active in the regu- 
lation of development in fruit flies (see Section 28.3). 

Leucine Zipper This motif is an amphipathic a helix 
with a series of hydrophobic amino acid residues 
concentrated on one side (Fig. 28-14), with the hy- 
drophobic surface forming the area of contact between 
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FIGURE 28-14 Leucine zippers, (a) Comparison of amino acid se- 
quences of several leucine zipper proteins. Note the Leu (L) residues at 
every seventh position in the zipper region, and the number of Lys (K) 
and Arg (R) residues in the DNA-binding region, (b) Leucine zipper 
from the yeast activator protein GCN4 (PDB ID 1 YSA). Only the 
"zippered" a helices (gray and light blue), derived from different sub- 
units of the dimeric protein, are shown. The two helices wrap around 
each other in a gently coiled coil. The interacting Leu residues are 
shown in red. 
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the two polypeptides of a dimer. A striking feature of 
these a helices is the occurrence of Leu residues at every 
seventh position, forming a straight line along the hy- 
drophobic surface. Although researchers initially thought 
the Leu residues interdigitated (hence the name "zip- 
per"), we now know that they line up side by side as the 
interacting a helices coil around each other (forming a 
coiled coil; Fig. 28-1 4b). Regulatory proteins with leucine 
zippers often have a separate DNA-binding domain with a 
high concentration of basic (Lys or Arg) residues that can 
interact with the negatively charged phosphates of the 
DNA backbone. Leucine zippers have been found in many 
eukaryotic and a few bacterial proteins. 

Basic Helix-Loop-Helix Another common structural 
motif occurs in some eukaryotic regulatory proteins 
implicated in the control of gene expression during the 
development of multicellular organisms. These proteins 
share a conserved region of about 50 amino acid 
residues important in both DNA binding and protein 
dimerization. This region can form two short amphi- 
pathic a helices linked by a loop of variable length, the 
helix-loop-helix (distinct from the helix-turn-helix motif 
associated with DNA binding). The helix-loop-helix 
motifs of two polypeptides interact to form dimers 
(Fig. 28-15). In these proteins, DNA binding is mediated 
by an adjacent short amino acid sequence rich in basic 
residues, similar to the separate DNA-binding region in 
proteins containing leucine zippers. 



i 




FIGURE 28-15 Helix-loop-helix. The human transcription factor Max, 
bound to its DNA target site (PDB ID 1HLO). The protein is dimeric; 
one subunit is colored. The DNA-binding segment (pink) merges with 
the first helix of the helix-loop-helix (red). The second helix merges 
with the carboxyl-terminal end of the subunit (purple). Interaction of 
the carboxyl-terminal helices of the two subunits describes a coiled 
coil very similar to that of a leucine zipper (see Fig, 28-1 4b), but with 
only one pair of interacting Leu residues (red side chains near the top) 
in this particular example. The overall structure is sometimes called a 
helix-loop-helix/leucine zipper motif. 



Subunit Mixing in Eukaryotic Regulatory Proteins 

Several families of eukaryotic transcription factors have 
been defined based on close structural similarities. 
Within each family, dimers can sometimes form between 
two identical proteins (a homodimer) or between two 
different members of the family (a heterodimer). A hy- 
pothetical family of four different leucine-zipper pro- 
teins could thus form up to 10 different dimeric species. 
In many cases, the different combinations seem to have 
distinct regulatory and functional properties. 

In addition to structural domains devoted to DNA bind- 
ing and dimerization (or oligomerization), many regulatory 
proteins must interact with RNA polymerase, with unre- 
lated regulatory proteins, or with both. At least three types 
of additional domains for protein-protein interaction have 
been characterized (primarily in eukaiyotes): glutamine- 
rich, proline-rich, and acidic domains, the names reflecting 
the amino acid residues that are especially abundant. 

Protein-DNA binding interactions are the basis of 
the intricate regulatory circuits fundamental to gene 
function. We now turn to a closer examination of these 
gene regulatory schemes, first in bacterial, then in 
eukaryotic systems. 

SUMMARY 28.1 Principles of Gene 
Regulation 

■ The expression of genes is regulated by processes 
that affect the rates at which gene products are 
synthesized and degraded. Much of this regulation 
occurs at the level of transcription initiation, 
mediated by regulatory proteins that either repress 
transcription (negative regulation) or activate 
transcription (positive regulation) at specific 
promoters. 

■ In bacteria, genes that encode products with 
interdependent functions are often clustered in an 
operon, a single transcriptional unit. Transcription 
of the genes is generally blocked by binding of a 
specific repressor protein at a DNA site called an 
operator. Dissociation of the repressor from the 
operator is mediated by a specific small molecule, 
an inducer. These principles were first elucidated 
in studies of the lactose (lac) operon. The Lac 
repressor dissociates from the lac operator when 
the repressor binds to its inducer, allolactose. 

■ Regulatory proteins are DNA-binding proteins that 
recognize specific DNA sequences; most have 
distinct DNA-binding domains. Within these 
domains, common structural motifs that bind 
DNA are the helix-turn-helix, zinc finger, and 
homeodomain. 

■ Regulatory proteins also contain domains for 
protein-protein interactions, including the leucine 
zipper and helix-loop-helix, which are involved 

in dimerization, and other motifs involved in 
activation of transcription. 
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28.2 Regulation of Gene Expression 
in Bacteria 

As in many other areas of biochemical investigation, the 
study of the regulation of gene expression advanced 
earlier and faster in bacteria than in other experimental 
organisms. The examples of bacterial gene regulation 
presented here are chosen from among scores of well- 
studied systems, partly for their historical significance, 
but primarily because they provide a good overview of 
the range of regulatory mechanisms in bacteria. Many of 
the principles of bacterial gene regulation are also 
relevant to understanding gene expression in eukary- 
otic cells. 

We begin by examining the lactose and tryptophan 
operons; each system has regulatory proteins, but the 
overall mechanisms of regulation are very different. This 
is followed by a short discussion of the SOS response in 
E. coli, illustrating how genes scattered throughout the 
genome can be coordinately regulated. We then de- 
scribe two bacterial systems of quite different types, il- 
lustrating the diversity of gene regulatory mechanisms: 
regulation of ribosomal protein synthesis at the level of 
translation, with many of the regulatory proteins bind- 
ing to RNA (rather than DNA), and regulation of the 
process of "phase variation" in Salmonella, which re- 
sults from genetic recombination. Finally, we examine 
some additional examples of posttranscriptional regula- 
tion in which the RNA modulates its own function. 

The fac Operon Undergoes Positive Regulation 

The operator-repressor-inducer interactions described 
earlier for the lac operon (Fig. 28-7) provide an intu- 
itively satisfying model for an on/off switch in the regu- 
lation of gene expression. In truth, operon regulation is 
rarely so simple. A bacterium's environment is too com- 
plex for its genes to be controlled by one signal. Other 
factors besides lactose affect the expression of the lac 
genes, such as the availability of glucose. Glucose, me- 
tabolized directly by glycolysis, is the preferred energy 
source in E. coli. Other sugars can serve as the main or 
sole nutrient, but extra steps are required to prepare 
them for entry into glycolysis, necessitating the synthe- 
sis of additional enzymes. Clearly, expressing the genes 
for proteins that metabolize sugars such as lactose or 
arabinose is wasteful when glucose is abundant. 

What happens to the expression of the lac operon 
when both glucose and lactose are present? A regulatory 
mechanism known as catabolite repression restricts 
expression of the genes required for catabolism of lactose, 
arabinose, and other sugars in the presence of glucose, 
even when these secondary sugars are also present. The 
effect of glucose is mediated by cAMP, as a coactivator, 
and an activator protein known as cAMP receptor pro- 
tein, or CRP (the protein is sometimes called CAP, for 
catabolite gene activator protein). CRP is a homoclimer 




FIGURE 28-16 CRP homodimer. (PDB ID 1 RUN) Bound molecules of 
cAMP are shown in red. Note the bending of the DNA around the pro- 
tein. The region that interacts with RNA polymerase is shaded yellow. 



(subunit M r 22,000) with binding sites for DNA and 
cAMP. Binding is mediated by a helix-turn-helix motif in 
the protein's DNA-binding domain (Fig. 28-16). When 
glucose is absent, CRP-cAMP binds to a site near the lac 
promoter (Fig. 28-1 7a) and stimulates RNA transcrip- 
tion 50-fold. CRP-cAMP is therefore a positive regulatory 
element responsive to glucose levels, whereas the Lac re- 
pressor is a negative regulatory element responsive to 
lactose. The two act in concert. CRP-cAMP has little ef- 
fect on the lac operon when the Lac repressor is blocking 
transcription, and dissociation of the repressor from the 
lac operator has little effect on transcription of the lac 
operon unless CRP-cAMP is present to facilitate tran- 
scription; when CRP is not bound, the wild-type lac pro- 
moter is a relatively weak promoter (Fig. 28-1 7b). The 
open complex of RNA polymerase and the promoter (see 
Fig. 26-6) does not form readily unless CRP-cAMP is 
present. CRP interacts directly with RNA polymerase (at 
the region shown in Fig. 28-16) through the polymerase's 
a subunit. 

The effect of glucose on CRP is mediated by the 
cAMP interaction (Fig. 28-18). CRP binds to DNA 
most avidly when cAMP concentrations are high. In the 
presence of glucose, the synthesis of cAMP is inhibited 
and efflux of cAMP from the cell is stimulated. As 
[cAMP] declines, CRP binding to DNA declines, thereby 
decreasing the expression of the lac operon. Strong 
induction of the lac operon therefore requires both lac- 
tose (to inactivate the lac repressor) and a lowered con- 
centration of glucose (to trigger an increase in [cAMP] 
and increased binding of cAMP to CRP). 

CRP and cAMP are involved in the coordinated reg- 
ulation of many operons, primarily those that encode 
enzymes for the metabolism of secondary sugars such 
as lactose and arabinose. A network of operons with a 
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FIGURE 28-17 Activation of transcription of the lac operon by CRP. (a) The binding site for CRP-cAMP is 
near the promoter. As in the case of the lac operator, the CRP site has twofold symmetry (bases shaded 
beige) about the axis indicated by the dashed line, (b) Sequence of the lac promoter compared with the 
promoter consensus sequence. The differences mean that RNA polymerase binds relatively weakly to the 
lac promoter until the polymerase is activated by CRP-cAMP. 
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FIGURE 28-18 Combined effects of glucose and lactose on expression of the lac operon. (a) High levels 
of transcription take place only when glucose concentrations are low (so [cAMP] is high and CRP-cAMP 
is bound) and lactose concentrations are high (so the Lac repressor is not bound), (b) Without bound acti- 
vator (CRP-cAMP), the lac promoter is poorly transcribed even when lactose concentrations are high and 
the Lac repressor is not bound. 



common regulator is called a regulon. This arrange- 
ment, which allows for coordinated shifts in cellular 
functions that can require the action of hundreds of 
genes, is a major theme in the regulated expression of 
dispersed networks of genes in eukaryotes. Other bacte- 
rial regulons include the heat-shock gene system that 
responds to changes in temperature (p. 1028) and the 
genes induced in E. coli as part of the SOS response to 
DNA damage, described later. 

Many Genes for Amino Acid Biosynthetic Enzymes Are 
Regulated by Transcription Attenuation 

The 20 common amino acids are required in large 
amounts for protein synthesis, and E. coli can synthesize 



all of them. The genes for the enzymes needed to 
synthesize a given amino acid are generally clustered in 
an operon and are expressed whenever existing sup- 
plies of that amino acid are inadequate for cellular 
requirements. When the amino acid is abundant, the 
biosynthetic enzymes are not needed and the operon is 
repressed. 

The E. coli tryptophan {trp) operon (Fig. 28-19) 
includes five genes for the enzymes required to convert 
chorismate to tryptophan. Note that two of the enzymes 
catalyze more than one step in the pathway. The mRNA 
from the trp operon has a half-life of only about 3 min, 
allowing the cell to respond rapidly to changing needs 
for this amino acid. The Trp repressor is a homo- 
dimer, each subunit containing 107 amino acid residues 
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FIGURE 28-19 The frp operon. This operon is regulated by two mech- 
anisms: when tryptophan levels are high, (1) the repressor (upper left) 
binds to its operator and (2) transcription of trp mRNA is attenuated 
(see Fig. 28-21). The biosynthesis of tryptophan by the enzymes 
encoded in the trp operon is diagrammed at the bottom (see also 
Fig. 22-17) 
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(Fig. 28-20). When tryptophan is abundant it binds to 
the Trp repressor, causing a conformational change that 
permits the repressor to bind to the trp operator and in- 
hibit expression of the trp operon. The trp operator site 




FIGURE 28-20 Trp repressor. The repressor is a dimer, with both sub- 
units (gray and light blue) binding the DNA at helix-turn-helix motifs 
(PDB ID 1TRO). Bound molecules of tryptophan are in red. 



overlaps the promoter, so binding of the repressor 
blocks binding of RNA polymerase. 

Once again, this simple on/off circuit mediated by a 
repressor is not the entire regulatory story. Different 
cellular concentrations of tryptophan can vary the 
rate of synthesis of the biosynthetic enzymes over a 
700-fold range. Once repression is lifted and transcrip- 
tion begins, the rate of transcription is fine-tuned by a 
second regulatory process, called transcription atten- 
uation, in which transcription is initiated normally but 
is abruptly halted before the operon genes are tran- 
scribed. The frequency with which transcription is at- 
tenuated is regulated by the availability of tryptophan 
and relies on the very close coupling of transcription 
and translation in bacteria. 

The trp operon attenuation mechanism uses signals 
encoded in four sequences within a 162 nucleotide 
leader region at the 5' end of the mRNA, preceding the 
initiation codon of the first gene (Fig. 28-21a). Within 
the leader lies a region known as the attenuator, made 
up of sequences 3 and 4. These sequences base-pair to 
form a G=C-rich stem-and-loop structure closely followed 
by a series of U residues. The attenuator structure acts as 
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When tryptophan levels are high, the ribosome quickly translates 
sequence 1 (open reading frame encoding leader peptide) and blocks 
sequence 2 before sequence 3 is transcribed. Continued transcription 
leads to attenuation at the terminator-like attenuator structure 
formed by sequences 3 and 4. 



Incomplete 
leader peptide 




When tryptophan levels are low, the ribosome pauses at the 
Trp codons in sequence 1, Formation of the paired structure 
between sequences 2 and 3 prevents attenuation, because 
sequence 3 is no longer available to form the attenuator 
structure with sequence 4. The 2:3 structure, unlike the 
3:4 attenuator, does not prevent transcription. 



FIGURE 28-21 Transcriptional attenuation in the trp operon. Tran- 
scription is initiated at the beginning of the 162 nucleotide mRNA 
leader encoded by a DNA region called trpL (see Fig. 28-1 9). A regu- 
latory mechanism determines whether transcription is attenuated at 
the end of the leader or continues into the structural genes, (a) The trp 
mRNA leader {trpL). The attenuation mechanism in the trp operon in- 
volves sequences 1 to 4 (highlighted), (b) Sequence 1 encodes a small 
peptide, the leader peptide, containing two Trp residues (W); it is trans- 
lated immediately after transcription begins. Sequences 2 and 3 are 
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complementary, as are sequences 3 and 4. The attenuator structure 
forms by the pairing of sequences 3 and 4 (top). Its structure and func- 
tion are similar to those of a transcription terminator (see Fig. 26-8). 
Fairing of sequences 2 and 3 (bottom) prevents the attenuator structure 
from forming. Note that the leader peptide has no other cellular func- 
tion. Translation of its open reading frame has a purely regulatory role 
that determines which complementary sequences (2 and 3 or 3 and 4) 
are paired, (c) Base-pairing schemes for the complementary regions of 
the trp mRNA leader. 
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a transcription terminator (Fig. 28-2 lb). Sequence 2 is 
an alternative complement for sequence 3 (Fig. 28-2 lc). 
If sequences 2 and 3 base-pair, the attenuator structure 
cannot form and transcription continues into the trp 
biosynthetic genes; the loop formed by the pairing of se- 
quences 2 and 3 does not obstruct transcription. 

Regulatory sequence 1 is crucial for a tryptophan- 
sensitive mechanism that determines whether sequence 
3 pairs with sequence 2 (allowing transcription to con- 
tinue) or with sequence 4 (attenuating transcription). 
Formation of the attenuator stem-and-loop structure 
depends on events that occur during translation of reg- 
ulatory sequence 1, which encodes a leader peptide (so 
called because it is encoded by the leader region of 
the mRNA) of 14 amino acids, two of which are Trp 
residues. The leader peptide has no other known cellu- 
lar function; its synthesis is simply an operon regulatory 
device. This peptide is translated immediately after it is 
transcribed, by a ribosome that follows closely behind 
RNA polymerase as transcription proceeds. 

When tryptophan concentrations are high, concen- 
trations of charged tryptophan tRNA (Trp-tRNA^) are 
also high. This allows translation to proceed rapidly past 
the two Trp codons of sequence 1 and into sequence 2, 
before sequence 3 is synthesized by RNA polymerase. In 
this situation, sequence 2 is covered by the ribosome and 
unavailable for pairing to sequence 3 when sequence 3 is 
synthesized; the attenuator structure (sequences 3 and 
4) forms and transcription halts (Fig. 28-2 lb, top). When 
tryptophan concentrations are low, however, the ribo- 
some stalls at the two Trp codons in sequence 1, because 



charged tRNA 1 ^ is less available. Sequence 2 remains 
free while sequence 3 is synthesized, allowing these two 
sequences to base-pair and permitting transcription to 
proceed (Fig. 28-21b, bottom). In this way, the propor- 
tion of transcripts that are attenuated declines as trypto- 
phan concentration declines. 

Many other amino acid biosynthetic operons use a sim- 
ilar attenuation strategy to fine-tune biosynthetic enzymes 
to meet the prevailing cellular requirements. The 15 amino 
acid leader peptide produced by the phe operon contains 
seven Phe residues. The leu operon leader peptide has four 
contiguous Leu residues. The leader peptide for the his 
operon contains seven contiguous His residues. In fact, in 
the his operon and a number of others, attenuation is suffi- 
ciently sensitive to be the only regulatory mechanism. 

Induction of the SOS Response Requires Destruction 
of Repressor Proteins 

Extensive DNA damage in the bacterial chromosome 
triggers the induction of many distantly located genes. 
This response, called the SOS response (p. 1001), pro- 
vides another good example of coordinated gene regula- 
tion. Many of the induced genes are involved in DNA 
repair (see Table 25-6). The key regulatory proteins are 
the RecA protein and the LexA repressor. 

The LexA repressor (M r 22,700) inhibits transcrip- 
tion of all the SOS genes (Fig. 28-22), and induction of 
the SOS response requires removal of LexA. This is not a 
simple dissociation from DNA in response to binding of a 
small molecule, as in the regulation of the lac operon 



FIGURE 28-22 SOS response in E. coli. See Table 25-6 
for the functions of many of these proteins. The LexA 
protein is the repressor in this system, which has an op- 
erator site (red) near each gene. Because the recA gene 
is not entirely repressed by the LexA repressor, the nor- 
mal cell contains about 1,000 RecA monomers. © 
When DNA is extensively damaged (such as by UV 
light), DNA replication is halted and the number of sin- 
gle-strand gaps in the DNA increases. © RecA protein 
binds to this damaged, single-stranded DNA, activating 
the protein's coprotease activity. © While bound to 
DNA, the RecA protein facilitates cleavage and inacti- 
vation of the LexA repressor When the repressor is in- 
activated, the SOS genes, including recA, are induced; 
RecA levels increase 50- to 100-fold. 
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described above. Instead, the LexA repressor is inacti- 
vated when it catalyzes its own cleavage at a specific 
Ala-Gly peptide bond, producing two roughly equal pro- 
tein fragments. At physiological pH, this autocleavage re- 
action requires the RecA protein. RecA is not a protease 
in the classical sense, but its interaction with LexA facil- 
itates the repressor's self-cleavage reaction. This func- 
tion of RecA is sometimes called a coprotease activity. 

The RecA protein provides the functional link be- 
tween the biological signal (DNA damage) and induc- 
tion of the SOS genes. Heavy DNA damage leads to 
numerous single-strand gaps in the DNA, and only RecA 
that is bound to single-stranded DNA can facilitate 
cleavage of the LexA repressor (Fig. 28-22, bottom). 
Binding of RecA at the gaps eventually activates its co- 
protease activity, leading to cleavage of the LexA re- 
pressor and SOS induction. 

During induction of the SOS response in a severely 
damaged cell, RecA also cleaves and thus inactivates the 
repressors that otherwise allow propagation of certain 
viruses in a dormant lysogenic state within the bacterial 
host. This provides a remarkable illustration of evolu- 
tionary adaptation. These repressors, like LexA, also un- 
dergo self-cleavage at a specific Ala-Gly peptide bond, 
so induction of the SOS response permits replication of 
the virus and lysis of the cell, releasing new viral parti- 
cles. Thus the bacteriophage can make a hasty exit from 
a compromised bacterial host cell. 
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Synthesis of Ribosomal Proteins Is Coordinated 
with rRNA Synthesis 

In bacteria, an increased cellular demand for protein 
synthesis is met by increasing the number of ribosomes 
rather than altering the activity of individual ribosomes. 
In general, the number of ribosomes increases as the 
cellular growth rate increases. At high growth rates, ri- 
bosomes make up approximately 45% of the cell's dry 
weight. The proportion of cellular resources devoted to 
making ribosomes is so large, and the function of ribo- 
somes so important, that cells must coordinate the syn- 
thesis of the ribosomal components: the ribosomal 
proteins (r-proteins) and RNAs (rRNAs). This regula- 
tion is distinct from the mechanisms described so far, 
because it occurs largely at the level of translation. 

The 52 genes that encode the r-proteins occur 
in at least 20 operons, each with 1 to 11 genes. Some 
of these operons also contain the genes for the subunits of 
DNA primase (see Fig. 25-13), RNA polymerase (see Fig. 
26-4), and protein synthesis elongation factors (see 
Fig. 27-28) — revealing the close coupling of replication, 
transcription, and protein synthesis during cell growth. 

The r-protein operons are regulated primarily 
through a translational feedback mechanism. One 
r-protein encoded by each operon also functions as a 
translational repressor, which binds to the mRNA 
transcribed from that operon and blocks translation of 
all the genes the messenger encodes (Fig. 28-23). In 
general, the r-protein that plays the role of repressor 
also binds directly to an rRNA. Each translational re- 
pressor r-protein binds with higher affinity to the appro- 
priate rRNA than to its mRNA, so the mRNA is bound 
and translation repressed only when the level of the r- 
protein exceeds that of the rRNA. This ensures that 
translation of the mRNAs encoding r-proteins is re- 
pressed only when synthesis of these r-proteins exceeds 
that needed to make functional ribosomes. In this way, 
the rate of r-protein synthesis is kept in balance with 
rRNA availability. 

FIGURE 28-23 Translational feedback in some ribosomal protein 
operons. The r-proteins that act as translational repressors are shaded 
pink. Each translational repressor blocks the translation of all genes in 
that operon by binding to the indicated site on the mRNA. Genes that 
encode subunits of RNA polymerase are shaded yellow; genes that en- 
code elongation factors are blue. The r-proteins of the large (50S) ribo- 
somal subunit are designated L1 to L34; those of the small (30S) 
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The mRNA binding site for the translational repres- 
sor is near the translational start site of one of the genes 
in the operon, usually the first gene (Fig. 28-23). In 
other operons this would affect only that one gene, be- 
cause in bacterial polycistronic mRNAs most genes have 
independent translation signals. In the r-protein oper- 
ons, however, the translation of one gene depends on 
the translation of all the others. The mechanism of this 
translational coupling is not yet understood in detail. 
However, in some cases the translation of multiple genes 
seems to be blocked by folding of the mRNA into an 
elaborate three-dimensional structure that is stabilized 
both by internal base-pairing '(as in Fig. 8-23) and by 
binding of the translational repressor protein. When the 
translational repressor is absent, ribosome binding and 
translation of one or more of the genes disrupts the 
folded structure of the mRNA and allows all the genes to 
be translated. 

Because the synthesis of r-proteins is coordinated 
with the available rRNA, the regulation of ribosome pro- 
duction reflects the regulation of rRNA synthesis. In 
E. coli, rRNA synthesis from the seven rRNA operons 
responds to cellular growth rate and to changes in the 
availability of crucial nutrients, particularly amino acids. 
The regulation coordinated with amino acid concentra- 
tions is known as the stringent response (Fig. 28-24). 
When amino acid concentrations are low, rRNA synthe- 




FIGURE 28-24 Stringent response in F. coli. This response to amino 
acid starvation is triggered by binding of an uncharged tRNA in the ri- 
bosomal A site. A protein called stringent factor binds to the ribosome 
and catalyzes the synthesis of pppGpp, which is converted by a phos- 
phohydrolase to ppGpp. The signal ppGpp reduces transcription of 
some genes and increases that of others, in part by binding to the £ 
subunit of RNA polymerase and altering the enzyme's promoter speci- 
ficity. Synthesis of rRNA is reduced when ppGpp levels increase. 



sis is halted. Amino acid starvation leads to the binding 
of uncharged tRNAs to the ribosomal A site; this triggers 
a sequence of events that begins with the binding of an 
enzyme called stringent factor (RelA protein) to the 
ribosome. When bound to the ribosome, stringent factor 
catalyzes formation of the unusual nucleotide guanosine 
tetraphosphate (ppGpp; see Fig. 8-39); it adds py- 
rophosphate to the 3' position of GTP, in the reaction 

GTP + ATP pppGpp + AMP 

then a phosphohydrolase cleaves off one phosphate to 
form ppGpp. The abrupt rise in ppGpp level in response 
to amino acid starvation results in a great reduction in 
rRNA synthesis, mediated at least in part by the binding 
of ppGpp to RNA polymerase. 

The nucleotide ppGpp, along with cAMP, belongs 
to a class of modified nucleotides that act as cellular 
second messengers (p. 298). InE. coli, these two nu- 
cleotides serve as starvation signals; they cause large 
changes in cellular metabolism by increasing or decreas- 
ing the transcription of hundreds of genes. In eukaryotic 
cells, similar nucleotide second messengers also have 
multiple regulatory functions. The coordination of cellu- 
lar metabolism with cell growth is highly complex, and 
further regulatory mechanisms undoubtedly remain to 
be discovered. 

The Function of Some mRNAs Is Regulated by 
Small RNAs in Cis or in Trans 

As described throughout this chapter, proteins play an 
important and well-documented role in regulating gene 
expression. But RNA also has a crucial role— one that is 
becoming better recognized as more examples of regula- 
tory RNAs are discovered. Once an mRNA is synthesized, 
its functions can be controlled by RNA-binding proteins, 
as seen for the r-protein operons just described, or by an 
RNA. A separate RNA molecule may bind to the mRNA 
"in trans" and affect its activity. Alternatively, a portion of 
the mRNA itself may regulate its own function. When part 
of a molecule affects the function of another part of the 
same molecule, it is said to act "in cis." 

A well-characterized example of RNA regulation in 
trans is seen in the regulation of the mRNA of the gene 
rpoS (RNA polymerase sigma factor), which encodes cr 3 , 
one of the sevens, coli sigma factors (see Table 26-1). 
The cell uses this specificity factor in certain stress situa- 
tions, such as when it enters the stationary phase (a state 
of no growth, necessitated by lack of nutrients) and is 
needed to transcribe large numbers of stress response 
genes. The (7 s mRNA is present at low levels under most 
conditions but is not translated, because a large hairpin 
structure upstream of the coding region inhibits ribosome 
binding (Fig. 28-25). Under certain stress conditions, 
one or both of two small special-function RNAs, DsrA 
(downstream region^) and RprA (ftpos regulator RNA 
A), are induced. Both can pair with one strand of the hair- 
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FIGURE 28-25 Regulation of bacterial mRNA function in trans by 

sRNAs. Several sRNAs (small RNAs) — DsrA, RprA, and OxyS— are 
involved in regulation of the rpoS gene. All require the protein Hfq, an 
RNA chaperone that facilitates RNA-RNA pairing. Hfq has a toroid 
structure, with a pore in the center, (a) DsrA promotes translation by 
pairing with one strand of a stem-loop structure that otherwise blocks 
the ribosome-binding site. RprA acts in a similar way. (b) OxyS blocks 
translation by pairing with the ribosome-binding site. 

pin in the mRNA, disrupting the hairpin and thus al- 
lowing translation of rpoS. Another small RNA, OxyS 
(oxidative stress gene S), is induced under conditions of 
oxidative stress and inhibits the translation of rpoS, prob- 
ably by pairing with and blocking the ribosome-binding 
site on the mRNA. OxyS is expressed as part of a system 
that responds to a different type of stress (oxidative dam- 
age) than does rpoS, and its task is to prevent expression 
of unneeded repair pathways. DsrA, RprA, and OxyS are 
all relatively small bacterial RNA molecules (less than 300 
nucleotides), designated sRNAs (s for small; there are of 
course other "small" RNAs with other designations in eu- 
karyotes). All require for their function a protein called 
Hfq, an RNA chaperone that facilitates RNA-RNA pairing. 
The known bacterial genes regulated in this way are few 
in number, just a few dozen in a typical bacterial species. 
However, these examples provide good model systems 
for understanding patterns present in the more complex 



and numerous examples of RNA-mediated regulation in 
eukaryotes. 

Regulation in cis involves a class of RNA structures 
known as riboswitches. As described in Box 26-3, ap- 
tamers are RNA molecules, generated in vitro, that are 
capable of specific binding to a particular ligand. As one 
might expect, such ligand-binding RNA domains are also 
present in nature— in riboswitches— in a significant 
number of bacterial mRNAs (and even in some eukary- 
otic mRNAs). These natural aptamers are structured 
domains found in untranslated regions at the 5' ends of 
certain bacterial mRNAs. Binding of an mRNAs ri- 
boswitch to its appropriate ligand results in a conforma- 
tional change in the mRNA, and transcription is 
inhibited by stabilization of a premature transcription 
termination structure, or translation is inhibited (in cis) 
by occlusion of the ribosome-binding site (Fig. 28-26). 
In most cases, the riboswitch acts in a kind of feedback 
loop. Most genes regulated in this way are involved in 
the synthesis or transport of the ligand that is bound by 
the riboswitch; thus, when the ligand is present in high 



Aptamer 




Ribosome- 
binding site 



Stabilization of a 
transcription termina- 
poly(U) tor structure aborts 
terminator transcription. 
3' 



Blockage of the 
ribosome-binding site 
blocks translation. 



■ GUACGG 

X 



Regulation of intron 
splicing in fungal and 
plant introns. 



3' splice site 



5' splice site 



FIGURE 28-26 Regulation of bacterial mRNA function in cis by 
riboswitches. The known modes of action are illustrated by several 
different riboswitches based on a widespread natural aptamer that 
binds thiamine pyrophosphate. TPP binding to the aptamer leads to a 
conformational change that produces the varied results illustrated in 
parts (a), (b), and (c) in the different systems in which the aptamer is 
utilized. 
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concentrations, the riboswitch inhibits expression of the 
genes needed to replenish this ligand. 

Each riboswitch binds only one ligand. Distinct ri- 
boswitches have been detected that respond to more 
than a dozen different ligands, including thiamine py- 
rophosphate (TPP, vitamin BJ, cobalamin (vitamin B 12 ), 
flavin mononucleotide, lysine, S-adenosylmethionine 
(adoMet), purines, iV-acetylglucosamine 6-phosphate, 
and glycine. It is likely that many more remain to be dis- 
covered. The riboswitch that responds to TPP seems to 
be the most widespread; it is found in many bacteria, 
fungi, and some plants. The bacterial TPP riboswitch in- 
hibits translation in some species and induces prema- 
ture transcription termination in others (Fig. 28-26). 
The eukaryotic TPP riboswitch is found in the introns of 
certain genes and modulates the alternative splicing of 
those genes (see Fig. 26-22). It is not yet clear how 
common riboswitches are. However, estimates suggest 
that more than 4% of the genes of Bacillus subtilis are 
regulated by riboswitches. 

E9 As riboswitches become better understood, re- 
Bfl searchers are finding medical applications. For 
example, most of the riboswitches described to date, in- 
cluding the one that responds to adoMet, have been 
found only in bacteria. A drug that bound to and acti- 
vated the adoMet riboswitch would shut down the genes 
encoding the enzymes that synthesize and transport 
adoMet, effectively starving the bacterial cells of this es- 
sential cofactor. Drugs of this type are being sought for 
use as a new class of antibiotics. ■ 

The pace of discovery of functional RNAs shows no 
signs of abatement and continues to enrich the hypoth- 
esis that RNA played a special role in the evolution of life 
(Chapter 26). The sRNAs and riboswitches, like ri- 
bozymes and ribosomes, may be vestiges of an RNA 
world obscured by time but persisting as a rich array of 
biological devices still functioning in the extant bio- 
sphere. The laboratory selection of aptamers and ri- 
bozymes with novel ligand-binding and enzymatic 
functions (see Box 26-3) tells us that the RNA-based 
activities necessary for a viable RNA world are possible. 
Discovery of many of the same RNA functions in living 
organisms tells us that key components for RNA-based 
metabolism do exist. For example, the natural aptamers 
of riboswitches may be derived from RNAs that, billions 
of years ago, bound to cofactors needed to promote the 
enzymatic processes required for metabolism in the 
RNA world. 

Some Genes Are Regulated by Genetic Recombination 

We turn now to another mode of bacterial gene regula- 
tion, at the level of DNA rearrangement— recombination. 
Salmonella typhimurium, which inhabits the mam- 
malian intestine, moves by rotating the flagella on its cell 
surface (Fig. 28-27). The many copies of the protein 
flagellin (M r 53,000) that make up the flagella are promi- 
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FIGURE 28-27 Salmonella typhimurium, with flagella evident. 

nent targets of mammalian immune systems. But Sal- 
monella cells have a mechanism that evades the im- 
mune response: they switch between two distinct 
flagellin proteins (FljB and FliC) roughly once every 
1,000 generations, using a process called phase 
variation. 

The switch is accomplished by periodic inversion of 
a segment of DNA containing the promoter for a flagellin 
gene. The inversion is a site-specific recombination re- 
action (see Fig. 25-41) mediated by the Hin recombi- 
nase at specific 14 bp sequences Qiix sequences) at 
either end of the DNA segment. When the DNA segment 
is in one orientation, the gene for FljB flagellin and the 
gene encoding a repressor (FljA) are expressed 
(Fig. 28-28a); the repressor shuts down expression of 
the gene for FliC flagellin. When the DNA segment is 
inverted (Fig. 28-28b), the fljA and fljB genes are no 
longer transcribed, and the fliC gene is induced as the 
repressor becomes depleted. The Hin recombinase, 
encoded by the hin gene in the DNA segment that 
undergoes inversion, is expressed when the DNA segment 
is in either orientation, so the cell can always switch 
from one state to the other. 

This type of regulatory mechanism has the advan- 
tage of being absolute: gene expression is impossible 
when the gene is physically separated from its pro- 
moter (note the position of the fljB promoter in Fig. 
28-28b). An absolute on/off switch may be important in 
this system (even though it affects only one of the two 
flagellin genes), because a flagellum with just one copy 
of the wrong flagellin might be vulnerable to host anti- 
bodies against that protein. The Salmonella system is 
by no means unique. Similar regulatory systems occur 
in some other bacteria and in some bacteriophages, and 
recombination systems with similar functions have 
been found in eukaryotes (Table 28-1). Gene regula- 
tion by DNA rearrangements that move genes and/or 
promoters is particularly common in pathogens that 
benefit by changing their host range or by changing 
their surface proteins, thereby staying ahead of host 
immune systems. 
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FIGURE 28-28 Regulation of flagellin genes in Salmonella: phase 
variation. The products of genes fliC and fljB are different flagellins. 
The hin gene encodes the recombinase that catalyzes inversion of the 
DNA segment containing the fljB promoter and the hin gene. The re- 
combination sites (inverted repeats) are called hix (yellow), (a) In one 
orientation, fljB is expressed along with a repressor protein (product of 
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the fljA gene) that represses transcription of the fliC gene, (b) In the op- 
posite orientation only the fliC gene is expressed; the fljA and fljB 
genes cannot be transcribed. The interconversion between these two 
states, known as phase variation, also requires two other nonspecific 
DNA-binding proteins (not shown), HU and FIS. 



TABLE 28-1 



System 



Examples of Gene Regulation by Recombination 



Recombinase/ 
recombination site 



Type of 
recombination 



Function 



Phase variation (Salmonella) 


Bm/hix 


Site-specific 


Alternative expression of two 
flagellin genes allows evasion 
of host immune response. 


Host range (bacteriophage fi) 


Gm/gix 


Site- specific 


Alternative expression of two 
sets of tail fiber genes affects 
host range. 


Mating-type switch (yeast) 


HO endonuclease, 
RAD52 protein, other 
proteins/M^T 7 


Nonreciprocal 
gene conversion* 


Alternative expression of two 
mating types of yeast, a and 
a, creates cells of different 
mating types that can mate 
and undergo meiosis. 


Antigenic variation 
(trypanosomes) t 


Varies 


Nonreciprocal gene 
conversion* 


Successive expression of 
different genes encoding the 
variable surface glycoproteins 
(VSGs) allows evasion of host 
immune response. 



*ln nonreciprocal gene conversion (a class of recombination events not discussed in Chapter 25), genetic information is moved from one part of 
the genome (where it is silent) to another (where it is expressed). The reaction is similar to replicative transposition (see Fig. 25-45) 
tTrypanosomes cause African sleeping sickness and other diseases (see Box 22-3). The outer surface of a trypanosome is made up of multiple 
copies of a single VSG, the major surface antigen. A cell can change surface antigens to more than 100 different forms, precluding an effective 
defense by the host immune system. 
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SUMMARY 28.2 Regulation of Gene 

Expression in Bacteria 

■ In addition to repression by the Lac repressor, 

the E. coli lac operon undergoes positive regulation 
by the cAMP receptor protein (CRP). When 
[glucose] is low, [cAMP] is high and CRP-cAMP 
binds to a specific site on the DNA, stimulating 
transcription of the lac operon and production of 
lactose-metabolizing enzymes. The presence of 
glucose depresses [cAMP], decreasing expression 
of lac and other genes involved in metabolism of 
secondary sugars, A group of coordinately regulated 
operons is referred to as a regulon. 

■ Operons that produce the enzymes of amino acid 
synthesis have a regulatory circuit called attenuation, 
which uses a transcription termination site (the 
attenuator) in the mRNA. Formation of the 
attenuator is modulated by a mechanism that 
couples transcription and translation while 
responding to small changes in amino acid 
concentration. 

■ In the SOS system, multiple unlinked genes 
repressed by a single repressor are induced 
simultaneously when DNA damage triggers RecA 
protein-facilitated autocatalytic proteolysis of 
the repressor. 

■ In the synthesis of ribosomal proteins, one protein 
in each r-protein operon acts as a translational 
repressor. The mRNA is bound by the repressor, 
and translation is blocked only when the r-protein 
is present in excess of available rRNA. 

■ Posttranscriptional regulation of some mRNAs 
is mediated by sRNAs that act in trans or by 
riboswitches, part of the mRNA structure itself, 
that act in cis. 

■ Some genes are regulated by genetic recombination 
processes that move promoters relative to the 
genes being regulated. Regulation can also take 
place at the level of translation. 

28.3 Regulation of Gene Expression in 
Eukaryotes 

Initiation of transcription is a crucial regulation point for 
gene expression in all organisms. Although eukaryotes 
and bacteria use some of the same regulatory mecha- 
nisms, the regulation of transcription in the two systems 
is fundamentally different. 

We can define a transcriptional ground state as the in- 
herent activity of promoters and transcriptional machinery 
in vivo in the absence of regulatory sequences. In bacteria, 
RNA polymerase generally has access to every promoter 
and can bind and initiate transcription at some level of 
efficiency in the absence of activators or repressors; the 



transcriptional ground state is therefore nonrestrictive. In 
eukaryotes, however, strong promoters are generally inac- 
tive in vivo in the absence of regulatory proteins; that is, 
the transcriptional ground state is restrictive. This funda- 
mental difference gives rise to at least four important fea- 
tures that distinguish the regulation of gene expression in 
eukaryotes from that in bacteria. 

First, access to eukaryotic promoters is restricted 
by the structure of chromatin, and activation of tran- 
scription is associated with many changes in chromatin 
structure in the transcribed region. Second, although 
eukaryotic cells have both positive and negative regula- 
tory mechanisms, positive mechanisms predominate in 
all systems characterized so far. Thus, given that the 
transcriptional ground state is restrictive, virtually every 
eukaryotic gene requires activation in order to be tran- 
scribed. Third, eukaryotic cells have larger, more com- 
plex multimeric regulatory proteins than do bacteria. 
Finally, transcription in the eukaryotic nucleus is sepa- 
rated from translation in the cytoplasm in both space 
and time. 

The complexity of regulatory circuits in eukaryotic 
cells is extraordinary, as the following discussion shows. 
We conclude the section with an illustrated description 
of one of the most elaborate circuits: the regulatory cas- 
cade that controls development in fruit flies. 

Transcriptionally Active Chromatin Is Structurally 
Distinct from Inactive Chromatin 

The effects of chromosome structure on gene regulation 
in eukaryotes have no clear parallel in bacteria. In the 
eukaryotic cell cycle, interphase chromosomes appear, 
at first viewing, to be dispersed and amorphous (see 
Figs 12-43, 24-25). Nevertheless, several forms of chro- 
matin can be found along these chromosomes. About 
10% of the chromatin in a typical eukaryotic cell is in a 
more condensed form than the rest of the chromatin. 
This form, heterochromatin, is transcriptionally inac- 
tive. Heterochromatin is generally associated with par- 
ticular chromosome structures— the centromeres, for 
example. The remaining, less condensed chromatin is 
called euchromatin. 

Transcription of a eukaryotic gene is strongly re- 
pressed when its DNA is condensed within heterochro- 
matin. Some, but not all, of the euchromatin is tran- 
scriptionally active. Transcriptionally active chromosomal 
regions are characterized not only by a more open chro- 
matin structure but also by the presence of nucleosomes 
with particular compositions and modifications. Tran- 
scriptionally active chromatin tends to be deficient in 
histone HI, which binds to the linker DNA between 
nucleosome particles, and enriched in the histone variants 
H3.3 and H2AZ (see Box 24-2). 

Histones within transcriptionally active chromatin 
and heterochromatin differ in their patterns of covalent 
modification. The core histones of nucleosome particles 
(H2A, H2B, H3, H4; see Fig. 24-27) are modified by 
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methylation of Lys or Arg residues, phosphorylation of 
Ser or Thr residues, acetylation (see below), ubiquitina- 
tion (see Fig. 27-47), or sumoylation. Each of the core 
histories has two distinct structural domains. A central 
domain is involved in histone-histone interaction and 
the wrapping of DNA around the nucleosome. A second, 
lysine-rich amino-terminal domain is generally posi- 
tioned near the exterior of the assembled nucleosome 
particle; the covalent modifications occur at specific 
residues concentrated in this amino-terminal domain. 
The patterns of modification have led some researchers 
to propose the existence of a histone code, in which 
modification patterns are recognized by enzymes that 
alter the structure of chromatin. Modifications associ- 
ated with transcriptional activation — primarily methyla- 
tion and acetylation — would be recognized by enzymes 
that make the chromatin more accessible to the tran- 
scription machinery. Indeed, some of the modifications 
are essential for interactions with proteins that play key 
roles in transcription. 

5-Methylation of cytosine residues of CpG se- 
quences is common in eukaryotic DNA (p. 292), but 
DNA in transcriptionally active chromatin tends to be 
undermethylated. Furthermore, CpG sites in particular 
genes are more often undermethylated in the cells of 



tissues where the genes are expressed than in those 
where the genes are not expressed. The overall pattern 
suggests that active chromatin is prepared for transcrip- 
tion by the removal of potential structural barriers. 

Chromatin Is Remodeled by Acetylation and Nucleosomal 
Displacement/Repositioning 

The transcription-associated structural changes in chro- 
matin are generated by a process called chromatin re- 
modeling. The remodeling involves enzymes that 
promote the changes described above. Some enzymes 
covalently modify the histones of the nucleosome. Oth- 
ers use the chemical energy of ATP to reposition nucle- 
osomes on the DNA (Table 28-2). Still others alter the 
histone composition of the nucleosomes. 

The acetylation and methylation of histones figure 
prominently in the processes that activate chromatin for 
transcription. As noted above, the amino-terminal do- 
mains of the core histones are generally rich in Lys and 
Arg residues. During transcription, histone H3 is methyl- 
ated (by specific histone methylases) at Lys 4 in nucleo- 
somes near the 5' end of the coding region and at Lys 36 
throughout the coding region. These methylations facili- 
tate the binding of histone acetyltransferases (HATs), 



TABLE 28-2 



ome Enzyme Complexes Catalyzing Chromatin Structural Changes Associated with Transcription 



Enzyme complex* 



Oligomeric structure 
(number of polypeptides) 



Source 



Activities 



Histone modification 

GCN5-ADA2-ADA3 
SAGA/PCAF 

NuA4 

Histone movement/replacement 
enzymes that require ATP 

SWI/SNF 
ISWI family 



SWR1 family 

Histone chaperones that do 
not require ATP 

HIRA 



3 

>20 

At least 12 



>6; total M T 2 X 10 6 
Varies 

-12 



Yeast 

Eukaryotes 
Eukaryotes 



Eukaryotes 



Eukaryotes 



Eukaryotes 



Eukaryotes 



GCN5 has type A HAT activity 
Includes GCN5-ADA2-ADA3; 

acetylates residues in H3 and H2B 
Esal component has HAT activity; 

acetylates H4, H2A, and H2AZ 



Nucleosome remodeling; 
transcriptional activation 

Nucleosome remodeling; 

transcriptional repression; 

transcriptional activation in some 

cases (NURF) 
H2AZ deposition 



Deposition of H3.3 during 
transcription 



♦The abbreviations for eukaryotic genes and proteins are often more confusing or obscure than those used for bacteria. The complex of GCN5 
{general control nonderepressible) and ADA (alteration/deficiency activation) proteins was discovered during investigation of the regulation of 
nitrogen metabolism genes in yeast. These proteins can be part of the larger SAGA complex (SPF, 4DA2,3, GCN5, acetyltransferase) in yeasts. The 
equivalent of SAGA in humans is PCAF (p300/CBP-associated factor). NuA4 is nucleosome acetyltransferase of H4; ESA1 is essential SAS2-related 
acetyltransferase. SWI (swrtching) was discovered as a protein required for expression of certain genes involved in matmg-type switching ir i yeast, 
and SNF (sucrose nonfermenting) as a factor for expression of the yeast gene for sucrase. Subsequent studies revealed multiple SWI and SNF pro- 
teins that acted in a complex. The SWI/SNF complex has a role in the expression of a wide range of genes and has been found in many eukaryotes, 
including humans. ISWI is imitation SWI; NURF^ nuclear remodeling factor; SWR1, Swi2/Snf2-related ATPase 1; and HIRA, n/stone regulator A. 
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enzymes that acetylate particular Lys residues. Cytosolic 
(type B) HATs acetylate newly synthesized histones be- 
fore the histones are imported into the nucleus. The sub- 
sequent assembly of the histones into chromatin after 
replication is facilitated by histone chaperones: CAF1 for 
H3 and H4, and NAP1 for H2A and H2B (see Box 24-2). 

Where chromatin is being activated for transcrip- 
tion, the nucleosomal histones are further acetylated by 
nuclear (type A) HATs. The acetylation of multiple Lys 
residues in the amino-terminal domains of histones H3 
and H4 can reduce the affinity of the entire nucleosome 
for DNA. Acetylation of particular Lys residues is criti- 
cal for the interaction of nucleosomes with other pro- 
teins. When transcription of a gene is no longer 
required, the extent of acetylation of nucleosomes in 
that vicinity is reduced by the activity of histone 
deacetylases (HDACs), as part of a general gene-si- 
lencing process that restores the chromatin to a tran- 
scriptionally inactive state. In addition to the removal of 
certain acetyl groups, new covalent modification of his- 
tones marks chromatin as transcriptionally inactive. 
For example, Lys 9 of histone H3 is often methylated in 
heterochromatin. 

There are five known families of enzyme complexes 
that actively move or displace nucleosomes, hydrolyzing 
ATP in the process, three of which are particularly im- 
portant in transcriptional activation (Table 28-2; see the 
table footnote for an explanation of the abbreviated 
names of the enzyme complexes described here). 
SWI/SNF, found in all eukaryotic cells, contains at least 
six core polypeptides that together remodel chromatin 
so that nucleosomes become more irregularly spaced, 
and stimulate the binding of transcription factors. The 
complex includes a component called a bromodomain 
near the carboxyl terminus of the active ATPase sub- 
unit, which interacts with acetylated histone tails. 
SWI/SNF is not required for the transcription of every 
gene. NURF, a member of the ISW1 family, remodels 
chromatin in ways that complement and overlap the ac- 
tivity of SWI/SNF. These two enzyme complexes are cru- 
cial in preparing a region of chromatin for active 
transcription. Some members of a third family, SWR1, 
are involved in deposition of the H2AZ histone variant in 
transcriptionally active chromatin. 

In the other families of chromatin remodelers, some 
are required to reorder nucleosomes within chromatin 
when genes are being silenced. The net effect of chro- 
matin remodeling is to make a segment of the chromo- 
some more accessible and to "label" (chemically modify) 
it so as to facilitate the binding and activity of transcrip- 
tion factors that regulate expression of the gene or 
genes in that region. 

Many Eukaryotic Promoters Are Positively Regulated 

As already noted, eukaryotic RNA polymerases have lit- 
tle or no intrinsic affinity for their promoters; initiation 
of transcription is almost always dependent on the action 



of multiple activator proteins. One important reason for 
the apparent predominance of positive regulation seems 
obvious: the storage of DNA within chromatin effec- 
tively renders most promoters inaccessible, so genes are 
silent in the absence of other regulation. The structure 
of chromatin affects access to some promoters more 
than others, but repressors that bind to DNA so as to 
preclude access of RNA polymerase (negative regula- 
tion) would often be simply redundant. Other factors 
must be at play in the use of positive regulation, and 
speculation generally centers around two: the large size 
of eukaryotic genomes and the greater efficiency of pos- 
itive regulation. 

First, nonspecific DNA binding of regulatory pro- 
teins becomes a more important problem in the much 
larger genomes of higher eukaryotes. And the chance 
that a single specific binding sequence will occur ran- 
domly at an inappropriate site also increases with 
genome size. Specificity for transcriptional activation 
can be improved if each of several positive-regulatory 
proteins must bind specific DNA sequences and then 
form a complex in order to become active. The average 
number of regulatory sites for a gene in a multicellular 
organism is probably at least five. The requirement for 
binding of several positive-regulatory proteins to spe- 
cific DNA sequences vastly reduces the probability of 
the random occurrence of a functional juxtaposition of 
all the necessary binding sites. In principle, a similar 
strategy could be used by multiple negative-regulatory 
elements, but this brings us to the second reason for the 
use of positive regulation: it is simply more efficient. If 
the -29,000 genes in the human genome were nega- 
tively regulated, each cell would have to synthesize, at 
all times, this same number of different repressors (or 
many times this number if multiple regulatory elements 
were used at each promoter) in concentrations suffi- 
cient to permit specific binding to each "unwanted" 
gene. In positive regulation, most of the genes are usu- 
ally inactive (that is, RNA polymerases do not bind to 
the promoters) and the cell synthesizes only the activa- 
tor proteins needed to promote transcription of the sub- 
set of genes required in the cell at that time. These 
arguments notwithstanding, there are examples of neg- 
ative regulation in eukaryotes, from yeasts to humans, 
as we shall see. 

DNA-Binding Activators and Coactivators Facilitate 
Assembly of the General Transcription Factors 

To continue our exploration of the regulation of gene 
expression in eukaryotes, we return to the interactions 
between promoters and RNA polymerase II (Pol II), the 
enzyme responsible for the synthesis of eukaryotic 
mRNAs. Although many (but not all) Pol II promoters 
include the TATA box and Inr (initiator) sequences, 
with their standard spacing (see Fig. 26-9), they vary 
greatly in both the number and the location of additional 
sequences required for the regulation of transcription. 
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These additional regulatory sequences are usually called 
enhancers in higher eukaryotes and upstream activa- 
tor sequences (UASs) in yeast. A typical enhancer 
may be found hundreds or even thousands of base pairs 
upstream from the transcription start site, or may even 
be downstream, within the gene itself. When bound by 
the appropriate regulatory proteins, an enhancer in- 
creases transcription at nearby promoters regardless of 
its orientation in the DNA. The UASs of yeast function in 
a similar way, although generally they must be posi- 
tioned upstream and within a few hundred base pairs of 
the transcription start site. An average Pol II promoter 
may be affected by a half-dozen regulatory sequences of 
this type, and even more-complex promoters are quite 
common (see Fig. 15-23, for example). 

Successful binding of active RNA polymerase II 
holoenzyme at one of its promoters usually requires the 
action of other proteins (Fig. 28-29), of four types: (1) 
transcription activators, which bind to enhancers or 
UASs and facilitate transcription; (2) chromatin modi- 
fication and remodeling proteins, described above; 
(3) coactivators; and (4) basal transcription fac- 
tors (see Fig. 26-10, Table 26-2), required at every Pol II 
promoter. The coactivators act indirectly — not by bind- 
ing to the DNA — and are required for essential commu- 
nication between the activators and the complex 
composed of Pol II and the basal (or general) transcrip- 
tion factors. Furthermore, a variety of repressor pro- 
teins can interfere with communication between the 
RNA polymerase and the activators, resulting in repres- 
sion of transcription (Fig. 28-29b). Here we focus on 
the protein complexes shown in Figure 28-29 and on 
how they interact to activate transcription. 

Transcription Activators The requirements for acti- 
vators vary greatly from one promoter to another. A few 
activators are known to facilitate transcription at hun- 
dreds of promoters, whereas others are specific for a few 
promoters. Many activators are sensitive to the binding 
of signal molecules, providing the capacity to activate or 
deactivate transcription in response to a changing cellu- 
lar environment. Some enhancers bound by activators 
are quite distant from the promoter's TATA box. How do 
the activators function at a distance? The answer in most 
cases seems to be that, as indicated earlier, the interven- 
ing DNA is looped so that the various protein complexes 
can interact directly. The looping is promoted by certain 
nonhistone proteins that are abundant in chromatin and 
bind nonspecifically to DNA. These high mobility 
group (HMG) proteins (Fig. 28-29; "high mobility" 
refers to their electrophoretic mobility in polyacrylamide 
gels) play an important structural role in chromatin re- 
modeling and transcriptional activation, 

Coactivator Protein Complexes Most transcription 
requires the presence of additional protein complexes. 
Some major regulatory protein complexes that interact 
with Pol II have been denned both genetically and bio- 
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FIGURE 28-29 Eukaryotic promoters and regulatory proteins. RNA 

polymerase II and its associated basal (general) transcription factors 
form a preinitiation complex at the TATA box and Inr site of the cognate 
promoters, a process facilitated by transcription activators, acting 
through mediator, (a) A composite promoter with typical sequence 
elements and protein complexes found in both yeast and higher eu- 
karyotes. The carboxyi-terminal domain (CTD) of Pol II (see Fig. 26-1 0) 
is an important point of interaction with mediator and other protein 
complexes. The histone modification enzymes catalyze methylation 
and acetylation; the remodeling enzymes alter the content and place- 
ment of nucleosomes. For the transcription activators, DNA-binding 
domains are shown in green, activation domains in pink. The interac- 
tions symbolized by blue arrows are discussed in the text, (b) Eukary- 
otic transcriptional repressors function by a range of mechanisms 
Some bind directly to DNA, displacing a protein complex required for 
activation; others interact with various parts of the transcription or ac- 
tivation complexes to prevent activation. Possible points of interaction 
are indicated with red arrows. 

chemically. These coactivator complexes act as interme- 
diaries between the transcription activators and the Pol II 
complex. 

The principal eukaryotic coactivator consists of 20 
to 30 or more polypeptides in a protein complex called 
mediator (Fig. 28-29); many of the 20 core polypep- 
tides are highly conserved from fungi to humans. An 
additional complex of four subunits can interact with 
mediator and inhibit transcription initiation. Mediator 
binds tightly to the carboxyi-terminal domain (CTD) of 



1140| Regulation of Gene Expression 



the largest subunit of Pol II. The mediator complex is 
required for both basal and regulated transcription at 
promoters used by Pol II, and it also stimulates phos- 
phorylation of the CTD by TFIIH (a basal transcription 
factor). Transcription activators interact with one or 
more components of the mediator complex, with the 
precise interaction sites differing from one activator to 
another. Coactivator complexes function at or near the 
promoter's TATA box. 

Additional coactivators, functioning with one or a 
few genes, have also been described. Some of these op- 
erate in conjunction with mediator, and some may act in 
systems that do not employ mediator. 

TATA-Binding Protein The first component to bind 
in the assembly of a preinitiation complex (PIC) at 

the TATA box of a typical Pol II promoter is the TATA- 
binding protein (TBP). The complete complex in- 
cludes the basal transcription factors TFIIB, TFIIE, 
TFIIF, TFIIH; Pol II; and perhaps TFIIA. This minimal 
PIC, however, is often insufficient for the initiation of 
transcription and generally does not form at all if the 
promoter is obscured within chromatin. Positive regulation, 
leading to transcription, is imposed by the activators 
and coactivators. 

Choreography of Transcriptional Activation We 

can now begin to piece together the sequence of tran- 
scriptional activation events at a typical Pol II promoter 
(Fig. 28-30). The exact order of binding of some com- 
ponents may vary, but the model in Figure 28-30 illus- 
trates the principles of activation as well as one common 
path. Many transcription activators have significant affin- 
ity for their binding sites even when the sites are within 
condensed chromatin. The binding of activators is often 
the event that triggers subsequent activation of the pro- 
moter. Binding of one activator may enable the binding of 
others, gradually displacing some nucleosomes. 

Crucial remodeling of the chromatin then takes 
place in stages, facilitated by interactions between acti- 
vators and HATs or enzyme complexes such as SWI/SNF 
(or both). In this way, a bound activator can draw 
in other components necessary for further chromatin 
remodeling to permit transcription of specific genes. 
The bound activators interact with the large mediator 
complex. Mediator, in turn, provides an assembly surface 
for the binding of first TBP (or TFIID), then TFIIB, and 
then other components of the PIC including RNA poly- 
merase II. Mediator stabilizes the binding of Pol II and its 
associated transcription factors and greatly facilitates 
formation of the PIC. Complexity in these regulatory cir- 
cuits is the rule rather than the exception, with multiple 
DNA-bound activators promoting transcription. 

The script can change from one promoter to an- 
other, but most promoters seem to require a precisely 
ordered assembly of components to initiate transcrip- 
tion. The assembly process is not always fast. At some 
genes it may take minutes; at certain genes of higher eu- 
karyotes the process can take days. 
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FIGURE 28-30 The components of transcriptional activation. Activators 
may bind the DNA first. The activators recruit the histone modifica- 
tion/nucleosome remodeling complexes and mediator. Mediator facil- 
itates the binding of TBP and TFIIB, and the other basal transcription 
factors and Pol II then bind. Phosphorylation of the carboxyl-terminal 
domain (CTD) of Pol II leads to transcription initiation (not shown). 

Reversible Transcriptional Activation Although rarer, 
some eukaryotic regulatory proteins that bind to Pol II 
promoters can act as repressors, inhibiting the formation 
of active PICs (Fig. 28-29b). Some activators can adopt 
different conformations, enabling them to serve as tran- 
scription activators or as repressors. For example, some 
steroid hormone receptors (described later) function in 
the nucleus as transcription activators, stimulating the 
transcription of certain genes when a particular steroid 
hormone signal is present. When the hormone is absent, 
the receptor proteins revert to a repressor conformation, 
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preventing the formation of PICs. In some cases this re- 
pression involves interaction with histone deacetylases 
and other proteins that help restore the surrounding chro- 
matin to its transcriptionally inactive state. Mediator, 
when it includes the inhibitory subunits, may also block 
transcription initiation. This may be a regulatory mecha- 
nism to ensure ordered assembly of the PIC (by delaying 
transcriptional activation until all required factors are 
present), or it may be a mechanism that helps deactivate 
promoters when transcription is no longer required. 

The Genes of Galactose Metabolism in Yeast Are Subject 
to Both Positive and Negative Regulation 

Some of the general principles described above can be 
illustrated by one well-studied eukaryotic regulatory 
circuit (Fig. 28-31). The enzymes required for the 
importation and metabolism of galactose in yeast are 
encoded by genes scattered over several chromosomes 
(Table 28-3). Each of the GAL genes is transcribed 
separately, and yeast cells have no operons like those 
in bacteria. However, all the GAL genes have similar 
promoters and are regulated coordinately by a com- 
mon set of proteins. The promoters for the GAL genes 
consist of the TATA box and Inr sequences, as well as 
an upstream activator sequence (UAS G ) recognized by 
the transcription activator Gal4 protein (Gal4p). Regu- 
lation of gene expression by galactose entails an inter- 
play between Gal4p and two other proteins, Gal80p 
and Gal3p (Fig. 28-31). Gal80p forms a complex with 
Gal4p, preventing Gal4p from functioning as an activa- 
tor of the GAL promoters. When galactose is present, it 
binds Gal3p, which then interacts with Gal80p, allow- 
ing Gal4p to function as an activator at the various GAL 
promoters. 




FIGURE 28-31 Regulation of transcription of GAL genes in yeast. 

Galactose imported into the yeast cell is converted to galactose 
6-phosphate by a pathway involving six enzymes, whose genes are 
scattered over three chromosomes (see Table 28-3). Transcription of 
these genes is regulated by the combined actions of the proteins 
Gal4p, Gal80p, and GaI3p, with Gal4p playing the central role of tran- 
scription activator. The Gal4p-Gal80p complex is inactive. Binding of 
galactose to Gal3p and interaction of Gal3p with Gal80p produces a 
conformational change in Gal80p. The Gal3p-Gal80p complex is then 
exported from the nucleus, allowing Gal4p to recruit mediator and 
SAGA and function in transcriptional activation. 
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FIGURE 28-32 Protein complexes involved in transcriptional activa- 
tion of a group of related eukaryotic genes. The GAL system illustrates 
the complexity of this process, but not all these protein complexes are 
yet known to affect GAL gene transcription. Note that many of the 
complexes (such as SWI/SNF, GCN5-ADA2-ADA3, and mediator) 
affect the transcription of many genes. The complexes assemble step- 
wise, as shown in Fig. 28-30. 



Other protein complexes also have a role in activat- 
ing transcription of the GAL genes. These include the 
SAGA complex for histone acetylation, the SWI/SNF 
complex for nucleosome remodeling, and the mediator 
complex. The Gal4 protein, when not bound to Gal80p, 
is responsible for the recruitment of these additional 
factors needed for transcriptional activation. Figure 28-32 
provides an idea of the complexity of protein interac- 
tions in the overall process of transcriptional activation 
in eukaryotic cells. 

Glucose is the preferred carbon source for yeast, as 
it is for bacteria. When glucose is present, most of the 
GAL genes are repressed— whether galactose is present 
or not. The GAL regulatory system described above is 
effectively overridden by a complex catabolite repres- 
sion system that includes several proteins (not depicted 
in Pig. 28-32). 

Transcription Activators Have a Modular Structure 

Transcription activators typically have a distinct struc- 
tural domain for specific DNA binding and one or more 



additional domains for transcriptional activation or for 
interaction with other regulatory proteins. Interaction 
of two regulatory proteins is often mediated by do- 
mains containing leucine zippers (Fig. 28-14) or helix- 
loop-helix motifs (Fig. 28-15). We consider here three 
distinct types of structural domains used in activation 
by transcription activators (Fig. 28-33a): Gal4p, Spl, 
and CTF1. 

Gal4p contains a zinc finger-like structure in its 
DNA-binding domain, near the amino terminus; this do- 
main has six Cys residues that coordinate two Zn 2+ . The 
protein functions as a homodimer (with dimerization 
mediated by interactions between two coiled coils) and 
binds to UAS G , a palindromic DNA sequence about 17 
bp long. Gal4p has a separate activation domain with 
many acidic amino acid residues. Experiments that 
substitute a variety of different peptide sequences 
for the acidic activation domain of Gal4p suggest 
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FIGURE 28-33 Transcription activators, (a) Typical activators such as 
CTF1, Gal4p, and Spl have a DNA-binding domain and an activation 
domain. The nature of the activation domain is indicated by symbols: 
, acidic; Q Q Q, glutamine-rich; PPP, proline-rich. These pro- 
teins generally activate transcription by interacting with coactivator 
complexes such as mediator. Note that the binding sites illustrated here 
are not generally found together near a single gene, (b) A chimeric 
protein containing the DNA-binding domain of Spl and the activation 
domain of CTF1 activates transcription if a GC box is present. 
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that the acidic nature of this domain is critical to its 
function, although its precise amino acid sequence can 
vary considerably. 

Spl (M r 80,000) is a transcription activator for a 
large number of genes in higher eukaryotes. Its DNA 
binding site, the GC box (consensus sequence GGGCGG), 
is usually quite near the TATA box. The DNA-binding 
domain of the Spl protein is near its carboxyl terminus 
and contains three zinc fingers. Two other domains in 
Spl function in activation and are notable in that 25% of 
their amino acid residues are Gin. A wide variety of 
other activator proteins also have these glutamine-rich 
domains. 

CTF1 (CCAAT-binding transcription factor 1) be- 
longs to a family of transcription activators that bind a 
sequence called the CCAAT site (its consensus se- 
quence is TGGN 6 GCCAA, where N is any nucleotide). 
The DNA-binding domain of CTF1 contains many basic 
amino acid residues, and the binding region is probably 
arranged as an a helix. This protein has neither a helix- 
turn-helix nor a zinc finger motif; its DNA-binding 
mechanism is not yet clear. CTF1 has a proline-rich 
activation domain, with Pro accounting for more than 
20% of the amino acid residues. 

The discrete activation and DNA-binding domains 
of regulatory proteins often act completely independ- 
ently, as has been demonstrated in "domain-swapping" 
experiments. Genetic engineering techniques (Chapter 
9) can join the proline-rich activation domain of CTF1 to 
the DNA-binding domain of Spl to create a protein that, 
like intact Spl, binds to GC boxes on the DNA and acti- 
vates transcription at a nearby promoter (as in Fig. 
28-33b). The DNA-binding domain of Gal4p has simi- 
larly been replaced experimentally with the DNA- 
binding domain of the E. coli LexA repressor (of the 
SOS response; Fig. 28-22). This chimeric protein neither 
binds at UAS G nor activates the yeast GAL genes (as 
would intact Gal4p) unless the UAS G sequence in the 
DNA is replaced by the LexA recognition site. 

Eukaryotic Gene Expression Can Be Regulated by 
Intercellular and Intracellular Signals 

The effects of steroid hormones (and of thyroid and 
retinoid hormones, which have the same mode of ac- 
tion) provide additional well-studied examples of the 
modulation of eukaryotic regulatory proteins by direct 
interaction with molecular signals (see Fig. 12-29). Un- 
like other types of hormones, steroid hormones do not 
have to bind to plasma membrane receptors. Instead, 
they can interact with intracellular receptors that are 
themselves transcription activators. Steroid hormones 
too hydrophobic to dissolve readily in the blood (estro- 
gen, progesterone, and Cortisol, for example) travel on 
specific carrier proteins from their point of release to 
their target tissues. In the target tissue, the hormone 
passes through the plasma membrane by simple diffu- 
sion and binds to its specific receptor protein in the nu- 
cleus. The hormone-receptor complex acts by binding 



TABLE 28-4 



Receptor 



Hormone Response Elements (HREsJ 
Bound by Steroid-Type Hormone 
Receptors 

Consensus sequence bound* 



Androgen 
Glucocorticoid 
Retinoic acid (some) 
Vitamin D 
Thyroid hormone 
RX 1 " 



GG (A/T) AC AN 2 TGTTCT 

GGTACAN 3 TGTTCT 

AGGTCAN5AGGTCA 

AGGTCAN3AGGTCA 

AGGTCAN3AGGTCA 

AGGTCANAGGTCANAG 
GTCANAGGTCA 



*N represents any nucleotide. 

t Forms a dimer with the retinoic acid receptor or vitamin D receptor. 

to highly specific DNA sequences called hormone re- 
sponse elements (HREs), thereby altering gene ex- 
pression. Hormone binding triggers changes in the 
conformation of the receptor proteins so that they be- 
come capable of interacting with additional transcrip- 
tion factors. The bound hormone-receptor complex 
can either enhance or suppress the expression of adja- 
cent genes. 

The DNA sequences (HREs) to which hormone- 
receptor complexes bind are similar in length and 
arrangement, but differ in sequence, for the various 
steroid hormones. Each receptor has a consensus HRE 
sequence (Table 28-4) to which the hormone-receptor 
complex binds well, with each consensus consisting of 
two six-nucleotide sequences, either contiguous or sep- 
arated by three nucleotides, in tandem or in a palin- 
dromic arrangement. The hormone receptors have a 
highly conserved DNA-binding domain with two zinc 
fingers (Fig. 28-34). The hormone-receptor complex 
binds to the DNA as a dimer, with the zinc finger do- 
mains of each monomer recognizing one of the six- 
nucleotide sequences. The ability of a given hormone to 
act through the hormone-receptor complex to alter the 
expression of a specific gene depends on the exact se- 
quence of the HRE, its position relative to the gene, and 
the number of HREs associated with the gene. 

Unlike the DNA-binding domain, the ligand-binding 
region of the receptor protein— always at the carboxyl 
terminus— is quite specific to the particular receptor. In 
the ligand-binding region, the glucocorticoid receptor is 
only 30% similar to the estrogen receptor and 17% simi- 
lar to the thyroid hormone receptor. The size of the 
ligand-binding region varies dramatically; in the vitamin D 
receptor it has only 25 amino acid residues, whereas in the 
mineralocorticoid receptor it has 603 residues. Mutations 
that change one amino acid in these regions can result in 
loss of responsiveness to a specific hormone. Some hu- 
mans unable to respond to Cortisol, testosterone, vitamin 
D, or thyroxine have mutations of this type. 

Some hormone receptors, including the human 
progesterone receptor, activate transcription with the 
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FIGURE 28-34 Typical steroid hormone receptors. These re- 
ceptor proteins have a binding site for the hormone, a DNA- 
binding domain, and a region that activates transcription of the 
regulated gene. The highly conserved DNA-binding domain 
has two zinc fingers. The sequence shown here is that for the 
estrogen receptor, but the residues in bold type are common to 
all steroid hormone receptors. 
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aid of an unusual coactivator— steroid receptor RNA 
activator (SRA), a -700 nucleotide RNA. SRA acts as 
part of a ribonucleoprotein complex, but it is the RNA 
component that is required for transcription coactiva- 
tion. The detailed set of interactions between SRA and 
other components of the regulatory systems for these 
genes remains to be worked out. 

Regulation Can Result from Phosphorylation 
of Nuclear Transcription Factors 

We noted in Chapter 12 that the effects of insulin on 
gene expression are mediated by a series of steps lead- 
ing ultimately to the activation of a protein kinase in the 
nucleus that phosphorylates specific DNA-binding pro- 
teins and thereby alters their ability to act as transcrip- 
tion factors (see Fig. 12-15). This general mechanism 
mediates the effects of many nonsteroid hormones. For 
example, the /3-adrenergic pathway that leads to ele- 
vated levels of cytosolic cAMP, which acts as a second 
messenger in eukaryotes as well as in bacteria (see 
Figs 12-4, 28-18), also affects the transcription of a set 
of genes, each of which is located near a specific DNA 
sequence called a cAMP response element (CRE). 
The catalytic subunit of protein kinase A, released when 
cAMP levels rise (see Fig. 12-6), enters the nucleus and 
phosphorylates a nuclear protein, the CRE-binding 
protein (CREB). When phosphorylated, CREB binds 
to CREs near certain genes and acts as a transcription 
factor, turning on the expression of these genes. 

Many Eukaryotic mRNAs Are Subject to 
Translational Repression 

Regulation at the level of translation assumes a much 
more prominent role in eukaryotes than in bacteria and is 
observed in a range of cellular situations. In contrast to the 



tight coupling of transcription and translation in bacteria, 
the transcripts generated in a eukaryotic nucleus must be 
processed and transported to the cytoplasm before trans- 
lation. This can impose a significant delay on the appear- 
ance of a protein. When a rapid increase in protein 
production is needed, a translationally repressed mRNA 
already in the cytoplasm can be activated for translation 
without delay. Translational regulation may play an espe- 
cially important role in regulating certain very long eu- 
karyotic genes (a few are measured in the millions of base 
pairs), for which transcription and mRNA processing can 
require many hours. Some genes are regulated at both the 
transcriptional and translational stages, with the latter 
playing a role in the fine-tuning of cellular protein levels. 
In some anucleate cells, such as reticulocytes (immature 
erythrocytes), transcriptional control is entirely unavail- 
able and translational control of stored mRNAs becomes 
essential. As described below, translational controls can 
also have spatial significance during development, when 
the regulated translation of prepositioned mRNAs creates 
a local gradient of the protein product. 

Eukaryotes have at least four main mechanisms of 
translational regulation. 

1. Translational initiation factors are subject 
to phosphorylation by protein kinases. The 
phosphorylated forms are often less active 
and cause a general depression of translation 
in the cell. 

2. Some proteins bind directly to mRNA and act as 
translational repressors, many of them binding at 
specific sites in the 39 untranslated region 
(39UTR). So positioned, these proteins interact 
with other translation initiation factors bound to 
the mRNA or with the 40S ribosomal subunit to 
prevent translation initiation (Fig. 28-35; compare 
this with Fig. 27-27). 
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FIGURE 28-35 Translational regulation of eukaryotic mRNA. One of 

the most important mechanisms for translational regulation in eukary- 
otes involves the binding of translational repressors (RNA-binding pro- 
teins) to specific sites in the 3' untranslated region (3'UTR) of the 
mRNA. These proteins interact with eukaryotic initiation factors or 
with the ribosome (see Fig. 27-27) to prevent or slow translation. 



3. Binding proteins, present in eukaryotes from yeast 
to mammals, disrupt the interaction between 
eIF4E and eIF4G (see Fig. 27-27). The mammalian 
versions are known as 4E-BPs (eIF4E binding 
proteins). When cell growth is slow, these proteins 
limit translation by binding to the site on eIF4E 
that normally interacts with eIF4G. When cell 
growth resumes or increases in response to growth 
factors or other stimuli, the binding proteins are 
inactivated by protein kinase-dependent 
phosphorylation. 

4. RNA-mediated regulation of gene expression, 
considered later, often occurs at the level of 
translational repression. 

The variety of translational regulation mechanisms pro- 
vides flexibility, allowing focused repression of a few 
mRNAs or global regulation of all cellular translation. 

Translational regulation has been particularly well 
studied in reticulocytes. One such mechanism in these 
cells involves eIF2, the initiation factor that binds to the 
initiator tRNA and conveys it to the ribosome; when 
Met-tRNA has bound to the P site, the factor eIF2B 
binds to eIF2, recycling it with the aid of GTP binding 
and hydrolysis. The maturation of reticulocytes includes 
destruction of the cell nucleus, leaving behind a plasma 
membrane packed with hemoglobin. Messenger RNAs 
deposited in the cytoplasm before the loss of the nu- 
cleus allow for the replacement of hemoglobin. When 
reticulocytes become deficient in iron or heme, the 
translation of globin mRNAs is repressed. A protein ki- 
nase called HCR (hemin-controlled repressor) is ac- 
tivated, catalyzing the phosphorylation of eIF2. When 
phosphorylated, eIF2 forms a stable complex with 



eIF2B that sequesters the eIF2, making it unavailable 
for participation in translation. In this way, the reticulo- 
cyte coordinates the synthesis of globin with the avail- 
ability of heme. 

Many additional examples of translational regulation 
have been found in studies of the development of multi- 
cellular organisms, as discussed in more detail below. 

Posttranscriptional Gene Silencing Is Mediated 
by RNA Interference 

In higher eukaryotes, including nematodes, fruit flies, 
plants, and mammals, a class of small RNAs called 
micro-RNAs (miRNAs) mediates the silencing of many 
genes. The RNAs function by interacting with mRNAs, 
often in the 3'UTR, resulting in either mRNA degrada- 
tion or translation inhibition. In either case, the mRNA, 
and thus the gene that produces it, is silenced. This form 
of gene regulation controls developmental timing in at 
least some organisms. It is also used as a mechanism to 
protect against invading RNA viruses (particularly im- 
portant in plants, which lack an immune system) and to 
control the activity of transposons. In addition, small 
RNA molecules may play a critical (but still undefined) 
role in the formation of heterochromatin. 

Many miRNAs are present only transiently during 
development, and these are sometimes referred to as 
small temporal RNAs (stRNAs). Thousands of differ- 
ent miRNAs have been identified in higher eukaryotes, 
and they may affect the regulation of a third of mam- 
malian genes. They are transcribed as precursor RNAs 
about 70 nucleotides long, with internally complemen- 
tary sequences that form hairpinlike structures (see Fig. 
26-28). The precursors are cleaved by endonucleases 
such as Drosha and Dicer to form short duplexes about 
20 to 25 nucleotides long. One strand of the processed 
miRNA is transferred to the target mRNA (or to a viral or 
transposon RNA), leading to inhibition of translation or 
degradation of the RNA (Fig. 28-36). Some miRNAs 
bind to and affect a single mRNA and thus affect expres- 
sion of only one gene. Others interact with multiple 
mRNAs and thus form the mechanistic core of regulons 
that coordinate the expression of multiple genes. 

This gene regulation mechanism has an interesting 
and very useful practical side. If an investigator intro- 
duces into an organism a duplex RNA molecule corre- 
sponding in sequence to virtually any mRNA, Dicer 
cleaves the duplex into short segments, called small 
interfering RNAs (siRNAs). These bind to the 
mRNA and silence it (Fig. 28-36b). The process is 
known as RNA interference (RNAi). In plants, virtu- 
ally any gene can be effectively shut down in this way. 
Nematodes readily ingest entire, functional RNAs, and 
simply introducing the duplex RNA into the worm's 
diet produces very effective suppression of the target 
gene. The technique has rapidly become an important 
tool in the ongoing efforts to study gene function, be- 
cause it can disrupt gene function without creating a 
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FIGURE 28-36 Gene silencing by RNA interference, (a) Small tempo- 
ral RNAs (stRNAs) are generated by Dicer-mediated cleavage of longer 
precursors that fold to create duplex regions. The stRNAs then bind to 
mRNAs, leading to degradation of mRNA or inhibition of translation, 
(b) Double-stranded RNAs can be constructed and introduced into a 
cell. Dicer processes the duplex RNAs into small interfering RNAs 
(siRNAs), which interact with the target mRNA. Again, either the 
mRNA is degraded or translation is inhibited. 



mutant organism. The procedure can be applied to hu- 
mans as well. Laboratory-produced siRNAs have been 
used to block HIV and poliovirus infections in cultured 
human cells for a week or so at a time. Rapid progress 
in research into RNA interference makes this a field to 
watch for future medical advances. 

RNA-Mediated Regulation of Gene Expression 
Takes Many Forms in Eukaryotes 

The special-function RNAs in eukaryotes include miRNAs, 
described above; snRNAs, involved in RNA splicing (see 
Fig. 26-16); and snoRNAs, involved in rRNA modifica- 
tion (see Fig. 26-26). All RNAs that do not encode pro- 
teins, including rRNAs and tRNAs, come under the 
general designation of ncRNAs (noncoding RNAs). 
Mammalian genomes seem to encode more ncRNAs 
than coding mRNAs (see Box 26-4). Not surprisingly, 
additional functional classes of ncRNAs are still being 
discovered. 

Many of the newly found examples interact with pro- 
teins rather than with RNAs, and affect the function of 
the bound proteins. The SRA that functions as a coactiva- 
tor of steroid hormone-responsive genes is one example: 
it affects the activation of transcription. The heat shock 
response in human cells provides another example. Heat 



shock factor 1 (HSF-1) is an activator protein that, in 
nonstressed cells, exists as a monomer bound by the 
chaperone Hsp90. Under stress conditions, HSF-1 is re- 
leased from Hsp90 and trimerizes. The HSF-1 trimer 
binds to DNA and activates transcription of genes whose 
products are required to deal with the stress. An ncRNA 
called HSR1 (heat shock RNA 1; -600 nucleotides) stim- 
ulates the HSF-1 trimerization and DNA binding. HSR1 
does not act alone; it functions in a complex with the 
translation elongation factor eEFlA. 

Additional RNAs affect transcription in a variety of 
ways. Besides its role in splicing (see Fig. 26-17), the 
snRNA Ul directly binds to the transcription factor 
TPIIH. Its function in this context is not yet clear, but it 
may regulate TFIIH or affect the coupling between tran- 
scription and splicing, or both. A 331 nucleotide ncRNA 
called 7SK, abundant in mammals, binds to the Pol II 
transcription elongation factor p-TEFb (see Table 26-2) 
and represses transcript elongation. Another ncRNA, B2 
(-178 nucleotides), binds directly to Pol II during heat 
shock and represses transcription. The B2-bound Pol II 
assembles into stable PICs, but transcription is blocked. 
The B2 RNA thus halts the transcription of many genes 
during heat shock, and the mechanism that allows HSF- 
1-responsive genes to be expressed in the presence of 
B2 remains to be worked out. 

The recognized roles of ncRNAs in gene expression 
and in many other cellular processes are rapidly ex- 
panding. At the same time, the study of the biochemistry 
of gene regulation is becoming much less protein-centric. 

Development Is Controlled by Cascades of 
Regulatory Proteins 

For sheer complexity and intricacy of coordination, the 
patterns of gene regulation that bring about develop- 
ment of a zygote into a multicellular animal or plant have 
no peer. Development requires transitions in morphol- 
ogy and protein composition that depend on tightly co- 
ordinated changes in expression of the genome. More 
genes are expressed during early development than in 
any other part of the life cycle. For example, in the sea 
urchin, an oocyte has about 18,500 different mRNAs, 
compared with about 6,000 different mRNAs in the cells 
of a typical differentiated tissue. The mRNAs in the 
oocyte give rise to a cascade of events that regulate the 
expression of many genes across both space and time. 

Several organisms have emerged as important 
model systems for the study of development, because 
they are easy to maintain in a laboratory and have rela- 
tively short generation times. These include nematodes, 
fruit flies, zebra fish, mice, and the plant Arabidopsis. 
This discussion focuses on the development of fruit flies. 
Our understanding of the molecular events during de- 
velopment of Drosophila melanogaster is particularly 
well advanced and can be used to illustrate patterns and 
principles of general significance. 
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FIGURE 28-37 Life cycle of the fruit fly Drosophila melanogaster. By the late embryonic stage, segments have formed, each containing 

Drosophila undergoes a complete metamorphosis, which means that specialized structures from which the various appendages and other 

the adult insect is radically different in form from its immature stages, a features of the adult fly will develop, 
transformation that requires extensive alterations during development. 



The life cycle of the fruit fly includes complete 
metamorphosis during its progression from an embryo 
to an adult (Fig. 28-37). Among the most important 
characteristics of the embryo are its polarity (the ante- 
rior and posterior parts of the animal are readily distin- 
guished, as are its dorsal and ventral parts) and its 
metamerism (the embryo body is made up of serially 
repeating segments, each with characteristic features). 
During development, these segments become organized 
into a head, thorax, and abdomen. Each segment of the 
adult thorax has a different set of appendages. Develop- 
ment of this complex pattern is under genetic control, 
and a variety of pattern-regulating genes have been 
discovered that dramatically affect the organization of 
the body. 

The Drosophila egg, along with 15 nurse cells, is 
surrounded by a layer of follicle cells (Fig. 28-38). As 
the egg cell forms (before fertilization), mRNAs and 
proteins originating in the nurse and follicle cells are de- 
posited in the egg cell, where some play a critical role in 
development. Once a fertilized egg is laid, its nucleus di- 
vides and the nuclear descendants continue to divide in 
synchrony every 6 to 10 min. Plasma membranes are not 
formed around the nuclei, which are distributed within 
the egg cytoplasm (forming a syncytium). Between the 
eighth and eleventh rounds of nuclear division, the nu- 
clei migrate to the outer layer of the egg, forming a 
monolayer of nuclei surrounding the common yolk-rich 
cytoplasm; this is the syncytial blastoderm. After a few 
additional divisions, membrane invaginations surround 



the nuclei to create a layer of cells that form the cellular 
blastoderm. At this stage, the mitotic cycles in the 
various cells lose their synchrony. The developmental 
fate of the cells is determined by the mRNAs and pro- 
teins originally deposited in the egg by the nurse and 
follicle cells. 

Proteins that, through changes in local concentra- 
tion or activity, cause the surrounding tissue to take up 
a particular shape or structure are sometimes referred 
to as morphogens; they are the products of pattern- 
regulating genes. As defined by Christiane Niisslein- 
Volhard, Edward B. Lewis, and Eric F. Wieschaus, three 
major classes of pattern-regulating genes— maternal, 
segmentation, and homeotic genes— function in succes- 
sive stages of development to specify the basic features 
of the Drosophila embryo body. Maternal genes are 
expressed in the unfertilized egg, and the resulting 
maternal mRNAs remain dormant until fertilization. 
These provide most of the proteins needed in very early 
development, until the cellular blastoderm is formed. 
Some of the proteins encoded by maternal mRNAs di- 
rect the spatial organization of the developing embryo at 
early stages, establishing its polarity. Segmentation 
genes, transcribed after fertilization, direct the forma- 
tion of the proper number of body segments. At least 
three subclasses of segmentation genes act at succes- 
sive stages: gap genes divide the developing embryo 
into several broad regions, and pair-rule genes to- 
gether with segment polarity genes define 14 stripes 
that become the 14 segments of a normal embryo. 
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FIGURE 28-38 Early development in Drosophila, During development 
of the egg, maternal mRNAs (including the bicoid and nanos gene 
transcripts, discussed in the text) and proteins are deposited in the de- 
veloping oocyte (unfertilized egg cell) by nurse cells and follicle cells. 
After fertilization, the two nuclei of the fertilized egg divide in syn- 
chrony within the common cytoplasm (syncytium), then migrate to the 
periphery. Membrane invaginations surround the nuclei to create a 
monolayer of cells at the periphery; this is the cellular blastoderm 
stage. During the early nuclear divisions, several nuclei at the far pos- 
terior become pole cells, which later become the germ-line cells. 

Homeotic genes are expressed still later; they specify 
which organs and appendages will develop in particular 
body segments. 

The many regulatory genes in these three classes 
direct the development of an adult fly, with a head, tho- 
rax, and abdomen, with the proper number of segments, 
and with the correct appendages on each segment. Al- 
though embryogenesis takes about a day to complete, all 
these genes are activated during the first four hours. 
Some mRNAs and proteins are present for only a few 
minutes at specific points during this period. Some of 
the genes code for transcription factors that affect the 
expression of other genes in a kind of developmental 
cascade. Regulation at the level of translation also oc- 
curs, and many of the regulatory genes encode transla- 
tional repressors, most of which bind to the 3'UTR of 
the mRNA (Fig. 28-35). Because many mRNAs are 
deposited in the egg long before their translation is 
required, translational repression provides an especially 
important avenue for regulation in developmental 
pathways. 

Maternal Genes Some maternal genes are expressed 
within the nurse and follicle cells, and some in the egg 
itself. In the unfertilized Drosophila egg, the maternal 
gene products establish two axes — anterior-posterior 
and dorsal-ventral — and thus define which regions of the 
radially symmetric egg will develop into the head and ab- 
domen and the top and bottom of the adult fly. A key 
event in very early development is establishment of 
mRNA and protein gradients along the body axes. Some 
maternal mRNAs have protein products that diffuse 
through the cytoplasm to create an asymmetric distribu- 
tion in the egg. Different cells in the cellular blastoderm 
therefore inherit different amounts of these proteins, 
setting the cells on different developmental paths. The 
products of the maternal mRNAs include transcription 
activators or repressors as well as translational repressors, 
all regulating the expression of other pattern-regulating 
genes. The resulting specific patterns and sequences of 
gene expression therefore differ between cell lineages, 
ultimately orchestrating the development of each adult 
structure. 

The anterior-posterior axis in Drosophila is de- 
fined at least in part by the products of the bicoid and 
nanos genes. The bicoid gene product is a major ante- 
rior morphogen, and the nanos gene product is a major 
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posterior morphogen. The 
mRNA from the bicoid gene is 
synthesized by nurse cells and 
deposited in the unfertilized 
egg near its anterior pole . 
Nusslein-Volhard found that 
this mRNA is translated soon 
after fertilization, and the Bi- 
coid protein diffuses through 
the cell to create, by the sev- 
enth nuclear division, a con- 
centration gradient radiating 
out from the anterior pole 
(Fig. 28-39a). The Bicoid protein is a transcription fac- 
tor that activates the expression of several segmentation 
genes; the protein contains a homeodomain (p. 1124). 
Bicoid is also a translational repressor that inactivates 
certain mRNAs. The amounts of Bicoid protein in various 
parts of the embryo affect the subsequent expression of 
other genes in a threshold-dependent manner. Genes are 
transcriptionally activated or translationally repressed 
only where the Bicoid protein concentration exceeds the 
threshold. Changes in the shape of the Bicoid concentra- 
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tion gradient have dramatic effects on the body pattern. 
Lack of Bicoid protein results in development of an em- 
bryo with two abdomens but neither head nor thorax 
(Fig. 28-39b); however, embryos without Bicoid will de- 
velop normally if an adequate amount of bicoid mRNA is 
injected into the egg at the appropriate end. The nanos 
gene has an analogous role, but its mRNA is deposited at 
the posterior end of the egg and the anterior-posterior 
protein gradient peaks at the posterior pole. The Nanos 
protein is a translational repressor. 

A broader look at the effects of maternal genes re- 
veals the outline of a developmental circuit. In addition 
to the bicoid and nanos mRNAs, which are deposited 
in the egg asymmetrically, several other maternal mR- 
NAs are deposited uniformly throughout the egg cyto- 
plasm. Three of these mRNAs encode the Pumilio, 
Hunchback, and Caudal proteins, all affected by nanos 
and bicoid (Fig. 28-40). Caudal and Pumilio are in- 
volved in development of the posterior end of the fly. 
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FIGURE 28-39 Distribution of a maternal gene product in a Drosophila 

egg. (a) Micrograph of an immunologically stained egg, showing distri- 
bution of the bicoid (bed) gene product. The graph measures stain in- 
tensity. This distribution is essential for normal development of the 
anterior structures, (b) If the bed gene is not expressed by the mother 
(hcd~/bcd~ mutant) and thus no bicoid mRNA is deposited in the egg, 
the resulting embryo has two posteriors (and soon dies). 
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FIGURE 28-40 Regulatory circuits of the anterior-posterior axis 
in a Drosophila egg. The bicoid and nanos mRNAs are localized 
near the anterior and posterior poles, respectively. The caudal, 
hunchback, and pumilio mRNAs are distributed throughout the 
egg cytoplasm. The gradients of Bicoid (Bed) and Nanos proteins 
lead to accumulation of Hunchback protein in the anterior and 
Caudal protein in the posterior of the egg. Because Pumilio pro- 
tein requires Nanos protein for its activity as a translational re- 
pressor of hunchback, it functions only at the posterior end. 
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Caudal is a transcription activator with a homeodomain; 
Pumilio is a translational repressor. Hunchback protein 
plays an important role in development of the anterior 
end and is also a transcriptional regulator of a variety of 
genes, in some cases a positive regulator, in other cases 
negative. Bicoid suppresses translation of caudal at the 
anterior end and also acts as a transcription activator of 
hunchback in the cellular blastoderm. Because hunch- 
back is expressed both from maternal mRNAs and from 
genes in the developing egg, it is considered both a 
maternal and a segmentation gene. The result of the 
activities of Bicoid is an increased concentration of 
Hunchback at the anterior end of the egg. The Nanos 
and Pumilio proteins act as translational repressors of 
hunchback, suppressing synthesis of its protein near 
the posterior end of the egg. Pumilio does not function 
in the absence of the Nanos protein, and the gradient of 
Nanos expression confines the activity of both proteins 
to the posterior region. Translational repression of the 
hunchback gene leads to degradation of hunchback 
mRNA near the posterior end. However, lack of Bicoid 
in the posterior leads to expression of caudal. In this 
way, the Hunchback and Caudal proteins become asym- 
metrically distributed in the egg. 

Segmentation Genes Gap genes, pair-rule genes, 
and segment polarity genes, three subclasses of seg- 
mentation genes in Drosophila, are activated at suc- 
cessive stages of embryonic development. Expression of 
the gap genes is generally regulated by the products of 
one or more maternal genes. At least some of the gap 
genes encode transcription factors that affect the ex- 
pression of other segmentation or (later) homeotic 
genes. 

One well-characterized segmentation gene is 
fushi tarazu (ftz), of the pair-rule subclass. When ftz 
is deleted, the embryo develops 7 segments instead of 
the normal 14, each segment twice the normal width. 
The Fushi-tarazu protein (Ftz) is a transcription 
activator with a homeodomain. The mRNAs and proteins 
derived from the normal ftz gene accumulate in a 
striking pattern of seven stripes that encircle the 
posterior two-thirds of the embryo (Fig. 28-41). The 
stripes demarcate the positions of segments that de- 
velop later; these segments are eliminated if ftz function 
is lost. The Ftz protein and a few similar regulatory 
proteins directly or indirectly regulate the expression 
of vast numbers of genes in the continuing develop- 
mental cascade. 

Homeotic Genes A set of 8 to 11 homeotic genes di- 
rects the formation of particular structures at specific 
locations in the body plan. These genes are now 
more commonly referred to as Hox genes, the term 
derived from "homeobox," the conserved gene sequence 
that encodes the homeodomain and is present in all of 
these genes. Despite the name, these are not the only 
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FIGURE 28-41 Distribution of the fushi tarazu (ftz) gene product in 
early Drosophila embryos, (a) In the normal embryo, the gene product 
can be detected in seven bands around the circumference of the 
embryo (shown schematically). These bands (b) appear as dark spots 
(generated by a radioactive label) in a cross-sectional autoradiograph 
and (c) demarcate the anterior margins of the segments in the late 
embryo (marked in red). 



development-related proteins to include a homeodomain 
(for example, the bicoid gene product described above 
has a homeodomain), and "Hox" is more a functional 
than a structural classification. The Hox genes are or- 
ganized in genomic clusters. Drosophila has one such 
cluster and mammals have four (Fig. 28-42). The genes 
in these clusters are remarkably similar from nematodes 
to humans. In Drosophila, each of the Hox genes is ex- 
pressed in a particular segment of the embryo and con- 
trols the development of the corresponding part of the 
mature fly. The terminology used to describe Hox genes 
can be confusing. They have historical names in the fruit 
fly (for example, ultrabithorax'), whereas in mammals 
they are designated by two competing systems based on 
lettered (A, B, C, D) or numbered (1, 2, 3, 4) clusters. 

Loss of Hox genes in fruit flies by mutation or dele- 
tion causes the appearance of a normal appendage or 
body structure at an inappropriate body position. An im- 
portant example is the ultrabithorax (ubx) gene. When 
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Ubx function is lost, the first abdominal segment devel- 
ops incorrectly, having the structure of the third thoracic 
segment. Other known homeotic mutations cause the 
formation of an extra set of wings, or two legs at the po- 
sition in the head where the antennae are normally found 
(Fig. 28-43). The Hox genes often span long regions of 
DNA. The ubx gene, for example, is 77,000 bp long. More 
than 73,000 bp of this gene are in introns, one of which is 
more than 50,000 bp long. Transcription of the ubx gene 
takes nearly an hour. The delay this imposes on ubx gene 
expression is believed to be a timing mechanism involved 
in the temporal regulation of subsequent steps in devel- 
opment. Many Hox genes are further regulated by miR- 
NAs encoded by intergenic regions of the Hox gene 
clusters. All of the Hox gene products are themselves 
transcription factors that regulate the expression of an 
array of downstream genes. Identification of these down- 
stream targets is ongoing. 



Many of the principles of development outlined 
above apply to other eukaryotes, from nematodes to 
humans. Some of the regulatory proteins are con- 
served. For example, the products of the homeobox- 
containing genes HOXA7 in mouse and antennapedia 
in fruit fly differ in only one amino acid residue. Of 
course, although the molecular regulatory mecha- 
nisms may be similar, many of the ultimate develop- 
mental events are not conserved (humans do not have 
wings or antennae). The different outcomes are 
brought about by differences in the downstream tar- 
get genes controlled by the Hox genes. The discovery 
of structural determinants with identifiable molecular 
functions is the first step in understanding the molec- 
ular events underlying development. As more genes 
and their protein products are discovered, the bio- 
chemical side of this vast puzzle will be elucidated in 
increasingly rich detail. 
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FIGURE 28-42 The Hox gene clusters and their effects on development. 

(a) Each Hox gene in the fruit fly is responsible for the development of 
structures in a defined part of the body and is expressed in defined re- 
gions of the embryo, as shown here with color coding, (b) Drosophila 
has one Hox gene cluster, the human genome has four. Many of these 
genes are highly conserved in multicellular animals. Evolutionary rela- 
tionships, as indicated by sequence alignments, between genes in the 
Drosophila Hox gene cluster and those in the mammalian Hox gene 
clusters are shown by dashed lines. Similar relationships among the 
four sets of mammalian Hox genes are indicated by vertical alignment. 
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FIGURE 28-43 Effects of mutations in Hox genes in Drosophila. 

(a) Normal head, (b) Homeotic mutant {antennapedia) in which anten- 
nae are replaced by legs, (c) Normal body structure, (d) Homeotic 

Our discussion of developmental regulation brings us 
full circle, back to a biochemical beginning — both 
figuratively and literally. Evolution appropriately pro- 
vides the first and last words of text in this book. If 
evolution is to generate the kind of changes in an or- 
ganism that we associate with a different species, it is 

BOX 28-1 Of Fins, Wings, Beaks, and Things 

South America has several species of seed-eating 
finches, commonly called grassquits. About 3 million 
years ago, a small group of grassquits, of a single 
species, took flight from the continent's Pacific coast. 
Perhaps driven by a storm, they lost sight of land and 
traveled nearly 1,000 km. Small birds such as these 
might easily have perished on such a journey, but the 
smallest of chances brought this group to a newly 
formed volcanic island in an archipelago later to be 
known as the Galapagos. It was a virgin landscape wi , 
untapped plant and insect food sources, and the newly 
arrived finches survived. Over the years, new islands 
formed and were colonized by new plants and insects— 
and by the finches. The birds exploited the new re- 
sources on the islands, and groups of birds gradually 
specialized and diverged into new species. By the time 
Charles Darwin stepped onto the islands in 1835, there 
were many different finch species to be found on the 
various islands of the archipelago, feeding on seeds, 
fruits, insects, pollen, or even blood. 

The diversity of living creatures was a source of won- 
der for humans long before scientists sought to under- 
stand its origins. The extraordinary insight handed down 
to us by Darwin, inspired in part by his encounter with 
the Galapagos finches, provided a broad explanation for 



(d) 

mutant {bithorax) in which a segment has developed incorrectly to 
produce an extra set of wings. 



the developmental program that must be affected. 
Developmental and evolutionary processes are closely 
allied, each informing the other (Box 28-1). The con- 
tinuing study of biochemistry has everything to do 
with enriching the future of humanity and under- 
standing our origins. 



the existence of organisms with a vast array of appear- 
ances and characteristics. It also gave rise to many ques- 
tions about the mechanisms underlying evolution. 
Answers to those questions have started to appear, first 
through the study of genomes and nucleic acid metabo- 
lism in the last half of the twentieth century, and more re- 
cently through an 6 n^ neld nicknamed evo-devo — a 
blend of evofr J jnary and developmental biology. 

in its modern synthesis, the theory of evolution has 
two main elements: mutations in a population generate 
genetic diversity; natural selection then acts on this di- 
versity to favor individuals with more useful genomic 
tools, and to disfavor others. Mutations occur at signifi- 
cant rates in every individual's genome, in every cell 
(Chapters 8 and 25). Advantageous mutations in single- 
celled organisms or in the germ line of multicellular or- 
ganisms can be inherited, and they are more likely to be 
inherited (that is, are passed on to greater numbers of 
offspring) if they confer an advantage. It is a straightfor- 
ward scheme. But many have wondered whether it is 
enough to explain, say, the many different beak shapes 
in the Galapagos finches, or the diversity of size and 
shape among mammals. Until recent decades, there 
were several widely held assumptions about the evolu- 
tionary process: that many mutations and new genes 
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would be needed to bring about a new physical struc- 
ture; that more-complex organisms would have larger 
genomes, and that very different species would have few 
genes in common. All of these assumptions were wrong. 

Modern genomics has revealed that the human 
genome contains fewer genes than expected — not many 
more than the fruit fly genome, and fewer than some am- 
phibian genomes. The genomes of every mammal, from 
mouse to human, are surprisingly similar in the number, 
types, and chromosomal arrangement of genes. Mean- 
while, evo-devo is telling us how complex and very differ- 
ent creatures can evolve within these genomic realities. 

The kinds of mutant organisms shown in Figure 
28-43 were studied by the English biologist William 
Bateson in the late nineteenth century. Bateson used 
his observations to challenge the Darwinian notion that 
evolutionary change would have to be gradual. Recent 
studies of the genes that control organismal develop- 
ment have put an exclamation point on Bateson's 
ideas. Subtle changes in regulatory patterns during 
development, reflecting just one or a few mutations, 
can result in startling physical changes and fuel sur- 
prisingly rapid evolution. 

The Galapagos finches provide a wonderful example 
of the link between evolution and development. There 
are at least 14 (some specialists list 15) species of Gala- 
pagos finches, distinguished in large measure by their 
beak structure. The ground finches, for example, have 
broad, heavy beaks adapted to crushing large, hard 
seeds. The cactus finches have longer, slender beaks 
ideal for probing cactus fruits and flowers (Fig. 1). 
Clifford Tabin and colleagues carefully surveyed a set of 
genes expressed during avian craniofacial development. 
They identified a single gene, Bmp4, whose expression 
level correlated with formation of the more robust beaks 
of the ground finches. More robust beaks were also 
formed in chicken embryos when high levels of Bmp4 
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were artificially expressed in the appropriate tissues, 
confirming the importance of Bmp4. In a similar study, 
the formation of long, slender beaks was linked to the 
expression of calmodulin (see Fig. 12-11) in particular 
tissues at appropriate developmental stages. Thus, ma- 
jor changes in the shape and function of the beak can be 
brought about by subtle changes in the expression of 
just two genes involved in developmental regulation. 
Very few mutations are required, and the needed muta- 
tions affect regulation. New genes are not required. 

The system of regulatory genes that guides devel- 
opment is remarkably conserved among all vertebrates. 
Elevated expression of Bmp4 in the right tissue at the 
right time leads to more robust jaw parts in zebrafish. 
The same gene plays a key role in tooth development in 
mammals. The development of eyes is triggered by the 
expression of a single gene, Pax6, in fruit flies and in 
mammals. The mouse Pax6 gene will trigger the devel- 
opment of fruit fly eyes in the fruit fly, and the fruit fly 
Pax6 gene will trigger the development of mouse eyes 
in the mouse. In each organism, these genes are part of 
the much larger regulatory cascade that ultimately cre- 
ates the correct structures in the correct locations in 
each organism. The cascade is ancient; for example, the 
Hox genes (described in the text) have been part of the 
developmental program of multicellular eukaryotes for 
more than 500 million years. Subtle changes in the cas- 
cade can have large effects on development, and thus 
on the ultimate appearance of the organism. These 
same subtle changes can fuel remarkably rapid evolu- 
tion. For example, the 400 to 500 described species of 
cichlids (spiny-finned fish) in Lake Malawi and Lake 
Victoria on the African continent are all derived from 
one or a few populations that colonized each lake in the 
past 100,000 to 200,000 years. The Galapagos finches 
simply followed a path of evolution and change that liv- 
ing creatures have been traveling for billions of years. 
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FIGURE 1 Evolution of new beak structures to exploit new food 
sources. In the Galapagos finches, the different beak structures of the 
cactus finch and the large ground finch, which feed on different, 



specialized food sources, were produced to a large extent by a few 
mutations that altered the timing and level of expression of just two 
genes: those encoding calmodulin (CaM) and Bmp4. 
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SUMMARY 28.3 Regulation of Gene 
Expression in 
Eukaryotes 

■ In eukaryotes, positive regulation is more 
common than negative regulation, and 
transcription is accompanied by large changes 
in chromatin structure. 

■ Promoters for Pol II typically have a TATA box and 
Inr sequence, as well as multiple binding sites for 
transcription activators. The latter sites, sometimes 
located hundreds or thousands of base pairs away 
from the TATA box, are called upstream activator 
sequences in yeast and enhancers in higher 
eukaryotes. 

■ Large complexes of proteins are generally 
required to regulate transcriptional activity. The 
effects of transcription activators on Pol II are 
mediated by coactivator protein complexes such 
as mediator. The modular structures of the activa- 
tors have distinct activation and DNA-binding 
domains. Other protein complexes, including 
histone acetyltransferases and ATP-dependent 
complexes such as SWI/SNF and NURF, 
reversibly remodel and modify chromatin 
structure. 

■ Hormones affect the regulation of gene expression 
in one of two ways. Steroid hormones interact 
directly with intracellular receptors that are 
DNA-binding regulatory proteins; binding of the 
hormone has either positive or negative effects on 
the transcription of genes targeted by the hormone. 
Nonsteroid hormones bind to cell-surface receptors, 
triggering a signaling pathway that can lead to 
phosphorylation of a regulatory protein, affecting 
its activity. 

■ RNA-mediated regulation plays an important role 
in eukaryotic gene expression, with the range of 
known mechanisms expanding. 

■ Development of a multicellular organism presents 
the most complex regulatory challenge. The fate 
of cells in the early embryo is determined by 
establishment of anterior-posterior and dorsal- 
ventral gradients of proteins that act as 
transcription activators or translational 
repressors, regulating the genes required for 
the development of structures appropriate to 

a particular part of the organism. Sets of 
regulatory genes operate in temporal and spatial 
succession, transforming given areas of 
an egg cell into predictable structures in the 
adult organism. 



Key Terms 

Terms in bold are defined in the glossary 
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induction 1116 
repression 1116 
specificity factor 1117 
repressor 1117 
activator 1117 
operator 1117 
negative regulation 1117 
positive regulation 1118 
operon 1118 
helix- turn-helix 1122 
zinc finger 1122 
homeodomain 1124 
homeobox 1124 
leucine zipper 1124 
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catabolite repression 1126 
cAMP receptor protein 

(CRP) 1126 
regulon 1127 
transcription 

attenuation 1128 
translational 

repressor 1131 
stringent response 1132 
riboswitch 1133 
phase variation 1 1 34 
chromatin remodeling 
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histone acetyltransferases 

(HATs) 1137 
SWI/SNF 1138 
enhancers 1139 



upstream activator 

sequences (UASs) 1139 
transcription 

activators 1139 
coactivators 1139 
basal transcription 

factors 1139 
high mobility group 

(HMG) 1139 
mediator 1139 
preinitiation complex 

(PIC) 1140 
TATA-binding protein 

(TBP) 1140 
hormone response 

elements (HREs) 1143 
micro-RNAs 

(miRNAs) 1145 
RNA interference 

(RNAi) 1145 
ncRNAs 1146 
polarity 1147 
metamerism 1 147 
morphogens 1147 
maternal genes 1147 
maternal mRNAs 1 147 
segmentation genes 1147 
gap genes 1147 
pair-rule genes 1147 
segment polarity 

genes 1147 
homeotic genes 1148 
Hox genes 1150 
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Problems 

1. Effect of mRNA and Protein Stability on Regulation 

E coli cells are growing in a medium with glucose as the sole 
carbon source. Tryptophan is suddenly added. The cells con- 
tinue to grow, and divide every 30 min. Describe (qualita- 
tively) how the amount of tryptophan synthase activity in the 
cells changes with time under the following conditions: 

(a) The trp mRNA is stable (degraded slowly over many 
hours). 

(b) The trp mRNA is degraded rapidly, but tryptophan 
synthase is stable. 

(c) The trp mRNA and tryptophan synthase are both de- 
graded rapidly. 

2. Negative Regulation Describe the probable effects on 
gene expression in the lac operon of a mutation in (a) the 
lac operator that deletes most of Oi; (b) the lad gene that 
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inactivates the repressor; and (c) the promoter that alters 
the region around position - 10. 

3. Specific DNA Binding by Regulatory Proteins A typi- 
cal bacterial repressor protein discriminates between its spe- 
cific DNA binding site (operator) and nonspecific DNA by a 
factor of 10 4 to 10 6 About 10 molecules of repressor per cell 
are sufficient to ensure a high level of repression. Assume that 
a very similar repressor existed in a human cell, with a similar 
specificity for its binding site. How many copies of the repres- 
sor would be required to elicit a level of repression similar to 
that in the bacterial cell? (Hint: The E. coli genome contains 
about 4 6 million bp; the human haploid genome has about 
3 2 billion bp ) 

4. Repressor Concentration in E. coli The dissociation 
constant for a particular repressor-operator complex is very 
low, about 10~ 13 m. AnE coli cell (volume 2 X 10~ 12 mL) con- 
tains 10 copies of the repressor. Calculate the cellular concen- 
tration of the repressor protein. How does this value compare 
with the dissociation constant of the repressor-operator com- 
plex? What is the significance of this answer? 

5. Catabolite Repression E coli cells are growing in a 
medium containing lactose but no glucose. Indicate whether 
each of the following changes or conditions would increase, 
decrease, or not change the expression of the lac operon. It 
may be helpful to draw a model depicting what is happening in 
each situation. 

(a) Addition of a high concentration of glucose 

(b) A mutation that prevents dissociation of the Lac re- 
pressor from the operator 

(c) A mutation that completely inactivates jS-galactosidase 

(d) A mutation that completely inactivates galactoside 
permease 

(e) A mutation that prevents binding of CRP to its binding 
site near the lac promoter 

6. IVanscription Attenuation How would transcription of 
the E. coli trp operon be affected by the following manipula- 
tions of the leader region of the trp mRNA? 

(a) Increasing the distance (number of bases) between 
the leader peptide gene and sequence 2 

(b) Increasing the distance between sequences 2 and 3 

(c) Removing sequence 4 

(d) Changing the two Trp codons in the leader peptide 
gene to His codons 

(e) Eliminating the rib osome -binding site for the gene 
that encodes the leader peptide 

(f) Changing several nucleotides in sequence 3 so that it 
can base-pair with sequence 4 but not with sequence 2 

7. Repressors and Repression How would the SOS re- 
sponse in E coli be affected by a mutation in the lexA gene 
that prevented autocatalytic cleavage of the LexA protein? 

8. Regulation by Recombination In the phase variation 
system of Salmonella, what would happen to the cell if the 
Hin recombinase became more active and promoted recombi- 
nation (DNA inversion) several times in each cell generation? 



9. Initiation of Transcription in Eukaryotes A new RNA 

polymerase activity is discovered in crude extracts of cells de- 
rived from an exotic fungus. The RNA polymerase initiates 
transcription only from a single, highly specialized promoter. 
As the polymerase is purified its activity declines, and the pu- 
rified enzyme is completely inactive unless crude extract is 
added to the reaction mixture. Suggest an explanation for 
these observations. 

10. Functional Domains in Regulatory Proteins A 

biochemist replaces the DNA-binding domain of the yeast 
Gal4 protein with the DNA-binding domain from the Lac re- 
pressor, and finds that the engineered protein no longer 
regulates transcription of the GAL genes in yeast. Draw a 
diagram of the different functional domains you would ex- 
pect to find in the Gal4 protein and in the engineered pro- 
tein Why does the engineered protein no longer regulate 
transcription of the GAL genes? What might be done to the 
DNA-binding site recognized by this chimeric protein to 
make it functional in activating transcription of GAL 
genes? 

11. Nucleosome Modification during Transcriptional 
Activation To prepare genomic regions for transcription, 
certain histones in the resident nucleosomes are acetylated 
and methylated at specific locations, Once transcription is no 
longer needed, these modifications need to be reversed. In 
mammals, the methylation of Arg residues in histones is re- 
versed by peptidylarginine deiminases (PADIs). The reaction 
promoted by these enzymes does not yield unmethylated argi- 
nine. Instead, it produces citrulline residues in the histone. 
What is the other product of the reaction? Suggest a mecha- 
nism for this reaction. 

12. Inheritance Mechanisms in Development A Droso- 
phila egg that is bcd~/bcd~ may develop normally, but the 
adult fruit fly will not be able to produce viable offspring. Ex- 
plain. 

Biochemistry on the Internet 

13. TATA-Binding Protein and the TATA Box To exam 
ine the roles of hydrogen bonds and hydrophobic interactions 
between transcription factors and DNA, go to FirstGlance in 
Jmol at http://firstglance.jmol.org and enter the PDB ID 
1TGH. This file models the interactions between a human 
TATA-binding protein and a segment of double-stranded 
DNA. Once the structure loads, click the "Spin" button to stop 
the molecule from rotating. When the molecule has reloaded, 
click the "Contacts" link. With the radio button for "Chains" 
selected, click on any part of the protein (Chain A, displays in 
blue) to select it as the target. Click "Show Atoms Contacting 
Target" and, in the list of contacts to display, check only 
"Show putatively hydrogen-bonded non-water" to display hy- 
drogen bonds between the protein and the TATA box DNA. 
Then click on the rightmost option for viewing images (Maxi- 
mum detail: Target & Contacts Balls and Sticks, Colored by 
Element). With this view you should be able to use the zoom 
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and rotate controls and mouse clicks to answer the following 
questions. 

(a) Which of the base pairs in the DNA form hydrogen 
bonds with the protein? Which of these contribute to the spe- 
cific recognition of the TATA box by this protein? (Hydrogen- 
bond length between hydrogen donor and hydrogen acceptor 
ranges from 2.5 to 3.3 A.) 

(b) Which amino acid residues in the protein interact with 
these base pairs? 

(c) What is the sequence of the DNA in this model and 
which portions of the sequence are recognized by the TATA- 
binding protein? 

(d) Examine the hydrophobic interactions in this com- 
plex. Are they rare or numerous? To answer this question, 
click on "Return to contacts" and check the option to "Show 
hydrophobic (apolar van der Waals) interactions." 



Data Analysis Problem — 

14. Engineering a Genetic Toggle Switch in Escherichia 
coli Gene regulation is often described as an "on or off 
phenomenon — a gene is either fully expressed or not expressed 
at all. In fact, repression and activation of a gene involve ligand- 
binding reactions, so genes can show intermediate levels of 
expression when intermediate levels of regulatory molecules 
are present. For example, for the E. coli lac operon, consider 
the binding equilibrium of the Lac repressor, operator DNA, 
and inducer (see Fig. 28-7). Although this is a complex, coop- 
erative process, it can be approximately modeled by the fol- 
lowing reaction (R is repressor; IPTG is the inducer 
isopropyl-/3-D-thiogalactoside) : 

R + IPTG ; R • IPTG 

Free repressor, R, binds to the operator and prevents tran- 
scription of the lac operon; the R • IPTG complex does not 
bind to the operator and thus transcription of the lac operon 
can proceed. 

(a) Using Equation 5-8, we can calculate the relative ex- 
pression level of the proteins of the lac operon as a function of 
[IPTG] t Use this calculation to determine over what range of 
[IPTG] the expression level would vary from 10% to 90% 

(b) Describe qualitatively the level of lac operon proteins 
present in an E coli cell before, during, and after induction 
with IPTG. You need not give the amounts at exact times — -just 
indicate the general trends. 

Gardner, Cantor, and Collins (2000) set out to make a "ge- 
netic toggle switch" — a gene-regulatory system with two key 
characteristics of a light switch. (A) It has only two states: it 
is either fully on or fully off; it is not a dimmer switch. In bio- 
chemical terms, the target gene or gene system (operon) is ei- 
ther fully expressed or not expressed at all; it cannot be 
expressed at an intermediate level. (B) Both states are stable: 
although you must use a finger to flip the light switch from one 
state to the other, once you have flipped it and removed your 
finger, the switch stays in that state. In biochemical terms, ex- 
posure to an inducer or some other signal changes the expres- 



sion state of the gene or operon, and it remains in that state 
once the signal is removed. 

(c) Explain how the lac operon lacks both characteristics 
A and B. 

To make their "toggle switch," Gardner and coworkers 
constructed a plasmid from the following components: 

OPi ac The operator-promoter region of the E. coli lac operon 

OP A The operator-promoter region of A phage 

lad The gene encoding the lac repressor protein, Lad. In 
the absence of IPTG, this protein strongly represses OP lac ; in 
the presence of IPTG, it allows full expression from OPi ac . 

rep ts The gene encoding a temperature-sensitive mutant A 
repressor protein, rep ts . At 37 °C this protein strongly re- 
presses OP A ; at 42 °C it allows full expression from OP A . 

GFP The gene for green fluorescent protein (GFP) , a highly 
fluorescent reporter protein (see Fig. 9-15) 

T Transcription terminator 

The investigators arranged these components (see figure be- 
low) so that the two promoters were reciprocally repressed: 
OP lac controlled expression of rep ts , and OP A controlled expres- 
sion of lad. The state of this system was reported by the expres- 
sion level of GFP, which was also under the control of OPi ac , 



lac represssor 




(d) The constructed system has two states: GFP-on (high 
level of expression) and GFP-off (low level of expression) . For 
each state, describe which proteins are present and which pro- 
moters are being expressed. 

(e) Treatment with IPTG would be expected to toggle the 
system from one state to the other. From which state to 
which? Explain your reasoning. 

(f) Treatment with heat (42 °C) would be expected to 
toggle the system from one state to the other. From which 
state to which? Explain your reasoning. 

(g) Why would this plasmid be expected to have charac- 
teristics A and B as described above? 

To confirm that their construct did indeed exhibit these 
characteristics, Gardner and colleagues first showed that, 
once switched, the GFP expression level (high or low) was sta- 
ble for long periods of time (characteristic B). Next, they 
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measured GFP level at different concentrations of the inducer 
IPTG, with the following results. 




They noticed that the average GFP expression level was inter- 
mediate at concentration X of IPTG. However, when they 
measured the GFP expression level in individual cells at 
[IPTG] = X, they found either a high level or a low level of 
GFP — no cells showed an intermediate level. 

(h) Explain how this finding demonstrates that the sys- 
tem has characteristic A. What is happening to cause the bi- 
modal distribution of expression levels at [IPTG] = X? 

Reference 

Gardner, T.S., Cantor, C.R., & Collins, J.J. (2000) Construction of 
a genetic toggle switch in Escherichia coli. Nature 403, 339-342. 
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a 

ABC transporter: A member of a large family 
of membrane transporters with sequences that 
make up /I TP-binding cassettes These trans- 
porters move a large variety of substrates, 
including inorganic ions, lipids, and nonpolar 
drugs, outward across the plasma membrane, 
using ATP as the energy source, 
absolute configuration: The configuration of 
four different substituent groups around an 
asymmetric carbon atom, in relation to D- and 
L-glyceraldehyde. 

absorption: Transport of the products of 
digestion from the intestinal tract into the 
blood. 

acceptor control: Regulation of the rate of 
respiration by the availability of ADP as phos- 
phate group acceptor 

accessory pigments: Visible light-absorbing 
pigments (carotenoids, xanthophylls, and 
phycobilins) in plants and photosynthetic 
bacteria that complement chlorophylls in 
trapping energy from sunlight 
acid dissociation constant: The dissociation 
constant (XJ of an acid, describing its dissoci- 
ation into its conjugate base and a proton, 
acidosis: A metabolic condition in which the 
capacity of the body to buffer H + is diminished; 
usually accompanied by decreased blood pH. 
actin: A protein that makes up the thin fila- 
ments of muscle; also an important component 
of the cytoskeleton of many eukaryotic cells, 
activation energy (AG*): The amount of 
energy (in joules) required to convert all the 
molecules in 1 mol of a reacting substance 
from the ground state to the transition state, 
activator: (1) A DNA-binding protein that 
positively regulates the expression of one 
or more genes; that is, transcription rates 
increase when an activator is bound to the 
DNA (2) A positive modulator of an 
allosteric enzyme. 

active site: The region of an enzyme surface 
that binds the substrate molecule and 
catalytically transforms it; also known as 
the catalytic site. 

active transport: Energy-requiring transport 
of a solute across a membrane in the direction 
of increasing concentration, 
activity: The true thermodynamic activity or 
potential of a substance, as distinct from its 
molar concentration 

activity coefficient: The factor by which the 
numerical value of the concentration of a 
solute must be multiplied to give the true 
thermodynamic activity of that solute, 
acyl phosphate: Any molecule with the 
general chemical form R— C — 0 — OPO?~. 

II 

0 



adenosine 3,5-cyclic monophosphate: 

See cyclic AMP. 

adenosine diphosphate: See ADP 
adenosine triphosphate: See ATP. 
5-adenosylmethionine (adoMet): An enzy- 
matic cofactor involved in methyl group 
transfers. 

adipocyte: An animal cell specialized for the 
storage of fats (triacylglycerols) . 
adipose tissue: Connective tissue specialized 
for the storage of large amounts of triacylglyc- 
erols See also brown adipose tissue; white 
adipose tissue. 

ADP (adenosine diphosphate): A ribonu- 
cleoside 5 '-diphosphate serving as phosphate 
group acceptor in the cell energy cycle, 
aerobe: An organism that lives in air and uses 
oxygen as the terminal electron acceptor in 
respiration. 

aerobic: Requiring or occurring in the 
presence of oxygen. 

agonist: A compound, typically a hormone or 
neurotransmitter, that elicits a physiological 
response when it binds to its specific receptor, 
alcohol fermentation: See ethanol 
fermentation. 

aldose: A simple sugar in which the carbonyl 
carbon atom is an aldehyde; that is, the 
carbonyl carbon is at one end of the carbon 
chain. 

alkalosis: A metabolic condition in which the 
capacity of the body to buffer OH~ is dimin- 
ished; usually accompanied by an increase in 
blood pH. 

allosteric enzyme: A regulatory enzyme with 
catalytic activity modulated by the noncova- 
lent binding of a specific metabolite at a site 
other than the active site, 
allosteric protein: A protein (generally with 
multiple subunits) with multiple ligand- 
binding sites, such that ligand binding at one 
site affects ligand binding at another 
allosteric site: The specific site on the 
surface of an allosteric enzyme molecule to 
which the modulator or effector molecule is 
bound. 

a helix: A helical conformation of a polypep- 
tide chain, usually right-handed, with maximal 
intrachain hydrogen bonding; one of the most 
common secondary structures in proteins 
a oxidation: An alternative path for the oxida- 
tion of /3-methyl fatty acids in peroxisomes. 
Ames test: A simple bacterial test for carcino- 
genicity, based on the assumption that 
carcinogens are mutagens 
amino acid activation: ATP-dependent 
enzymatic esterification of the carboxyl group 
of an amino acid to the 3'-hydroxyl group of its 
corresponding tRNA 



amino acids: a-Amino-substituted carboxylic 
acids, the building blocks of proteins. 
aminoacyl-tRNA: An aminoacyl ester of a 
tRNA 

aminoacyl-tRNA synthetases: Enzymes that 
catalyze synthesis of an aminoacyl-tRNA at the 
expense of ATP energy, 
amino-terminal residue: The only amino 
acid residue in a polypeptide chain with a free 
a-amino group; defines the amino terminus of 
the polypeptide. 

aminotransferases: Enzymes that catalyze 
the transfer of amino groups from a-amino to 
a-keto acids; also called transaminases, 
ammonotelic: Excreting excess nitrogen in 
the form of ammonia. 

amphibolic pathway: A metabolic pathway 
used in both catabolism and anabolism. 
amphipathic: Containing both polar and 
nonpolar domains 

amphitropic proteins: Proteins that asso- 
ciate reversibly with the membrane and thus 
can be found in the cytosol, in the membrane, 
or in both places. 

ampholyte: A substance that can act as either 
a base or an acid. 

amphoteric: Capable of donating and 
accepting protons, thus able to serve as an 
acid or a base. 

anabolism: The phase of intermediary metab- 
olism concerned with the energy-requiring 
biosynthesis of cell components from smaller 
precursors 

anaerobe: An organism that lives without 
oxygen. Obligate anaerobes die when exposed 
to oxygen. 

anaerobic: Occurring in the absence of air or 
oxygen 

analyte: A molecule to be analyzed by mass 
spectrometry. 

anammox: Anaerobic oxidation of ammonia 
to N 2 , using nitrite as electron acceptor; carried 
out by specialized chemolithotrophic bacteria, 
anaplerotic reaction: An enzyme-catalyzed 
reaction that can replenish the supply of 
intermediates in the citric acid cycle. 

angstrom (A): A unit of length (1CT 8 cm) 
used to indicate molecular dimensions. 

anhydride: The product of the condensation 
of two carboxyl or phosphate groups in which 
the elements of water are eliminated to form 
a compound with the general structure 

R — X — 0 — X — R, where X is either carbon 
II II 
O O 

or phosphorus. 

anion-exchange resin: A polymeric resin 
with fixed cationic groups, used in the chro- 
matographic separation of anions 

[c,] 
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anomeric carbon: The carbon atom in a 
sugar at the new stereocenter formed when a 
sugar cyclizes to form a hemiacetal. This is the 
carbonyl carbon of aldehydes and ketones 
anomers: Two stereoisomers of a given sugar 
that differ only in the configuration about the 
carbonyl (anomeric) carbon atom, 
antagonist: A compound that interferes with 
the physiological action of another substance 
(the agonist), usually at a hormone or neuro- 
transmitter receptor, 

antibiotic: One of many different organic 
compounds that are formed and secreted by 
various species of microorganisms and plants, 
are toxic to other species, and presumably 
have a defensive function 
antibody: A defense protein synthesized by 
the immune system of vertebrates. See also 
immunoglobulin. 

anticodon: A specific sequence of three 
nucleotides in a tRNA, complementary to a 
codon for an amino acid in an mRNA. 
antigen: A molecule capable of eliciting the 
synthesis of a specific antibody in vertebrates, 
antiparallel: Describes two linear polymers 
that are opposite in polarity or orientation 
antiport: Cotransport of two solutes across a 
membrane in opposite directions, 
apoenzyme: The protein portion of an 
enzyme, exclusive of any organic or inorganic 
cofactors or prosthetic groups that might be 
required for catalytic activity, 
apolipoprotein: The protein component of a 
lipoprotein. 

apoprotein: The protein portion of a protein, 
exclusive of any organic or inorganic cofactors 
or prosthetic groups that might be required for 
activity. 

apoptosis: (app'-a-toe'-sis) Programmed cell 
death in which a cell brings about its own death 
and lysis, in response to a signal from outside 
or programmed in its genes, by systematically 
degrading its own macromolecules. 
aptamer: Oligonucleotide that binds specifi- 
cally to one molecular target, usually selected 
by an iterative cycle of affinity-based enrich- 
ment (SELEX). 

Arena ea: One of the five kingdoms of living 
organisms; includes many species that thrive 
in extreme environments of high ionic 
strength, high temperature, or low pH. 
arrestins: A family of proteins that bind to the 
phosphorylated carboxyl- terminal region of 
G protein-coupled receptors, preventing their 
interactions with G proteins and thereby termi- 
nating the signal through those receptors, 
asymmetric carbon atom: A carbon atom 
that is covalently bonded to four different 
groups and thus may exist in two different 
tetrahedral configurations, 
ATP (adenosine triphosphate): A ribonu- 
cleoside 5 '-triphosphate functioning as a 
phosphate group donor in the cellular energy 
cycle; carries chemical energy between 
metabolic pathways by serving as a shared 
intermediate coupling endergonic and exer- 
gonic reactions. 



ATPase: An enzyme that hydrolyzes ATP to 
yield ADP and phosphate, usually coupled to 
a process requiring energy 
ATP synthase: An enzyme complex that 
forms ATP from ADP and phosphate during 
oxidative phosphorylation in the inner mito- 
chondrial membrane or the bacterial plasma 
membrane, and during photophosphorylation 
in chloroplasts. 

attenuator: An RNA sequence involved in 
regulating the expression of certain genes; 
functions as a transcription terminator, 
autophosphorylation: Strictly, the phosphoryl- 
ation of an amino acid residue in a protein that 
is catalyzed by the same protein molecule; often 
extended to include phosphorylation of one 
subunit of a homodimer by the other subunit, 
autotroph: An organism that can synthesize 
its own complex molecules from very simple 
carbon and nitrogen sources, such as carbon 
dioxide and ammonia, 
auxin: A plant growth hormone, 
auxotrophic mutant (auxotroph): A 
mutant organism defective in the synthesis of 
a particular biomolecule, which must therefore 
be supplied for the organism's growth. 
Avogadro's number (AO: The number of 
molecules in a gram molecular weight (a mole) 
of any compound (6.02 X 10 23 ) 

b 

back-mutation: A mutation that causes a 
mutant gene to regain its wild-type base 
sequence 

Bacteria: One of the five kingdoms of living 
organisms; bacteria have a plasma membrane 
but no internal organelles or nucleus . 
bacteriophage: A virus capable of replicating 
in a bacterial cell; also called phage, 
basal metabolic rate: An animal's rate of 
oxygen consumption when at complete rest, 
long after a meal. 

base pair: Two nucleotides in nucleic acid chains 

that are paired by hydrogen bonding of their 

bases; for example, A with T or U, and G with C. 

BAT: See brown adipose tissue 

B cell: See B lymphocyte 

p conformation: An extended, zigzag 

arrangement of a polypeptide chain; a common 

secondary structure in proteins. 

p oxidation: Oxidative degradation of fatty 

acids into acetyl-CoA by successive oxidations 

at the /3-carbon atom. 

p turn: A type of protein secondary structure 
consisting of four amino acid residues 
arranged in a tight turn so that the polypeptide 
turns back on itself. 

bilayer: A double layer of oriented amphi- 
pathic lipid molecules, forming the basic 
structure of biological membranes The hydro- 
carbon tails face inward to form a continuous 
nonpolar phase. 

bile salts: Amphipathic steroid derivatives 
with detergent properties, participating in 
digestion and absorption of lipids. 



binding energy: The energy derived from 
noncovalent interactions between enzyme and 
substrate or receptor and ligand 
binding site: The crevice or pocket on a 
protein in which a ligand binds, 
bioassay: A method for measuring the 
amount of a biologically active substance 
(such as a hormone) in a sample by quanti- 
fying the biological response to aliquots of 
that sample. 

biocytin: The conjugate amino acid residue 
arising from covalent attachment of biotin, 
through an amide linkage, to a Lys residue 
bioinformatics: The computerized analysis 
of biological data, using methods derived from 
statistics, linguistics, mathematics, chemistry, 
biochemistry, and physics. The data are 
often nucleic acid or protein sequence or 
structural data, but can also involve experi- 
mental data from many sources, patient statis- 
tics, and materials in the scientific literature. 
Bioinformatics research focuses on methods 
for data storage, retrieval, and analysis, 
biomolecule: An organic compound normally 
present as an essential component of living 
organisms. 

biopterin: An enzymatic cofactor derived 
from pterin and involved in certain oxidation- 
reduction reactions. 

biosphere: All the living matter on or in the 

earth, the seas, and the atmosphere. 

biotin: A vitamin; an enzymatic cofactor 

involved in carboxylation reactions. 

B lymphocyte (B cell): One of a class of 

blood cells (lymphocytes), responsible for the 

production of circulating antibodies. 

bond energy: The energy required to break a 

bond. 

branch migration: Movement of the branch 
point in a branched DNA formed from two 
DNA molecules with identical sequences. See 
also Holliday intermediate, 
brown adipose tissue (BAT): Thermogenic 
adipose tissue rich in mitochondria that 
contain the uncoupling protein thermogenin, 
which uncouples electron transfer through the 
respiratory chain from ATP synthesis 
Compare white adipose tissue, 
buffer: A system capable of resisting changes 
in pH, consisting of a conjugate acid-base pair 
in which the ratio of proton acceptor to proton 
donor is near unity. 

C 

calorie: The amount of heat required to raise 
the temperature of 1.0 g of water from 14,5 
to 15.5 °C. One calorie (cal) equals 4 18 
joules (J). 

Calvin cycle: The cyclic pathway in plants 

that fixes carbon dioxide and produces triose 

phosphates, 

cAMP: See cyclic AMP. 

cAMP receptor protein (CRP): In bacteria, 

a specific regulatory protein that controls 

initiation of transcription of the genes that 

produce the enzymes required for the cell to 
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use some other nutrient when glucose is 
lacking; also called catabolite gene activator 
protein (CAP) 

CAP: See cAMP receptor protein. 

capsid: The protein coat of a virion or virus 

particle 

carbanion: A negatively charged carbon atom, 
carbocation: A positively charged carbon 
atom; also called a carbonium ion. 
carbon- assimilation reactions: Reaction 
sequences in which atmospheric C0 2 is 
converted into organic compounds 
carbon-fixation reaction: The reaction, 
catalyzed by rubisco during photosynthesis or 
by other carboxylases, in which atmospheric 
C0 2 is initially incorporated (fixed) into an 
organic compound. 

carboxyl-terminal residue: The only amino 
acid residue in a polypeptide chain with a free 
a-carboxyl group; defines the carboxyl 
terminus of the polypeptide, 
carnitine shuttle: Mechanism for moving 
fatty acids from the cytosol to the mitochon- 
drial matrix as fatty esters of carnitine, 
carotenoids: Lipid-soluble photosynthetic 
pigments made up of isoprene units, 
cascade: See enzyme cascade, 
catabolism: The phase of intermediary 
metabolism concerned with the energy- 
yielding degradation of nutrient molecules, 
catabolite gene activator protein (CAP): 
See cAMP receptor protein, 
catalytic site: See active site, 
catecholamines: Hormones, such as epineph- 
rine, that are amino derivatives of catechol 
catenane: Two or more circular polymeric 
molecules interlinked by one or more noncova- 
lent topological links, resembling the links of a 
chain 

cation-exchange resin: An insoluble polymer 
with fixed negative charges, used in the chro- 
matographic separation of cationic substances 
cDNA: See complementary DNA, 
cDNA library: DNA library consisting entirely 
of cloned cDNAs from a particular organism or 
cell type. 

cellular differentiation: The process in 
which a precursor cell becomes specialized to 
carry out a particular function, by acquiring a 
new complement of proteins and RNA, 
central dogma: The organizing principle of 
molecular biology: genetic information flows 
from DNA to RNA to protein, 
centromere: A specialized site in a chromo- 
some, serving as the attachment point for the 
mitotic or meiotic spindle 
cerebroside: Sphingolipid containing one 
sugar residue as a head group, 
channeling: The direct transfer of a reaction 
product (common intermediate) from the 
active site of an enzyme to the active site of the 
enzyme catalyzing the next step in a pathway 
chemiosmotic coupling: Coupling of ATP 
synthesis to electron transfer via a transmem- 
brane difference in charge and pH. 



chemiosmotic theory: The theory that energy 
derived from electron transfer reactions is 
temporarily stored as a transmembrane 
difference in charge and pH, which subse- 
quently drives the formation of ATP in oxidative 
phosphorylation and photophosphorylation. 
chemotaxis: A cell's sensing of and movement 
toward or away from a specific chemical agent 
chemotroph: An organism that obtains 
energy by metabolizing organic compounds 
derived from other organisms, 
chiral center: An atom with substituents 
arranged so that the molecule is not superpos- 
able on its mirror image, 
chiral compound: A compound that contains 
an asymmetric center (chiral atom or chiral 
center) and thus can occur in two nonsuper- 
posable mirror-image forms (enantiomers) . 
chlorophylls: A family of green pigments that 
function as receptors of light energy in photo- 
synthesis; magnesium-porphyrin complexes, 
chloroplasts: Chlorophyll-containing photo- 
synthetic organelles in some eukaryotic cells, 
chondroitin sulfate: One of a family of 
sulfated glycosaminoglycans, a major compo- 
nent of the extracellular matrix, 
chromatin: A filamentous complex of DNA, 
histones, and other proteins, constituting the 
eukaryotic chromosome, 
chromatography: A process in which 
complex mixtures of molecules are separated 
by many repeated partitionings between a 
flowing (mobile) phase and a stationary phase, 
chromosome: A single large DNA molecule 
and its associated proteins, containing many 
genes; stores and transmits genetic information, 
chylomicron: A plasma lipoprotein consisting 
of a large droplet of triacylglycerols stabilized 
by a coat of protein and phospholipid; carries 
lipids from the intestine to the tissues, 
chymotrypsin: A well-studied protease that 
cleaves polypeptides on the carbonyl side of 
amino acid residues with aromatic side chains, 
chymotrypsinogen: The zymogen (inactive) 
precursor of chymotrypsin. 
cis and trans isomers: See geometric isomers 
cistron: A unit of DNA or RNA corresponding 
to one gene. 

citric acid cycle: A cyclic pathway for the 
oxidation of acetyl residues to carbon dioxide, 
in which formation of citrate is the first step; 
also known as the Krebs cycle or tricarboxylic 
acid cycle 

clones: The descendants of a single cell, 
cloning: The production of large numbers of 
identical DNA molecules, cells, or organisms 
from a single ancestral DNA molecule, cell, or 
organism 

closed system: A system that exchanges 
neither matter nor energy with the surround- 
ings See also system, 
cobalamin: See coenzyme B ]2 
codon: A sequence of three adjacent 
nucleotides in a nucleic acid that codes for a 
specific amino acid. 



coenzyme: An organic cofactor required for 
the action of certain enzymes; often has a 
vitamin component, 

coenzyme A: A pantothenic acid-containing 
coenzyme that serves as an acyl group carrier 
in certain enzymatic reactions, 
coenzyme B 12 : An enzymatic cofactor derived 
from the vitamin cobalamin, involved in certain 
types of carbon skeletal rearrangements, 
cofactor: An inorganic ion or a coenzyme 
required for enzyme activity, 
cognate: Describes two biomolecules that 
normally interact; for example, an enzyme and 
its usual substrate, or a receptor and its usual 
ligand. 

cohesive ends: See sticky ends, 
cointegrate: An intermediate in the migration 
of certain DNA transposons in which the donor 
DNA and target DNA are covalently attached, 
colligative properties: Properties of a solu- 
tion that depend on the number of solute 
particles per unit volume; for example, 
freezing-point depression, 
common intermediate: A chemical compound 
common to two chemical reactions, as a product 
of one and a reactant in the other, 
competitive inhibition: A type of enzyme 
inhibition reversed by increasing the substrate 
concentration; a competitive inhibitor gener- 
ally competes with the normal substrate or 
ligand for a protein's binding site, 
complementary: Having a molecular surface 
with chemical groups arranged to interact 
specifically with chemical groups on another 
molecule, 

complementary DNA (cDNA): A DNA 

complementary to a specific mRNA, used in 
DNA cloning; usually made by reverse 
transcriptase. 

configuration: The spatial arrangement of an 
organic molecule that is conferred by the pres- 
ence of either (1) double bonds, about which 
there is no freedom of rotation, or (2) chiral 
centers, around which substituent groups are 
arranged in a specific sequence. Configurational 
isomers cannot be interconverted without 
breaking one or more covalent bonds, 
conformation: A spatial arrangement of 
substituent groups that are free to assume 
different positions in space, without breaking 
any bonds, because of the freedom of bond 
rotation 

conjugate acid-base pair: A proton donor 
and its corresponding deprotonated species; 
for example, acetic acid (donor) and acetate 
(acceptor) 

conjugated protein: A protein containing 
one or more prosthetic groups 
conjugate redox pair: An electron donor and 
its corresponding electron acceptor form; for 
example, Cu + (donor) and Cu 2+ (acceptor), 
or NADH (donor) and NAD + (acceptor) 
consensus sequence: A DNA or amino acid 
sequence consisting of the residues that most 
commonly occur at each position in a set of 
similar sequences. 
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conservative substitution: Replacement of 
an amino acid residue in a polypeptide by 
another residue with similar properties; for 
example, substitution of Glu by Asp 
constitutive enzymes: Enzymes required 
at all times by a cell and present at some 
constant level; for example, many enzymes of 
the central metabolic pathways. Sometimes 
called housekeeping enzymes, 
contig: A series of overlapping clones or a 
continuous sequence denning an uninter- 
rupted section of a chromosome, 
contour length: The length of a helical 
polymeric molecule as measured along its 
helical axis. 

cooperativity: The characteristic of an 
enzyme or other protein in which binding of 
the first molecule of a ligand changes the 
affinity for the second molecule. In positive 
cooperativity, the affinity for the second ligand 
molecule increases; in negative cooperativity, 
it decreases 

corticosteroids: Steroid hormones formed by 
the adrenal cortex 

cotransport: The simultaneous transport, by 
a single transporter, of two solutes across a 
membrane. See also antiport; symport. 
coupled reactions: Two chemical reactions 
that have a common intermediate and thus a 
means of energy transfer from one to the 
other. 

covalent bond: A chemical bond that involves 

sharing of electron pairs. 

cristae: Infoldings of the inner mitochondrial 

membrane, 

CRP: See cAMP receptor protein, 
cruciform: Secondary structure in double- 
stranded RNA or DNA in which the double 
helix is denatured at palindromic repeat 
sequences in each strand, and each separated 
strand is paired internally to form opposing 
hairpin structures See also hairpin 
cyclic AMP (cAMP): A second messenger; its 
formation in a cell by adenylyl cyclase is stimu- 
lated by certain hormones or other molecular 
signals, 

cyclic electron flow: In chloroplasts, the 
light-induced flow of electrons originating from 
and returning to photosystem I 
cyclic photophosphorylation: ATP 

synthesis driven by cyclic electron flow 
through photosystem I, 
cyclin: One of a family of proteins that acti- 
vate cyclin-dependent protein kinases and 
thereby regulate the cell cycle, 
cytochrome P-450: A family of heme- 
containing enzymes, with a characteristic 
absorption band at 450 nm, that participate 
in biological hydroxylations with 0 2 . 
cytochromes: Heme proteins serving as 
electron carriers in respiration, photosyn- 
thesis, and other oxidation-reduction 
reactions. 

cytokine: One of a family of small secreted 
proteins (such as interleukins or interferons) 
that activate cell division or differentiation by 



binding to plasma membrane receptors in 
target cells. 

cytokinesis: The final separation of daughter 

cells following mitosis. 

cytoplasm: The portion of a cell's contents 

outside the nucleus but within the plasma 

membrane; includes organelles such as 

mitochondria 

cytoskeleton: The filamentous network that 
provides structure and organization to the 
cytoplasm; includes actin filaments, micro- 
tubules, and intermediate filaments, 
cytosol: The continuous aqueous phase of the 
cytoplasm, with its dissolved solutes; excludes 
the organelles such as mitochondria 

d 

dalton: Unit of atomic or molecular weight; 

1 dalton (Da) is the weight of a hydrogen 

atom (1.66 X 10" 24 g) 

dark reactions: See carbon-assimilation 

reactions. 

deamination: The enzymatic removal of 
amino groups from biomolecules such as 
amino acids or nucleotides 
degenerate code: A code in which a single 
element in one language is specified by more 
than one element in a second language, 
dehydrogenases: Enzymes that catalyze the 
removal of pairs of hydrogen atoms from 
substrates 

deletion mutation: A mutation resulting 
from the deletion of one or more nucleotides 
from a gene or chromosome, 
denaturation: Partial or complete unfolding 
of the specific native conformation of a 
polypeptide chain, protein, or nucleic acid 
such that the function of the molecule is lost, 
denatured protein: A protein that has lost 
enough of its native conformation by exposure 
to a destabilizing agent such as heat or deter- 
gent that its function is lost, 
de novo pathway: Pathway for the synthesis 
of a biomolecule, such as a nucleotide, from 
simple precursors; as distinct from a salvage 
pathway. 

deoxyribonucleic acid: See DNA. 
deoxyribonucleotides: Nucleotides containing 
2-deoxy-D-ribose as the pentose component, 
desaturases: Enzymes that catalyze the 
introduction of double bonds into the hydro- 
carbon portion of fatty acids, 
desensitization: Universal process by which 
sensory mechanisms cease to respond after 
prolonged exposure to the specific stimulus 
they detect. 

desolvation: In aqueous solution, the release 
of bound water surrounding a solute, 
dextrorotatory isomer: A stereoisomer that 
rotates the plane of plane-polarized light 
clockwise. 

diabetes mellitus: A group of metabolic 
diseases with symptoms that result from a 
deficiency in insulin production or utilization; 
characterized by a failure in glucose transport 



from the blood into cells at normal glucose 
concentrations. 

dialysis: Removal of small molecules from a 
solution of a macromolecule by their diffusion 
through a semipermeable membrane into a 
suitably buffered solution, 
differential centrifugation: Separation of 
cell organelles or other particles of different 
size by their different rates of sedimentation in 
a centrifugal field. 

differentiation: Specialization of cell structure 
and function during growth and development, 
diffusion: The net movement of molecules in 
the direction of lower concentration, 
digestion: Enzymatic hydrolysis of major 
nutrients in the gastrointestinal system to 
yield their simpler components, 
diploid: Having two sets of genetic informa- 
tion; describes a cell with two chromosomes of 
each type. Compare haploid, 
dipole: A molecule with both positive and 
negative charges. 

diprotic acid: An acid with two dissociable 
protons. 

disaccharide: A carbohydrate consisting of 
two covalently joined monosaccharide units, 
dissociation constant: An equilibrium 
constant (AT d ) for the dissociation of a complex 
of two or more biomolecules into its compo- 
nents; for example, dissociation of a substrate 
from an enzyme. 

disulfide bond: A covalent bond involving the 
oxidative linkage of two Cys residues, from the 
same or different polypeptide chains, forming 
a cystine residue. 

DNA (deoxyribonucleic acid): A polynu- 
cleotide with a specific sequence of deoxyri- 
bonucleotide units covalently joined through 
3',5'-phosphodiester bonds; serves as the 
carrier of genetic information. 
DNA chimera: DNA containing genetic 
information derived from two different species, 
DNA chip: Informal term for a DNA 
microarray, referring to the small size of 
typical microarrays. 
DNA cloning: See cloning , 
DNA library: A collection of cloned DNA 
fragments 

DNA ligase: An enzyme that creates a phos- 
phodiester bond between the 3' end of one 
DNA segment and the 5' end of another. 
DNA looping: The interaction of proteins 
bound at distant sites on a DNA molecule so 
that the intervening DNA forms a loop. 
DNA microarray: A collection of DNA 
sequences immobilized on a solid surface, with 
individual sequences laid out in patterned 
arrays that can be probed by hybridization. 
DNA polymerase: An enzyme that catalyzes 
template-dependent synthesis of DNA from its 
deoxyribonucleoside 5 '-triphosphate 
precursors. 

DNA replicase system: The entire complex 
of enzymes and specialized proteins required 
in biological DNA replication. 
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DNA supercoiling: The coiling of DNA upon 
itself, generally as a result of bending, under- 
winding, or overwinding of the DNA helix. 
DNA transposition: See transposition, 
domain: A distinct structural unit of a 
polypeptide; domains may have separate 
functions and may fold as independent, 
compact units- 

double helix: The natural coiled conforma- 
tion of two complementary, antiparallel DNA 
chains. 

double-reciprocal plot: A plot of 1/V 0 versus 
1/[S], which allows a more accurate determina- 
tion of V max and K m than a plot of V 0 versus 
[S]; also called the Lineweaver-Burk plot. 



E'°: See standard reduction potential 
E. coli (Escherichia coli)'. A common 
bacterium found in the small intestine of 
vertebrates; the most well-studied organism, 
electrochemical gradient: The sum of the 
gradients of concentration and of electric 
charge of an ion across a membrane; the 
driving force for oxidative phosphorylation and 
photophosphorylation. 
electrochemical potential: The energy 
required to maintain a separation of charge 
and of concentration across a membrane, 
electrogenic: Contributing to an electrical 
potential across a membrane, 
electron acceptor: A substance that receives 
electrons in an oxidation-reduction reaction, 
electron carrier: A protein, such as a flavo- 
protein or a cytochrome, that can reversibly 
gain and lose electrons; functions in the 
transfer of electrons from organic nutrients to 
oxygen or some other terminal acceptor, 
electron donor: A substance that donates 
electrons in an oxidation-reduction reaction, 
electron transfer: Movement of electrons 
from electron donor to electron acceptor; 
especially, from substrates to oxygen via the 
carriers of the respiratory (electron-transfer) 
chain 

electrophile: An electron-deficient group 
with a strong tendency to accept electrons 
from an electron-rich group (nucleophile) . 
electrophoresis: Movement of charged 
solutes in response to an electrical field; often 
used to separate mixtures of ions, proteins, or 
nucleic acids. 

electroporation: Introduction of macromole- 
cules into cells after rendering the cells tran- 
siently permeable by the application of a high- 
voltage pulse, 

elongation factors: (1) Proteins that 
function in the elongation phase of eukaryotic 
transcription (2) Specific proteins required in 
the elongation of polypeptide chains by 
ribosomes. 

eluate: The effluent from a chromatographic 
column. 

enantiomers: Stereoisomers that are 
nonsuperposable mirror images of each other. 



endergonic reaction: A chemical reaction 
that consumes energy (that is, for which AG is 
positive). 

endocrine glands: Groups of cells specialized 
to synthesize hormones and secrete them into 
the blood to regulate other types of cells, 
endocytosis: The uptake of extracellular 
material by its inclusion in a vesicle (endosome) 
formed by invagination of the plasma 
membrane. 

endonuclease: An enzyme that hydrolyzes 
the interior phosphodiester bonds of a nucleic 
acid; that is, it acts at bonds other than the 
terminal bonds 

endoplasmic reticulum: An extensive 
system of double membranes in the cytoplasm 
of eukaryotic cells; it encloses secretory chan- 
nels and is often studded with ribosomes 
(rough endoplasmic reticulum) . 
endothermic reaction: A chemical reaction 
that takes up heat (that is, for which AH is 
positive) . 

end-product inhibition: See feedback 
inhibition. 

energy charge: The fractional degree to 
which the ATP/ADP/AMP system is filled with 
high-energy phosphate groups 
energy coupling: The transfer of energy from 
one process to another. 

enhancers: DNA sequences that facilitate the 
expression of a given gene; may be located a 
few hundred, or even thousand, base pairs 
away from the gene, 

enthalpy (If): The heat content of a system, 
enthalpy change (AZT): For a reaction, 
approximately equal to the difference between 
the energy used to break bonds and the energy 
gained by the formation of new ones 
entropy (£): The extent of randomness or 
disorder in a system. 

enzyme: A biomolecule, either protein or 
RNA, that catalyzes a specific chemical reac- 
tion. It does not affect the equilibrium of the 
catalyzed reaction; it enhances the rate of the 
reaction by providing a reaction path with a 
lower activation energy, 
enzyme cascade: A series of reactions, often 
involved in regulatory events, in which one 
enzyme activates another (often by phospho- 
rylation), which activates a third, and so on. 
The effect of a catalyst activating a catalyst is 
a large amplification of the signal that initiated 
the cascade. 

epigenetic: Describes any inherited charac- 
teristic of a living organism that is acquired by 
means that do not involve the nucleotide 
sequence of the parental chromosomes; for 
example, covalent modifications of histones, 
epimerases: Enzymes that catalyze the 
reversible interconversion of two epimers. 
epimers: Two stereoisomers differing in 
configuration at one asymmetric center in a 
compound having two or more asymmetric 
centers, 

epithelial cell: Any cell that forms part of the 
outer covering of an organism or organ. 



epitope: An antigenic determinant; the partic- 
ular chemical group or groups in a macromole- 
cule (antigen) to which a given antibody binds, 
epitope tag: A protein sequence or domain 
bound by some well-characterized antibody, 
equilibrium: The state of a system in which 
no further net change is occurring; the free 
energy is at a minimum, 
equilibrium constant (K eq ): A constant, 
characteristic for each chemical reaction, that 
relates the specific concentrations of all reac- 
tants and products at equilibrium at a given 
temperature and pressure 
erythrocyte: A cell containing large amounts 
of hemoglobin and specialized for oxygen 
transport; a red blood cell 
Escherichia coli: See E. coli 
essential amino acids: Amino acids that 
cannot be synthesized by humans (and other 
vertebrates) and must be obtained from the diet, 
essential fatty acids: The group of polyun- 
saturated fatty acids produced by plants, but 
not by humans; required in the human diet, 
EST: See expressed sequence tag. 
ethanol fermentation: The anaerobic 
conversion of glucose to ethanol via glycolysis; 
also called alcohol fermentation. See also 
fermentation. 

euchromatin: The regions of interphase 
chromosomes that stain diffusely, as opposed 
to the more condensed, heavily staining, 
heterochromatin. These are often regions in 
which genes are being actively expressed 
eukaryote: A unicellular or multicellular 
organism with cells having a membrane- 
bounded nucleus, multiple chromosomes, and 
internal organelles. 

excited state: An energy-rich state of an 
atom or molecule; produced by the absorption 
of light energy. 

exergonic reaction: A chemical reaction that 
proceeds with the release of free energy (that 
is, for which AG is negative), 
exocytosis: The fusion of an intracellular 
vesicle with the plasma membrane, releasing 
the vesicle contents to the extracellular space, 
exon: The segment of a eukaryotic gene that 
encodes a portion of the final product of the 
gene; a segment of RNA that remains after 
posttranscriptional processing and is tran- 
scribed into a protein or incorporated into the 
structure of an RNA. See also intron. 
exonuclease: An enzyme that hydrolyzes only 
those phosphodiester bonds that are in the 
terminal positions of a nucleic acid, 
exothermic reaction: A chemical reaction 
that releases heat (that is, for which AH is 
negative) . 

expressed sequence tag (EST): A specific 
type of sequence-tagged site in DNA repre- 
senting a gene that is expressed 
expression vector: See vector, 
extracellular matrix: An interwoven 
combination of glycosaminoglycans, proteo- 
glycans, and proteins, just outside the plasma 
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membrane, that provides cell anchorage, 
positional recognition, and traction during cell 
migration 

extrahepatic: Describes all tissues outside 
the liver; implies the centrality of the liver in 
metabolism. 

f 

facilitated diffusion: Diffusion of a polar 
substance across a biological membrane 
through a protein transporter; also called 
passive diffusion or passive transport 
facultative cells: Cells that can live in the 
presence or absence of oxygen. 
FAD (flavin adenine dinucleotide): The 
coenzyme of some oxidation-reduction 
enzymes; contains riboflavin, 
fatty acid: A long-chain aliphatic carboxylic acid 
found in natural fats and oils; also a component 
of membrane phospholipids and glycolipids. 
feedback inhibition: Inhibition of an allosteric 
enzyme at the beginning of a metabolic 
sequence by the end product of the sequence; 
also known as end-product inhibition 
fermentation: Energy-yielding anaerobic 
breakdown of a nutrient molecule, such as 
glucose, without, net oxidation; yields lactate, 
ethanol, or some other simple product 
ferredoxin: An iron-sulfur protein (2Fe-2S) of 
chloroplasts that carries electrons from iron- 
sulfur centers associated with photosystem I to 
NADP + during photophosphorylation, 
fibroblast: A cell of the connective tissue that 
secretes connective tissue proteins such as 
collagen. 

fibrous proteins: Insoluble proteins that 
serve a protective or structural role; contain 
polypeptide chains that generally share a 
common secondary structure, 
fingerprinting: See peptide mapping, 
first law of thermodynamics: The law 
stating that, in all processes, the total energy 
of the universe remains constant. 
Fischer projection formulas: See projection 
formulas. 

5' end: The end of a nucleic acid that lacks a 
nucleotide bound at the 5' position of the 
terminal residue. 

flagellum: A cell appendage used in propul- 
sion. Bacterial flagella have a much simpler 
structure than eukaryotic flagella, which are 
similar to cilia. 

flavin adenine dinucleotide: See FAD 
flavin-linked dehydrogenases: 

Dehydrogenases requiring one of the riboflavin 
coenzymes, FMN or FAD. 

flavin mononucleotide: See FMN 
flavin nucleotides: Nucleotide coenzymes 
(FMN and FAD) containing riboflavin 
flavoprotein: An enzyme containing a flavin 
nucleotide as a tightly bound prosthetic group, 
flippases: Membrane proteins in the ABC 
transporter family that catalyze the movement 
of phospholipids from the extracellular leaflet 
to the cytosolic leaflet of a membrane bilayer. 



floppases: Membrane proteins in the ABC 
transporter family that catalyze movement of 
phospholipids from the cytosolic leaflet to the 
extracellular leaflet of a membrane bilayer 
fluid mosaic model: A model describing 
biological membranes as a fluid lipid bilayer 
with embedded proteins; the bilayer exhibits 
both structural and functional asymmetry 
fluorescence: Emission of light by excited 
molecules as they revert to the ground state. 
FMN (flavin mononucleotide): Riboflavin 
phosphate, a coenzyme of certain oxidation- 
reduction enzymes, 
fold: See motif 

footprinting: A technique for identifying the 
nucleic acid sequence bound by a DNA- or 
RNA-binding protein 

fraction: A portion of a biological sample that 
has been subjected to a procedure designed to 
separate macromolecules based on a property 
such as solubility, net charge, molecular 
weight, or function. 

fractionation: The process of separating the 
proteins or other components of a complex 
molecular mixture into fractions based on 
differences in properties such as solubility, net 
charge, molecular weight, or function 
frame shift: A mutation caused by insertion 
or deletion of one or more paired nucleotides, 
changing the reading frame of codons during 
protein synthesis; the polypeptide product has 
a garbled amino acid sequence beginning at 
the mutated codon 

FRAP (fluorescence recovery after photo- 
bleaching): A technique used to quantify the 
diffusion of membrane components (lipids or 
proteins) in the plane of the bilayer. 
free energy (G): The component of the total 
energy of a system that can do work at 
constant temperature and pressure, 
free energy of activation (AG*): See activa- 
tion energy, 

free-energy change (AG): The amount of 

free energy released (negative AG) or 

absorbed (positive AG) in a reaction at 

constant temperature and pressure. 

free radical: See radical 

functional group: The specific atom or group 

of atoms that confers a particular chemical 

property on a biomolecule 

furanose: A simple sugar containing the five- 

membered furan ring 

fusion protein: (1) One of a family of 

proteins that facilitate membrane fusion. 

(2) The protein product of a gene created by 

the fusion of two distinct genes or portions of 

genes. 

futile cycle: A cycle of enzyme-catalyzed 
reactions that results in release of thermal 
energy by the hydrolysis of ATR 



AG' 0 : See standard free-energy change, 
gametes: Reproductive cells with a haploid 
gene content; sperm or egg cells. 



gangliosides: Sphingolipids containing 
complex oligosaccharides as head groups; 
especially common in nervous tissue 
gel filtration: See size-exclusion 
chromatography. 

gene: A chromosomal segment that codes for 
a single functional polypeptide chain or RNA 
molecule. 

gene expression: Transcription, and in the 
case of proteins, translation, to yield the 
product of a gene; a gene is expressed when 
its biological product is present and active, 
general acid-base catalysis: Catalysis 
involving proton transfer (s) to or from a mole- 
cule other than water. 

gene splicing: The enzymatic attachment of 
one gene, or part of a gene, to another, 
genetic code: The set of triplet code words in 
DNA (or mRNA) coding for the amino acids of 
proteins. 

genetic information: The hereditary infor- 
mation contained in a sequence of nucleotide 
bases in chromosomal DNA or RNA. 
genetic map: A diagram showing the relative 
sequence and position of specific genes along a 
chromosome. 

genome: All the genetic information encoded 
in a cell or virus 

genomic library: A DNA library containing 
DNA segments that represent all (or most) of 
the sequences in an organism's genome, 
genomics: A science devoted broadly to the 
understanding of cellular and organism 
genomes. 

genotype: The genetic constitution of an 
organism, as distinct from its physical charac- 
teristics, or phenotype. 

geometric isomers: Isomers related by 
rotation about a double bond; also called cis 
and trans isomers. 

germ-line cell: A type of animal cell that is 
formed early in embryogenesis and may 
multiply by mitosis or produce by meiosis 
cells that develop into gametes (egg or sperm 
cells), 

GFP: See green fluorescent protein 
globular proteins: Soluble proteins with a 
globular (somewhat rounded) shape, 
glucogenic amino acids: Amino acids with 
carbon chains that can be metabolically 
converted into glucose or glycogen via 
gluconeogenesis 

gluconeogenesis: The biosynthesis of a 
carbohydrate from simpler, noncarbohydrate 
precursors such as oxaloacetate or pyruvate. 
GLUT: Designation for a family of membrane 
proteins that transport glucose, 
glycan: Another term for polysaccharide; a 
polymer of monosaccharide units joined by 
glycosidic bonds, 

glycerophospholipid: An amphipathic lipid 
with a glycerol backbone; fatty acids are ester- 
linked to C-l and C-2 of glycerol, and a polar 
alcohol is attached through a phosphodiester 
linkage to C-3 
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glycoconjugate: A compound containing a 
carbohydrate component bound covalently to 
a protein or lipid, forming a glycoprotein or 
glycolipid 

glycogenesis: The process of converting 
glucose to glycogen, 

glycogenin: The protein that both primes the 
synthesis of new glycogen chains and catalyzes 
the polymerization of the first few sugar 
residues of each chain before glycogen 
synthase continues the extension. 

glycogenolysis: The enzymatic breakdown of 

stored (not dietary) glycogen. 

glycolipid: A lipid containing a carbohydrate 

group. 

glycolysis: The catabolic pathway by which a 
molecule of glucose is broken down into two 
molecules of pyruvate. 

glycomics: The systematic characterization of 
all the carbohydrate components of a cell or 
tissue, including those bound covalently to a 
protein or lipid. 

glycoprotein: A protein containing a carbohy- 
drate group. 

glycosaminoglycan: A heteropolysaccharide 
of two alternating units: one is either N- 
acetylglucosamine or ^/-acetylgalactosamine; 
the other is a uronic acid (usually glucuronic 
acid). Formerly called a mucopolysaccharide, 
glycosidic bonds: Bonds between a sugar and 
another molecule (typically an alcohol, purine, 
pyrimidine, or sugar) through an intervening 
oxygen 

glyoxylate cycle: A variant of the citric acid 
cycle, for the net conversion of acetate into 
succinate and, eventually, new carbohydrate; 
present in bacteria and some plant cells, 
glyoxysome: A specialized peroxisome 
containing the enzymes of the glyoxylate 
cycle; found in cells of germinating seeds. 
Golgi complex: A complex membranous 
organelle of eukaryotic cells; functions in the 
posttranslational modification of proteins and 
their secretion from the cell or incorporation 
into the plasma membrane or organellar 
membranes, 

G protein-coupled receptors (GPCRs): A 

large family of membrane receptor proteins 
with seven transmembrane helical segments, 
often associating with G proteins to transduce 
an extracellular signal into a change in cellular 
metabolism; also called serpentine receptors 
or heptahelical receptors. 

G proteins: A family of heterotrimeric 
GTP-binding proteins that act in intracellular 
signaling pathways. Commonly, ligand binding 
to a G protein-coupled receptor induces the 
exchange of GTP for bound GDP, enabling 
the G protein to activate a downstream 
enzyme in a signaling pathway. G proteins 
have intrinsic GTPase activity, and therefore 
self-inactivate 

gram molecular weight: For a compound, 
the weight in grams that is numerically equal 
to its molecular weight; the weight of one 
mole. 



grana: Stacks of thylakoids, flattened 
membranous sacs or disks, in chloroplasts. 
green fluorescent protein (GFP): A small 
protein from a marine organism that produces 
a bright fluorescence in the green region of the 
visible spectrum. Fusion proteins with GFP are 
commonly used to determine the subcellular 
location of the fused protein by fluorescence 
microscopy- 
ground state: The normal, stable form of an 
atom or molecule; as distinct from the excited 
state. 

group transfer potential: A measure of the 
ability of a compound to donate an activated 
group (such as a phosphate or acyl group); 
generally expressed as the standard free 
energy of hydrolysis 

growth factors: Proteins or other molecules 
that act from outside a cell to stimulate cell 
growth and division. 

guanosine nucleotide-binding protein: 

One of a family of regulatory proteins that bind 
either GDP or GTP, are active in the GTP- 
bound form, and have intrinsic GTPase activity 
that turns them off. 



hairpin: Secondary structure in single- 
stranded RNA or DNA, in which complemen- 
tary parts of a palindromic repeat fold back 
and are paired to form an antiparallel duplex 
helix that is closed at one end. 
half-life: The time required for the disappear- 
ance or decay of one-half of a given compo- 
nent in a system 

haploid: Having a single set of genetic infor- 
mation; describes a cell with one chromosome 
of each type Compare diploid, 
hapten: A small molecule that, when linked to 
a larger molecule, elicits an immune response. 
Haworth perspective formulas: A method 
for representing cyclic chemical structures 
so as to define the configuration of each 
substituent group; commonly used for repre- 
senting sugars. 

helicase: An enzyme that catalyzes the sepa- 
ration of strands in a DNA molecule before 
replication 

helix: See a helix; double helix. 

heme: The iron-porphyrin prosthetic group of 

heme proteins. 

heme protein: A protein containing a heme 
as a prosthetic group 

hemoglobin: A heme protein in erythrocytes; 
functions in oxygen transport. 
Henderson-Hasselbalch equation: An 

equation relating the pH, the pK a , and the ratio 
of the concentrations of proton-acceptor (A") 
and proton-donor (HA) species in a solution: 
[A"] 



pH = pK a + log 



[HA] 



heparan sulfate: A sulfated polymer of 
alternating Af-acetylglucosamine and a uronic 
acid, either glucuronic or iduronic acid; typi- 
cally found in the extracellular matrix. 



hepatocyte: The major cell type of liver tissue, 
heteroduplex DNA: Duplex DNA containing 
complementary strands derived from two 
different DNA molecules with similar sequences, 
often as a product of genetic recombination 
heteropolysaccharide: A polysaccharide 
containing more than one type of sugar, 
heterotroph: An organism that requires 
complex nutrient molecules, such as glucose, 
as a source of energy and carbon, 
heterotropic: Describes an allosteric modu- 
lator that is distinct from the normal ligand. 
heterotropic enzyme: An allosteric enzyme 
requiring a modulator other than its substrate, 
hexose: A simple sugar with a backbone 
containing six carbon atoms, 
hexose monophosphate pathway: See 
pentose phosphate pathway, 
high-energy compound: A compound that on 
hydrolysis undergoes a large decrease in free 
energy under standard conditions, 
high-performance liquid chromatography 
(HPLC): Chromatographic procedure, often 
conducted at relatively high pressures, using 
automated equipment that permits refined and 
highly reproducible profiles. 
Hill coefficient: A measure of cooperative 
interaction between protein subunits. 
Hill reaction: The evolution of oxygen and 
photoreduction of an artificial electron 
acceptor by a chloroplast preparation in 
the absence of carbon dioxide, 
histones: The family of basic proteins that 
associate tightly with DNA in the chromo- 
somes of all eukaryotic cells. 
Holliday intermediate: An intermediate in 
genetic recombination in which two double- 
stranded DNA molecules are joined by a recip- 
rocal crossover involving one strand of each 
molecule. 

holoenzyme: A catalytically active enzyme, 
including all necessary subunits, prosthetic 
groups, and cofactors. 
homeobox: A conserved DNA sequence of 
180 base pairs that encodes a protein domain 
found in many proteins that play a regulatory 
role in development. 

homeodomain: The protein domain encoded 
by the homeobox; a regulatory unit that deter- 
mines the segmentation of a body plan, 
homeostasis: The maintenance of a dynamic 
steady state by regulatory mechanisms that 
compensate for changes in external circum- 
stances. 

homeotic genes: Genes that regulate devel- 
opment of the pattern of segments in the 
Drosophila body plan; similar genes are found 
in most vertebrates 

homologous genetic recombination: 

Recombination between two DNA molecules of 
similar sequence, occurring in all cells; occurs 
during meiosis and mitosis in eukaryotes. 
homologous proteins: Proteins having 
similar sequences and functions in different 
species; for example, the hemoglobins. 
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homopolysaccharide: A polysaccharide 
made up of one type of monosaccharide unit, 
homotropic: Describes an allosteric 
modulator that is identical to the normal 
ligand, 

homotropic enzyme: An allosteric enzyme 
that uses its substrate as a modulator 
hormone: A chemical substance synthesized 
in small amounts by an endocrine tissue and 
carried in the blood to another tissue, where it 
acts as a messenger to regulate the function of 
the target tissue or organ, 
hormone receptor: A protein in, or on the 
surface of, target cells that binds a specific 
hormone and initiates the cellular response, 
hormone response element (HRE): A short 
(12 to 20 bp) DNA sequence that binds recep- 
tors for steroid, retinoid, thyroid, and vitamin 
D hormones, altering the expression of the 
contiguous genes Each hormone has a 
consensus sequence preferred by the cognate 
receptor 

HPLC: See high-performance liquid 
chromatography 

HRE: See hormone response element, 
hyaluronan: A high molecular weight, acidic 
polysaccharide typically composed of the 
alternating disaccharide GlcUAG31->3) 
GlcNAc; a major component of the extracel- 
lular matrix, forming larger complexes 
(proteoglycans) with proteins and other 
acidic polysaccharides Also called hyaluronic 
acid, 

hydrogen bond: A weak electrostatic attrac- 
tion between one electronegative atom (such 
as oxygen or nitrogen) and a hydrogen atom 
covalently linked to a second electronegative 
atom 

hydrolases: Enzymes (proteases, lipases, 
phosphatases, nucleases, for example) that 
catalyze hydrolysis reactions, 
hydrolysis: Cleavage of a bond, such as an 
anhydride or peptide bond, by the addition of 
the elements of water, yielding two or more 
products 

hydronium ion: The hydrated hydrogen ion 
(H 3 0 + ) 

hydropathy index: A scale that expresses 
the relative hydrophobic and hydrophilic 
tendencies of a chemical group 
hydrophilic: Polar or charged; describes 
molecules or groups that associate with 
(dissolve easily in) water, 
hydrophobic: Nonpolar; describes molecules 
or groups that are insoluble in water 
hydrophobic interactions: The association 
of nonpolar groups or compounds with 
each other in aqueous systems, driven by 
the tendency of the surrounding water 
molecules to seek their most stable 
(disordered) state. 

hyperchromic effect: The large increase in 
light absorption at 260 nm occurring as a 
double-helical DNA unwinds (melts), 
hypoxia: The metabolic condition in which 
the supply of oxygen is severely limited. 



i 

immune response: The capacity of a verte- 
brate to generate antibodies to an antigen, a 
macromolecule foreign to the organism, 
immunoglobulin: An antibody protein gener- 
ated against, and capable of binding specifi- 
cally to, an antigen. 

induced fit: A change in the conformation of 
an enzyme in response to substrate binding 
that renders the enzyme catalytically active; 
also used to denote changes in the conforma- 
tion of any macromolecule in response to 
ligand binding such that the binding site of the 
macromolecule better conforms to the shape 
of the ligand 

inducer: A signal molecule that, when bound 
to a regulatory protein, produces an increase 
in the expression of a given gene . 
induction: An increase in the expression of a 
gene in response to a change in the activity of 
a regulatory protein 

informational macromolecules: 

Biomolecules containing information in the 
form of specific sequences of different 
monomers; for example, many proteins, lipids, 
polysaccharides, and nucleic acids 
inhibitory G protein (Gj): A trimeric GTP- 
binding protein that, when activated by an 
associated plasma membrane receptor, acts 
to inhibit a neighboring membrane enzyme 
such as adenylyl cyclase Its effects oppose 
those of G K . 

initiation codon: AUG (sometimes GUG or, 
even more rarely, UUG in bacteria and archaea); 
codes for the first amino acid in a polypeptide 
sequence: A^-formylmethionine in bacteria; 
methionine in archaea and eukaryotes, 
initiation complex: A complex of a ribosome 
with an mRNA and the initiating Met-tRNA Met 
or fMet-tRNA™ ct , ready for the elongation 
steps 

inorganic pyrophosphatase: An enzyme 
that hydrolyzes a molecule of inorganic 
pyrophosphate to yield two molecules of 
(ortho)phosphate; also known as pyrophos- 
phatase. 

insertion mutation: A mutation caused by 
insertion of one or more extra bases, or a 
mutagen, between successive bases in DNA, 
insertion sequence: Specific base sequences 
at either end of a transposable segment of DNA. 
in situ: "In position"; that is, in its natural 
position or location. 

integral proteins: Proteins firmly bound to 
a membrane by hydrophobic interactions; 
as distinct from peripheral proteins 
integrin: One of a large family of hetero- 
dimeric transmembrane proteins that mediate 
adhesion of cells to other cells or to the extra- 
cellular matrix 

intercalating mutagen: A mutagen that 
inserts itself between successive bases in a 
nucleic acid, causing a frame-shift mutation, 
intercalation: Insertion between stacked 
aromatic or planar rings; for example, the 



insertion of a planar molecule between two 
successive bases in a nucleic acid, 
interferons: A class of glycoproteins with 
antiviral activities 

intermediary metabolism: In cells, the 
enzyme-catalyzed reactions that extract 
chemical energy from nutrient molecules and 
use it to synthesize and assemble cell 
components. 

intron: A sequence of nucleotides in a gene 
that is transcribed but excised before the gene 
is translated; also called intervening sequence 
See also exon. 

in vitro: "In glass"; that is, in the test tube, 
in vivo: "In life"; that is, in the living cell or 
organism. 

ion channel: An integral protein that provides 
for the regulated transport of a specific ion, or 
ions, across a membrane, 
ion-exchange resin: A polymeric resin that 
contains fixed charged groups, used in chro- 
matographic columns to separate ionic 
compounds. 

ionizing radiation: A type of radiation, such 
as x rays, that causes loss of electrons from 
some organic molecules, thus making them 
more reactive. 

ionophore: A compound that binds one or 
more metal ions and is capable of diffusing 
across a membrane, carrying the bound ion, 
ion product of water (iT w ): The product of 
the concentrations of H + and OH~ in pure 
water: K w = [H + ][OH _ ] - 1 X 10" 14 at 25 °C. 
iron-sulfur center: A prosthetic group of 
certain redox proteins involved in electron 
transfers; Fe 2+ or Fe 3+ is bound to inorganic 
sulfur and to Cys groups in the protein, 
iron-sulfur protein: One of a large family of 
electron-transfer proteins in which the elec- 
tron carrier is one or more iron ions associated 
with two or more sulfur atoms of Cys residues 
or of inorganic sulfide 
isoelectric focusing: An electrophoretic 
method for separating macromolecules on the 
basis of isoelectric pH, 

isoelectric pH (isoelectric point): The pH 

at which a solute has no net electric charge 
and thus does not move in an electric field 
isoenzymes: See isozymes, 
isomerases: Enzymes that catalyze the trans- 
formation of compounds into their positional 
isomers. 

isomers: Any two molecules with the same 
molecular formula but a different arrangement 
of molecular groups. 

isoprene: The hydrocarbon 2-methyl-l,3- 
butadiene, a recurring structural unit of 
terpenoids 

isoprenoid: Any of a large number of natural 
products synthesized by enzymatic polymer- 
ization of two or more isoprene units; also 
called terpenoid. 

isothermal: Occurring at constant temperature, 
isotopes: Stable or radioactive forms of an 
element that differ in atomic weight but are 
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otherwise chemically identical to the naturally 
abundant, form of the element; used as tracers, 
isozymes: Multiple forms of an enzyme 
that catalyze the same reaction but differ in 
amino acid sequence, substrate affinity, 
^maxj and/or regulatory properties; also called 
isoenzymes 

k 

keratins: Insoluble protective or structural 
proteins consisting of parallel polypeptide 
chains in a-helical or (3 conformations, 
ketogenic amino acids: Amino acids with 
carbon skeletons that can serve as precursors 
of the ketone bodies. 

ketone bodies: Acetoacetate, D-jS-hydroxy- 
butyrate, and acetone; water-soluble fuels 
normally exported by the liver but overpro- 
duced during fasting or in untreated diabetes 
mellitus 

ketose: A simple monosaccharide in which 
the carbonyl group is a ketone 
ketosis: A condition in which the concentra- 
tion of ketone bodies in the blood, tissues, and 
urine is abnormally high, 
kinases: Enzymes that catalyze the phospho- 
rylation of certain molecules by ATP. 
kinetics: The study of reaction rates 
Krebs cycle: See citric acid cycle, 

I 

lagging strand: The DNA strand that, during 
replication, must be synthesized in the direc- 
tion opposite to that in which the replication 
fork moves 

law of mass action: The law stating that the 
rate of any given chemical reaction is propor- 
tional to the product of the activities (or 
concentrations) of the reactants, 
leader: A short sequence near the amino 
terminus of a protein or the 5' end of an RNA 
that has a specialized targeting or regulatory 
function. 

leading strand: The DNA strand that, during 
replication, is synthesized in the same direc- 
tion in which the replication fork moves, 
leaky mutant: A mutant gene that gives rise 
to a product with a detectable level of biolog- 
ical activity. 

leaving group: The departing or displaced 
molecular group in a unimolecular elimination 
or bimolecular substitution reaction 
lectin: A protein that binds a carbohydrate, 
commonly an oligosaccharide, with very high 
affinity and specificity mediating cell-cell 
interactions, 

lethal mutation: A mutation that inactivates 
a biological function essential to the life of the 
cell or organism 

leucine zipper: A protein structural motif 
involved in protein-protein interactions in 
many eukaryotic regulatory proteins; consists 
of two interacting a helices in which Leu 
residues in every seventh position are a promi- 
nent feature of the interacting surfaces. 



leukocyte: White blood cell; involved in the 
immune response in mammals, 
leukotrienes: A family of molecules derived 
from arachidonate; muscle contractants that 
constrict air passages in the lungs and are 
involved in asthma. 

levorotatory isomer: A stereoisomer that 
rotates the plane of plane-polarized light 
counterclockwise , 

ligand: A small molecule that binds specifi- 
cally to a larger one; for example, a hormone 
is the ligand for its specific protein receptor, 
ligases: Enzymes that catalyze condensation 
reactions in which tw T o atoms are joined using 
the energy of ATP or another energy-rich 
compound. 

light-dependent reactions: The reactions of 
photosynthesis that require light and cannot 
occur in the dark; also known as light 
reactions, 

Lineweaver-Burk equation: An algebraic 
transform of the Michaelis-Menten equation, 
allowing determination of V nvdX and K m by 
extrapolation of [S] to infinity: 



v 0 v max [S] v max 

linking number: The number of times one 
closed circular DNA strand is wound about 
another; the number of topological links 
holding the circles together, 
lipases: Enzymes that catalyze the hydrolysis 
of triacylglycerols, 

lipid: A small water-insoluble biomolecule 

generally containing fatty acids, sterols, or 

isoprenoid compounds. 

lipidome: The full complement of lipids in 

a cell or tissue under a particular set of 

conditions 

lipoate (lipoic acid): A vitamin for some 
microorganisms; an intermediate carrier of 
hydrogen atoms and acyl groups in a-keto acid 
dehydrogenases. 

lipoprotein: A lipid-protein aggregate that 
serves to carry water-insoluble lipids in the 
blood. The protein component alone is an 
apolipoprotein 

liposome: A small, spherical vesicle composed 
of a phospholipid bilayer, forming sponta- 
neously when phospholipids are suspended 
in an aqueous buffer 

London forces: Weak, temporary, attractive 
forces between molecules that induce dipoles 
in each other, 

low-energy phosphate compound: A phos- 
phorylated compound with a relatively small 
standard free energy of hydrolysis, 
lyases: Enzymes that catalyze the removal of 
a group from a molecule to form a double 
bond, or the addition of a group to a double 
bond. 

lymphocytes: A subclass of leukocytes 
involved in the immune response. See also B 
lymphocytes; T lymphocytes, 
lysis: Destruction of a plasma membrane or 
(in bacteria) cell wall, releasing the cellular 
contents and killing the cell, 



lysosome: A membrane-bounded organelle of 
eukaryotic cells; it contains many hydrolytic 
enzymes and serves as a degrading and recy- 
cling center for unneeded components- 

m 

macromolecule: A molecule having a molec- 
ular weight in the range of a few thousand to 
many millions. 

mass-action ratio (<?): For the reaction aA 
+ bB cC + dD, the ratio [C] c [D] d /[A] a [B] b 
matrix: The aqueous contents of a cell or 
organelle (the mitochondrion, for example) 
with dissolved solutes, 
meiosis: A type of cell division in which 
diploid cells give rise to haploid cells destined 
to become gametes. 

membrane potential (F m ): The difference 
in electrical potential across a biological 
membrane, commonly measured by the inser- 
tion of a microelectrode. Typical membrane 
potentials vary from -25 mV (by convention, 
the negative sign indicates that the inside is 
negative relative to the outside) to greater 
than - 100 mV across some plant vacuolar 
membranes. 

membrane transport: Movement of a polar 
solute across a membrane via a specific 
membrane protein (a transporter), 
messenger RNA (mRNA): A class of RNA 
molecules, each of which is complementary 
to one strand of DNA; carries the genetic 
message from the chromosome to the 
ribosomes. 

metabolic control: The mechanisms by 
which the flux through a metabolic pathway 
is changed to reflect a cell's altered circum- 
stances. 

metabolic regulation: The mechanisms by 
which a cell resists changes in the concentra- 
tion of individual metabolites that would 
otherwise occur when metabolic control mech- 
anisms alter the flux through a pathway, 
metabolism: The entire set of enzyme- 
catalyzed transformations of organic molecules 
in living cells; the sum of anabolism and 
catabolism. 

metabolite: A chemical intermediate in the 
enzyme-catalyzed reactions of metabolism 
metabolome: The complete set of small- 
molecule metabolites (metabolic inter- 
mediates, signals, secondary metabolites) 
present in a given cell or tissue under 
specific conditions. 

metabolon: A supramolecular assembly of 
sequential metabolic enzymes, 
metalloprotein: A protein with a metal ion as 
its prosthetic group, 
metamerism: Division of the body into 
segments, as in insects, for example, 
micelle: An aggregate of amphipathic mole- 
cules in water, with the nonpolar portions in 
the interior and the polar portions at the 
exterior surface, exposed to water 
Michaelis constant (iT m ): The substrate 
concentration at which an enzyme-catalyzed 
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reaction proceeds at one-half its maximum 
velocity. 

Michaelis-Menten equation: The equation 
describing the hyperbolic dependence of the 
initial reaction velocity, V 0 , on substrate 
concentration, [S], in many enzyme-catalyzed 
reactions: 

0 K m + [sy 

Michaelis-Menten kinetics: A kinetic 
pattern in which the initial rate of an enzyme - 
catalyzed reaction exhibits a hyperbolic 
dependence on substrate concentration, 
microbodies: Cytoplasmic, membrane- 
bounded vesicles containing peroxide -forming 
and peroxide-destroying enzymes; include 
lysosomes, peroxisomes, and glyoxysomes 
microfilaments: Thin filaments composed of 
actin, found in the cytoplasm of eukaryotic 
cells; serve in structure and movement. 
micro-RNA: See miRNA. 
microsomes: Membranous vesicles formed by 
fragmentation of the endoplasmic reticulum of 
eukaryotic cells; recovered by differential 
centrifugation 

microtubules: Thin tubules assembled from 
two types of globular tubulin subunits; present 
in cilia, flagella, centrosomes, and other 
contractile or motile structures 
miRNA (micro-RNA): A class of small RNA 
molecules (20 to 25 nucleotides after 
processing is complete) involved in gene 
silencing by inhibiting translation and/or 
promoting the degradation of particular 
mRNAs 

mismatch: A base pair in a nucleic acid that 
cannot form normal Watson-Crick pairs, 
mismatch repair: An enzymatic system for 
repairing base mismatches in DNA, 
mitochondrion: Membrane-bounded 
organelle of eukaryotic cells; contains the 
enzyme systems required for the citric acid 
cycle, fatty acid oxidation, electron transfer, 
and oxidative phosphorylation 
mitosis: In eukaryotic cells, the multistep 
process that results in the replication of 
chromosomes and cell division, 
mixed-function oxidases: Enzymes (a 
monooxygenase, for example) that catalyze 
reactions in which two reductants— one of 
which is generally NADPH, the other the 
substrate— are oxidized. One oxygen atom is 
incorporated into the product, the other is 
reduced to H 2 0. These enzymes often use 
cytochrome P-450 to carry electrons from 
NADPH to 0 2 

mixed inhibition: The reversible inhibition 
pattern resulting when an inhibitor molecule 
can bind to either the free enzyme or the 
enzyme-substrate complex (not necessarily 
with the same affinity). 

modulator: A metabolite that, when bound to 
the allosteric site of an enzyme, alters its 
kinetic characteristics 

molar solution: One mole of solute dissolved 
in water to give a total volume of 1,000 mL. 



mole: One gram molecular weight of a 
compound. See also Avogadro's number, 
monocistronic mRNA: An mRNA that can be 
translated into only one protein, 
monoclonal antibodies: Antibodies 
produced by a cloned hybridoma cell, which 
therefore are identical and directed against the 
same epitope of the antigen, 
monolayer: A single layer of oriented lipid 
molecules. 

monoprotic acid: An acid with only one 
dissociable proton. 

monosaccharide: A carbohydrate consisting 
of a single sugar unit 

moonlighting enzymes: Enzymes that play 
two distinct roles, at least one of which is 
catalytic; the other may be catalytic, regula- 
tory, or structural. 

motif: Any distinct folding pattern for 
elements of secondary structure, observed in 
one or more proteins A motif can be simple or 
complex, and can represent all or just a small 
part of a polypeptide chain. Also called a fold 
or supersecondary structure, 
mRNA: See messenger RNA. 
mucopolysaccharide: See glycosaminoglycan. 
multienzyme system: A group of related 
enzymes participating in a given metabolic 
pathway. 

mutarotation: The change in specific rotation 
of a pyranose or furanose sugar or glycoside 
accompanying the equilibration of its a- and 
/3-anomeric forms. 

mutases: Enzymes that catalyze the transpo- 
sition of functional groups, 
mutation: An inheritable change in the 
nucleotide sequence of a chromosome, 
myocyte: A muscle cell 

myofibril: A unit of thick and thin filaments of 
muscle fibers. 

myosin: A contractile protein; the major 
component of the thick filaments of muscle 
and other actin-myosin systems. 

n 

NAD, NADP (nicotinamide adenine 
dinucleotide, nicotinamide adenine 
dinucleotide phosphate): Nicotinamide- 
containing coenzymes functioning as carriers 
of hydrogen atoms and electrons in some 
oxidation-reduction reactions, 
native conformation: The biologically active 
conformation of a macromolecule. 
ncRNA (noncoding RNA): Any RNA that does 
not encode instructions for a protein product, 
negative cooperativity: A property of some 
multisubunit enzymes or proteins in which 
binding of a ligand or substrate to one subunit 
impairs binding to another subunit , 
negative feedback: Regulation of a biochem- 
ical pathway in which a reaction product 
inhibits an earlier step in the pathway 
neuron: A cell of nervous tissue specialized 
for transmission of a nerve impulse. 



neurotransmitter: A low molecular weight 
compound (usually containing nitrogen) 
secreted from the axon terminal of a neuron 
and bound by a specific receptor in the next 
neuron; serves to transmit a nerve impulse, 
nicotinamide adenine dinucleotide, 
nicotinamide adenine dinucleotide 
phosphate: See NAD, NADP 
ninhydrin reaction: A color reaction given by 
amino acids and peptides on heating with 
ninhydrin, widely used for their detection and 
estimation. 

nitrogenase complex: A system of enzymes 
capable of reducing atmospheric nitrogen to 
ammonia in the presence of ATP. 
nitrogen cycle: The cycling of various forms 
of biologically available nitrogen through the 
plant, animal, and microbial worlds, and 
through the atmosphere and geosphere. 
nitrogen fixation: Conversion of atmospheric 
nitrogen (N 2 ) into a reduced, biologically 
available form by nitrogen-fixing organisms, 
NMR: See nuclear magnetic resonance 
spectroscopy: 

noncoding RNA: See ncRNA. 

noncyclic electron flow: The light-induced 

flow of electrons from water to NADP + in 

oxygen-evolving photosynthesis; it involves 

both photosystems 1 and II 

nonessential amino acids: Amino acids that 

can be made by humans and other vertebrates 

from simpler precursors and are thus not 

required in the diet, 

nonheme iron proteins: Proteins, usually 
acting in oxidation-reduction reactions, 
containing iron but no porphyrin groups, 
nonpolar: Hydrophobic; describes molecules 
or groups that are poorly soluble in water, 
nonsense codon: A codon that does not 
specify an amino acid, but signals the termina- 
tion of a polypeptide chain 
nonsense mutation: A mutation that results 
in the premature termination of a polypeptide 
chain. 

nonsense suppressor: A mutation, usually in 
the gene for a tRNA, that causes an amino acid 
to be inserted into a polypeptide in response 
to a termination codon. 

nuclear magnetic resonance (NMR) spec- 
troscopy: A technique that utilizes certain 
quantum mechanical properties of atomic 
nuclei to study the structure and dynamics of 
the molecules of which they are a part, 
nucleases: Enzymes that hydrolyze the inter- 
nucleotide (phosphodiester) linkages of 
nucleic acids. 

nucleic acids: Biologically occurring polynu- 
cleotides in which the nucleotide residues are 
linked in a specific sequence by phosphodi- 
ester bonds; DNA and RNA. 
nucleoid: In bacteria, the nuclear zone that 
contains the chromosome but has no 
surrounding membrane, 
nucleolus: In eukaryotic cells, a densely 
staining structure in the nucleus; involved in 
rRNA synthesis and ribosome formation. 
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nucleophile: An electron-rich group with a 
strong tendency to donate electrons to an 
electron-deficient nucleus (electrophile); the 
entering reactant in a bimolecular substitution 
reaction, 

nucleoplasm: The portion of a eukaryotic 
cell's contents enclosed by the nuclear 
membrane; also called the nuclear matrix, 
nucleoside: A compound consisting of a 
purine or pyrimidine base covalently linked to 
a pentose. 

nucleoside diphosphate kinase: An enzyme 
that catalyzes the transfer of the terminal 
phosphate of a nucleoside 5 '-triphosphate to a 
nucleoside 5'-diphosphate. 
nucleoside diphosphate sugar: A coen- 
zymelike carrier of a sugar molecule, functioning 
in the enzymatic synthesis of polysaccharides 
and sugar derivatives 

nucleoside monophosphate kinase: An 

enzyme that catalyzes the transfer of the 
terminal phosphate of ATP to a nucleoside 
5' -monophosphate. 

nucleosome: In eukaryotes, structural unit 
for packaging chromatin; consists of a DNA 
strand wound around a histone core 
nucleotide: A nucleoside phosphorylated at 
one of its pentose hydroxyl groups, 
nucleus: In eukaryotes, a membrane-bounded 
organelle that contains chromosomes. 

0 

oligomer: A short polymer, usually of amino 

acids, sugars, or nucleotides; the definition of 

"short" is somewhat arbitrary, but usually 

fewer than 50 subunits. 

oligomeric protein: A multisubunit protein 

having two or more identical polypeptide 

chains. 

oligonucleotide: A short polymer of 
nucleotides (usually fewer than 50). 
oligopeptide: A few amino acids joined by 
peptide bonds. 

oligosaccharide: Several monosaccharide 

groups joined by glycosidic bonds. 

o> oxidation: An alternative mode of fatty acid 

oxidation in which the initial oxidation is at the 

carbon most distant from the carboxyl carbon; 

as distinct from (B oxidation. 

oncogene: A cancer-causing gene; any of 

several mutant genes that cause cells to 

exhibit rapid, uncontrolled proliferation. See 

also proto-oncogene. 

open reading frame: A group of contiguous 
nonoverlapping nucleotide codons in a DNA or 
RNA molecule that does not include a termina- 
tion codon. 

open system: A system that exchanges 
matter and energy with its surroundings. See 
also system. 

operator: A region of DNA that interacts with 
a repressor protein to control the expression 
of a gene or group of genes, 
operon: A unit of genetic expression 
consisting of one or more related genes and 



the operator and promoter sequences that 
regulate their transcription - 
optical activity: The capacity of a substance 
to rotate the plane of plane-polarized light, 
optimum pH: The characteristic pH at which 
an enzyme has maximal catalytic activity, 
organelles: Membrane-bounded structures 
found in eukaryotic cells; contain enzymes and 
other components required for specialized cell 
functions. 

origin: The nucleotide sequence or site in 
DNA where DNA replication is initiated, 
orthologs: Genes in different organisms that 
possess a clear sequence and functional 
relationship to each other, 
osmosis: Bulk flow of water through a semi- 
permeable membrane into another aqueous 
compartment containing solute at a higher 
concentration. 

osmotic pressure: Pressure generated by the 
osmotic flow of water through a semiperme- 
able membrane into an aqueous compartment 
containing solute at a higher concentration, 
oxidases: Enzymes that catalyze oxidation 
reactions in which molecular oxygen serves 
as the electron acceptor, but neither of the 
oxygen atoms is incorporated into the product. 
Compare oxygenases, 
oxidation: The loss of electrons from a 
compound. 

oxidation-reduction reaction: A reaction in 
which electrons are transferred from a donor 
to an acceptor molecule; also called a redox 
reaction. 

oxidative phosphorylation: The enzymatic 
phosphorylation of ADP to ATP coupled to 
electron transfer from a substrate to molecular 
oxygen. 

oxidizing agent (oxidant): The acceptor of 
electrons in an oxidation-reduction reaction, 
oxygenases: Enzymes that catalyze reactions 
in which oxygen atoms are directly incorpo- 
rated into the product, forming a hydroxyl or 
carboxyl group. In reactions catalyzed by a 
monooxygenase, only one of the two 0 atoms 
is incorporated; the other is reduced to H 2 0. 
In reactions catalyzed by a dioxygenase, both 
0 atoms are incorporated into the product. 
Compare oxidases. 

oxygen debt: The extra oxygen (above the 
normal resting level) consumed in the 
recovery period after strenuous physical 
exertion. 

P 

palindrome: A segment of duplex DNA in 
which the base sequences of the two strands 
exhibit twofold rotational symmetry about 
an axis 

paradigm: In biochemistry, an experimental 
model or example. 

partition coefficient: A constant that 
expresses the ratio in which a given solute will 
be partitioned or distributed between two 
given immiscible liquids at equilibrium. 



pathogenic: Disease-causing. 
PCR: See polymerase chain reaction 
PDB (Protein Data Bank): An international 
database (www.rcsb.org/pdb) that archives the 
data describing the three-dimensional struc- 
ture of nearly all macromolecules for which 
structures have been published 
pentose: A simple sugar with a backbone 
containing five carbon atoms , 
pentose phosphate pathway: A pathway 
present in most organisms that serves to 
inter convert hexoses and pentoses and is a 
source of reducing equivalents (NADPH) and 
pentoses for biosynthetic processes; it begins 
with glucose 6-phosphate and includes 6- 
phosphogluconate as an intermediate. Also 
called the phosphogluconate pathway and the 
hexose monophosphate pathway, 
peptidases: Enzymes that hydrolyze peptide 
bonds. 

peptide: Two or more amino acids covalently 
joined by peptide bonds, 
peptide bond: A substituted amide linkage 
between the a-amino group of one amino acid 
and the a-carboxyl group of another, with the 
elimination of the elements of water, 
peptide mapping: The characteristic two- 
dimensional pattern (on paper or gel) formed 
by the separation of a mixture of peptides 
resulting from partial hydrolysis of a protein; 
also known as peptide fingerprinting, 
peptidoglycan: A major component of bacte- 
rial cell walls; generally consists of parallel 
heteropolysaccharides cross-linked by short 
peptides. 

peptidyl transferase: The enzyme activity 
that synthesizes the peptide bonds of proteins; 
a ribozyme, part of the rRNA of the large 
ribosomal subunit. 

peripheral proteins: Proteins loosely or 
reversibly bound to a membrane by hydrogen 
bonds or electrostatic forces; generally water 
soluble once released from the membrane, 
permeases: See transporters, 
peroxisome: Membrane-bounded organelle of 
eukaryotic cells; contains peroxide-forming 
and peroxide-destroying enzymes, 
peroxisome proliferator-activated 
receptor: See PPAR. 

pH: The negative logarithm of the hydrogen 
ion concentration of an aqueous solution, 
phage: See bacteriophage, 
phenotype: The observable characteristics of 
an organism 

phosphatases: Enzymes that hydrolyze a 
phosphate ester or anhydride, releasing inor- 
ganic phosphate, P P 
phosphodiester linkage: A chemical 
grouping that contains two alcohols esterified 
to one molecule of phosphoric acid, which thus 
serves as a bridge between them, 
phosphogluconate pathway: See pentose 
phosphate pathway. 

phospholipid: A lipid containing one or more 
phosphate groups. 
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phosphorolysis: Cleavage of a compound 
with phosphate as the attacking group; analo- 
gous to hydrolysis 

phosphorylases: Enzymes that catalyze 
phosphorolysis. 

phosphorylation: Formation of a phosphate 
derivative of a biomolecule, usually by enzy- 
matic transfer of a phosphoryl group from ATP. 
phosphorylation potential (AG P ): The 

actual free-energy change of ATP hydrolysis 
under the nonstandard conditions prevailing in 
a cell 

photochemical reaction center: The part of 
a photosynthetic complex where the energy of 
an absorbed photon causes charge separation, 
initiating electron transfer 

photon: The ultimate unit (a quantum) of 
light energy 

photophosphorylation: The enzymatic 
formation of ATP from ADP coupled to the 
light-dependent transfer of electrons in photo- 
synthetic cells. 

photoreduction: The light-induced reduction 
of an electron acceptor in photosynthetic cells 
photorespiration: Oxygen consumption 
occurring in illuminated temperate-zone 
plants, largely due to oxidation of 
phosphoglycolate, 

photosynthesis: The use of light energy to 
produce carbohydrates from carbon dioxide 
and a reducing agent such as water 
photosynthetic phosphorylation: See 

photophosphorylation 
photosystem: In photosynthetic cells, a 
functional set of light-absorbing pigments and 
its reaction center. 

phototroph: An organism that can use the 
energy of light to synthesize its own fuels from 
simple molecules such as carbon dioxide, 
oxygen, and water; as distinct from a 
chemotroph. 

pK a : The negative logarithm of an equilibrium 
constant 

plasmalogen: A phospholipid with an alkenyl 
ether substituent on the C-l of glycerol 
plasma membrane: The exterior membrane 
surrounding the cytoplasm of a cell, 

plasma proteins: The proteins present in 
blood plasma. 

plasmid: An extrachromosomal, independ- 
ently replicating, small circular DNA molecule; 
commonly employed in genetic engineering, 
plastid: In plants, a self-replicating organelle; 
may differentiate into a chloroplast 
platelets: Small, enucleated cells that initiate 
blood clotting; they arise from bone marrow 
cells called megakaryocytes Also known as 
thrombocytes, 

pleated sheet: The side-by-side, hydrogen- 
bonded arrangement of polypeptide chains in 
the extended 0 conformation, 
plectonemic: Describes a structure in a 
molecular polymer that has a net twisting of 
strands about each other in some simple and 
regular way. 



polar: Hydrophilic, or "water-loving"; 
describes molecules or groups that are 
soluble in water. 

polarity: (1) In chemistry, the nonuniform 
distribution of electrons in a molecule; polar 
molecules are usually soluble in water. (2) In 
molecular biology, the distinction between the 
5' and 3' ends of nucleic acids 
poly(A) tail: A length of adenosine residues 
added to the 3' end of many mRNAs in 
eukaryotes (and sometimes in bacteria), 
polycistronic mRNA: A contiguous mRNA 
with more than two genes that can be trans- 
lated into proteins. 

polyclonal antibodies: A heterogeneous pool 
of antibodies produced in an animal by 
different B lymphocytes in response to an 
antigen. Different antibodies in the pool recog- 
nize different parts of the antigen 
polylinker: A short, often synthetic, fragment 
of DNA containing recognition sequences for 
several restriction endonucleases. 
polymerase chain reaction (PCR): A 
repetitive laboratory procedure that results in 
a geometric amplification of a specific DNA 
sequence. 

polymorphic: Describes a protein for which 
amino acid sequence variants exist in a 
population of organisms, but the variations 
do not destroy the protein's function 
polynucleotide: A covalently linked 
sequence of nucleotides in which the 3' 
hydroxyl of the pentose of one nucleotide 
residue is joined by a phosphodiester bond 
to the 5' hydroxyl of the pentose of the next 
residue 

polypeptide: A long chain of amino acids 
linked by peptide bonds; the molecular weight 
is generally less than 10,000 

polyribosome: See polysome, 
polysaccharide: A linear or branched 
polymer of monosaccharide units linked by 
glycosidic bonds. 

polysome: A complex of an mRNA molecule 
and two or more ribosomes; also called polyri- 
bosome. 

polyunsaturated fatty acid: See PUFA. 
P/O ratio: The number of moles of ATP 
formed in oxidative phosphorylation per \0 2 
reduced (thus, per pair of electrons passed to 
0 2 ) Experimental values used in this text are 
2,5 for passage of electrons from NADH to 0 2 , 
and 1 .5 for passage of electrons from FADH to 
0 2 Some textbooks use the integral values of 
3.0 and 2.0. 

porphyria: Inherited condition resulting from 
the lack of one or more enzymes required to 
synthesize porphyrins. 

porphyrin: Complex nitrogenous compound 
containing four substituted pyrroles covalently 
joined into a ring; often complexed with a 
central metal atom. 

positive cooperativity: A property of some 
multisubunit enzymes or proteins in which 
binding of a ligand or substrate to one subunit 
facilitates binding to another subunit. 



posttranscriptional processing: The 

enzymatic processing of the primary RNA 
transcript to produce functional RNAs, 
including mRNAs, tRNAs, rRNAs, and many 
other classes of RNAs 

posttranslational modification: Enzymatic 
processing of a polypeptide chain after transla- 
tion from its mRNA. 

PPAR (peroxisome proliferator-activated 
receptor): A family of nuclear transcription 
factors, activated by lipidic ligands, that alter 
the expression of specific genes, including 
those encoding enzymes of lipid synthesis and 
breakdown 

primary structure: A description of the 
covalent backbone of a polymer (macromole- 
cule), including the sequence of monomeric 
subunits and any interchain and intrachain 
covalent bonds, 

primary transcript: The immediate RNA 
product of transcription before any posttran- 
scriptional processing reactions, 
primase: An enzyme that catalyzes the forma- 
tion of RNA oligonucleotides used as primers 
by DNA polymerases, 
primer: A short oligomer (of sugars or 
nucleotides, for example) to which an enzyme 
adds additional monomeric subunits. 
primer terminus: The end of a primer to 
which monomeric subunits are added, 
priming: (1) In protein phosphorylation, the 
phosphorylation of an amino acid residue that 
becomes the binding site and point of refer- 
ence for phosphorylation of other residues in 
the same protein (2) In DNA replication, the 
synthesis of a short oligonucleotide to which 
DNA polymerases can add additional 
nucleotides, 

primosome: An enzyme complex that synthe- 
sizes the primers required for lagging strand 
DNA synthesis. 

probe: A labeled fragment of nucleic acid 
containing a nucleotide sequence complemen- 
tary to a gene or genomic sequence that one 
wishes to detect in a hybridization experiment 
processivity: For any enzyme that catalyzes 
the synthesis of a biological polymer, the 
property of adding multiple subunits to the 
polymer without dissociating from the 
substrate 

prochiral molecule: A symmetric molecule 
that can react asymmetrically with an enzyme 
having an asymmetric active site, generating a 
chiral product, 

projection formulas: A method for repre- 
senting molecules to show the configuration of 
groups around chiral centers; also known as 
Fischer projection formulas, 
prokaryote: A term used historically to refer 
to any species in the kingdoms Bacteria and 
Archaea The differences between bacteria 
(formerly referred to as "eubacteria") and 
archaea are sufficiently great that the inclusive 
term is of marginal usefulness , A tendency 
to use "prokaryote" when referring only 
to bacteria is common and misleading; 
"prokaryote" also implies an ancestral 
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relationship to eukaryotes, which is incorrect. 
In this text, "prokaryote" and "prokaryotic" are 
no longer used 

promoter: A DNA sequence at which RNA 
polymerase may bind, leading to initiation of 
transcription. 

proofreading: The correction of errors in 
the synthesis of an information-containing 
biopolymer by removing incorrect monomeric 
subunits after they have been covalently 
added to the growing polymer, 
prostaglandins: A class of lipid-soluble, 
hormonelike regulatory molecules derived 
from arachidonate and other polyunsaturated 
fatty acids 

prosthetic group: A metal ion or an organic 
compound (other than an amino acid) that is 
covalently bound to a protein and is essential 
to its activity, 

proteasome: Supramolecular assembly of 
enzymatic complexes that function in the 
degradation of damaged or unneeded cellular 
proteins, 

protein: A macromolecule composed of one 
or more polypeptide chains, each with a char- 
acteristic sequence of amino acids linked by 
peptide bonds. 

Protein Data Bank: See PDB 
protein kinases: Enzymes that transfer the 
terminal phosphoryl group of ATP or another 
nucleoside triphosphate to a Ser, Thr, Tyr, Asp, 
or His side chain in a target protein, thereby 
regulating the activity or other properties of 
that protein, 

protein targeting: The process by which 
newly synthesized proteins are sorted and 
transported to their proper locations in 
the cell, 

proteoglycan: A hybrid macromolecule 
consisting of a heteropolysaccharide joined to 
a polypeptide; the polysaccharide is the major 
component 

proteome: The full complement of proteins 
expressed in a given cell, or the complete 
complement of proteins that can be expressed 
by a given genome. 

proteomics: Broadly, the study of the protein 
complement of a cell or organism, 
protomer: A general term describing any 
repeated unit of one or more stably associated 
protein subunits in a larger protein structure. 
If a protomer has multiple subunits, the 
subunits may be identical or different 
proton acceptor: An anionic compound 
capable of accepting a proton from a proton 
donor; that is, a base 

proton donor: The donor of a proton in an 
acid-base reaction; that is, an acid, 
proton-motive force: The electrochemical 
potential inherent in a transmembrane 
gradient of H + concentration; used in oxida- 
tive phosphorylation and photophosphoryla- 
tion to drive ATP synthesis, 
proto-oncogene: A cellular gene, usually 
encoding a regulatory protein, that can be 
converted into an oncogene by mutation 



protoplasm: A general term referring to the 
entire contents of a living cell, 
PUFA (polyunsaturated fatty acid): A fatty 
acid with more than one double bond, gener- 
ally nonconjugated 

purine: A nitrogenous heterocyclic base found 
in nucleotides and nucleic acids; containing 
fused pyrimidine and imidazole rings 
puromycin: An antibiotic that inhibits 
polypeptide synthesis by being incorporated 
into a growing polypeptide chain, causing its 
premature termination, 
pyranose: A simple sugar containing the six- 
membered pyran ring, 

pyridine nucleotide: A nucleotide coenzyme 
containing the pyridine derivative 
nicotinamide; NAD or NAD P. 
pyridoxal phosphate: A coenzyme 
containing the vitamin pyridoxine (vitamin 
B 6 ); functions in reactions involving amino 
group transfer. 

pyrimidine: A nitrogenous heterocyclic base 
found in nucleotides and nucleic acids, 
pyrimidine dimer: A covalently joined dimer 
of two adjacent pyrimidine residues in DNA, 
induced by absorption of UV light; most 
commonly derived from two adjacent thymines 
(a thymine dimer) . 

pyrophosphatase: See inorganic pyrophos- 
phatase, 

q 

quantum: The ultimate unit of energy, 
quaternary structure: The three- 
dimensional structure of a multisubunit 
protein, particularly the manner in which the 
subunits fit together 

r 

racemic mixture (racemate): An equimolar 
mixture of the D and L stereoisomers of an 
optically active compound, 
radical: An atom or group of atoms possessing 
an unpaired electron; also called a free radical, 
radioactive isotope: An isotopic form of an 
element with an unstable nucleus that stabi- 
lizes itself by emitting ionizing radiation, 
radioimmunoassay (RIA): A sensitive, 
quantitative method for detecting trace 
amounts of a biomolecule, based on its 
capacity to displace a radioactive form of the 
molecule from combination with its specific 
antibody, 

rate constant: The proportionality constant 
that relates the velocity of a chemical 
reaction to the concentration(s) of the 
reactant(s). 

rate-limiting step: (1) Generally, the step in 
an enzymatic reaction with the greatest activa- 
tion energy or the transition state of highest 
free energy (2) The slowest step in a meta- 
bolic pathway. 

reaction intermediate: Any chemical 
species in a reaction pathway that has a finite 
chemical lifetime 



reactive oxygen species (ROS): Highly 
reactive products of the partial reduction of 
0 2 , including hydrogen peroxide (H 2 0 2 ), 
superoxide ('O2), and hydroxyl free radical 
*OH, produced as minor byproducts during 
oxidative phosphorylation, 
reading frame: A contiguous, nonoverlapping 
set of three-nucleotide codons in DNA or RNA. 
receptor T^r kinase (RTK): A large family 
of plasma membrane proteins with ligand- 
binding sites on the extracellular domain, a 
single transmembrane helix, and a cytoplasmic 
domain with protein Tyr kinase activity 
controlled by the extracellular ligand. 
recombinant DNA: DNA formed by the 
joining of genes into new combinations, 
recombination: Any enzymatic process by 
which the linear arrangement of nucleic acid 
sequences in a chromosome is altered by 
cleavage and rejoining, 
recombinational DNA repair: 
Re combinational processes directed at the 
repair of DNA strand breaks or cross-links, 
especially at inactivated replication forks, 
redox pair: An electron donor and its corre- 
sponding oxidized form; for example, NADH 
and NAD + . 

redox reaction: See oxidation-reduction 
reaction. 

reducing agent (reductant): The electron 
donor in an oxidation-reduction reaction, 
reducing end: The end of a polysaccharide 
having a terminal sugar with a free anomeric 
carbon; the terminal residue can act as a 
reducing sugar 

reducing equivalent: A general term for an 
electron or an electron equivalent in the form 
of a hydrogen atom or a hydride ion 
reducing sugar: A sugar in which the 
carbonyl (anomeric) carbon is not involved in 
a glycosidic bond and can therefore undergo 
oxidation. 

reduction: The gain of electrons by a 
compound or ion. 

regulatory enzyme: An enzyme with a regula- 
tory function, through its capacity to undergo a 
change in catalytic activity by allosteric mecha- 
nisms or by covalent modification, 
regulatory gene: A gene that gives rise to a 
product involved in the regulation of the 
expression of another gene; for example, a 
gene encoding a repressor protein, 
regulatory sequence: A DNA sequence 
involved in regulating the expression of a gene; 
for example, a promoter or operator, 
regulon: A group of genes or operons that are 
coordinately regulated even though some, or 
all, may be spatially distant in the chromosome 
or genome 

relaxed DNA: Any DNA that exists in its most 
stable and unstrained structure, typically the B 
form under most cellular conditions, 
release factors: Protein factors of the cytosol 
required for the release of a completed 
polypeptide chain from a ribosome; also known 
as termination factors. 
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releasing factors: Hypothalamic hormones 
that stimulate release of other hormones by 
the pituitary gland 

renaturation: Refolding of an unfolded 
(denatured) globular protein so as to restore 
its native structure and function, 
replication: Synthesis of daughter nucleic 
acid molecules identical to the parental nucleic 
acid. 

replication fork: The Y-shaped structure 
generally found at the point where DNA is 
being synthesized 

replicative form: Any of the full-length 
structural forms of a viral chromosome that 
serve as distinct replication intermediates 
replisome: The multiprotein complex that 
promotes DNA synthesis at the replication 
fork 

repressible enzyme: In bacteria, an enzyme 
whose synthesis is inhibited when its reaction 
product is readily available to the cell 
repression: A decrease in the expression of a 
gene in response to a change in the activity of 
a regulatory protein 

repressor: The protein that binds to the 
regulatory sequence or operator for a gene, 
blocking its transcription 
residue: A single unit in a polymer; for 
example, an amino acid in a polypeptide chain. 
The term reflects the fact that sugars, 
nucleotides, and amino acids lose a few atoms 
(generally the elements of water) when incor- 
porated in their respective polymers 
respiration: Any metabolic process that leads 
to the uptake of oxygen and the release of C0 2 
respiration-linked phosphorylation: ATP 
formation from ADP and P b driven by electron 
flow through a series of membrane-bound 
carriers, with a proton gradient as the direct 
source of energy driving rotational catalysis by 
ATP synthase. 

respiratory chain: The electron-transfer 
chain; a sequence of electron-carrying proteins 
that transfers electrons from substrates to 
molecular oxygen in aerobic cells, 
response element: A region of DNA, near 
(upstream from) a gene, that is bound by 
specific proteins that influence the rate of 
transcription of the gene, 
restriction endonucleases: Site-specific 
endodeoxyribonucleases that cleave both 
strands of DNA at points in or near the specific 
site recognized by the enzyme; important tools 
in genetic engineering 

restriction fragment: A segment of double- 
stranded DNA produced by the action of a 
restriction endonuclease on a larger DNA 
restriction fragment length polymor- 
phisms (RFLPs): Variations, among individ- 
uals in a population, in the length of certain 
restriction fragments in which certain genomic 
sequences occur. These variations result from 
rare sequence changes that create or destroy 
restriction sites in the genome 

retrovirus: An RNA virus containing a reverse 
transcriptase 



reverse transcriptase: An RNA-directed 
DNA polymerase in retroviruses; capable of 
making DNA complementary to an RNA. 
reversible inhibition: Inhibition by a mole- 
cule that binds reversibly to the enzyme, such 
that the enzyme activity returns when the 
inhibitor is no longer present. 
RFLPs: See restriction fragment length 
polymorphisms 

R group: (1) Formally, an abbreviation 
denoting any alkyl group (2) Occasionally, 
used in a more general sense to denote virtu- 
ally any organic substituent (the R groups of 
amino acids, for example) 
ribonuclease: A nuclease that catalyzes the 
hydrolysis of certain internucleotide linkages 
of RNA. 

ribonucleic acid: See RNA. 
ribonucleotide: A nucleotide containing D- 
ribose as its pentose component, 
ribosomal RNA (rRNA): A class of RNA 
molecules serving as components of ribo- 
somes. 

ribosome: A supramolecular complex of 
rRNAs and proteins, approximately 18 to 22 nm 
in diameter; the site of protein synthesis 
riboswitch: A structured segment of an 
mRNA that binds to a specific ligand and 
affects the translation or processing of the 
mRNA. 

ribozymes: Ribonucleic acid molecules with 
catalytic activities; RNA enzymes, 
Rieske iron-sulfur protein: A type of iron- 
sulfur protein in which two of the ligands to 
the central iron ion are His side chains These 
proteins act in many electron-transfer 
sequences, including oxidative phosphoryla- 
tion and photophosphorylation. 
RNA (ribonucleic acid): A 
polyribonucleotide of a specific sequence 
linked by successive 3',5'-phosphodiester 
bonds. 

RNA editing: Posttranscriptional modification 
of an mRNA that alters the meaning of one or 
more codons during translation. 
RNA polymerase: An enzyme that catalyzes 
the formation of RNA from ribonucleoside 
5'-triphosphates, using a strand of DNA or 
RNA as a template. 

RNA splicing: Removal of introns and joining 
of exons in a primary transcript. 
ROS: See reactive oxygen species. 
rRNA: See ribosomal RNA. 
RTK: See receptor Tyr kinase. 



salvage pathway: Synthesis of a biomolecule, 
such as a nucleotide, from intermediates in the 
degradative pathway for the biomolecule; a 
recycling pathway, as distinct from a de novo 
pathway. 

saponification: Alkaline hydrolysis of triacyl- 
glycerols to yield fatty acids as soaps, 
sarcomere: A functional and structural unit of 
the muscle contractile system. 



satellite DNA: Highly repeated, nontrans- 
lated segments of DNA in eukaryotic chromo- 
somes; most often associated with the 
centromeric region. Its function is unknown. 
Also called simple-sequence DNA. 
saturated fatty acid: A fatty acid containing 
a fully saturated alkyl chain, 
scaffold proteins: Noncatalytic proteins that 
nucleate formation of multienzyme complexes 
by providing two or more specific binding sites 
for those proteins 

scramblases: Membrane proteins that 
catalyze the movement of phospholipids across 
the membrane bilayer, leading to uniform 
distribution of a lipid between the two 
membrane leaflets. 

secondary metabolism: Pathways that lead 
to specialized products not found in every 
living cell. 

secondary structure: The local spatial 
arrangement of the main-chain atoms in a 
segment of a polypeptide chain; also applied to 
polynucleotide structure. 

second law of thermodynamics: The law 

stating that, in any chemical or physical process, 
the entropy of the universe tends to increase, 
second messenger: An effector molecule 
synthesized in a cell in response to an external 
signal (first messenger) such as a hormone, 
sedimentation coefficient: A physical 
constant specifying the rate of sedimentation 
of a particle in a centrifugal field under speci- 
fied conditions 

selectins: A large family of membrane 
proteins, lectins that bind oligosaccharides on 
other cells tightly and specifically and serve to 
carry signals across the plasma membrane. 
SELEX: A method for rapid experimental 
identification of nucleic acid sequences 
(usually RNA) that have particular catalytic 
or ligand-binding properties, 
sequence polymorphisms: Any alterations in 
genomic sequence (base-pair changes, inser- 
tions, deletions, rearrangements) that help 
distinguish subsets of individuals in a popula- 
tion or distinguish one species from another, 
sequence-tagged site (STS): Any known 
sequence that has been mapped in a chromo- 
some and/or clones derived from it. 
serpentine receptors: See G protein- 
coupled receptors. 

Shine-Dalgarno sequence: A sequence in an 
mRNA that is required for binding bacterial 
ribosomes 

short tandem repeat (STR): A short (typi- 
cally 3 to 6 bp) DNA sequence, repeated many 
times in tandem at a particular location in a 
chromosome. 

SH2 domain: A protein domain that binds 
tightly to a phosphotyrosine residue in certain 
proteins such as the receptor Tyr kinases, 
initiating the formation of a multiprotein 
complex that acts in a signaling pathway, 
shuttle vector: A recombinant DNA vector 
that can be replicated in two or more different 
host species. See also vector, 
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sialoadhesin: A lectin on the outside surface 
of a cell that binds carbohydrate ligands with 
terminal sialic acids; one of the family of 
siglecs. 

sickle-cell anemia: A human disease charac- 
terized by defective hemoglobin molecules in 
individuals homozygous for a mutant allele 
coding for the /3 chain of hemoglobin, 
sickle-cell trait: A human condition recog- 
nized by the sickling of erythrocytes when 
exposed to low oxygen tension; occurs in 
individuals heterozygous for the allele respon- 
sible for sickle-cell anemia 
siglec: One of a family of cell surface lectins, 
the sialic acid binding Ig-like lectins, with 
multiple immunoglobulin-like domains; it 
binds surface structures containing sialic 
acid. 

signal sequence: An amino acid sequence, 
often at the amino terminus, that signals the 
cellular fate or destination of a newly synthe- 
sized protein. 

signal transduction: The process by which 
an extracellular signal (chemical, mechanical, 
or electrical) is amplified and converted to a 
cellular response. 

silent mutation: A mutation in a gene that 
causes no detectable change in the biological 
characteristics of the gene product, 
simple diffusion: The movement of solute 
molecules across a membrane to a region of 
lower concentration, unassisted by a protein 
transporter. 

simple protein: A protein yielding only amino 
acids on hydrolysis. 

single nucleotide polymorphism (SNP): A 

genomic base-pair change that helps distin- 
guish one species from another or one subset 
of individuals in a population, 
site-directed mutagenesis: A set of methods 
used to create specific alterations in the 
sequence of a gene. 

site-specific recombination: A type of 
genetic recombination that occurs only at 
specific sequences. 

size-exclusion chromatography: A proce- 
dure for the separation of molecules by size, 
based on the capacity of porous polymers to 
exclude solutes above a certain size; also 
called gel filtration. 

small nuclear RNA (snRNA): A class of 
short RNAs, typically 100 to 200 nucleotides 
long, found in the nucleus and involved in the 
splicing of eukaryotic mRNAs. 
small nucleolar RNA (snoRNA): A class of 
short RNAs, generally 60 to 300 nucleotides 
long, that guide the modification of rRNAs in 
the nucleolus. 

somatic cells: All body cells except the germ- 
line cells. 

SOS response: In bacteria, a coordinated 
induction of a variety of genes in response to 
high levels of DNA damage 
Southern blot: A DNA hybridization proce- 
dure in which one or more specific DNA frag- 
ments are detected in a larger population by 



hybridization to a complementary, labeled 
nucleic acid probe, 

specific acid-base catalysis: Acid or base 
catalysis involving the constituents of water 
(hydroxide or hydronium ions), 
specific activity: The number of micromoles 
(yLtmol) of a substrate transformed by an 
enzyme preparation per minute per milligram 
of protein at 25 °C; a measure of enzyme purity, 
specific heat: The amount of energy (in 
joules or calories) needed to raise the temper- 
ature of 1 g of a pure substance by 1 °C. 
specific rotation: The rotation, in degrees, of 
the plane of plane-polarized light (D-line of 
sodium) by an optically active compound at 
25 °C, with a specified concentration and light 
path. 

specificity: The ability of an enzyme or 
receptor to discriminate among competing 
substrates or ligands 

sphingolipid: An amphipathic lipid with a 
sphingosine backbone to which are attached a 
long-chain fatty acid and a polar alcohol, 
spliceosome: A complex of RNAs and 
proteins involved in the splicing of mRNAs in 
eukaryotic cells. 

splicing: See gene splicing; RNA splicing, 
standard free-energy change (AG 0 ): The 

free-energy change for a reaction occurring 
under a set of standard conditions: tempera- 
ture, 298 K; pressure, 1 atm or 101.3 kPa; and 
all solutes at 1 m concentration. AG'° denotes 
the standard free-energy change at pH 7.0 in 
55 5 M water. 

standard reduction potential (£'°): The 

electromotive force exhibited at an electrode 
by 1 m concentrations of a reducing agent 
and its oxidized form at 25 °C and pH 7 0; a 
measure of the relative tendency of the 
reducing agent to lose electrons, 
steady state: A nonequilibrium state of a 
system through which matter is flowing and in 
which all components remain at a constant 
concentration. 

stem cells: The common, self-regenerating 
cells in bone marrow that give rise to differen- 
tiated blood cells such as erythrocytes and 
lymphocytes. 

stereoisomers: Compounds that have the 
same composition and the same order of 
atomic connections but different molecular 
arrangements. 

sterols: A class of lipids containing the steroid 
nucleus. 

sticky ends: Two DNA ends in the same DNA 
molecule, or in different molecules, with short 
overhanging single-stranded segments that are 
complementary to one another, facilitating 
ligation of the ends; also known as cohesive 
ends 

stimulatory G protein (G s ): A trimeric 
regulatory GTP-binding protein that, when 
activated by an associated plasma membrane 
receptor, stimulates a neighboring membrane 
enzyme such as adenylyl cyclase. Its effects 
oppose those of G ir 



stop codons: See termination codons. 
STR: See short tandem repeat, 
stroma: The space and aqueous solution 
enclosed within the inner membrane of a 
chloroplast, not including the contents in the 
thylakoid membranes 

structural gene: A gene coding for a protein 
or RNA molecule; as distinct from a regulatory 
gene 

substitution mutation: A mutation caused 
by the replacement of one base by another, 
substrate: The specific compound acted upon 
by an enzyme, 

substrate channeling: Movement of the 
chemical intermediates in a series of enzyme- 
catalyzed reactions from the active site of one 
enzyme to that of the next enzyme in the 
pathway, without leaving the surface of a 
protein complex that includes both enzymes, 
substrate-level phosphorylation: 
Phosphorylation of ADP or some other nucleo- 
side 5 '-diphosphate coupled to the dehydro- 
genation of an organic substrate; independent 
of the electron-transfer chain, 
suicide inactivator: A relatively inert mole- 
cule that is transformed by an enzyme, at its 
active site, into a reactive substance that 
irreversibly inactivates the enzyme, 
supercoil: The twisting of a helical (coiled) 
molecule on itself; a coiled coil, 
supercoiled DNA: DNA that twists upon 
itself because it is under- or overwound (and 
thereby strained) relative to B-form DNA. 
superhelical density: In a helical molecule 
such as DNA, the number of supercoils (super- 
helical turns) relative to the number of coils 
(turns) in the relaxed molecule, 
supersecondary structure: See motif, 
suppressor mutation: A mutation that 
totally or partially restores a function lost by a 
primary mutation; located at a site different 
from the site of the primary mutation. 
Svedberg (S): A unit of measure of the rate 
at which a particle sediments in a centrifugal 
field. 

symbionts: Two or more organisms that are 
mutually interdependent; usually living in 
physical association. 

symport: Cotransport of solutes across a 
membrane in the same direction, 
synteny: Conserved gene order along the 
chromosomes of different species, 
synthases: Enzymes that catalyze condensa- 
tion reactions in which no nucleoside triphos- 
phate is required as an energy source, 
synthetases: Enzymes that catalyze conden- 
sation reactions using ATP or another nucleo- 
side triphosphate as an energy source, 
system: An isolated collection of matter; all 
other matter in the universe apart from the 
system is called the surroundings, 
systems biology: The study of complex 
biochemical systems, integrating the functions 
of several to all of the macromolecules in a cell 
(RNA, DNA, proteins). 
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telomere: Specialized nucleic acid structure 
found at the ends of linear eukaryotic 
chromosomes, 

template: A macromolecular mold or pattern 
for the synthesis of an informational 
macromolecule. 

template strand: A strand of nucleic acid 
used by a polymerase as a template to synthe- 
size a complementary strand, 
terminal transferase: An enzyme that 
catalyzes the addition of nucleotide residues of 
a single kind to the 3' end of DNA chains 
termination codons: UAA, UAG, and UGA; in 
protein synthesis, these codons signal the 
termination of a polypeptide chain Also 
known as stop codons 
termination factors: See release factors 
termination sequence: A DNA sequence, at 
the end of a transcriptional unit, that signals 
the end of transcription, 
terpenes: Organic hydrocarbons or hydro- 
carbon derivatives constructed from recurring 
isoprene units. They produce some of the 
scents and tastes of plant products, 
tertiary structure: The three-dimensional 
conformation of a polymer in its native folded 
state. 

tetrahydrobiopterin: The reduced coenzyme 
form of biopterin 

tetrahydrofolate: The reduced, active coen- 
zyme form of the vitamin folate 
thermogenesis: The biological generation of 
heat by muscle activity (shivering), uncoupled 
oxidative phosphorylation, or the operation of 
futile cycles. 

thermogenin: A protein of the inner mito- 
chondrial membrane in brown adipose tissue 
that allows transmembrane movement of 
protons, short-circuiting the normal use of 
protons to drive ATP synthesis and dissipating 
the energy of substrate oxidation as heat; also 
called uncoupling protein (UCP) 
thiamine pyrophosphate (TPP): The active 
coenzyme form of vitamin involved in 
aldehyde transfer reactions, 
thioester: An ester of a carboxylic acid with a 
thiol or mercaptan. 

3' end: The end of a nucleic acid that lacks 
a nucleotide bound at the 3' position of 
the terminal residue 
thrombocytes: See platelets, 
thromboxanes: A class of molecules derived 
from arachidonate and involved in platelet 
aggregation during blood clotting 
thylakoid: Closed cisterna, or disk, formed by 
the pigment-bearing internal membranes of 
chloroplasts, 

thymine dimer: See pyrimidine dimer. 
tissue culture: Method by which cells derived 
from multicellular organisms are grown in 
liquid media 

titration curve: A plot of pH versus the 
equivalents of base added during titration of 
an acid. 



T lymphocyte (T cell): One of a class of 
blood cells (lymphocytes) of thymic origin, 
involved in cell-mediated immune reactions, 
tocopherols: Forms of vitamin E , 
topoisomerases: Enzymes that introduce 
positive or negative supercoils in closed, 
circular duplex DNA 

topoisomers: Different forms of a covalently 
closed, circular DNA molecule that differ only 
in their linking number. 

topology: The study of the properties of an 
object that do not change under continuous 
deformations such as twisting or bending, 
toxins: Proteins produced by some organisms 
and toxic to certain other species. 
TPP: See thiamine pyrophosphate, 
trace element: A chemical element required 
by an organism in only trace amounts, 
transaminases: See aminotransferases, 
transamination: Enzymatic transfer of an 
amino group from an a-amino acid to an 
a-keto acid. 

transcription: The enzymatic process 
whereby the genetic information contained in 
one strand of DNA is used to specify a comple- 
mentary sequence of bases in an mRNA chain, 
transcriptional control: The regulation 
of a protein's synthesis by regulation of the 
formation of its mRNA. 

transcription factor: In eukaryotes, a 
protein that affects the regulation and tran- 
scription initiation of a gene by binding to a 
regulatory sequence near or within the gene 
and interacting with RNA polymerase and/or 
other transcription factors, 
transcriptome: The entire complement of 
RNA transcripts present in a given cell or 
tissue under specific conditions, 
transduction: (1) Generally, the conversion 
of energy or information from one form to 
another. (2) The transfer of genetic informa- 
tion from one cell to another by means of a 
viral vector. 

transfer RNA (tRNA): A class of RNA mole- 
cules (M r 25,000 to 30,000), each of which 
combines covalently with a specific amino acid 
as the first step in protein synthesis, 
transformation: Introduction of an exoge- 
nous DNA into a cell, causing the cell to 
acquire a new phenotype. 
transgenic: Describes an organism that has 
genes from another organism incorporated in 
its genome as a result of recombinant DNA 
procedures. 

transition state: An activated form of a 
molecule in which the molecule has undergone 
a partial chemical reaction; the highest point 
on the reaction coordinate, 
translation: The process in which the genetic 
information present in an mRNA molecule 
specifies the sequence of amino acids during 
protein synthesis. 

translational control: The regulation of a 
protein's synthesis by regulation of the rate of 
its translation on the ribosome 



translational frameshifting: A programmed 
change in the reading frame during translation 
of an mRNA on a ribosome, occurring by any 
of several mechanisms, 
translational repressor: A repressor that 
binds to an mRNA, blocking translation, 
translocase: (1) An enzyme that catalyzes 
membrane transport . (2) An enzyme that 
causes a movement such as the movement 
of a ribosome along an mRNA. 
transpiration: Passage of water from the 
roots of a plant to the atmosphere via the 
vascular system and the stomata of the leaves, 
transporters: Proteins that span a membrane 
and transport specific nutrients, metabolites, 
ions, or proteins across the membrane; some- 
times called permeases, 
transposition: The movement of a gene or 
set of genes from one site in the genome to 
another. 

transposon (transposable element): A 

segment of DNA that can move from one 
position in the genome to another, 
triacylglycerol: An ester of glycerol with 
three molecules of fatty acid; also called a 
triglyceride or neutral fat. 

tricarboxylic acid cycle: See citric acid 
cycle, 

triose: A simple sugar with a backbone 
containing three carbon atoms, 
tRNA: See transfer RNA. 
tropic hormone (tropin): A peptide 
hormone that stimulates a specific target 
gland to secrete its hormone; for example, 
thyrotropin produced by the pituitary stimu- 
lates secretion of thyroxine by the thyroid 
tumor suppressor gene: One of a class of 
genes that encode proteins that normally 
suppress the division of cells, When defective, 
the normal gene becomes a tumor suppressor 
gene, and when both copies are defective, the 
cell is allowed to continue dividing without 
limitation; it becomes a tumor celL 
turnover number: The number of times an 
enzyme molecule transforms a substrate 
molecule per unit time, under conditions 
giving maximal activity at substrate concentra- 
tions that are saturating, 
two-component signaling systems: Signal- 
transducing systems found in bacteria and 
plants, composed of a receptor His kinase that 
phosphorylates an internal His residue when 
occupied by its ligand. It then catalyzes phos- 
phoryl transfer to a second component, the 
response regulator, which, when phosphory- 
lated, alters the output of the signaling system 

U 

ubiquitin: A small, highly conserved protein 
that targets an intracellular protein for degra- 
dation by proteasomes Several ubiquitin 
molecules are covalently attached in tandem 
to a Lys residue in the target protein by a 
specific ubiquitinating enzyme, 
ultraviolet (UV) radiation: Electromagnetic 
radiation in the region of 200 to 400 nm 
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uncompetitive inhibition: The reversible 
inhibition pattern resulting when an inhibitor 
molecule can bind to the enzyme-substrate 
complex but not to the free enzyme, 
uncoupling agent: A substance that uncou- 
ples phosphorylation of ADP from electron 
transfer; for example, 2,4-dinitrophenoL 
uniport: A transport system that carries only 
one solute, as distinct from cotransport. 
unsaturated fatty acid: A fatty acid 
containing one or more double bonds 
urea cycle: A cyclic metabolic pathway in 
vertebrate liver, synthesizing urea from amino 
groups and carbon dioxide, 
ureotelic: Excreting excess nitrogen in the 
form of urea. 

uricotelic: Excreting excess nitrogen in the 
form of urate (uric acid), 

V 

F max : The maximum velocity of an enzymatic 
reaction when the binding site is saturated 
with substrate. 

van der Waals interaction: Weak intermole- 
cular forces between molecules as a result of 
each inducing polarization in the other, 
vector: A DNA molecule known to replicate 
autonomously in a host cell, to which a 
segment of DNA may be spliced to allow its 
replication; for example, a plasmid or an artifi- 
cial chromosome. 

vectorial: Describes an enzymatic reaction or 
transport process in which the protein has a 



specific orientation in a biological membrane 
such that the substrate is moved from one side 
of the membrane to the other as it is converted 
into product, 

vectorial metabolism: Metabolic transforma- 
tions in which the location (not the chemical 
composition) of a substrate changes relative to 
the plasma membrane or a membrane between 
two cellular compartments Transporters 
catalyze vectorial reactions, as do the proton 
pumps of oxidative phosphorylation and 
photophosphorylation . 
viral vector: A viral DNA altered so that it 
can act as a vector for recombinant DNA. 
virion: A virus particle, 
virus: A self-replicating, infectious, nucleic 
acid-protein complex that requires an intact 
host cell for its replication; its genome is either 
DNA or RNA 

vitamin: An organic substance required in 
small quantities in the diet of some species; 
generally functions as a component of a coen- 
zyme 

W 

white adipose tissue (WAT): Nonthermo- 
genic adipose tissue rich in triacylglycerols 
stored and mobilized in response to hormonal 
signals. Transfer of electrons in the mito- 
chondrial respiratory chain is tightly coupled 



to ATP synthesis. Compare brown adipose 
tissue. 

wild type: The normal (unmutated) genotype 
or phenotype. 

wobble: The relatively loose base pairing 
between the base at the 3' end of a codon and 
the complementary base at the 5' end of the 
anticodon 

X 

x-ray crystallography: The analysis of x-ray 
diffraction patterns of a crystalline compound, 
used to determine the molecule's three-dimen- 
sional structure. 

Z 

zinc finger: A specialized protein motif 
involved in DNA recognition by some DNA- 
binding proteins; characterized by a single 
atom of zinc coordinated to four Lys residues 
or to two His and two Lys residues, 
zwitterion: A dipolar ion with spatially 
separated positive and negative charges, 
zymogen: An inactive precursor of an enzyme; 
for example, pepsinogen, the precursor of 
pepsin. 
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Common Abbreviations in the Biochemical Research Literature 



A 

A 

A 

Ab 

ABC 

ACAT 

ACC 

ACh 

ACP 

ACTH 

acyl-CoA 

ADH 

adoHcy 

adoMet 

AFM 

Ag 

AGE 

AIDS 

AKAP 

Ala 

[«]rf C 

AMP, ADP, ATP 

AMPK 

ANF 

AQP 

Arg 

ARS 

Asn 

Asp 

ATCase 

atm 

ATPase 

B12 

BAC 

BAT 

BMR 

bp 

1,3-BPG 

C 

CAM 

CaM 

cAMP 

CD 

CDK 

cDNA 

CFTR 

cGMP 
Chi 



angstrom 

adenine, adenosine, or adenylate 

absorbance 

antibody 

ATP-binding cassette 

acyl-CoA cholesterol acyl transferase 

acetyl-CoA carboxylase 

acetylcholine 

acyl carrier protein 

adrenocorticotropic hormone 

acyl derivatives of coenzyme A 

(also, acyl-S-CoA) 

alcohol dehydrogenase 

iS-adenosylhomocysteine 

S-adenosylmethionine (also, SAM) 

atomic force microscopy 

antigen 

advanced glycation end product 
acquired immunodeficiency syndrome 
A kinase anchoring protein 
alanine 

specific rotation 

adenosine 5'-mono-, di-, triphosphate 

AMP-activated protein kinase 

atrial natriuretic factor 

aquaporin 

arginine 

autonomously replicating sequence 

asparagine 

aspartate 

aspartate transcarbamoylase 
atmosphere 

adenosine triphosphatase 
coenzyme B 12 , cobalamin 
bacterial artificial chromosome 
brown adipose tissue 
basal metabolic rate 
base pair 

1 ,3-bisphosphoglycerate 
cytosine, cytidine, or cytidylate 
crassulacean acid metabolism 
calmodulin 
3\5'-cyclie AMP 
circular dichroism 
cyclin-dependent protein kinase 
complementary DNA 

cystic fibrosis transmembrane conductance 

regulator 

3',5'-cyclic GMP 

chlorophyll 



ChREBP carbohydrate response element binding 
protein 

CKII casein kinase II 

CL cardiolipin 

CMP, CDP, CTP cytidine 5' -mono-, di-, triphosphate 

CoA coenzyme A (also, CoASH) 

COHb carbon monoxide hemoglobin 

CoQ coenzyme Q (ubiquinone; also, UQ) 

COX cyclooxygenase; cytochrome oxidase 

CREB cyclic AMP response element binding 
protein 

CRP cAMP receptor protein 

Cys cysteine 

Cyt cytochrome 

D dihydrouridine 

D diffusion coefficient 

d density 

dADP, dGDP, etc. deoxyadenosine 5 '-diphosphate, 

deoxyguanosine 5'-diphosphate, etc. 

dAMP, dGMP, etc. deoxyadenosine 5 '-monophosphate, 

deoxyguanosine 5 '-monophosphate, etc. 

dATP, dGTP, etc. deoxyadenosine 5 '-triphosphate, 

deoxyguanosine 5'-triphosphate, etc. 

DEAE diethylarninoethyl 

DFP diisopropylfluorophosphate (also, DIFP) 

DHAP dihydroxyacetone phosphate 

DHF dihydrofolate (also, H 2 folate) 

DMS dimethyl sulfate 

DNA deoxyribonucleic acid 

DNase deoxyribonuclease 

DNP 2,4-dinitrophenol 

Dol dolichol 

dopa dmydroxyphenylalanine 

E electrical potential 

E .C Enzyme Commission (followed by numbers 
indicating the formal classification of an 
enzyme) 

ECM extracellular matrix 

EDTA ethylenediaminetetraacetate 

EF elongation factor 

EGF epidermal growth factor 

ELISA enzyme-linked immunosorbent assay 

EM electron microscopy 

emf electromotive force 

EPO erythropoietin 

e molar absorption coefficient 

ER endoplasmic reticulum 

EST expressed sequence tag 

7] viscosity 

j faraday 

/ fractional coefficient 




Common Abbreviations 



FA 

FAD, FADH 2 
FAS 

FBPase-1 
FBPase-2 
Fd 

FDNB (DNFB) 

FFA 

FH 

Met 

FMN, FMNH 2 

FOXOl 

FP 

F1P 

F6P 

FRAP 

FRET 

Fru 

AG 

AG° 

AG'° 

AG* 
AG B 
AG n 



GABA 

Gal 

GalN 

GalNAc 

GAP 

GDH 

GEF 

GFP 

GH 

GHB 

GLC 

Glc 

GlcA 

GlcN 

GlcNAc 

GlcUA 

Gin 

Glu 

GLUT 

Gly 

GMP, GDP, GTP 

GPCR 

GPI 

G1P 
G6P 



fatty acid 

flavin adenine dinucleotide, and its reduced 
form 

fatty acid synthase 
fructose 1,6-bisphosphatase 
fructose 2,6-bisphosphatase 
ferredoxin 

l-fluoro-2,4-dinitrobenzene 
mitochondrial ATP synthase 
free (unesterified) fatty acid 
familial hypercholesterolemia 
A/"-formylmethionine 

flavin mononucleotide, and its reduced form 

forkhead box other 

flavoprotein 

fructose 1 -phosphate 

fructose 6-phosphate 

fluorescence recovery after photobleaching 

fluorescence resonance energy transfer 

D-fructose 

free-energy change 

standard free-energy change 

standard free-energy change in the 

biochemical standard state 

activation energy 

binding energy 

free-energy change of ATP hydrolysis 
(phosphorylation potential) under cellular 
conditions 

guanine, guanosine, or guanylate 
acceleration due to gravity 
y-aminobutyric acid 
D-galactose 
D-galactosamine 
7V-acetyl-D-galactosamme 
glyceraldehyde 3-phosphate; GTPase activa- 
tor protein 

glutamate dehydrogenase 

guanosine nucleotide-exchange factor 

green fluorescent protein 

growth hormone 

glycated hemoglobin 

gas-liquid chromatography 

D-glucose 

D-gluconic acid 

D-glucosamine 

A^acetyl-D-glucosamine (also, NAG) 

D-glucuronic acid 

glutamine 

glutamate 

glucose transporter 

glycine 

guanosine 5 '-mono-, di-, triphosphate 
G protein-coupled receptor 
glycosylated derivative of 
phosphatidylinositol 
glucose 1-phosphate 
glucose 6-phosphate 



G6PD 
GRK 

GSH, GSSG 

GSK3 

AH 

HA 

Hb, Hb0 2 

HDL 

H 2 folate 

H 4 folate 

HIF 

His 

HIV 

HMG-CoA 

hnRNA 

HPLC 

HRE 

Hsp 

HTH 

Hyp 

I 

IF 

Ig 

IgG 

He 

IMP, IDP, ITP 

IP 3 
IR 
IRP 

IRS-l.IRS-2 

IS 

J 

JAK 

K a 
K d 
K D 

^eq 
-^eq 

Ki 
Km 

k 

^cat 

kb 

kbp 

a-KG 

lac 

A 

LDH 

LDL 

Leu 

LH 

LHC 

Lk 

In 

log 

LT 



glucose 6-phosphate dehydrogenase 
G protein-coupled receptor kinase 
glutathione, and its oxidized form 
glycogen synthase kinase 3 
enthalpy change 
hemagglutinin protein 
hemoglobin, oxyhemoglobin 
high-density lipoprotein 
dihydrofolate (also, DHF) 
tetrahydrofolate (also, THF) 
hypoxia-inducible transcription factor 
histidine 

human immunodeficiency virus 

j8-hydroxy-j8-methylglutaryl-CoA 

heterogeneous nuclear RNA 

high-performance liquid chromatography 

hormone response element 

heat shock protein 

helix-turn-helix 

hydroxyproline 

inosme 

initiation factor 
immunoglobulin 
immunoglobulin G 
isoleucine 

inosine 5'-mono-, di-, triphosphate 
inositol 1 ,4,5-trisphosphate 
infrared 

iron regulatory protein 
insulin receptor substrate- 1, -2 
insertion sequence 
joule 

Janus kinase 

acid dissociation constant 

dissociation constant 

diffusion constant 

equilibrium constant 

equilibrium constant under standard 

conditions 

inhibition constant 

Michaelis constant 

rate constant 

turnover number 

kilobase 

kilobase pair 

a-ketoglutarate 

lactose 

wavelength 

lactate dehydrogenase 

low-density lipoprotein 

leucine 

luteinizing hormone 
light-harvesting complex 
linking number 
logarithm to the base e 
logarithm to the base 10 
leukotriene 



Common Abbreviations 




LTR long terminal repeat 

Lys lysine 

M r relative molecular mass 

Man D-mannose 

MAPK mitosis-associated protein kinase 

Mb, Mb0 2 myoglobin, oxymyoglobin 

MCAD medium-chain acyl-CoA dehydrogenase 

MCM minichromosome maintenance 

mDNA mitochondrial DNA 

Met methionine 

MFP multifunctional protein (in fatty 

acid oxidation) 

miRNA micro-RNA 

MODY mature onset diabetes of the young 

mRNA messenger RNA 

MS mass spectroscopy 

MSH melanocyte -stimulating hormone 

ju. electrophoretic mobility 

Mur muramic acid 

Mur2Ac 7V-acetylmuramic acid (also, NAM) 

NAD + , NADH nicotinamide adenine dinucleotide, and its 
reduced form 

NADP + , NADPH nicotinamide adenine dinucleotide 
phosphate, and its reduced form 

NAG A/"-acetylglucosamine (also, GlcNAc) 

NAM AT-acetylmuramic acid (also, MurNAc) 

ncRNA noncoding RNA 

Neu5Ac Af-acetylneuraminic acid (sialic acid) 

NMN + , NMNH nicotinamide mononucleotide, and 
its reduced form 

NMP, NDP, NTP nucleoside mono-, di-, and triphosphate 

NMR nuclear magnetic resonance 

NO nitric oxide 

OAA oxaloacetate 

ORC origin recognition complex 

ORF open reading frame 

P pressure 

Pi inorganic orthophosphate 
(inorganic phosphate) 

PAB or PABA p-aminobenzoate 

PAGE polyacrylamide gel electrophoresis 

PC plastocyanin; phosphatidylcholine 

PCR polymerase chain reaction 

PDB Protein Data Bank 

PDE cyclic nucleotide phosphodiesterase 

PDH pyruvate dehydrogenase 

PE phosphatidylethanolamine 

PEP phosphoenolpyruvate 

PEPCK phosphoenolpyruvate carboxykinase 

PET positron emission tomography 

PFK phosphofructokinase 

PG prostaglandin 

2PGA 2-phosphoglycerate 

3PGA 3-phosphoglycerate 

pH log (1/[H + ]) 

Phe phenylalanine 

PI phosphatidylinositol 

pi isoelectric point 



PIP 2 phosphatidylinositol 4,5-bisphosphate 

PK protein kinase; pyruvate kinase 

pK log (liK) 

PKA cAMP-dependent protein kinase (protein 
kinase A) 

PKB protein kinase B (also, Akt) 

PKC Ca 2+ /calmodulin-dependent protein kinase 

(protein kinase C) 

PKG cGMP-dependent protein kinase 

(protein kinase G) 

PKU phenylketonuria 

PLC phospholipase C 

PLP pyridoxal 5-phosphate 

p0 2 partial pressure of oxygen 

Pol polymerase (DNA or RNA) 

PPi inorganic pyrophosphate 

PP1 phosphoprotein phosphatase 1 

PPAR peroxisome proliferator-activated receptor 

PQ plastoquinone 

pRb retinoblastoma protein 

Pro proline 

PRPP 5-phosphoribosyl- 1 -pyrophosphate 

PS phosphatidylserine; photosystem 

transmembrane electrical potential 

PTB phosphotyrosine binding 

PUFA polyunsaturated fatty acid 

Q mass action ratio 

Q ? QH 2 ubiquinone, reduced ubiquinone 

Rb retinoblastoma gene (a tumor suppressor 

gene) 

RC photoreaction center 

RER rough endoplasmic reticulum 

RF release factor; replicative form 

RFLP restriction fragment length polymorphism 

RGS regulator of G protein signaling 

RIA radioimmunoassay 

Rib D-ribose 

RLK receptor-like kinase 

RNA ribonucleic acid 

RNAi RNA interference 

RNase ribonuclease 

ROS reactive oxygen species; rod outer segment 

rRNA ribosomal RNA 

RSV Rous sarcoma virus 

RTK receptor tyrosine kinase (receptor 
TVr kinase) 

AS entropy change 

SAM S-adenosylmethionine (also, adoMet) 

SDS sodium dodecyl sulfate 

SELEX systematic evolution of ligands by exponen- 
tial enrichment 

SER smooth endoplasmic reticulum 

Ser serine 

SERCA sarcoplasmic and endoplasmic 

reticulum calcium ATPase 

SMC structural maintenance of chromosomes 

SNARE synaptosome-associated protein receptor 

snoRNA small nucleolar RNA 



[a-4 I Common Abbreviations 



SNP single nucleotide polymorphism 

snRNA small nuclear RNA 

snRNP small nuclear ribonucleoprotein 

SREBP sterol response element binding protein 

sRNA small RNA 

SRP signal recognition particle 

STAT signal transducer and activator of 
transcription 

STP standard temperature and pressure 

STR short tandem repeat 

stRNA small temporal RNA 

STS sequence-tagged site 

SUR sulfonylurea receptor 

T thymine, thymidine, or thymidylate 

T absolute temperature 

T2DM type 2 diabetes mellitus 

TBP TATA-binding protein 

TCA tricarboxylic acid 

TFP trifunctional protein 

THF tetrahydrofolate (also, H 4 folate) 

Thr threonine 

TLC thin layer chromatography 

TMP, TDP, TTP thymidine 5'-mono-, di-, triphosphate 

TMV tobacco mosaic virus 

TPI triose phosphate isomerase 

TPP thiamine pyrophosphate 



tRNA transfer RNA 

Trp tryptophan 

TX thromboxane 

Tyr tyrosine 

U uracil, uridine, or uridylate 

UAS upstream activator sequence 

UCP uncoupling protein 

UDP-Gal uridine diphosphate galactose (also, UDP- 
galactose) 

UDP-Glc uridine diphosphate glucose (also, UDP- 
glucose) 

UMP, UDP, UTP uridine 5'-mono-, di-, triphosphate 

UQ ubiquinone (coenzyme Q; also, CoQ) 

UV ultraviolet 

V m transmembrane electrical potential (mem- 
brane potential) 

Vmax maximum velocity 

V 0 initial velocity 

Val valine 

VDR vitamin D receptor 

VEGF vascular endothelial growth factor 

VLDL very low-density lipoprotein 

WAT white adipose tissue 

YAC yeast artificial chromosome 

Z net charge 
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Abbreviated Solutions to Problems 



Fuller solutions to all chapter problems are published in the Absolute 
Ultimate Study Guide to Accompany Principles of Biochemistry . 
For all numerical problems, answers are expressed with the correct 
number of significant figures. 

Chapter 1 

1. (a) Diameter of magnified cell = 500 mm (b) 2.7 X 10 12 actin 
molecules (c) 36,000 mitochondria (d) 3.9 x 10 10 glucose 
molecules (e) 50 glucose molecules per hexokinase molecule 

2. (a) 1 x 10" 12 g = 1 pg (b) 10% (c) 5% 

3. (a) 1.6 mm; 800 times longer than the cell; DNA must be tightly 
coiled, (b) 4,000 proteins 

4. (a) Metabolic rate is limited by diffusion, which is limited by 
surface area, (b) 12 fim~ l for the bacterium; 0.04 /xm"" 1 for the 
amoeba; surface-to-volume ratio 300 times higher in the bacterium, 

5. 2 x 10 6 s (about 23 days) 

6. The vitamin molecules from the two sources are identical; the 
body cannot distinguish the source; only associated impurities 
might vary with the source. 



fill 



H 
I 

-C- 
I 

H 



Amino 



I " 
H 

Hydroxyl 



(b) 




Hydroxyls 



The two enantiomers have different interactions with a chiral 
biological "receptor" (a protein). 
9. (a) Only the amino acids have amino groups; separation could 
be based on the charge or binding affinity of these groups. 
Fatty acids are less soluble in water than amino acids, and the 
two types of molecules also differ in size and shape — either of 
these property differences could be the basis for separation, 
(b) Glucose is a smaller molecule than a nucleotide; separation 
could be based on size. The nitrogenous base and/or the 
phosphate group also endows nucleotides with < haracteristics 
(solubility, charge) that could be used for separation from 
glucose 

10. It is improbable that silicon could serve as the central organizing 
element for life, especially in an 0 2 -containing atmosphere such 
as that of Earth. Long chains of silicon atoms are not readily syn- 
thesized; the polymeric macromolecules necessary for more 
complex functions would not readily form. Oxygen disrupts 
bonds between silicon atoms, and silicon-oxygen bonds are ex- 
tremely stable and difficult to break, preventing the breaking 
and making of bonds that is essential to life processes 

11. Only one enantiomer of the drug was physiologically active. 
Dexedrine consisted of the single enantiomer; Benzedrine 
consisted of a racemic mixture. 

12. (a) 3 Phosphoric acid groups; a-D-ribose; guanine (b) Choline; 
phosphoric acid; glycerol; oleic acid; palmitic acid (c) Tyrosine; 
2 glycines; phenylalanine; methionine 

13. (a) CH 2 0; C 3 H 6 0 3 
(b) 



(c) 



(d) 



H 



O 

\ I 

c=c-cocr 



(e) 



Carboxyl 



\ # Carboxyl 
T 

I 



(f) 





COO 


H 3 N- 
EE- 


1 

-C-H 

c-oe 


Methyl 



I 

NH 



^\ ^ Aldehyde 



H — C— |H16fe Amino 

i 



Amide 



Hydroxy] ;Pfgf-C— H 

H — c — ililii 



H~ C^Olf Hydroxy] 
Methyl BaC— CH^ Methyl 
CH^OH Hydroxy! 
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AS-2 Abbreviated Solutions to Problems 



(c) X contains a chiral center; eliminates all but 6 and 8. (d) X 
contains an acidic functional group; eliminates 8; structure 6 is 
consistent with all data, (e) Structure 6; we cannot distinguish 
between the two possible enantiomers 
14. (a) A more negative AG° corresponds to a larger K eq for the 
binding reaction, so the equilibrium is shifted more toward prod- 
ucts and tighter binding— and thus greater sweetness and higher 
MRS (b) Animal-based sweetness assays are time-consuming. 
A computer program to predict sweetness, even if not always com- 
pletely accurate, would allow chemists to design effective sweet- 
eners much faster. Candidate molecules could then be tested in 
the conventional assay, (c) The range 0.25 to 0,4 nm corre- 
sponds to about 1.5 to 2.5 single-bond lengths. The figure below 
can be used to construct an approximate ruler; any atoms in the 
pink rectangle are between 0.25 and 0.4 nm from the origin of 
the ruler 



There are many possible AH-B groups in the molecules; a few are 
shown here. 



H OH 




\, OH 



(d) First, each molecule has multiple AH-B groups, so it is 
difficult to know which is the important one. Second, because 
the AH-B motif is very simple, many nonsweet molecules will 
have this group, (e) Sucrose and deoxysucrose. Deoxysucrose 
lacks one of the AH-B groups present in sucrose and has a 
slightly lower MRS than sucrose — as is expected if the AH-B 
groups are important for sweetness, (f) There are many 
such examples; here are a few: (1) D-Tryptophan and 
6-chloro-D-tryptophan have the same AH-B group but very 
different MRS values. (2) Aspartame and neotame have the 
same AH-B groups but very different MRS values. (3) Neotame 
has two AH-B groups and alitame has three, yet neotame is 
more than five times sweeter than alitame. (4) Bromine is 
less electronegative than oxygen and thus is expected to 
weaken an AH-B group, yet tetrabromosucrose is much 
sweeter than sucrose, (g) Given enough "tweaking" of parame- 
ters, any model can be made to fit a denned dataset. Because 
the objective was to create a model to predict AG ° for molecules 
not tested in vivo, the researchers needed to show that the 
model worked well for molecules it had not been trained on. 
The degree of inaccuracy with the test set could give researchers 
an idea of how the model would behave for novel molecules, 
(h) MRS is related to K eq , which is related exponentially to AG 0 , 
so adding a constant amount to AG ° multiplies the MRS by a 
constant amount, Based on the values given with the structures, 
a change in AG° of 1.3 kcal/mol corresponds to a 10-fold change 
in MRS 
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Sucrose 
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OH 



CH 3 
Aspartame 



NHg- 0 
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6-Chloro-D-tryptophan 




NH — G 



CH 3 
Neolame 



Br OH 
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H ^N-C 9 H 17 
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Tetrabromosucrose 



Sucronic acid 



Chapter 2 

1. Ethanol is polar; ethane is not. The ethanol — OH group can 
hydrogen-bond with water 

2. (a) 4 76 (b) 9 19 (c) 4 0 (d) 4 82 

3. (a) 1.51 X 10" 4 m (b) 3,02 X 10" 7 m (c) 7 76 X 10" 12 m 

4. 1 1 

5. (a) HC1^=^H + + cr (b)3 3 (c)NaOH^^Na + + OH" 
(d)9 8 

6. 1 1 

7. 1.7 x 10" 9 mol of acetylcholine 

8. 0.1 M HC1 

9. 3 3 mL 

10. (a) RCOCT (b) RNH 2 (c) H 2 PC>4 (d) HCO3 

11. (a) 5 06 (b) 4.28 (c) 5 46 (d) 4.76 (e) 3.76 

12. (a) 0.1 m HC1 (b) 0 1 M NaOH (c) 0.1 M NaOH 

13. (d) Bicarbonate, a weak base, titrates —OH to — 0~, making 
the compound more polar and more water-soluble. 

14. Stomach; the neutral form of aspirin present at the lower 
pH is less polar and passes through the membrane more 
easily. 

15. 9 

16. 7.4 

17. (a) pH 8.6 to 10 6 (b) 4/5 (c) 10 mL (d) pH = pK a -2 

18. 1.4 

19. NaH 2 P0 4 ■ H 2 0, 5.8 g/L; Na 2 HP0 4 , 8 2 g/L 

20. [A"]/[HA] = 0 # 10 

21. Mix 150 mL of 0 10 M sodium acetate and 850 mL of 0.10 m 
acetic acid. 

22. Acetic acid; its pif a is closest to the desired pH. 

23. (a) 4.6 (b) 0 1 pH unit (c) 4 pH units 

24. 4.3 

25. 0.13 M acetate and 0.07 m acetic acid 

26. 1 7 

27. 7 
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28. (a) 



COOH 

+ I 
H 3 N-C-H 

I 

CH 3 
Fully protonated 



coo- 



coo- 

H 2 N— C— H 

I 

CH 3 

Fully deprotonated 



30. 
31. 



(b) fully protonated (c) zwitterion (d) zwitterion (e) fully 
deprotonated 

29. (a) Blood pH is controlled by the carbon dioxide-bicarbonate 
buffer system, C0 2 + H 2 0 H + + HCO ; J. During hypoven- 
tilation, [C0 2 ] increases in the lungs and arterial blood, driving the 
equilibrium to the right, raising [H + ] and lowering pH (b) During 
hyperventilation, [C0 2 ] decreases in the lungs and arterial blood, 
reducing [H + ] and increasing pH above the normal 7.4 value. 

(c) Lactate is a moderately strong acid, completely dissociating 
under physiological conditions and thus lowering the pH of blood 
and muscle tissue. Hyperventilation removes H + , raising the pH of 
blood and tissues in anticipation of the acid buildup, 
74 

Dissolving more C0 2 in the blood increases [H + ] in blood and 
extracellular fluids, lowering pH: C0 2 (d) + H 2 0 v 
H 2 C0 3 H + + HCO : J 
32. (a) Use the substance in its surfactant form to emulsify the spilled 
oil, collect the emulsified oil, then switch to the nonsurfactant 
form. The oil and water will separate and the oil can be collected 
for further use. (b) The equilibrium lies strongly to the right. The 
stronger acid (lower piQ, H 2 C0 3 , donates a proton to the conju- 
gate base of the weaker acid (higher piT a ), amidine. (c) The 
strength of a surfactant depends on the hydrophilicity of its head 
groups: the more hydrophilic, the more powerful the surfactant. 
The amidinium form of s-surf is much more hydrophilic than the 
amidine form, so it is a more powerful surfactant (d) Point A: 
amidinium; the C0 2 has had plenty of time to react with the ami- 
dine to produce the amidinium form. Point B: amidine; Ar has re- 
moved C0 2 from the solution, leaving the amidine form (e) The 
conductivity rises as uncharged amidine reacts with C0 2 to pro- 
duce the charged amidinium form, (f) The conductivity falls as Ar 
removes C0 2 , shifting the equilibrium to the uncharged amidine 
form, (g) Treat s-surf with C0 2 to produce the surfactant ami- 
dinium form and use this to emulsify the spill. Treat the emulsion 
with Ar to remove the C0 2 and produce the nonsurfactant amidine 
form. The oil will separate from the water and can be recovered. 

Chapter 3 

1. l; determine the absolute configuration at the a carbon and com- 
pare it with D- and L-glyceraldehyde. 

2. (a) I (b) II (c) IV (d) II (e) IV (f ) II and IV (g) III (h) III (i) V 
( j) III (k) V (1) II (m) III (n) V (o) I, III, and V 

3. (a) pi > pif a of the a-carboxyl group and pi < pK a of the 
ct-amino group, so both groups are charged (ionized). 

(b) 1 in 2,19 X 10 7 , The pi of alanine is 6,01 From Table 3-1 and 
the Henderson-Hasselbalch equation, 1/4,680 carboxyl groups and 
1/4,680 amino groups are uncharged The fraction of alanine mole- 
cules with both groups uncharged is the product of these fractions. 

(a)-(c) 



4. 



HN 



-CH 2 



COOH 

I + pKi 

-CH-NH,— 



H+ 



HN 



coo- 
I 



pK R = 6.0 



- CH 2 — CH— NH 3 

H- 







HN^\ 

i >-CH 2 - 

L / 2 


+ pK 2 = 9.17 

CH— NH 3 — — ^ ► 
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COO" 
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4 


pH 


Structure 


Net charge 


Migrates toward 


1 


1 


+2 


Cathode 


4 


2 


+ 1 


Cathode 


8 


3 


0 


Does not migrate 


12 


4 


-1 


Anode 



5. (a) Asp (b) Met (c) Glu (d) Gly (e) Ser 

6. (a) 2 (b) 4 (c) 

COO" COO" COO" 



H3N-C-H 

H-C-CH3 

CH, 
I 



CH 3 

7. (a) Structure at pH 7: 

O 

II 



H 3 N— C — H H-C — -NH 3 
H— C— CH 3 

CH 2 
CH 3 



H 3 C-C— H 

CH 2 



I 

CH, 



coo- 

H— C^NH 3 

I 

H 3 C— C— H 

CH 2 
CH, 



O 



O 



M-CH-C-N-CH-C-N-CH-C-0 



CH, 



I 

H CH, 



I I 
H CH, 



pK 2 = 8.03 



p^ = 3.39 



(b) Electrostatic interaction between the carboxylate anion and 
the protonated amino group of the alanine zwitterion favorably 
affects the ionization of the carboxyl group. This favorable elec- 
trostatic interaction decreases as the length of the poly(Ala) in- 
creases, resulting in an increase in pKi (c) Ionization of the 
protonated amino group destroys the favorable electrostatic in- 
teraction noted in (b). With increasing distance between the 
charged groups, removal of the proton from the amino group in 
poly(Ala) becomes easier and thus pK 2 is lower. The intramolec- 
ular effects of the amide (peptide bond) linkages keep piT a val- 
ues lower than they would be for an alkyl-substituted amine. 

8. 75,000 

9. (a) 32,000. The elements of water are lost when a peptide bond 
forms, so the molecular weight of a Trp residue is not the same 
as the molecular weight of free tryptophan, (b) 2 

10. The protein has four subunits, with molecular masses of 160, 90, 
90, and 60 kDa The two 90 kDa subunits (possibly identical) are 
linked by one or more disulfide bonds. 

11. (a) atpH3, +2;atpH8, 0; atpHll, -1 (b) pi = 7.8 

12. pi » 1; carboxylate groups; Asp and Glu 

13. Lys, His, Arg; negatively charged phosphate groups in DNA inter- 
act with positively charged side groups in histones. 

14. (a) (Glu) 20 (b) (Lys-Ala) 3 (c) (Asn-Ser-His) 5 
(d) (Asn-Ser-His) 5 

15. (a) Specific activity after step 1 is 200 units/mg; step 2, 600 
units/mg; step 3, 250 units/mg; step 4, 4,000 units/mg; step 5, 
15,000 units/mg; step 6, 15,000 units/mg (b) Step 4 (c) Step 3 
(d) Yes. Specific activity did not increase in step 6; SDS poly- 
acrylamide gel electrophoresis 
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16. (a) [NaCl] = 0,5 mM (b) [NaCl] = 0 05 mM. 

17. C elutes first, B second, A last, 

18. TVr-Gly-Gly-Phe-Leu 



19. 



Leu- 



Orn 

/ 
Val 

t 

Pro 

\ 

Phe 



Phe 

\ 

Pro 

I 

Val 
Orn 



Leu^ 



The arrows correspond to the orientation of the peptide bonds, 
—CO -> NH— . 

20. 88%, 97%, The percentage (x) of correct amino acid residues re- 
leased in cycle n is xjx, All residues released in the first cycle 
are correct, even though the efficiency of cleavage is not perfect, 

21. (a) Y (1), F (7), and R (9) (b) Positions 4 and 9; K (Lys) 

is more common at 4, R (Arg) is invariant at 9 (c) Positions 5 
and 10; E (Glu) is more common at both positions 
(d) Position 2; S (Ser) 

22. (a) The protein to be isolated (citrate synthase, CS) is a rela- 
tively small fraction of the total cellular protein. Cold tempera- 
tures reduce protein degradation; sucrose provides an isotonic 
environment that preserves the integrity of organelles during ho- 
mogenization. (b) This step separates organelles on the basis of 
relative size_ (c) The first addition of ammonium sulfate removes 
some unwanted proteins from the homogenate. Additional am- 
monium sulfate precipitates CS. (d) To resuspend (solubilize) 
CS, ammonium sulfate must be removed under conditions of pH 
and ionic strength that support the native conformation, (e) CS 
molecules are larger than the pore size of the chromatographic 
gel. Protein is detectable at 280 nm because of absorption at this 
wavelength by Tyr and Trp residues, (f) CS has a positive charge 
and thus binds to the negatively charged cation-exchange col- 
umn. After the neutral and negatively charged proteins pass 
through, CS is displaced from the column using the washing so- 
lution of higher pH, which alters the charge on CS. (g) Different 
proteins can have the same pi. The SDS gel confirmed that only a 
single protein was purified. SDS is difficult to remove completely 
from a protein, and its presence distorts the acid-base properties 
of the protein, including pi 

23. (a) Any linear polypeptide chain has only two kinds of free 
amino groups: a single a-amino group at the amino terminus, 
and an e-amino group on each Lys residue present. These 
amino groups react with FDNB to form a DNP-amino acid de- 
rivative. Insulin gave two different a-amino-DNP derivatives, 
suggesting that it has two amino termini and thus two polypep- 
tide chains — one with an amino-terminal Gly and the other with 
an amino-terminal Phe. Because the DNP-lysine product is 
e-DNP-lysine, the Lys is not at an amino terminus (b) Yes The 
A chain has amino-terminal Gly; the B chain has amino-terminal 
Phe; and (nonterminal) residue 29 in the B chain is Lys. (c) 
Phe-Val-Asp-Glu- Peptide Bl shows that the amino-terminal 
residue is Phe. Peptide B2 also includes Val, but since no DNP- 
Val is formed, Val is not at the amino terminus; it must be on 
the carboxyl side of Phe. Thus the sequence of B2 is DNP- 
Phe-Val. Similarly, the sequence of B3 must be DNP-Phe-Val-Asp, 
and the sequence of the A chain must begin Phe-Val-Asp-Glu-. 
(d) No, The known amino-terminal sequence of the A chain is 
Phe-Val-Asn-Gln-. The Asn and Gin appear in Sanger's analy- 
sis as Asp and Glu because the vigorous hydrolysis in step 

© hydrolyzed the amide bonds in Asn and Gin (as well as the 
peptide bonds), forming Asp and Glu. Sanger et al. could not 
distinguish Asp from Asn or Glu from Gin at this stage in their 
analysis, (e) The sequence exactly matches that in Fig. 3-24. Each 
peptide in the table gives specific information about which Asx 
residues are Asn or Asp and which Glx residues are Glu or Gin. 



Acl: residues 20-21 This is the only Cys-Asx sequence in the A 
chain; there is -1 amido group in this peptide, so it must be 
Cys-Asn: 

A^-Gly-Ile-Val-Glx-Glx-Cys-Cys-Ala-Ser-Val-Cys-Ser- 
1 5 10 

Leu-Tyr-Glx-Leu-Glx-Asx-Tyr-Cys-Asn-C 
15 20 

Apl5: residues 14-15-16. This is the only Tyr-Glx-Leu in the A 
chain; there is ~1 amido group, so the peptide must be 
Tyr-Gln-Leu: 

AT-Gly-Ile-Val-Glx-Glx-Cys-Cys-Ala-Ser-Val-Cys-Ser- 
1 5 10 

Leu-Tyr-Gln-Leu-Glx-Asx-Tyr-Cys-Asn-C 
15 20 

Apl4: residues 14-15-16-17. It has ~1 amido group, and we al- 
ready know that residue 15 is Gin, so residue 17 must be Glu: 

A^-Gly-Ile-Val-Glx-Glx-Cys-Cys-Ala-Ser-Val-Cys-Ser- 
1 5 10 

Leu-Tyr-Gln-Leu-Glu-Asx-Tyr-Cys-Asn-C 
15 20 

Ap3: residues 18-19-20-21 It has ~2 amido groups, and we 
know that residue 21 is Asn, so residue 18 must be Asn: 

A/^-Gly-Ile-Val-Glx-Glx-Cys-Cys-Ala-Ser-Val-Cys-Ser- 
1 5 10 

Leu-Tyr-Gln-Leu-Glu-Asn-Tyr-Cys-Asn-C 
15 20 

Apl: residues 17-18-19-20-21, which is consistent with 
residues 18 and 21 being Asn, 

ApSpal: residues 1-2-3-4. It has ~0 amido group, so residue 4 
must be Glu: 

Leu-Tyr-Gln-Leu-Glu-Asn-Tyr-Cys-Asn-C 
15 20 
Leu-Tyr-Gln-Leu-Glu-Asn-Tyr-Cys-Asn-C 
15 20 

Ap5; residues 1 through 13 It has ~1 amido group, and we know 
that residue 14 is Glu, so residue 5 must be Gin: 

A^-Gly-Ile-Val-Glu-Gln-Cys-Cys-Ala-Ser-Val-Cys-Ser- 
1 5 10 

Leu-Tyr-Gln-Leu-Glu-Asn-Tyr-Cys-Asn-C 
15 20 

Chapter 4 

1. (a) Shorter bonds have a higher bond order (are multiple rather 
than single) and are stronger. The peptide C— N bond is stronger 
than a single bond and is midway between a single and a double 
bond in character, (b) Rotation about the peptide bond is diffi- 
cult at physiological temperatures because of its partial double- 
bond character. 

2. (a) The principal structural units in the wool fiber polypeptide 
(a-keratin) are successive turns of the a helix, at 5 4 A intervals; 
coiled coils produce the 5.2 A spacing. Steaming and stretching 
the fiber yields an extended polypeptide chain with the 0 confor- 
mation, with a distance between adjacent R groups of about 7.0 A, 
As the polypeptide reassumes an a-helical structure, the fiber 
shortens, (b) Processed wool shrinks when polypeptide chains 
are converted from an extended (3 conformation to the native 
a-helical conformation in the presence of moist heat. The 
structure of silk — (5 sheets, with their small, closely packed 
amino acid side chains — is more stable than that of wool. 

3. —42 peptide bonds per second 

4. At pH > 6, the carboxyl groups of poly(Glu) are deprotonated; 
repulsion among negatively charged carboxylate groups leads to 
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unfolding. Similarly, at pH 7, the amino groups of poly(Lys) are 
protonated; repulsion among these positively charged groups 
also leads to unfolding. 

5. (a) Disulfide bonds are covalent bonds, which are much stronger 
than the noncovalent interactions that stabilize most proteins, 
They cross-link protein chains, increasing their stiffness, me- 
chanical strength, and hardness (b) Cystine residues (disulfide 
bonds) prevent the complete unfolding of the protein, 

6. (a) Bends are most likely at residues 7 and 19; Pro residues in 
the cis configuration accommodate turns well, (b) The Cys 
residues at positions 13 and 24 can form disulfide bonds. 

(c) External surface: polar and charged residues (Asp, Gin, Lys); 
interior: nonpolar and aliphatic residues (Ala, He); Thr, though 
polar, has a hydropathy index near zero and thus can be found 
either on the external surface or in the interior of the protein, 

7. 30 amino acid residues; 0 87 

8. Myoglobin is all three. The folded structure, the "globin fold," is 
a motif found in all globins. The polypeptide folds into a single 
domain, which for this protein represents the entire three- 
dimensional structure. 

9. The bacterial enzyme is a collagenase; it destroys the connective- 
tissue barrier of the host, allowing the bacterium to invade the 
tissues, Bacteria do not contain collagen. 

10. (a) The number of moles of DNP-valine formed per mole of pro- 
tein equals the number of amino termini and thus the number of 
polypeptide chains, (b) 4 (c) Different chains would probably 
run as discrete bands on an SDS polyacrylamide gel. 

11. (a); it has more amino acid residues that favor a-helical struc- 
ture (see Table 4-1). 

12. (a) Aromatic residues seem to play an important role in stabiliz- 
ing amyloid fibrils. Thus, molecules with aromatic substituents 
may inhibit amyloid formation by interfering with the stacking or 
association of the aromatic side chains (b) Amyloid is formed in 
the pancreas in association with type 2 diabetes, as it is in the 
brain in Alzheimer's disease. Although the amyloid fibrils in the 
two diseases involve different proteins, the fundamental structure 
of the amyloid is similar and similarly stabilized in both, and thus 
they are potential targets for similar drugs designed to disrupt 
this structure. 

13. (a) NFkB transcription factor, also called RelA transforming 
factor, (b) No. You will obtain similar results, but with additional 
related proteins listed, (c) The protein has two subunits. There 
are multiple variants of the subunits, with the best-characterized 
being 50, 52, or 65 kDa. These pair with each other to form a 
variety of homodimers and heterodimers. The structures of a 
number of different variants can be found in the PDB. (d) The 
NFkB transcription factor is a dimeric protein that binds specific 
DNA sequences, enhancing transcription of nearby genes One 
such gene is the immunoglobulin k light chain, from which the 
transcription factor gets its name, 

14. (a) Aba is a suitable replacement because Aba and Cys have 
approximately the same sized side chain and are similarly hy- 
drophobic However, Aba cannot form disulfide bonds so it will 
not be a suitable replacement if these are required, (b) There 
are many important differences between the synthesized protein 
and HIV protease produced by a human cell, any of which could 
result in an inactive synthetic enzyme: (1) Although Aba and Cys 
have similar size and hydrophobicity, Aba may not be similar 
enough for the protein to fold properly. (2) HIV protease may re- 
quire disulfide bonds for proper functioning (3) Many proteins 
synthesized by ribosomes fold as they are produced; the protein 
in this study folded only after the chain was complete. (4) Pro- 
teins synthesized by ribosomes may interact with the ribosomes 
as they fold; this is not possible for the protein in the study. 

(5) Cytosol is a more complex solution than the buffer used in the 
study; some proteins may require specific, unknown proteins for 
proper folding, (6) Proteins synthesized in cells often require 



chaperones for proper folding; these are not present in the study 
buffer. (7) In cells, HIV protease is synthesized as part of a larger 
chain that is then proteolytically processed; the protein in the 
study was synthesized as a single molecule, (c) Because the en- 
zyme is functional with Aba substituted for Cys, disulfide bonds 
do not play an important role in the structure of HIV protease, 
(d) Model 1 : it would fold like the L-protease. Argument for: 
the covalent structure is the same (except for chirality), so it 
should fold like the L-protease. Argument against: chirality is 
not a trivial detail; three-dimensional shape is a key feature of 
biological molecules The synthetic enzyme will not fold like the 
L-protease. Model 2: it would fold to the mirror image of the 
L-protease. For: because the individual components are mirror 
images of those in the biological protein, it will fold in the mirror- 
image shape Against- the interactions involved in protein fold- 
ing are very complex, so the synthetic protein will most likely 
fold in another form. Model 3: it would fold to something else. 
For: the interactions involved in protein folding are very com- 
plex, so the synthetic protein will most likely fold in another 
form. Against: because the individual components are mirror 
images of those in the biological protein, it will fold in the mirror- 
image shape, (e) Model 1 . The enzyme is active, but with the 
enantiomeric form of the biological substrate, and it is inhibited by 
the enantiomeric form of the biological inhibitor. This is consistent 
with the D-protease being the mirror image of the L-protease. 

(f) Evans blue is achiral; it binds to both forms of the enzyme. 

(g) No. Because proteases contain only L-amino acids and recog- 
nize only L-peptides, chymotrypsin would not digest the D-protease. 

(h) Not necessarily. Depending on the individual enzyme, any of 
the problems listed in (b) could result in an inactive enzyme. 



Chapters 

1. Protein B has a higher affinity for ligand X; it will be half- 
saturated at a much lower concentration of X than will protein A. 
Protein A has K a = 10 6 m -1 ; protein B has K a = 10 9 m" 1 . 

2. (a), (b), and (c) all have ™H < 1.0. Apparent negative coopera- 
tivity in ligand binding can be caused by the presence of two or 
more types of ligand-binding sites with different affinities for the 
ligand on the same or different proteins in the same solution. 
Apparent negative cooperativity is also commonly observed in 
heterogeneous protein preparations. There are few well- 
documented cases of true negative cooperativity. 

3. (a) decreases (b) increases (c) decreases (d) decreases 

4. fe d = 8.9 x 10" 5 s _1 . 

5. The cooperative behavior of hemoglobin arises from subunit 
interactions. 

6. (a) The observation that hemoglobin A (HbA; maternal) is about 
60% saturated when the p0 2 is 4 kPa, whereas hemoglobin F 
(HbF; fetal) is more than 90% saturated under the same physio- 
logical conditions, indicates that HbF has a higher 0 2 affinity 
than HbA (b) The higher 0 2 affinity of HbF ensures that oxygen 
will flow from maternal blood to fetal blood in the placenta. Fetal 
blood approaches full saturation where the 0 2 affinity of HbA is 
low. (c) The observation that the 0 2 -saturation curve of HbA 
undergoes a larger shift on BPG binding than that of HbF sug- 
gests that HbA binds BPG more tightly than does HbF. Differen- 
tial binding of BPG to the two hemoglobins may determine the 
difference in their 0 2 affinities. 

7. (a) Hb Memphis (b) HbS, Hb Milwaukee, Hb Providence, possi- 
bly Hb Cowtown (c) Hb Providence 

8. More tightly An inability to form tetramers would limit the 
cooperativity of these variants, and the binding curve would 
become more hyperbolic. Also, the BPG-binding site would be 
disrupted. Oxygen binding would probably be tighter, because 
the default state in the absence of bound BPG is the tight- 
binding R state. 
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9. (a) 1 X 1(T 8 m (b) 5 X 10" 8 M (c) 8 X 1(T 8 m 

(d) 2 X 10" 7 m. Note that a rearrangement of Eqn 5-8 gives 
[L] = bkjci - ex 

10. The epitope is likely to be a structure that is buried when G-actin 
polymerizes to F-actin 

11. Many pathogens, including HIV, have evolved mechanisms by 
which they can repeatedly alter the surface proteins to which im- 
mune system components initially bind. Thus the host organism 
regularly faces new antigens and requires time to mount an im- 
mune response to each one As the immune system responds to 
one variant, new variants are created 

12. Binding of ATP to myosin triggers dissociation of myosin from 
the actin thin filament In the absence of ATP, actin and myosin 
bind tightly to each other 




(a) (b) (c) (d) 

14. (a) Chain L is the light chain and chain H is the heavy chain of 
the Fab fragment of this antibody molecule Chain Y is lysozyme 

(b) j3 structures are predominant in the variable and constant 
regions of the fragment (c) Fab heavy-chain fragment, 218 
amino acid residues; light-chain fragment, 214; lysozyme, 129. 
Less than 15% of the lysozyme molecule is in contact with the 
Fab fragment, (d) In the H chain, residues that seem to be in 
contact with lysozyme include Gly 31 , TVr^ 2 , Arg 99 , Asp 100 , and 
TVr 101 . In the L chain the residues that seem to be in contact 
with lysozyme include Tyr 32 , Tyr 49 , Tyr™, and Trp 92 . In lysozyme, 
residues Asn 19 , Gly 22 , T>r 23 , Ser 24 , Lys 116 , Gly 117 , Thr 118 , Asp 119 , 
Gin 121 , and Arg 125 seem to be situated at the antigen-antibody 
interface Not all these residues are adjacent in the primary 
structure Folding of the polypeptide chain into higher levels of 
structure brings the nonconsecutive residues together to form 
the antigen-binding site. 

15. (a) The protein with a K a of 5 /am will have the highest affinity 
for ligand L. When the K d is 10 /am, doubling [L] from 0.2 to 0.4 
/am (values well below the K d ) will nearly double 0 (the actual in- 
crease factor is 1.96) This is a property of the hyperbolic curve; 
at low ligand concentrations, 0 is an almost linear function of [L], 
On the other hand, doubling [L] from 40 to 80 /am (well above the 
K d , where the binding curves is approaching its asymptotic limit) 
will increase 0 by a factor of only 1 1. The increase factors are 
identical for the curves generated from Eqn 5-11 (b) 0 = 0 998. 

(c) A variety of answers will be obtained depending on the val- 
ues entered for the different parameters 

16. (a) 




The drawing is not to scale; any given cell would have many more 
myosin molecules on its surface, (b) ATP is needed to provide 
the chemical energy to drive the motion (see Chapter 13). (c) An 
antibody that bound to the myosin tail, the actin-binding site, 



would block actin binding and prevent movement An antibody 
that bound to actin would also prevent actin-myosin interaction 
and thus movement (d) There are two possible explanations: 
(1) Trypsin cleaves only at Lys and Arg residues (see Table 3-7) so 
would not cleave at many sites in the protein (2) Not all Arg or 
Lys residues are equally accessible to trypsin; the most-exposed 
sites would be cleaved first, (e) The SI model The hinge model 
predicts that bead-antibody-HMM complexes (with the hinge) 
would move, but bead-antibody-SHMM complexes (no hinge) 
would not The SI model predicts that because both complexes 
include SI, both would move. The finding that the beads move 
with SHMM (no hinge) is consistent only with the SI model, 
(f) With fewer myosin molecules bound, the beads could tem- 
porarily fall off the actin as a myosin let go of it. The beads would 
then move more slowly, as time is required for a second myosin to 
bind. At higher myosin density, as one myosin lets go another 
quickly binds, leading to faster motion, (g) Above a certain den- 
sity, what limits the rate of movement is the intrinsic speed with 
which myosin molecules move the beads. The myosin molecules 
are moving at a maximum rate and adding more will not increase 
speed, (h) Because the force is produced in the SI head, damag- 
ing the SI head would probably inactivate the resulting molecule, 
and SHMM would be incapable of producing movement (i) The 
SI head must be held together by noncovalent interactions that 
are strong enough to retain the active shape of the molecule, 

Chapter 6 

1. The activity of the enzyme that converts sugar to starch is de- 
stroyed by heat denaturation 

2. 2.4 X 10" 6 M 

3. 9.5 X 10 8 years 

4. The enzyme-substrate complex is more stable than the enzyme 
alone 

5. (a) 190 A (b) Three-dimensional folding of the enzyme brings 
the amino acid residues into proximity. 

6. The reaction rate can be measured by following the decrease in 
absorption by NADH (at 340 nm) as the reaction proceeds Deter- 
mine the K m value; using substrate concentrations well above the 
K ni1 measure initial rate (rate of NADH disappearance with time, 
measured spectrophotometrically) at several known enzyme 
concentrations, and make a plot of initial rate versus concentra- 
tion of enzyme. The plot should be linear, with a slope that pro- 
vides a measure of LDH concentration. 

7. (b), (e), (g) 

8. (a) 1 7 X 10" 3 m (b) 0.33; 0,67; 0 91 (c) The upper curve corre- 
sponds to enzyme B ([X] > K m for this enzyme); the lower curve, 
enzyme A, 

9. (a) 400 s" 1 (b) 10 /am (c) a = 2, a' = 3 (d) Mixed inhibitor 
10. (a) 24 nM (b) 4 /am (V 0 is exactly half 7 max , so [A] = (c) 40 /am 

(Yo is exactly half V max , so [A] = 10 times K m in the presence 
of inhibitor) (d) No. k cat /K m = 0.33/(4 x 10" 6 m" 1 s" 1 ) = 
8 25 x 10 4 m^ 1 s" 1 , well below the diffusion-controlled limit. 
11- Lax ~ 140 /AM/min; K m ~lx 10~ 5 m 

12. (a) V max = 51.5 mM/min, K m = 0.59 mM 
(b) Competitive inhibition 

13. K m = 22 mM; V max = 0.50 /Amol/min 

14. Curve A 

15. k cat = 2.0 X lO^in" 1 

16. The basic assumptions of the Michaelis-Menten equation still 
hold. The reaction is at steady state, and the rate is determined 
by V 0 = k 2 [ES]. The equations needed to solve for [ES] are 

[E t ] = [E] + [ES] + [EI] and [EI] = 

[E] can be obtained by rearranging Eqn 6-19 The rest follows the 
pattern of the Michaelis-Menten equation derivation in the text. 
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17. Minimum M T = 29,000 

18. Activity of the prostate enzyme equals total phosphatase activity 
in a blood sample minus phosphatase activity in the presence of 
enough tartrate to completely inhibit the prostate enzyme. 

19. The inhibition is mixed. Because K m seems not to change appre- 
ciably, this could be the special case of mixed inhibition called 
noncompetitive 

20. The [S] at which V 0 = V mSLX /2a' is obtained when all terms 
except K max on the right side of Eqn 6-30 — that is, [S]/(aK m + 
a'[S])— equal ^ a'. Begin with [S]/(aK m + a'[S]) = \ a' and 
solve for [S]. 

21. The optimum activity occurs when Glu 35 is protonated and Asp 52 
is unprotonated. 

22. (a) Increase factor = 1 96; V 0 = 50 ^iM s _1 ; increase factor = 
1 048 

(b) When a = 2.0, the curve is shifted to the right as the K m is 
increased by a factor of 2. When a' = 3.0, the asymptote of the 
curve (the V max ) declines by a factor of 3 When a = 2.0 and 
a' = 3.0, the curve briefly rises above the curve where both a 
and a' = 1.0, due to a decline in K m . However, the asymptote is 
lower, because V max declines by a factor of 3. (c) When a = 2.0, 
the x intercept moves to the right When a = 2 0 and a' = 3.0, 
the x intercept moves to the left. 

23. (a) In the wild-type enzyme, the substrate is held in place by a 
hydrogen bond and an ion-dipole interaction between the 
charged side chain of Arg 109 and the polar carbonyl of pyruvate 
During catalysis, the charged Arg 109 side chain also stabilizes the 
polarized carbonyl transition state. In the mutant, the binding is 
reduced to just a hydrogen bond, substrate binding is weaker, 
and ionic stabilization of the transition state is lost, reducing cat- 
alytic activity, (b) Because Lys and Arg are roughly the same 
size and have a similar positive charge, they probably have very 
similar properties Furthermore, because pyruvate binds to 
Arg 171 by (presumably) an ionic interaction, an Arg to Lys muta- 
tion would probably have little effect on substrate binding 

(c) The "forked" arrangement aligns two positively charged 
groups of Arg residues with the negatively charged oxygens of 
pyruvate and facilitates two combined hydrogen-bond and ion- 
dipole interactions. When Lys is present, only one such com- 
bined hydrogen-bond and ion-dipole interaction is possible, thus 
reducing the strength of the interaction. The positioning of the 
substrate is less precise, (d) Ile 2D ° interacts hydrophobically 
with the ring of NADH This type of interaction is not possible 
with the hydrophilic side chain of Gin (e) The structure is 
shown below 
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(f) The mutant enzyme rejects pyruvate because pyruvate's 
hydrophobic methyl group will not interact with the highly 
hydrophilic guanidinium group of Arg 102 . The mutant binds 
oxaloacetate because of the strong ionic interaction between the 
Arg 102 side chain and the carboxyl of oxaloacetate. (g) The pro- 
tein must be flexible enough to accommodate the added bulk of 
the side chain and the larger substrate. 

Chapter 7 

1. With reduction of the carbonyl oxygen to a hydroxyl group, the 
chemistry at C-l and C-3 is the same; the glycerol molecule is 
not chiral. 

2. Epimers differ by the configuration about only one carbon. 

(a) D-altrose (C-2), D-glucose (C-3), D-gulose (C-4). (b) D-idose 
(C-2), D-galactose (C-3), D-allose (C-4) (c) D-arabinose (C-2), 
D-xylose (C-3) 

3. Osazone formation destroys the configuration around C-2 of al- 
doses, so aldoses differing only at the C-2 configuration give the 
same derivative, with the same melting point. 

4. (a) 



CH,QH 



CH^OH 




HO 











OH 





OH 



H OH 

a-D-Glucose 



H OH 

/3-D-Glucose 



(b) A fresh solution of a-D-glucose, or of /3-D-glucose, undergoes 
mutarotation to an equilibrium mixture of the a and (3 forms. 

(c) 36% a form; 64% 0 form 

5. To convert a-D-glucose to /3-D-glucose, the bond between C-l and 
the hydroxyl on C-5 (as in Fig. 7-6) . To convert D-glucose to 
D-mannose, either the — H or the —OH on C-2 Conversion be- 
tween chair conformations does not require bond breakage; this 
is the critical distinction between configuration and conformation. 

6. No; glucose and galactose differ at C-4. 

7. (a) Both are polymers of D-glucose, but they differ in the glyco- 
side linkage: Q31->4) for cellulose, (al->4) for glycogen 

(b) Both are hexoses, but glucose is an aldohexose, fructose a 
ketohexose. (c) Both are disaccharides, but maltose has two 
(al-»4)-linked D-glucose units; sucrose has (alo2j8)-linked 
D-glucose and D-fructose, 



8. 



cH./m 



HO 




OH 

} reducing 
sugar 



H H 



9. A hemiacetal is formed when an aldose or ketose condenses with 
an alcohol; a glycoside is formed when a hemiacetal condenses 
with an alcohol (see Fig. 7-5, p. 238). 

10. Fructose cyclizes to either the pyranose or the furanose struc- 
ture Increasing the temperature shifts the equilibrium in the 
direction of the furanose, the less sweet form 

11. Maltose; sucrose has no reducing (oxidizable) group, as the 
anomeric carbons of both monosaccharides are involved in the 
glycosidic bond. 
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12. The rate of mutarotation is sufficiently high that, as the 
enzyme consumes /3-D-glucose, more a-D-glucose is converted 
to the j8 form and, eventually, all the glucose is oxidized. 
Glucose oxidase is specific for glucose and does not detect 
other reducing sugars (such as galactose) that react with 
Fehling's reagent. 

13. (a) Measure the change in optical rotation with time, (b) The 
optical rotation of the mixture is negative (inverted) relative to 
that of the sucrose solution, (c) -2 0° 

14. Prepare a slurry of sucrose and water for the core; add a small 
amount of sucrase (invertase); immediately coat with chocolate 

15. Sucrose has no free anomeric carbon to undergo mutarotation. 
16. 




H OH 



Yes; yes 

17. TV-Acetyl-jS-D-glucosamine is a reducing sugar; its C-l can be 
oxidized (see Fig. 7-10, p. 241). D-Gluconate is not a reducing 
sugar; its C-l is already at the oxidation state of a carboxylic 
acid. GlcN(al^->la)Glc is not a reducing sugar; the anomeric 
carbons of both monosaccharides are involved in the glycosidic 
bond. 

18. Humans lack cellulase in the gut and cannot break down 
cellulose 

19. Native cellulose consists of glucose units linked by Q31-»4) 
glycosidic bonds, which force the polymer chain into an ex- 
tended conformation Parallel series of these extended chains 
form intermolecular hydrogen bonds, aggregating into long, 
tough, insoluble fibers. Glycogen consists of glucose units 
linked by (al->4) glycosidic bonds, which cause bends in the 
chain and prevent formation of long fibers In addition, glyco- 
gen is highly branched and, because many of its hydroxyl 
groups are exposed to water, is highly hydrated and disperses 
in water. 

Cellulose is a structural material in plants, consistent with its 
side-by-side aggregation into insoluble fibers. Glycogen is a stor- 
age fuel in animals. Highly hydrated glycogen granules with their 
many nonreducing ends are rapidly hydrolyzed by glycogen 
phosphorylase to release glucose 1 -phosphate 

20. Cellulose is several times longer; it assumes an extended confor- 
mation, whereas amylose has a helical structure. 

21. 6,000 residues/s 

22. 11 s 

23. The ball-and-stick model of the disaccharide in Fig. 7-19 shows 
no steric interactions, but a space-filling model, showing atoms 
with their real relative sizes, would show several strong steric 
hindrances in the -170°, -170° conformer that are not present in 
the 30°, -40° conformer. 

24. The negative charges on chondroitin sulfate repel each other 
and force the molecule into an extended conformation The 
polar molecule attracts many water molecules, increasing the 
molecular volume In the dehydrated solid, each negative 
charge is counterbalanced by a positive ion, and the molecule 
condenses. 

25. Positively charged amino acid residues would bind the highly 
negatively charged groups on heparin. In fact, Lys residues of 
antithrombin III interact with heparin 

26. 8 possible sequences, 144 possible linkages, and 64 stereochemi- 
cal possibilities, for a total of 73,728 permutations! 



27. 

•53 Oligosaccharide 

8 chains: 
o 

J3 2 

I- . 

© 

28. Oligosaccharides; their subunits can be combined in more ways 
than the amino acid subunits of oligopeptides Each hydroxyl 
group can participate in glycosidic bonds, and the configuration 
of each glycosidic bond can be either a or /3. The polymer can be 
linear or branched 

29. (a) Branch-point residues yield 2,3-di-O-methylglucose; the un- 
branched residues yield 2,3,6-tri-O-methylglucose (b) 3.75% 

30. Chains of (l->6)-linked D-glucose residues with occasional 
(l->3) -linked branches, with about one branch every 20 residues 

31. (a) The tests involve trying to dissolve only part of the sample in a 
variety of solvents, then analyzing both dissolved and undissolved 
materials to see whether their compositions differ, (b) For a pure 
substance, all molecules are the same and any dissolved fraction 
will have the same composition as any undissolved fraction. An im- 
pure substance is a mixture of more than one compound. When 
treated with a particular solvent, more of one component may dis- 
solve, leaving more of the other component(s) behind As a result, 
the dissolved and undissolved fractions have different composi- 
tions, (c) A quantitative assay allows researchers to be sure that 
none of the activity has been lost through degradation. When de- 
termining the structure of a molecule, it is important that the sam- 
ple under analysis consist only of intact (undegraded) molecules 

If the sample is contaminated with degraded material, this will 
give confusing and perhaps uninterpretable structural results A 
qualitative assay would detect the presence of activity, even if it 
had become significantly degraded, (d) Results 1 and 2. Result 1 
is consistent with the known structure, because type B antigen 
has three molecules of galactose; types A and O each have only 
two. Result 2 is also consistent, because type A has two amino 
sugars (^-acetylgalactosamine and W-acetylglucosamine); types 
B and O have only one (A^-acetylglucosamine). Result 3 is not 
consistent with the known structure: for type A, the glucosamine: 
galactosamine ratio is 1:1; for type B, it is 1:0. (e) The samples 
were probably impure and/or partly degraded The first two re- 
sults were correct possibly because the method was only roughly 
quantitative and thus not as sensitive to inaccuracies in measure- 
ment. The third result is more quantitative and thus more likely 
to differ from predicted values, because of impure or degraded 
samples, (f) An exoglycosidase. If it were an endoglycosidase, 
one of the products of its action on O antigen would include 
galactose, A^-acetylglucosamine, or ^acetylgalactosamine, and at 
least one of those sugars would be able to inhibit the degradation. 
Given that the enzyme is not inhibited by any of these sugars, 
it must be an exoglycosidase, removing only the terminal sugar 
from the chain. The terminal sugar of O antigen is fucose, so fu- 
cose is the only sugar that could inhibit the degradation of O anti- 
gen (g) The exoglycosidase removes ^-acetylgalactosamine 
from A antigen and galactose from B antigen. Because fucose is 
not a product of either reaction, it will not prevent removal of 
these sugars, and the resulting substances will no longer be active 
as A or B antigen However, the products should be active as O 
antigen, because degradation stops at fucose. (h) All the results 
are consistent with Fig 10-15. (1) D-Fucose and L-galactose, 
which would protect against degradation, are not present in any 
of the antigens. (2) The terminal sugar of A antigen is ^-acetyl- 
galactosamine, and this sugar alone protects this antigen from 
degradation. (3) The terminal sugar of B antigen is galactose, 
which is the only sugar capable of protecting this antigen 
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Chapter 8 

1. N-3 and N-7 

2. (5 ' ) GCGCAATATTTTGAGAAATATTGCGC (3') ; it contains a 
palindrome. The individual strands can form hairpin structures; 
the two strands can form a cruciform. 

3. 9.4 X 10" 4 g 

4. (a) 40° (b) 0° 

5. The RNA helix is in the A conformation; the DNA helix is gener- 
ally in the B conformation. 

6. In eukaryotic DNA, about 5% of C residues are methylated. 
5-Methylcytosine can spontaneously deaminate to form thymine; 
the resulting G-T pair is one of the most common mismatches in 
eukaryotic cells. 

7. Without the base, the ribose ring can be opened to generate the 
noncyclic aldehyde form. This, and the loss of base-stacking 
interactions, could contribute significant flexibility to the DNA 
backbone. 

8. Base stacking in nucleic acids tends to reduce the absorption of 
UV light. Denaturation involves loss of base stacking, and UV 
absorption increases 

9. 0 35 mg/mL 
10. 
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Solubilities: phosphate > deoxyribose > guanine. The highly po- 
lar phosphate groups and sugar moieties are on the outside of 
the double helix, exposed to water; the hydrophobic bases are in 
the interior of the helix. 
11. If dCTP is omitted, when the first G residue is encountered in 
the template, ddCTP will be added, and polymerization will halt. 
Only one band will be seen in the sequencing geL 



12. 



13. 



14 



(5')P — GCGCCAUUGC(3')— OH 
(5')P — GCGCCAUUG(3') — OH 
(5')P — GCGCCAUU(3') — OH 
(5')P — GCGCCAU(3') — OH 
(5')P— GCGCCA(3')— OH 
(5')P — GCGCC(3') — OH 
(5')P — GCGC(3') — OH 
(5')P — GCG(3') — OH 
(5')P — GC(3') — OH 
and the nucleoside 5 '-phosphates 

(a) Water is a participant in most biological reactions, including 
those that cause mutations. The low water content in 



endospores reduces the activity of mutation-causing enzymes 
and slows the rate of nonenzymatic depurination reactions, 
which are hydrolysis reactions, (b) UV light induces formation of 
cyclobutane pyrimidine dimers. Because B. subtilis is a soil or- 
ganism, spores can be lofted to the top of the soil or into the air, 
where they may be subject to prolonged UV exposure. 

15. DMT is a blocking group that prevents reaction of the incoming 
base with itself 

16. (a) Right-handed. The base at one 5' end is adenine; at the 
other 5' end, cytidine. (b) Left-handed (c) If you cannot see the 
structures in stereo, see additional tips in the expanded solu- 
tions manual, or use a search engine to find tips online, 

17. (a) It would not be easy! The data for different samples from the 
same organism show significant variation, and the recovery is 
never 100%. The numbers for C and T show much more consis- 
tency than those for A and G, so for C and T it is much easier to 
make the case that samples from the same organism have the 
same composition. But even with the less consistent values for A 
and G, (1) the range of values for different tissues does overlap 
substantially; (2) the difference between different preparations 
of the same tissue is about the same as the difference between 
samples from different tissues; and (3) in samples for which re- 
covery is high, the numbers are more consistent, (b) This 
technique would not be sensitive enough to detect a difference 
between normal and cancerous cells. Cancer is caused by muta- 
tions, but these changes in DNA— a few base pairs out of several 
billion— would be too small to detect with these techniques. 

(c) The ratios of A:G and T:C vary widely among different species, 
For example, in the bacterium Serratia marcescens, both ratios 
are 0.4, meaning that the DNA contains mostly G and C. In 
Haemophilus influenzae, by contrast, the ratios are 1 74 and 
1 .54, meaning that the DNA is mostly A and T. (d) Conclusion 4 
has three requirements: A = T: The table shows an A:T ratio very 
close to 1 in all cases Certainly, the variation in this ratio is sub- 
stantially less than the variation in the A:G and T:C ratios. G = C: 
Again, the G:C ratio is very close to 1, and the other ratios vary 
widely (A + G) = (T + C): This is the puime:pyrimidine ratio, 
which also is very close to 1 . (e) The different "core" fractions 
represent different regions of the wheat germ DNA. If the DNA 
were a monotonous repeating sequence, the base composition of 
all regions would be the same. Because different core regions have 
different sequences, the DNA sequence must be more complex. 



Chapter 9 

1. (a) (5')- 
(30- 

(b) (50- 
(30- 

(c) (50- 
(30- 

(d) (5 )- 
(30- 

(e) (50- 
(3')- 

(0 (50- 
(30- 
(g) (50- 
(30- 
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(50CTG- 
(30GAC- 

-(30 

-(50 



-(30 
-(50 



(h) Method 1: cut the DNA with EcoRl as in (a). At this point, 
one could treat the DNA as in (b) or (d), then ligate a synthetic 
DNA fragment with the BamHI recognition sequence between 
the two resulting blunt ends. Method 2 (more efficient): synthe- 
size a DNA fragment with the structure 

(50AATTGGATCC(30 

(30CCTAGGTTAA(50 

This would ligate efficiently to the sticky ends generated by 
EcoRl cleavage, would introduce a£araHI site, but would not 
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regenerate the EcoRl site, (i) The four fragments (with N = any 
nucleotide), in order of discussion in the problem, are 

(5 ' ) AATTCNNNNCTGCA(3 ') 

(3')GNNNNG(5') 
(5 , )AATTCNNNNGTGCA(3 / ) 

(3')GNNNNC(5') 
(5')AATTGNNNNCTGCA(3') 

(3')CNNNNG(5') 
(5')AATTGNNNNGTGCA(3') 

(3')CNNNNC(5') 

2. A phage DNA can be packaged into infectious phage particles only 
if it is between 40,000 and 53,000 bp in length. Since bacterio- 
phage vectors generally include about 30,000 bp (in two pieces), 
they will not be packaged into phage particles unless they contain 
a sufficient length of inserted DNA (10,000 to 23,000 bp). 

3. (a) Plasmids in which the original pBR322 was regenerated 
without insertion of a foreign DNA fragment; these would retain 
resistance to ampicillin Also, two or more molecules of pBR322 
might be ligated together with or without insertion of foreign 
DNA. (b) The clones in lanes 1 and 2 each have one DNA frag- 
ment inserted in different orientations The clone in lane 3 has 
two DNA fragments, ligated such that the EcoRl proximal ends 
are joined 

4. Focus on the amino acids with the fewest, codons: Met and Trp. 
The best possibility is the span of DNA from the codon for the 
first Trp residue to the first two nucleotides of the codon for He 
The sequence of the probe would be: 

(5')UGGUA(U/C)UG(U/C)AUGGA(U/C)UGGAU 

The synthesis would be designed to incorporate either U or C 
where indicated, producing a mixture of eight 20-nucleotide 
probes that differ only at one or more of these positions. 

5. Your test would require DNA primers, a heat-stable DNA poly- 
merase, deoxynucleoside triphosphates, and a PCR machine 
(thermal cycler) The primers would be designed to amplify a 
DNA segment encompassing the GAG repeat. The DNA strand 
shown is the coding strand, oriented 5'-»3' left to right. The 
primer targeted to DNA to the left of the repeat would be identi- 
cal to any 25-nucleotide sequence shown in the region to the left 
of the CAG repeat. The primer on the right side must be comple- 
mentary and antiparallel to a 25-nucleotide sequence to the 
right of the CAG repeat Using the primers, DNA including the 
CAG repeat would be amplified by PCR, and its size would be de- 
termined by comparison to size markers after electrophoresis. 
The length of the DNA would reflect the length of the CAG re- 
peat, providing a simple test for the disease 

6. Design PCR primers complementary to DNA in the deleted seg- 
ment, but which would direct DNA synthesis away from each 
other. No PCR product will be generated unless the ends of the 
deleted segment are joined to create a circle. 

7. The plant expressing firefly luciferase must take up luciferin, the 
substrate of luciferase, before it can "glow" (albeit weakly). The 
plant expressing green fluorescent protein glows without requir- 
ing any other compound. 

8. None of the children can be excluded. All have one band that 
could be derived from the father 
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10. Simply for convenience; the 200,000 bp Ti plasmid, even when 
the T DNA is removed, is too large to isolate in quantity and ma- 
nipulate in vitro. It is also too large to reintroduce into a cell by 
standard transformation techniques. The vir genes will facilitate 
the transfer of any DNA between the T DNA repeats, even if 
they are on a separate plasmid. The second plasmid in the two- 
plasmid system, because it requires only the T DNA repeats 
and a few sequences necessary for plasmid selection and 
propagation, is relatively small, easily isolated, and easily mani- 
pulated (foreign DNA easily added and/or altered) . 
A B C 

M 
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1 2 
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1 2i 



B 



12. Cover spot 4, add solution containing activated T, irradiate, 
wash. 



1 A-T 



2 G-T 



3. A-T 



4. G-C 



Cover spots 2 and 4, add solution containing activated G, irradi- 
ate, wash. 



1. A-T-G 



2. G-T 



3. A-T-G 



4 . G-C 



Cover spot 3, add solution containing activated C, irradiate, 
wash. 



L A-T-G-C 2.G-T-C 



3, A-T-G 



4. G-C-C 



Cover spots 1, 3, and 4, add solution containing activated C, irra- 
diate, wash. 



1, A-T-G-C 2. G-T-C-C 3. A-T-G 



4, G-C-C 



13 



14 



Cover spots 1 and 2, add solution containing activated G, irradi- 
ate, wash. 

1. A-T-G-C 2 G-T-C-C 3 A-T-G-C 4. G-C-C-C 

The vectors must be introduced into a cell infected with a helper 
virus that can provide the necessary replication and packaging 
functions but cannot itself be packaged. Once recombinant DNA 
is integrated into the chromosome of the target cell with these 
vectors, the lack of recombination and packaging functions 
makes the integration very stable by preventing the deletion or 
replication of the integrated DNA. 

(a) DNA solutions are highly viscous because the very long 
molecules are tangled in solution. Shorter molecules tend to 
tangle less and form a less viscous solution, so decreased vis- 
cosity corresponds to shortening of the polymers— as caused 
by nuclease activity, (b) An endonuclease. An exonuclease 
removes single nucleotides from the 5' or 3' end and would 
produce TCA-soluble 32 P-labeled nucleotides. An endonuclease 
cuts DNA into oligonucleotide fragments and produces little or 
no TCA-soluble 32 P-labeled material, (c) The 5' end. If the 
phosphate were left on the 3' end, the kinase would 
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incorporate significant 32 P as it added phosphate to the 5' end; 
treatment with the phosphatase would have no effect on this. 
In this case, samples A and B would incorporate significant 
amounts of 32 P. When the phosphate is left on the 5' end, the 
kinase does not incorporate any 32 P: it cannot add a phosphate 
if one is already present, Treatment with the phosphatase re- 
moves 5' phosphate, and the kinase then incorporates signifi- 
cant amounts of 32 P Sample A will have little or no 32 P, and B 
will show substantial 32 P incorporation — as was observed, 
(d) Random breaks would produce a distribution of fragments 
of random size, The production of specific fragments indicates 
that the enzyme is site-specific, (e) Cleavage at the site of 
recognition. This produces a specific sequence at the 5' end of 
the fragments. If cleavage occurred near but not within the 
recognition site, the sequence at the 5' end of the fragments 
would be random, (f) The results are consistent with two 
recognition sequences, as shown below, cleaved where shown 
by the arrows: 



(5') — 
(3')-- 



GTT AAC 
CAA TTG- 



-(3') 
-(5') 



which gives the (5')pApApC and (3')TpTp fragments; and 



(5')- 
(3')- 



GTC GAC - 
CAG CTG - 

T 



(5') 
■(5') 



which gives the (5')pGpApC and (3')CpTp fragments 



Chapter 10 

1. The term "lipid" does not specify a particular chemical structure. 
Compounds are categorized as lipids based on their greater solu- 
bility in organic solvents than in water. 

2. (a) The number of cis double bonds. Each cis double bond 
causes a bend in the hydrocarbon chain, lowering the melting 
temperature, (b) Six different triacylglycerols can be con- 
structed, in order of increasing melting points: 

OOO < OOP = OPO < PPO = POP < PPP 

where 0 = oleic and P = palmitic acid. The greater the content of 
saturated fatty acid, the higher is the melting point, (c) Branched- 
chain fatty acids increase the fluidity of membranes because they 
decrease the extent of membrane lipid packing. 

3. Lecithin, an amphipathic compound, is an emulsifying agent, 
facilitating the solubilization of butter. 




Squalen? 



5. Spearmint is (ft)-carvone; caraway is (5)-carvone 



COOH 
H-C-NH 3 

CH 3 

CR)-2-Aminopropanoic acid 



OH 

H 3 C— C— COOH 
H 

CR)-2-Hydroxypropanoic acid (S)-2-Hydroxypropanoic acid 

7. Hydrophobic units: (a) 2 fatty acids; (b), (c), and (d) 1 fatty 
acid and the hydrocarbon chain of sphingosine; (e) steroid nu- 
cleus and acyl side chain Hydrophilic units: (a) phospho- 
ethanolamine; (b) phosphocholine; (c) D-galactose; (d) several 
sugar molecules; (e) alcohol group (OH) 

8. 



COOH 
H3N-C-H 

CH 3 

(S)-2-Aminopropanoic acid 

COOH 
H 3 C — C^OH 
H 




9. It reduces double bonds, which increases the melting point of 
lipids containing the fatty acids. 

10. The triacylglycerols of animal fats (grease) are hydrolyzed by 
NaOH (saponified) to form soaps, which are much more soluble 
in water than are triacylglycerols. 

11. It could only be a sphingolipid (sphingomyelin). 

12. O 

CH 2 0— O 

I o 

I II 

CH— 0— C- 
0 

I II 

CH 2 — O— P— O— CH 2 — CH— NH 3 

0" COO" 
Phosphatidylserine 

13. Long, saturated acyl chains, nearly solid at air temperature, form 
a hydrophobic layer in which a polar compound such as H 2 0 can- 
not dissolve or diffuse. 

14. (a) The free —OH group on C-2 and the phosphocholine head 
group on C-3 are hydrophilic; the fatty acid on C-l of lysolecithin 
is hydrophobic, (b) Certain steroids such as prednisone inhibit 
the action of phospholipase A 2) inhibiting the release of arachi- 
donic acid from C-2 Arachidonic acid is converted to a variety 
of eicosanoids, some of which cause inflammation and pain. 

(c) Phospholipase A 2 releases arachidonic acid, a precursor of 
other eicosanoids with vital protective functions in the body; it 
also breaks down dietary glycerophospholipids. 

15. The part of the membrane lipid that determines blood type is the 
oligosaccharide in the head group of the membrane sphingolipids 
(see Fig 10-15, p, 355). This same oligosaccharide is attached to 
certain membrane glycoproteins, which also serve as points of 
recognition by the antibodies that distinguish blood groups. 

16. Diacylglycerol is hydrophobic and remains in the membrane. 
Inositol 1,4,5-trisphosphate is highly polar, very soluble in water, 
and more readily diffusible in the cytosol. Both are second 
messengers 

17. Water-soluble vitamins are more rapidly excreted in the urine 
and are not stored effectively. Fat-soluble vitamins have very low 
solubility in water and are stored in body lipids 

18. (a) Glycerol and the sodium salts of palmitic and stearic acids, 
(b) D-Glycerol 3-phosphocholine and the sodium salts of 
palmitic and oleic acids. 
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19. Solubilities in water: monoacylglycerol > diacylglycerol > tria- 
cylglycerol. 

20. First eluted to last eluted: cholesteryl palmitate and triacylglyc- 
erol; cholesterol and rc-tetradecanol; phosphatidylcholine and 
phosphatidylethanolamine; sphingomyelin; phosphatidyls erine 
and palmitate. 

21. (a) Subject acid hydrolysates of each compound to chromatog- 
raphy (GLC or silica gel TLC) and compare the result with 
known standards. Sphingomyelin hydrolysate: sphingosine, 
fatty acids, phosphocholine, choline, and phosphate; cerebroside 
hydrolysate: sphingosine, fatty acids, sugars, but no phosphate, 
(b) Strong alkaline hydrolysis of sphingomyelin yields sphingo- 
sine; phosphatidylcholine yields glycerol. Detect hydrolysate 
components on thin-layer chromatograms by comparing with 
standards or by their differential reaction with FDNB (only 
sphingosine reacts to form a colored product) Treatment with 
phospholipase A 1 or A 2 releases free fatty acids from phos- 
phatidylcholine, but not from sphingomyelin. 

22. Phosphatidylethanolamine and phosphatidylserine. 

23. (a) GM1 and globoside. Both glucose and galactose are hexoses, 
so "hexose" in the molar ratio refers to glucose + galactose. The 
ratios for the four gangliosides are: GM1, 1:3:1:1; GM2, 1:2:1:1; 
GM3, 1:2:0:1; globoside, 1:3:1:0. (b) Yes. The ratio matches GM2, 
the ganglioside expected to build up in Tay-Sachs disease (see 
Box 10-2, Fig. 1). (c) This analysis is similar to that used by 
Sanger to determine the amino acid sequence of insulin. The 
analysis of each fragment reveals only its composition, not its 
sequence, but because each fragment is formed by sequential re- 
moval of one sugar, we can draw conclusions about sequence. 
The structure of the normal asialoganglioside is ceramide— 
glucose-galactose-galactosamine-galactose, consistent with 
Box 10-2 (excluding NeuSAc, removed before hydrolysis), 

(d) The Tay-Sachs asialoganglioside is ceramide-glucose- 
galactose-galactosamine, consistent with Box 10-2. (e) The 
structure of the normal asialoganglioside, GM1, is: ceramide- 
glucose (2 — OH involved in glycosidic links; 1 — OH involved in 
ring structure; 3 —OH (2,3,6) free for methylation)- galactose 
(2 —OH in links; 1 —OH in ring; 3 —OH (2,4,6) free for 
methyUtion)~galactosamine (2 —OH in links; 1 —OH in ring; 
1 — NH 2 instead of an —OH; 2 —OH (4,6) free for methylation)- 
galactose (1 —OH in link; 1 —OH in ring; 4 —OH (2,3,4,6) free 
for methylation) (f) Two key pieces of information are missing: 
What are the linkages between the sugars? Where is NeuSAc 
attached? 

Chapter 11 

1. From the known amount of lipid used, its molecular weight, and 
the area occupied by a monolayer (determined as shown), the 
area per molecule can be calculated 

2. The data support a bilayer of lipid in the dog erythrocytes: a sin- 
gle cell, with surface area 98 ^m 2 , has a lipid monolayer area of 
200 fwi . In the case of sheep and human erythrocytes, the data 
suggest a monolayer, not a bilayer In fact, significant experimen- 
tal errors occurred in these early experiments; recent, more ac- 
curate measurements support a bilayer in all cases. 

3. 63 SDS molecules per micelle 

4. (a) Lipids that form bilayers are amphipathic molecules: they 
contain a hydrophilic and a hydrophobic region. To minimize the 
hydrophobic area exposed to the water surface, these lipids form 
two-dimensional sheets with the hydrophilic regions exposed to 
water and the hydrophobic regions buried in the interior of the 
sheet. Furthermore, to avoid exposing the hydrophobic edges 

of the sheet to water, lipid bilayers close upon themselves. 

(b) These sheets form the closed membrane surfaces that envelop 

cells and compartments within cells (organelles). 

5. 2 nm. Two palmitates placed end to end span about 4 nm, 
approximately the thickness of a typical bilayer. 



11. 



6. Decrease Movement of individual lipids in bilayers occurs much 
faster at 37 °C, when the lipids are in the "fluid" phase, than at 
10°C, when they are in the "solid" phase. 

7. 35 kJ/mol, neglecting the effects of transmembrane electrical 
potential; 0.60 mol 

8. 13 kJ/mol. 

9. Most of the 0 2 consumed by a tissue is for oxidative phosphory- 
lation, the source of most of the ATP. Therefore, about two- 
thirds of the ATP synthesized by the kidney is used for pumping 
K + and Na + . 

10. No. The symporter may carry more than one equivalent of Na + 
for each mole of glucose transported. 

Salt extraction indicates a peripheral location, and inaccessibility 
to protease in intact cells indicates an internal location. X seems to 
be a peripheral protein on the cytosolic face of the membrane. 
The hydrophobic interactions among membrane lipids are nonco- 
valent and reversible, allowing membranes to spontaneously reseal. 
13. The temperature of body tissues at the extremities is lower than 
that of tissues closer to the center of the body. If lipid is to 
remain fluid at this lower temperature, it must contain a higher 
proportion of unsaturated fatty acids; unsaturated fatty acids 
lower the melting point of lipid mixtures. 
The energetic cost of moving the highly polar, sometimes 
charged, head group through the hydrophobic interior of the 
bilayer is prohibitive. 

At pH 7, tryptophan bears a positive and a negative charge, but 
indole is uncharged. The movement of the less polar indole 
through the hydrophobic core of the bilayer is energetically more 
favorable 

3 X 10" 2 s 

17. Treat a suspension of cells with unlabeled NEM in the presence 
of excess lactose, remove the lactose, then add radiolabeled 
NEM. Use SDS-PAGE to determine the M r of the radiolabeled 
band (the transporter) 

18. Construct a hydropathy plot; hydrophobic regions of 20 or more 
residues suggest transmembrane segments. Determine whether 
the protein in intact erythrocytes reacts with a membrane- 
impermeant reagent specific for primary amines; if so, the 
transporter is of type I. 

The leucine transporter is specific for the L isomer, but the 
binding site can accommodate either L-leucine or L-valine. 
Reduction of V mSiX in the absence of Na + indicates that leucine 
(or valine) is transported by symport with Na + , 
^max reduced; K t unaffected, 

-1%; estimated by calculating the surface area of the cell and of 
10,000 transporter molecules (using the dimensions of hemoglo- 
bin (p. 159) as a model globular protein) 



14 



15 



19 



20. 
21. 



22. 
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The amino acids with the greatest hydropathy index (V, L, F, and 
C) are clustered on one side of the helix This amphipathic helix 
is likely to dip into the lipid bilayer along its hydrophobic sur- 
face, while exposing the other surface to the aqueous phase. 
Alternatively, a group of helices may cluster with their polar 
surfaces in contact with one another and their hydrophobic 
surfaces facing the lipid bilayer 

23. —22. To estimate the fraction of membrane surface covered by 
phospholipids, you would need to know (or estimate) the aver- 
age cross-sectional area of a phospholipid molecule in a bilayer 
(e g , from an experiment such as that diagrammed in problem 1 
in this chapter) and the average cross-sectional area of a 50 kDa 
protein, 

24. (a) The rise-per-residue for an a helix (Chapter 4) is about 
1.5 A = 0 15 run To span a 4 nm bilayer, an a helix must contain 
about 27 residues; thus for seven spans, about 190 residues are 
required A protein of M T 64,000 has about 580 residues (b) A 
hydropathy plot is used to locate transmembrane regions. 

(c) Because about half of this portion of the epinephrine receptor 
consists of charged residues, it probably represents an intracellu- 
lar loop that connects two adjacent membrane-spanning regions 
of the protein, (d) Because this helix is composed mostly of 
hydrophobic residues, this portion of the receptor is probably 
one of the membrane-spanning regions of the protein. 

25. (a) Model A: supported The two dark lines are either the pro- 
tein layers or the phospholipid heads, and the clear space is ei- 
ther the bilayer or the hydrophobic core, respectively. Model B: 
not supported This model requires a more-or-less uniformly 
stained band surrounding the cell, Model C: supported, with one 
reservation. The two dark lines are the phospholipid heads; the 
clear zone is the tails. This assumes that the membrane proteins 
are not visible, because they do not stain with osmium or do not 
happen to be in the sections viewed (b) Model A: supported A 
"naked" bilayer (4.5 nm) + two layers of protein (2 nm) sums to 
6.5 nm, which is within the observed range of thickness. Model B: 
neither, This model makes no predictions about membrane 
thickness. Model C: unclear, The result is hard to reconcile with 
this model, which predicts a membrane as thick as, or slightly 
thicker than (due to the projecting ends of embedded proteins), 
a "naked" bilayer The model is supported only if the smallest 
values for membrane thickness are correct or if a substantial 
amount of protein projects from the bilayer (c) Model A: 
unclear, The result is hard to reconcile with this model If the 
proteins are bound to the membrane by ionic interactions, the 
model predicts that the proteins contain a high proportion of 
charged amino acids, in contrast to what was observed, Also, be- 
cause the protein layer must be very thin (see (b)), there would 
not be much room for a hydrophobic protein core, so hydropho- 
bic residues would be exposed to the solvent. Model B: sup- 
ported. The proteins have a mixture of hydrophobic residues 
(interacting with lipids) and charged residues (interacting with 
water) Model C: supported. The proteins have a mixture of 
hydrophobic residues (anchoring in the membrane) and charged 
residues (interacting with water), (d) Model A: unclear. The 
result is hard to reconcile with this model, which predicts a ratio 
of exactly 2.0; this would be hard to achieve under physiologi- 
cally relevant pressures, Model B; neither. This model makes no 
predictions about amount of lipid in the membrane. Model C: 
supported. Some membrane surface area is taken up with pro- 
teins, so the ratio would be less than 2.0, as was observed under 
more physiologically relevant conditions, (e) Model A: unclear. 
The model predicts proteins in extended conformations rather 
than globular, so supported only if one assumes that proteins lay- 
ered on the surfaces include helical segments. Model B: sup- 
ported. The model predicts mostly globular proteins (containing 
some helical segments) Model C: supported. The model predicts 
mostly globular proteins. (J) Model A: unclear. The phosphoryl- 
amine head groups are protected by the protein layer, but only if 
the proteins completely cover the surface will the phospholipids 



be completely protected from phospholipase. Model B: supported. 
Most head groups are accessible to phospholipase. Model C: 
supported. All head groups are accessible to phospholipase 
(g) Model A: not supported. Proteins are entirely accessible to 
trypsin digestion and virtually all will undergo multiple cleavage, 
with no protected hydrophobic segments. Model B: not supported. 
Virtually all proteins are in the bilayer and inaccessible to trypsin 
Model C: supported Segments of protein that penetrate or span 
the bilayer are protected from trypsin; those exposed at the sur- 
faces will be cleaved. The trypsin-resistant portions have a high 
proportion of hydrophobic residues. 

Chapter 12 

1. X is cAMP; its production is stimulated by epinephrine. 

(a) Centrifugation sediments adenylyl cyclase (which catalyzes 
cAMP formation) in the particulate fraction, (b) Added cAMP 
stimulates glycogen phosphorylase. (c) cAMP is heat stable; it 
can be prepared by treating ATP with barium hydroxide 

2. Unlike cAMP, dibutyryl cAMP passes readily through the plasma 
membrane 

3. (a) It increases [cAMP]. (b) cAMP regulates Na + permeability, 
(c) Replace lost body fluids and electrolytes. 

4. (a) The mutation makes R unable to bind and inhibit C, so C is 
constantly active, (b) The mutation prevents cAMP binding to R, 
leaving C inhibited by bound R. 

5. Albuterol raises [cAMP], leading to relaxation and dilation of the 
bronchi and bronchioles . Because ^-adrenergic receptors control 
many other processes, this drug would have undesirable side 
effects To ininimize them, find an agonist specific for the subtype 
of jS-adrenergic receptors found in the bronchial smooth muscle. 

6. Hormone degradation; hydrolysis of GTP bound to a G protein; 
degradation, metabolism, or sequestration of second messenger; 
receptor desensitization; removal of receptor from the cell surface. 

7. Fuse CFP to /3-arrestin and YFP to the cytoplasmic domain of the 
jS-adrenergic receptor, or vice versa. In either case, illuminate at 
433 nm and observe at both 476 and 527 nm. If the interaction oc- 
currs, emitted light intensity will decrease at 476 nm and increase 
at 527 nm on addition of epinephrine to cells expressing the fu- 
sion proteins. If the interaction does not occur, the wavelength of 
emitted light will remain at 476 nm. Some reasons why this might 
fail: the fusion proteins (1) are inactive or otherwise unable to in- 
teract, (2) are not translocated to their normal subcellular 
location, or (3) are not stable to proteolytic breakdown, 

8. Vasopressin acts by elevating cytosolic [Ca 2+ ] to 10" 6 m, activat- 
ing protein kinase C. EGTA injection blocks vasopressin action, 
but should not affect the response to glucagon, which uses 
cAMP, not Ca 2+ , as second messenger 

9. Amplification results as one molecule of a catalyst activates many 
molecules of another catalyst, in an amplification cascade involv- 
ing, in order, insulin receptor, IRS-1, Raf, MEK, ERK; ERK acti- 
vates a transcription factor, which stimulates mRNA production. 

10. A mutation in ras that inactivates the Ras GTPase activity cre- 
ates a protein that, once activated by the binding of GTP, contin- 
ues to give, through Raf, the insulin-response signal. 

11. Shared properties of Ras and G s : both bind either GDP or GTP; 
both are activated by GTP; both, when active, activate a down- 
stream enzyme; both have intrinsic GTPase activity that shuts 
them off after a short period of activation. Differences between 
Ras and G s : Ras is a small, monomeric protein; G s is het- 
erotrimeric. Functional difference between G s and G^ G s acti- 
vates adenylyl cyclase, Gj inhibits it 

12. Kinase (factor in parentheses): PKA (cAMP); PKG (cGMP); 
PKC (Ca 2+ , DAG); Ca 2+ /CaM kinase (Ca 2+ , CaM); cyclin- 
dependent kinase (cyclin); protein Tyr kinase (ligand for the 
receptor, such as insulin); MAPK (Raf); Raf (Ras); glycogen 
phosphorylase kinase (PKA). 
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13. G s remains in its activated form when the nonhydrolyzable ana- 
log is bound. The analog therefore prolongs the effect of 
epinephrine on the injected cell. 

14. (a) Use the a-bungarotoxin-bound beads for affinity purification 
(see Fig. 3-1 7c, p 87) of AChR Extract proteins from the elec- 
tric organs and pass the mixture through the chromatography 
column; the AChR binds selectively to the beads. Elute the AChR 
with a solute that weakens its interaction with a-bungarotoxin 
(b) Use binding of [ 125 I]a-bungarotoxin as a quantitative assay 
for AChR during purification by various techniques. At each step, 
assay AChR by measuring [ 125 I]a-bungarotoxin binding to the 
proteins in the sample Optimize purification for the highest spe- 
cific activity of AChR (counts/min of bound [ 125 I]a-bungarotoxin 
per mg of protein) in the final material 

15. (a) No. If V m were set by permeability to (primarily) K + , the 
Nernst equation would predict a V m of -90 mV, not the observed 
-95 mV, so some other conductance must contribute to V m 

(b) Chloride ion is probably the determinant of V m ; the 
predicted E C{ - is -94 mV. 

16. (a) V m of the oocyte membrane changes from -60 mVto -10 
mV— that is, the membrane is depolarized (b) The effect of KC1 
depends on influx of Ca 2+ from the extraceUular medium. 

17. Hyperpolarization results in the closing of voltage-dependent Ca 2+ 
channels in the presynaptic region of the rod cell. The resulting 
decrease in [Ca 2+ ] iJ1 diminishes release of an inhibitory neuro- 
transmitter that suppresses activity in the next neuron of the 
visual circuit When this inhibition is removed in response to a 
light stimulus, the circuit becomes active and visual centers in the 
brain are excited 

18. (a) This would prevent influx of Na + and Ca 2+ into the cells in 
response to light; the cone cells would fail to signal the brain that 
light had been received Because rod cells are unaffected, the 
individuals would be able to see but would not have color vision, 
(b) This would prevent efflux of K + , which would lead to depo- 
larization of the /3-cell membrane and constitutive release of 
insulin into the blood (c) ATP is responsible for closing this 
channel, so the channels will remain open, preventing depolar- 
ization of the /3-cell membrane and release of insulin. 

19. Individuals with Oguchi's disease might have a defect in 
rhodopsin kinase or in arrestin 

20. Rod cells would no longer show any change in membrane poten- 
tial in response to light This experiment has been done. Illumi- 
nation did activate PDE, but the enzyme could not significantly 
reduce the 8-Br-cGMP level, which remained well above that 
needed to keep the gated ion channels open Thus, light had no 
impact on membrane potential 

21. (a) On exposure to heat, TRPV1 channels open, causing an influx 
of Na + and Ca 2+ into the sensory neuron. This depolarizes the neu- 
ron, triggering an action potential When the action potential 
reaches the axon terminus, neurotransmitter is released, signaling 
the nervous system that heat has been sensed (b) Capsaicin mim- 
ics the effects of heat by opening TRPV1 at low temperature, lead- 
ing to the false sensation of heat The extiemely low EC 5U indicates 
that even very small amounts of capsaicin will have dramatic sen- 
sory effects, (c) At low levels, menthol should open the TRPM8 
channel, leading to a sensation of cool; at high levels, both TRPM8 
and TRPV3 will open, leading to a mixed sensation of cool and 
heat, such as you may have experienced with very strong 
peppermints 

22. (a) These mutations might lead to permanent activation of the 
PGE 2 receptor, leading to unregulated cell division and tumor 
formation, (b) The viral gene might encode a constitutively 
active form of the receptor, causing a constant signal for cell 
division and thus tumor formation (c) E1A protein might bind 
to pRb and prevent E2F from binding, so E2F is constantly 
active and cells divide uncontrollably, (d) Lung cells do not nor- 
mally respond to PGE 2 because they do not express the PGE 2 



receptor; mutations resulting in a constitutively active PGE2 
receptor do not affect lung cells. 

23. A normal tumor suppressor gene encodes a protein that restrains 
cell division. A mutant form of the protein fails to suppress cell 
division, but if either of the two alleles encodes normal protein, 
normal function will continue A normal oncogene encodes a reg- 
ulator protein that triggers cell division, but only when an appro- 
priate signal (growth factor) is present. The mutant version of 
the oncogene product constantly sends the signal to divide, 
whether or not growth factors are present. 

24. In a child who develops multiple tumors in both eyes, every reti- 
nal cell had a defective copy of the Rb gene at birth. Early in the 
child's life, several cells independently undergo a second muta- 
tion that damages the one good Rb allele, producing a tumor A 
child who develops a single tumor had, at birth, two good copies 
of the Rb gene in every cell; mutation in both Rb alleles in one 
cell (extremely rare) causes a single tumor 

25. Two cells expressing the same surface receptor may have 
different complements of target proteins for protein 
phosphorylation 

26. (a) The cell-based model, which predicts different receptors 
present on different cells, (b) This experiment addresses the 
issue of the independence of different taste sensations. Even 
though the receptors for sweet and/or umami are missing, the 
animals' other taste sensations are normal; thus, pleasant and 
unpleasant taste sensations are independent, (c) Yes. Loss of 
either T1R1 or T1R3 subunits abolishes umami taste sensation, 
(d) Both models With either model, removing one receptor 
would abolish that taste sensation, (e) Yes Loss of either the 
T1R2 or T1R3 subunits almost completely abolishes the sweet 
taste sensation; complete elimination of sweet taste requires 
deletion of both subunits. (f) At very high sucrose concentra- 
tions, T1R2 and, to a lesser extent, T1R3 receptors, as homo- 
dimers, can detect sweet taste, (g) The results are consistent 
with either model of taste encoding, but do strengthen the re- 
searchers' conclusions. Ligand binding can be completely sepa- 
rated from taste sensation If the ligand for the receptor in 
"sweet-tasting cells" binds a molecule, mice prefer that molecule 
as a sweet compound 



Chapter 13 

1. Consider the developing chick as the system; the nutrients, egg 
shell, and outside world are the surroundings. Transformation of 
the single cell into a chick drastically reduces the entropy of the 
system. Initially, the parts of the egg outside the embryo (the 
surroundings) contain complex fuel molecules (a low-entropy 
condition) During incubation, some of these complex molecules 
are converted to large numbers of C0 2 and H 2 0 molecules (high 
entropy). This increase in the entropy of the surroundings is 
larger than the decrease in entropy of the chick (the system), 

2. (a) -4.8 kJ/mol (b) 7 56 kj/mol (c) -13 7 kJ/mol 

3. (a) 262 (b) 608 (c) 0.30 

4. K' eil = 21; AG' 0 = -7.6 kJ/mol 

5. -31 kJ/mol 

6. (a) - 1 68 kJ/mol (b) -4 4 kJ/mol (c) At a given temperature, 
the value of AG'° for any reaction is fixed and is defined for 
standard conditions (here, both fructose 6-phosphate and 
glucose 6-phosphate at 1 m) In contrast, AG is a variable 

that can be calculated for any set of reactant and product 
concentrations. 

7. AG'° ~ 0; K' eq ~ 1 

8. Less. The overall equation for ATP hydrolysis can be approxi- 
mated as 

ATP 4 " + H 2 0-*ADP 3 " + HPOr + H + 
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(This is only an approximation, because the ionized species 
shown here are the major, but not the only, forms present ) Un- 
der standard conditions (i.e., [ATP] - [ADP] = [PJ - 1 m), the 
concentration of water is 55 M and does not change during the 
reaction. Because H + ions are produced in the reaction, at a 
higher [H + ] (pH 5.0) the equilibrium would be shifted to the left 
and less free energy would be released. 
9. 10 
10. 0 p 
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AG for ATP hydrolysis is lower when [ATP]/[ADP] is low («1) 
than when [ATP]/[ADP] is high The energy available to the cell 
from a given [ATP] is lower when the [ATP]/[ADP] ratio falls and 
greater when it rises. 

11. (a) 3 85 X IO^m" 1 ; [glucose 6-phosphate] = 8.9 X 10~ 8 M;no. 
(b) 14 M; because the maximum solubility of glucose is less than 
1 M, this is not a reasonable step (c) 837 (AG'° - - 16 7 kJ/mol); 
[glucose] - 1.2 X 10" 7 m; yes (d) No. This would require such 
high [Pi] that the phosphate salts of divalent cations would pre- 
cipitate, (e) By directly transferring the phosphoryl group from 
ATP to glucose, the phosphoryl group transfer potential ("ten- 
dency" or "pressure") of ATP is utilized without generating high 
concentrations of intermediates The essential part of this trans- 
fer is, of course, the enzymatic catalysis - 

12. (a) -12 5 kJ/mol (b) -14 6 kJ/mol 

13. (a) 3 X 10" 4 (b) 68 7 (c) 7.4 X 10 4 

14. -13kJ/mol 

15. 46.7kJ/mol 

16. Isomerization moves the carbonyl group from C-l to C-2, setting 
up a carbon-carbon bond cleavage between C-3 and C-4 With- 
out isomerization, bond cleavage would occur between C-2 

and C-3, generating one two-carbon and one four-carbon 
compound 

17. The mechanism is the same as that of the alcohol dehydrogenase 
reaction (Fig 14-13, p 547) 

18. The first step is the reverse of an aldol condensation (see the al- 
dolase mechanism, Fig 14-5, p. 534); the second step is an aldol 
condensation (see Fig 13-4, p 497) 

19. (a) 46 kJ/mol (b) 46 kg; 68% (c) ATP is synthesized as it is 
needed, then broken down to ADP and P^ its concentration is 
maintained in a steady state, 

20. The ATP system is in a dynamic steady state; [ATP] remains 
constant because the rate of ATP consumption equals its rate of 
synthesis ATP consumption involves release of the terminal (-y) 
phosphoryl group; synthesis of ATP from ADP involves replace- 
ment of this phosphoryl group _ Hence the terminal phosphate 
undergoes rapid turnover. In contrast, the central (j8) phosphate 
undergoes only relatively slow turnover. 

21. (a) 1.7 kJ/mol (b) Inorganic pyrophosphatase catalyzes the 
hydrolysis of pyrophosphate and drives the net reaction toward 
the synthesis of acetyl-CoA 

22. 36kJ/mol 



23. (a) NAD + /NADH (b) Pyruvate/lactate (c) Lactate formation 
(d) -26.1 kJ/mol (e) 3.63 X 10 4 

24. (a) 1 14 V (b) -220 kJ/mol (c) ~4 

25. (a) -0.35 V (b) -0.320 V (c) -0.29 V 

26. In order of increasing tendency: (a), (d), (b), (c) 

27. (c) and (d) 

28. (a) The lowest-energy, highest-entropy state occurs when the 
dye concentration is the same in both cells. If a "fish trap" gap 
junction allowed unidirectional transport, more of the dye would 
end up in the oligodendrocyte and less in the astrocyte This 
would be a higher-energy, lower-entropy state than the starting 
state, violating the second law of thermodynamics. Robinson 

et al.'s model requires an impossible spontaneous decrease in 
entropy. In terms of energy, the model entails a spontaneous 
change from a lower-energy to a higher-energy state without an 
energy input— again, thermodynamically impossible, (b) Mole- 
cules, unlike fish, do not exhibit directed behavior; they move 
randomly by Brownian motion. Diffusion results in net move- 
ment of molecules from a region of higher concentration to a re- 
gion of lower concentration simply because it is more likely that 
a molecule on the high-concentration side will enter the connect- 
ing channel Look at this as a pathway with a rate-limiting step: 
the narrow end of the channel, The narrower end limits the rate 
at which molecules pass through because random motion of the 
molecules is less likely to move them through the smaller cross 
section. The wide end of the channel does not act like a funnel 
for molecules, although it may for fish, because molecules are 
not "crowded" by the sides of the narrowing funnel as fish would 
be The narrow end limits the rate of movement equally in both 
directions. When the concentrations on both sides are equal, the 
rates of movement in both directions are equal and there will be 
no change in concentration (c) Fish exhibit nonrandom 
behavior, adjusting their actions in response to the environment, 
Fish that enter the large opening of the channel tend to move 
forward because fish have behavior that tends to make them pre- 
fer forward movement, and they experience "crowding" as they 
move through the narrowing channel It is easy for fish to enter 
the large opening, but they don't move out of the trap as readily 
because they are less likely to enter the small opening, (d) There 
are many possible explanations, some of which were proposed 
by the letter- writers who criticized the article. Here are two 

(1) The dye could bind to a molecule in the oligodendrocyte 
Binding effectively removes the dye from the bulk solvent, so 
it doesn't "count" as a solute for thermodynamic considera- 
tions yet remains visible in the fluorescence microscope 

(2) The dye could be sequestered in a subcellular organelle 
of the oligodendrocyte, either actively pumped at the expense 
of ATP or drawn in by its attraction to other molecules in that 
organelle 



Chapter 14 

1. Net equation: Glucose + 2ATP -> 2 glyceraldehyde 3~phosphate 
+ 2 ADP + 2H + ; AG'° = 2.1 kJ/mol 

2. Net equation: 2 Glyceraldehyde 3-phosphate + 4ADP + 
2 lactate + 2NAD + ; AG'° = -114 kJ/mol 

3. GLUT2 (and GLUT1) is found in liver and is always present in 
the plasma membrane of hepatocytes, GLUT3 is always present 
in the plasma membrane of certain brain cells. GLUT4 is nor- 
mally sequestered in vesicles in cells of muscle and adipose tis- 
sue and enters the plasma membrane only in response to insulin. 
Thus, liver and brain can take up glucose from blood regardless 
of insulin level, but muscle and adipose tissue take up glucose 
only when insulin levels are elevated in response to high blood 
glucose 

4. CH 3 CHO + NADH + H + CH 3 CH 2 OH + NAD + ; K' m = 1 .45 X 10 4 
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5. -8.6kJ/mol 

6. (a) 14 CH 3 CH 2 OH (b) [3- 14 C]glucose or [4- 14 C]glucose 

7. Fermentation releases energy, some conserved in the form of 
ATP but much of it dissipated as heat. Unless the fermenter con- 
tents are cooled, the temperature would become high enough to 
kill the microorganisms 

8. Soybeans and wheat contain starch, a polymer of glucose The 
microorganisms break down starch to glucose, glucose to pyruvate 
via glycolysis, and— because the process is carried out in the 
absence of 0 2 (i.e., it is a fermentation)— pyruvate to lactic acid 
and ethanol. If 0 2 were present, pyruvate would be oxidized to 
acetyl-CoA, then to C0 2 and H 2 0. Some of the acetyl-CoA, how- 
ever, would also be hydrolyzed to acetic acid (vinegar) in the 
presence of oxygen 

9. C-l This experiment demonstrates the reversibility of the 
aldolase reaction. The C-l of glyceraldehyde 3-phosphate is 
equivalent to C-4 of fructose 1,6-bisphosphate (see Fig. 14-6). 
The starting glyceraldehyde 3-phosphate must have been labeled 
at C-l. The C-3 of dihydroxyacetone phosphate becomes labeled 
through the triose phosphate isomerase reaction, thus giving rise 
to fructose 1,6-bisphosphate labeled at C-3. 

10. No. There would be no anaerobic production of ATP; aerobic 
ATP production would be diminished only slightly. 

11. No. Lactate dehydrogenase is required to recycle NAD + from 
the NADH formed during the oxidation of glyceraldehyde 
3-phosphate 

12. The transformation of glucose to lactate occurs when myocytes 
are low in oxygen, and it provides a means of generating ATP 
under 0 2 -deficient conditions Because lactate can be oxidized 
to pyruvate, glucose is not wasted; pyruvate is oxidized by 
aerobic reactions when 0 2 becomes plentiful. This metabolic 
flexibility gives the organism a greater capacity to adapt to its 
environment. 

13. It rapidly removes the 1,3-bisphosphoglycerate in a favorable 
subsequent step, catalyzed by phosphoglycerate kinase. 

14. (a) 3-Phosphoglycerate is the product, (b) In the presence 
of arsenate there is no net ATP synthesis under anaerobic 
conditions 

(a) Ethanol fermentation requires 2 mol of P t per mole of glu- 
cose, (b) Ethanol is the reduced product formed during reoxida- 
tion of NADH to NAD + , and C0 2 is the byproduct of the 
conversion of pyruvate to ethanol. Yes; pyruvate must be 
converted to ethanol, to produce a continuous supply of NAD + 
for the oxidation of glyceraldehyde 3-phosphate. Fructose 
1,6-bisphosphate accumulates; it is formed as an intermediate in 
glycolysis, (c) Arsenate replaces Pj in the glyceraldehyde 
3-phosphate dehydrogenase reaction to yield an acyl 
arsenate, which spontaneously hydrolyzes. This prevents 
formation of ATP, but 3-phosphoglycerate continues through 
the pathway. 

Dietary niacin is used to synthesize NAD + Oxidations carried 
out by NAD + are part of cyclic processes, with NAD + as electron 
carrier (reducing agent); one molecule of NAD + can oxidize 
many thousands of molecules of glucose, and thus the 
dietary requirement for the precursor vitamin (niacin) is 
relatively small. 

17. Dihydroxyacetone phosphate + NADH + H + Glycerol 
3-phosphate + NAD + (catalyzed by a dehydrogenase) 
Galactokinase deficiency: galactose (less toxic); UDP-glucose: 
galactose 1-phosphate uridylyl deficiency: galactose 
1 -phosphate (more toxic) 

The proteins are degraded to amino acids and used for 
gluconeogenesis. 

20. (a) In the pyruvate carboxylase reaction, 14 C0 2 is added to 
pyruvate, but PEP carboxykinase removes the same C0 2 in the 
next step, Thus, 14 C is not (initially) incorporated into glucose 



15. 
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Fructose 1,6-bisphosphate 

21. 4 ATP equivalents per glucose molecule 

22. Gluconeogenesis would be highly endergonic, and it would be 
impossible to separately regulate gluconeogenesis and glycolysis. 

23. The cell "spends" 1 ATP and 1 GTP in converting pyruvate to PEP. 

24. (a), (b), (d) are glucogenic; (c) (e) are not. 

25. Consumption of alcohol forces competition for NAD + between 
ethanol metabolism and gluconeogenesis. The problem is com- 
pounded by strenuous exercise and lack of food, because at 
these times the level of blood glucose is already low. 

26. (a) The rapid increase in glycolysis; the rise in pyruvate and 
NADH results in a rise in lactate, (b) Lactate is transformed to 
glucose via pyruvate; this is a slower process, because formation 
of pyruvate is limited by NAD + availability, the LDH equilibrium 
is in favor of lactate, and conversion of pyruvate to glucose is 
energy-requiring, (c) The equilibrium for the LDH reaction is in 
favor of lactate formation 

27. Lactate is transformed to glucose in the liver by gluconeogenesis 
(see Figs 14-15, 14-16). A defect in FBPase-1 would prevent 
entry of lactate into the gluconeogenic pathway in hepatocytes, 
causing lactate to accumulate in the blood 

28. Succinate is transformed to oxaloacetate, which passes into the 
cytosol and is converted to PEP by PEP carboxykinase. Two 
moles of PEP are then required to produce a mole of glucose by 
the route outlined in Figure 14-16. 

29. If the catabolic and anabolic pathways of glucose metabolism are 
operating simultaneously, futile cycling of ATP occurs, with extra 
0 2 consumption. 

30. At the very least, accumulation of ribose 5-phosphate would tend 
to force this reaction in the reverse direction by mass action (see 
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Eqn 13^1, p. 493). It might also affect other metabolic reactions 
that involve ribose 5-phosphate as a substrate or product — such 
as the pathways of nucleotide synthesis, 
31. (a) Ethanol tolerance is likely to involve many more genes, and 
thus the engineering would be a much more involved project, 
(b) L-Arabinose isomerase (the araA enzyme) converts an aldose 
to a ketose by moving the carbonyl of a nonphosphorylated sugar 
from C-l to C-2. No analogous enzyme is discussed in this chapter; 
all the enzymes described here act on phosphorylated sugars. An 
enzyme that carries out a similar transformation with phosphory- 
lated sugars is phosphohexose isomerase. L-Ribulokinase (araB) 
phosphorylates a sugar at C-5 by transferring the y phosphate 
from ATP. Many such reactions are described in this chapter, in- 
cluding the hexokinase reaction. L-Ribulose 5-phosphate epimerase 
(araD) switches the — H and — OH groups on a chiral carbon of a 
sugar. No analogous reaction is described in the chapter, but it is 
described in Chapter 20 (see Fig. 20-1, p. 774). (c) The three 
ara enzymes would convert arabinose to xylulose 5-phosphate 
by the following pathway: Arabinose L ' arabinose isomerase > L-ribulose 
L-ribuiokinase > L _ ribu i ose 5-phosphate epimerase > xylulose 
5-phosphate. (d) The arabinose is converted to xylulose 

5- phosphate as in (c), which enters the pathway in Fig. 14-22; 
the glucose 6-phosphate product is then fermented to ethanol 
and C0 2 . (e) 6 molecules of arabinose + 6 molecules of ATP are 
converted to 6 molecules of xylulose 5-phosphate, which feed 
into the pathway in Fig. 14-22 to yield 5 molecules of glucose 

6- phosphate, each of which is fermented to yield 3 ATP (they 
enter as glucose 6-phosphate, not glucose) — 15 ATP in all Over- 
all, you would expect a yield of 15 ATP - 6 ATP = 9 ATP from 
the 6 arabinose molecules. The other products are 10 molecules 
of ethanol and 10 molecules of C0 2 (f) Given the lower ATP 
yield, for an amount of growth (i.e., of available ATP) equivalent 
to growth without the added genes the engineered Z. mobilis 
must ferment more arabinose, and thus it produces more 
ethanol. (g) One way to allow the use of xylose would be to add 
the genes for two enzymes: an analog of the araD enzyme that 
converts xylose to ribose by switching the — H and — OH on C-3, 
and an analog of the araB enzyme that phosphorylates ribose at 
C-5. The resulting ribose 5-phosphate would feed into the existing 
pathway. 



Chapter 15 

1. (a) 0.0293 (b) 308 (c) No. Q is much lower thanif; q , indicating 
that the PFK-1 reaction is far from equilibrium in cells; this 
reaction is slower than the subsequent reactions in glycolysis. 
Flux through the glycolytic pathway is largely determined by the 
activity of PFK-1. 

2. (a) 1.4 X 10~ 9 m (b) The physiological concentration 
(0.023 mM) is 16,000 times the equilibrium concentration; this 
reaction does not reach equilibrium in the cell. Many reactions in 
the cell are not at equilibrium. 

3. In the absence of 0 2 , the ATP needs are met by anaerobic glucose 
metabolism (fermentation to lactate). Because aerobic oxidation 
of glucose produces far more ATP than does fermentation, less 
glucose is needed to produce the same amount of ATP. 

4. (a) There are two binding sites for ATP: a catalytic site and a 
regulatory site. Binding of ATP to a regulatory site inhibits 
PFK-1, by reducing 7 max or increasing K m for ATP at the 
catalytic site (b) Glycolytic flux is reduced when ATP is plentiful, 
(c) The graph indicates that increased [ADP] suppresses the 
inhibition by ATP. Because the adenine nucleotide pool is fairly 
constant, consumption of ATP leads to an increase in [ADP]. The 
data show that the activity of PFK-1 may be regulated by the 
[ATP]/[ADP] ratio. 

5. The phosphate group of glucose 6-phosphate is completely ion- 
ized at pH 7, giving the molecule an overall negative charge. 
Because membranes are generally impermeable to electrically 



charged molecules, glucose 6-phosphate cannot pass from the 
bloodstream into cells and hence cannot enter the glycolytic 
pathway and generate ATP. (This is why glucose, once phospho- 
rylated, cannot escape from the cell.) 

6. (a) In muscle: glycogen breakdown supplies energy (ATP) via 
glycolysis Glycogen phosphorylase catalyzes the conversion of 
stored glycogen to glucose 1-phosphate, which is converted to 
glucose 6-phosphate, an intermediate in glycolysis. During stren- 
uous activity, skeletal muscle requires large quantities of glucose 
6-phosphate, In the liver: glycogen breakdown maintains a 
steady level of blood glucose between meals (glucose 6-phosphate 
is converted to free glucose), (b) In actively working muscle, 
ATP flux requirements are very high and glucose 1-phosphate 
must be produced rapidly, requiring a high F max . 

7. (a) [Pi]/[glucose 1-phosphate] = 3.3/1 (b) and (c) The value of 
this ratio in the cell (>100:1) indicates that [glucose 1-phosphate] 
is far below the equilibrium value. The rate at which glucose 
1-phosphate is removed (through entry into glycolysis) is greater 
than its rate of production (by the glycogen phosphorylase 
reaction), so metabolite flow is from glycogen to glucose 
1-phosphate. The glycogen phosphorylase reaction is probably 
the regulatory step in glycogen breakdown. 

8. (a) Increases (b) Decreases (c) Increases 

9. Resting: [ATP] high; [AMP] low; [acetyl-CoA] and [citrate] inter- 
mediate. Running: [ATP] intermediate; [AMP] high; [acetyl-CoA] 
and [citrate] low. Glucose flux through glycolysis increases 
during the anaerobic sprint because (1) the ATP inhibition of 
glycogen phosphorylase and PFK-1 is partially relieved, (2) AMP 
stimulates both enzymes, and (3) lower citrate and acetyl-CoA 
levels relieve their inhibitory effects on PFK-1 and pyruvate 
kinase, respectively. 

10. The migrating bird relies on the highly efficient aerobic oxidation 
of fats, rather than the anaerobic metabolism of glucose used by 
a sprinting rabbit. The bird reserves its muscle glycogen for short 
bursts of energy during emergencies. 

11. Case A: (f), (3); Case B: (c), (3); Case C: (h), (4); Case D: 
(d), (6) 

12. (a) (1) Adipose: fatty acid synthesis slower. (2) Muscle: glycoly- 
sis, fatty acid synthesis, and glycogen synthesis slower. (3) Liver: 
glycolysis faster; gluconeogenesis, glycogen synthesis, and fatty 
acid synthesis slower; pentose phosphate pathway unchanged, 
(b) (1) Adipose and (3) liver: fatty acid synthesis slower be- 
cause lack of insulin results in inactive acetyl-CoA carboxylase, 
the first enzyme of fatty acid synthesis. Glycogen synthesis 
inhibited by cAMP-dependent phosphorylation (thus activation) 
of glycogen synthase. (2) Muscle: glycolysis slower because 
GLUT4 is inactive, so glucose uptake is inhibited, (3) Liver: 
glycolysis slower because the bifunctional PFK-2/FBPase-2 is 
converted to the form with active FBPase-2, decreasing [fructose 
2,6-bisphosphate], which allosterically stimulates phosphofruc- 
tokinase and inhibits FBPase-1; this also accounts for the 
stimulation of gluconeogenesis. 

13. (a) Elevated (b) Elevated (c) Elevated 

14. (a) PKA cannot be activated in response to glucagon or epineph- 
rine, and glycogen phosphorylase is not activated, (b) PP1 remains 
active, allowing it to dephosphorylate glycogen synthase (activat- 
ing it) and glycogen phosphorylase (inhibiting it), (c) Phosphory- 
lase remains phosphorylated (active), increasing the breakdown of 
glycogen, (d) Gluconeogenesis cannot be stimulated when blood 
glucose is low, leading to dangerously low blood glucose during 
periods of fasting. 

15. The drop in blood glucose triggers release of glucagon by the 
pancreas. In the liver, glucagon activates glycogen phosphorylase 
by stimulating its cAMP-dependent phosphorylation and stimulates 
gluconeogenesis by lowering [fructose 2,6-bisphosphate], thus 
stimulating FBPase-1. 
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16. (a) Reduced capacity to mobilize glycogen; lowered blood 
glucose between meals (b) Reduced capacity to lower blood 
glucose after a carbohydrate meal; elevated blood glucose 
(c) Reduced fructose 2,6-bisphosphate (F26BP) in liver, 
stimulating glycolysis and inhibiting gluconeogenesis (d) Reduced 
F26BP, stimulating gluconeogenesis and inhibiting glycolysis 
(e) Increased uptake of fatty acids and glucose; increased oxida- 
tion of both (f) Increased conversion of pyruvate to acetyl-CoA; 
increased fatty acid synthesis. 

17. (a) Given that each particle contains about 55,000 glucose 
residues, the equivalent free glucose concentration would be 
55,000 X O.OljtM = 550 mM, or 0 55 M. This would present a seri- 
ous osmotic challenge for the cell! (Body fluids have a substan- 
tially lower osmolality.) (b) The lower the number of branches, 
the lower the number of free ends available for glycogen phos- 
phorylase activity, and the slower the rate of glucose release. 
With no branches, there would be just one site for phosphorylase 
to act (c) The outer tier of the particle would be too crowded 
with glucose residues for the enzyme to gain access to cleave 
bonds and release glucose (d) The number of chains doubles in 
each succeeding tier: tier 1 has one chain (2°), tier 2 has two 
(2 1 ), tier 3 has four (2 2 ), and so on. Thus, for t tiers, the number 
of chains in the outermost tier, C A , is 2 ( 1 . (e) The total number 
of chains is 2° + 2 1 + 2 2 +. . . 2'" 1 = 2 ( - 1. Each chain contains 
g c glucose molecules, so the total number of glucose molecules, 
Ct, is g c (%t - 1)- (f) Glycogen phosphorylase can release all but 
four of the glucose residues in a chain of length g c . Therefore, 
from each chain in the outer tier it can release Qg c - 4) glucose 
molecules. Given that there are 2 f_1 chains in the outer tier, the 
number of glucose molecules the enzyme can release, G PT , is 

(#c - 4)(2'~ 1 ). (g) The volume of a sphere is ^^ 3 - In this case, 

r is the thickness of one tier times the number of tiers, or 

(G.12g c + 0 35)t nm. Thus V s = ^7rt 3 (0.12g c + 0 35) 3 nm 3 

(h) You can show algebraically that the value of g c that maxi- 
mizes/ is independent of t. Choosing t = 3: 
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The optimum value of g c (i.e., at maximum/) is 13. In nature, g c 
varies from 12 to 14, which corresponds to/ values very close to 
the optimum. If you choose another value for t, the numbers will 
differ but the optimal g c will still be 13. 

Chapter 16 

1. (a) 

(T) Citrate synthase: 

Acetyl-CoA + oxaloacetate + H 2 0 -» citrate + CoA 

©Aconitase: 

Citrate isocitrate 



(3) Isocitrate dehydrogenase: 

Isocitrate + NAD + -h> a-ketoglutarate + C0 2 + NADH 

(4) a-Ketoglutarate dehydrogenase: 

a-Ketoglutarate + NAD + + CoA — » succinyl-CoA + C0 2 + NADH 
© Succinyl-CoA synthetase: 

Succinyl-CoA + P t 4- GDP -» succinate + CoA + GTP 

© Succinate dehydrogenase: 

Succinate + FAD fumarate + FADH 2 

(j)Fumarase: 

Fumarate + H 2 0 malate 

©Malate dehydrogenase: 

Malate + NAD + oxaloacetate + NADH + H + 

(t>), (c) © CoA, condensation; © none, isomerization; 

© NAD + , oxidative decarboxylation; @NAD + , CoA, and 

thiamine pyrophosphate, oxidative decarboxylation; © CoA, 

substrate-level phosphorylation; © FAD, oxidation; © none, 

hydration; © NAD + , oxidation 

(d) Acetyl-CoA + 3NAD + + FAD + GDP + P t + 2H 2 0 -» 

2C0 2 + CoA + 3NADH + FADH 2 + GTP + 2H + 

2. Glucose + 4ADP + 4P* + 10NAD + + 2FAD -> 

4ATP + 10NADH + 2FADH 2 + 6C0 2 

3. (a) Oxidation; methanol — » formaldehyde + [H — H] 

(b) Oxidation; formaldehyde —> formate + [H — H] 

(c) Reduction; C0 2 + [H— H] -> formate + H + 

(d) Reduction; glycerate + H + + [H— H] -> glyceraldehyde + H 2 0 

(e) Oxidation; glycerol -» dihydroxyacetone + [H— H] 

(f) Oxidation; 2H 2 0 + toluene -> benzoate + H + + 3[H— H] 

(g) Oxidation; succinate fumarate + [H — H] 

(h) Oxidation; pyruvate + H 2 0 acetate + C0 2 + [H— H] 

4. From the structural formulas, we see that the carbon-bound H/C 
ratio of hexanoic acid (11/6) is higher than that of glucose (7/6). 
Hexanoic acid is more reduced and yields more energy on 
complete combustion to C0 2 and H 2 0. 

5. (a) Oxidized; ethanol + NAD + -> acetaldehyde + NADH + H + 

(b) Reduced; 1,3-bisphosphoglycerate + NADH + H + -> 

glyceraldehyde 3-phosphate + NAD + + HPOf" 

(c) Unchanged; pyruvate + H + -> acetaldehyde + C0 2 

(d) Oxidized; pyruvate + NAD + acetate + C0 2 + NADH + H + 

(e) Reduced; oxaloacetate + NADH + H + -> malate + NAD 4 " 

(f) Unchanged; acetoacetate + H + -» acetone + C0 2 

6. TPP: thiazolium ring adds to a carbon of pyruvate, then 
stabilizes the resulting carbanion by acting as an electron 
sink Lipoic acid: oxidizes pyruvate to level of acetate 
(acetyl-CoA), and activates acetate as a thioester. CoA-SH: 
activates acetate as thioester. FAD: oxidizes lipoic acid. NAD + : 
oxidizes FAD. 

7. Lack of TPP inhibits pyruvate dehydrogenase; pyruvate 
accumulates 

8. Oxidative decarboxylation; NAD + or NADP + ; a-ketoglutarate 
dehydrogenase 

9. Oxygen consumption is a measure of the activity of the first two 
stages of cellular respiration: glycolysis and the citric acid cycle. 
The addition of oxaloacetate or malate stimulates the citric acid 
cycle and thus stimulates respiration, The added oxaloacetate or 
malate serves a catalytic role, because it is regenerated in the 
latter part of the citric acid cycle. 

10. (a) 5.6 X 10^ 6 (b) 1.1 X 10" 8 m (c) 28 molecules 

11. ADP (or GDP), P b CoA-SH, TPP, NAD + ; not lipoic acid, which is 
covalently attached to the isolated enzymes that use it 

12. The flavin nucleotides, FMN and FAD, would not be synthesized. 
Because FAD is required in the citric acid cycle, flavin deficiency 
would strongly inhibit the cycle. 
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13. Oxaloacetate might be withdrawn for aspartate synthesis 
or for gluconeogenesis. Oxaloacetate is replenished by the 
anaplerotic reactions catalyzed by PEP carboxykinase, PEP 
carboxylase, malic enzyme, or pyruvate carboxylase (see 
Fig. 16-15, p. 632). 

14. The terminal phosphoryl group in GTP can be transferred to 
ADP in a reaction catalyzed by nucleoside diphosphate kinase, 
with an equilibrium constant of 1.0: GTP + ADP GDP + ATP 

15. (a) ~OOC — CH 2 — CH 2 — COO~ (succinate) (b) Malonate is a 
competitive inhibitor of succinate dehydrogenase, (c) A block in 
the citric acid cycle stops NADH formation, which stops electron 
transfer, which stops respiration, (d) A large excess of succinate 
(substrate) overcomes the competitive inhibition. 

16. (a) Add uniformly labeled [ 14 C]glucose and check for the release 
of 14 C0 2 . (b) Equally distributed in C-2 and C-3 of oxaloacetate; 
an infinite number 

17. Oxaloacetate equilibrates with succinate, in which C-l and C-4 
are equivalent. Oxaloacetate derived from succinate is labeled at 
C-l and C-4, and the PEP derived from it has label at C-l, which 
gives rise to C-3 and C-4 of glucose. 

18. (a) C-l (b) C-3 (c) C-3 (d) C-2 (methyl group) (e) C-4 (f) C-4 

(g) Equally distributed in C-2 and C-3 

19. Thiamine is required for the synthesis of TPP, a prosthetic group 
in the pyruvate dehydrogenase and a-ketoglutarate dehydroge- 
nase complexes. A thiamine deficiency reduces the activity of 
these enzyme complexes and causes the observed accumulation 
of precursors. 

20. No. For every two carbons that enter as acetate, two leave the 
cycle as C0 2 ; thus there is no net synthesis of oxaloacetate. Net 
synthesis of oxaloacetate occurs by the carboxylation of pyru- 
vate, an anaplerotic reaction, 

21. Yes, the citric acid cycle would be inhibited Oxaloacetate is 
present at relatively low concentrations in mitochondria, 
and removing it for gluconeogenesis would tend to shift the 
equilibrium for the citrate synthase reaction toward 
oxaloacetate. 

22. (a) Inhibition of aconitase (b) Fluorocitrate; competes with 
citrate; by a large excess of citrate (c) Citrate and fluorocitrate 
are inhibitors of PFK-1. (d) All catabolic processes necessary for 
ATP production are shut down. 

23. Glycolysis: 

Glucose + 2Pi + 2ADP + 2NAD + 

2 pyruvate + 2ATP + 2NADH + 2H + + 2H 2 0 

Pyruvate carboxylase reaction: 
2 Pyruvate + 2C0 2 + 2ATP + 2H 2 0 

2 oxaloacetate + 2 ADP + 2Pi + 4H + 

Malate dehydrogenase reaction: 

2 Oxaloacetate + 2NADH + 2H + 2 L-malate + 2NAD + 
This recycles nicotinamide coenzymes under anaerobic condi- 
tions. The overall reaction is 

Glucose + 2C0 2 2 L-malate + 4H + 
This produces four H + per glucose, increasing the acidity and 
thus the tartness of the wine. 

24. Net reaction: 2 Pyruvate + ATP + 2NAD + + H 2 0 

a-ketoglutarate + C0 2 + ADP + P i + 2NADH + 3H + 

25. The cycle participates in catabolic and anabolic processes. For 
example, it generates ATP by substrate oxidation, but also 
provides precursors for amino acid synthesis (see Fig. 16-15, 
p, 632). 

26. (a) Decreases (b) Increases (c) Decreases 

27. (a) Citrate is produced through the action of citrate synthase on 
oxaloacetate and acetyl-CoA, Citrate synthase can be used for 
net synthesis of citrate when (1) there is a continuous influx of 
new oxaloacetate and acetyl-CoA and (2) isocitrate synthesis is 



restricted, as in a medium low in Fe 3+ . Aconitase requires 
Fe 3+ , so an Fe 3+ -restricted medium restricts the synthesis of 
aconitase 

(b) Sucrose + H 2 0 glucose + fructose 
Glucose + 2Pj + 2ADP + 2NAD + -> 

2 pyruvate + 2ATP + 2NADH + 2H + + 2H 2 0 
Fructose + 2P 2 + 2ADP + 2NAD + 

2 pyruvate + 2ATP + 2NADH + 2H + + 2H 2 0 
2 Pyruvate + 2NAD + + 2CoA 

2 acetyl-CoA + 2NADH + 2H + + 2C0 2 
2 Pyruvate + 2C0 2 + 2ATP + 2H 2 0 

2 oxaloacetate + 2ADP + 2Pi + 4H + 
2 Acetyl-CoA + 2 oxaloacetate + 2H 2 0 2 citrate + 2CoA 
The overall reaction is 

Sucrose + H 2 0 + 2Pj + 2 ADP + 6NAD + 

2 citrate + 2ATP + 6NADH + 10H + 

(c) The overall reaction consumes NAD + . Because the cellular 
pool of this oxidized coenzyme is limited, it must be recycled by 
the electron-transfer chain with consumption of 0 2 . Conse- 
quently, the overall conversion of sucrose to citric acid is an 
aerobic process and requires molecular oxygen. 

28. Succinyl-CoA is an intermediate of the citric acid cycle; its accu- 
mulation signals reduced flux through the cycle, calling for 
reduced entry of acetyl-CoA into the cycle. Citrate synthase, by 
regulating the primary oxidative pathway of the cell, regulates 
the supply of NADH and thus the flow of electrons from NADH 
to 0 2 

29. Fatty acid catabolism increases [acetyl-CoA], which stimulates 
pyruvate carboxylase. The resulting increase in [oxaloacetate] 
stimulates acetyl-CoA consumption by the citric acid cycle, and 
[citrate] rises, inhibiting glycolysis at the level of PFK-1. In 
addition, increased [acetyl-CoA] inhibits the pyruvate dehydro- 
genase complex, slowing the utilization of pyruvate from 
glycolysis. 

30. Oxygen is needed to recycle NAD + from the NADH 
produced by the oxidative reactions of the citric acid cycle. 
Reoxidation of NADH occurs during mitochondrial oxidative 
phosphorylation, 

31. Increased [NADH]/[NAD + ] inhibits the citric acid cycle by mass 
action at the three NAD + -reducing steps; high [NADH] shifts 
equilibrium toward NAD + . 

32. Toward citrate; AG for the citrate synthase reaction under these 
conditions is about -8 kJ/mol. 

33. Steps © and© are essential in the reoxidation of the enzyme's 
reduced lipoamide cofactor. 

34. The citric acid cycle is so central to metabolism that a serious de- 
fect in any cycle enzyme would probably be lethal to the embryo. 

35. The first enzyme in each path is under reciprocal allosteric 
regulation Inhibition of one path shunts isocitrate into the 
other path. 

36. (a) The only reaction in muscle tissue that consumes significant 
amounts of oxygen is cellular respiration, so 0 2 consumption is a 
good proxy for respiration, (b) Freshly prepared muscle tissue 
contains some residual glucose; 0 2 consumption is due to oxida- 
tion of this glucose (c) Yes. Because the amount of 0 2 con- 
sumed increased when citrate or 1-phosphoglycerol was added, 
both can serve as substrate for cellular respiration in this system, 
(d) Experiment I: citrate is causing much more 0 2 consumption 
than would be expected from its complete oxidation. Each mole- 
cule of citrate seems to be acting as though it were more than 
one molecule. The only possible explanation is that each mole- 
cule of citrate functions more than once in the reaction— which 
is how a catalyst operates. Experiment II: the key is to calculate 
the excess 0 2 consumed by each sample compared with the 
control (sample 1) 
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Substrate(s) 




Excess fiL 


Sample 


added 


absorbed 


0 2 consumed 


1 


No extra 


342 


0 


2 


0 3 mL 0.2 M 


757 


415 










3 


0.15 mL 0.02 m citrate 


431 


89 


4 


0.3 mL 0.2 M 








1 -phosphoglycerol 








+ 0 15 mL 0 02 M citrate 


1,385 


1,043 



If both citrate and 1 -phosphoglycerol were simply substrates for 
the reaction, you would expect the excess 0 2 consumption by 
sample 4 to be the sum of the individual excess consumptions by 
samples 2 and 3 (415 /jlL + 89 fxL = 504 /uL). However, the ex- 
cess consumption when both substrates are present is roughly 
twice this amount (1,043 fxh). Thus citrate increases the ability 
of the tissue to metabolize 1 -phosphoglycerol. This behavior is 
typical of a catalyst. Both experiments (I and II) are required to 
make this case convincing. Based on experiment I only, citrate is 
somehow accelerating the reaction, but it is not clear whether it 
acts by helping substrate metabolism or by some other mecha- 
nism. Based on experiment II only, it is not clear which molecule 
is the catalyst, citrate or 1 -phosphoglycerol. Together, the exper- 
iments show that citrate is acting as a "catalyst" for the oxidation 
of 1 -phosphoglycerol. (e) Given that the pathway can consume 
citrate (see sample 3), if citrate is to act as a catalyst it must be 
regenerated. If the set of reactions first consumes then regener- 
ates citrate, it must be a circular rather than a linear pathway, 
(f) When the pathway is blocked at a-ketoglutarate dehydroge- 
nase, citrate is converted to a-ketoglutarate but the pathway 
goes no further. Oxygen is consumed by reoxidation of the 
NADH produced by isocitrate dehydrogenase. 



(g) 



*Citrate^ 



Glucose— ► 7 9 ? 



Oxaloacetate 



a-Ketoglutarate 

\ 

Succinate 



Fumarate 
"Malonate*^ 

This differs from Fig. 16-7 in that it does not include c^s-aconitate 
and isocitrate (between citrate and a-ketoglutarate), or succinyl- 
CoA, or acetyl-CoA. (h) Establishing a quantitative conversion 
was essential to rule out a branched or other, more complex 
pathway. 

Chapter 17 

1. The fatty acid portion; the carbons in fatty acids are more re- 
duced than those in glycerol. 

2. (a) 4.0 x 10 5 kJ (9.6 X 10 4 kcal) (b) 48 days (c) 0.48 lb/day 

3. The first step in fatty acid oxidation is analogous to the conver- 
sion of succinate to fumarate; the second step, to the conversion 
of fumarate to malate; the third step, to the conversion of malate 
to oxaloacetate. 

4. 7 cycles; the last releases 2 acetyl-CoA. 

5. (a) R— COO" + ATP -> acyl-AMP + PP; 

Acyl-AMP + CoA -> acyl-CoA + AMP 

(b) Irreversible hydrolysis of PP, to 2^ by cellular inorganic 
pyrophosphatase 

6. czs-A 3 -dodecanoyl-CoA; it is converted to cw>-A 2 -dodecanoyl- 
CoA, then /3-hydroxydodecanoyl-CoA. 



7. 4 acetyl-CoA and 1 propionyl-CoA 

8. Yes. Some of the tritium is removed from palmitate during the 
dehydrogenation reactions of /3 oxidation. The removed tritium 
appears as tritiated water, 

9. Fatty acyl groups condensed with CoA in the cytosol are first 
transferred to carnitine, releasing CoA, then transported into the 
mitochondrion, where they are again condensed with CoA. The 
cytosolic and mitochondrial pools of CoA are thus kept separate, 
and no radioactive CoA from the cytosolic pool enters the 
mitochondrion 

10. (a) In the pigeon, /3 oxidation predominates; in the pheasant, 
anaerobic glycolysis of glycogen predominates, (b) Pigeon muscle 
would consume more 0 2 . (c) Fat contains more energy per gram 
than glycogen does. In addition, the anaerobic breakdown of glyco- 
gen is limited by the tissue's tolerance to lactate buildup. Thus the 
pigeon, operating on the oxidative catabolism of fats, is the long- 
distance flyer, (d) These enzymes are the regulatory enzymes of 
their respective pathways and thus limit ATP production rates. 

11. Malonyl-CoA would no longer inhibit fatty acid entry into the 
mitochondrion and j8 oxidation, so there might be a futile cycle 
of simultaneous fatty acid synthesis in the cytosol and fatty acid 
breakdown in mitochondria 

12. (a) The carnitine-mediated entry of fatty acids into mitochon- 
dria is the rate-limiting step in fatty acid oxidation. Carnitine de- 
ficiency slows fatty acid oxidation; added carnitine increases the 
rate, (b) All increase the metabolic need for fatty acid oxidation, 
(c) Carnitine deficiency might result from a deficiency of lysine, 
its precursor, or from a defect in one of the enzymes in the 
biosynthesis of carnitine. 

13. Oxidation of fats releases metabolic water; 1.4 L of water per kg 
of tripalmitoylglycerol (ignores the small contribution of glycerol 
to the mass) 

14. The bacteria can be used to completely oxidize hydrocarbons to 
C0 2 and H 2 0. However, contact between hydrocarbons and bacte- 
rial enzymes may be difficult to achieve. Bacterial nutrients such 
as nitrogen and phosphorus may be limiting and inhibit growth. 

15. (a) M r 136; phenylacetic acid (b) Even 

16. Because the mitochondrial pool of CoA is small, CoA must be re- 
cycled from acetyl-CoA via the formation of ketone bodies. This 
allows the operation of the /3-oxidation pathway, necessary for 
energy production. 

17. (a) Glucose yields pyruvate via glycolysis, and pyruvate is the 
main source of oxaloacetate. Without glucose in the diet, 
[oxaloacetate] drops and the citric acid cycle slows, (b) Odd- 
numbered; propionate conversion to succinyl-CoA provides 
intermediates for the citric acid cycle and four-carbon precursors 
for gluconeogenesis 

18. For the odd-chain heptanoic acid, (B oxidation produces propionyl- 
CoA, which can be converted in several steps to oxaloacetate, a 
starting material for gluconeogenesis. The even-chain fatty acid 
cannot support gluconeogenesis, because it is entirely oxidized 
to acetyl-CoA 

19. j8 Oxidation of (w-fluorooleate forms fluoroacetyl-CoA, which en- 
ters the citric acid cycle and produces fluorocitrate, a powerful in- 
hibitor of aconitase. Inhibition of aconitase shuts down the citric 
acid cycle. Without reducing equivalents from the citric acid cycle, 
oxidative phosphorylation (ATP synthesis) is fatally slowed 

20. Ser to Ala: blocks jS oxidation in mitochondria. Ser to Asp: blocks 
fatty acid synthesis, stimulates f3 oxidation. 

21. Response to glucagon or epinephrine would be prolonged, giving 
a greater mobilization of fatty acids in adipocytes. 

22. Enz-FAD, having a more positive standard reduction potential, is a 
better electron acceptor than NAD + , and the reaction is driven in 
the direction of fatty acyl-CoA oxidation. This more favorable equi- 
librium is obtained at the cost of 1 ATP; only 1 .5 ATP are produced 
per FADH 2 oxidized in the respiratory chain (vs. 2.5 per NADH) . 

23. 9 turns; arachidic acid, a 20-carbon saturated fatty acid, yields 10 
molecules of acetyl-CoA, the last two formed in the ninth turn. 
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24. See Fig 17-11. [3- 14 C]Succinyl~CoA is formed, which gives rise 
to oxaioacetate labeled at C-2 and C-3. 

25. Phytanic acid — » pristanic acid — > propionyl-CoA — > — > succinyl- 
CoA -> succinate -> fumarate -» malate All malate carbons 
would be labeled, but C-l and C-4 would have only half as much 
label as C-2 and C-3 

26. ATP hydrolysis in the energy-requiring reactions of a cell takes 
up water in the reaction ATP + H 2 0 -» ADP + P t ; thus, in the 
steady state, there is no net production of H 2 0. 

27. Methylmalonyl-CoA mutase requires the cobalt-containing 
cofactor formed from vitamin B 12 

28. Mass lost per day is about 0.66 kg, or about 140 kg in 7 months, 
Ketosis could be avoided by degradation of non-essential body 
proteins to supply amino acid skeletons for gluconeogenesis. 

29. (a) Fatty acids are converted to their CoA derivatives by enzymes 
in the cytoplasm; the acyl-CoAs are then imported into mitochon- 
dria for oxidation. Given that the researchers were using isolated 
mitochondria, they had to use CoA derivatives (b) Stearoyl-CoA 
was rapidly converted to 9 acetyl-CoA by the jS-oxidation pathway. 
All intermediates reacted rapidly and none were detectable at sig- 
nificant levels, (c) Two rounds. Each round removes two carbon 
atoms, thus two rounds convert an 18-carbon to a 14-carbon fatty 
acid and 2 acetyl-CoA. (d) The K m is higher for the trans isomer 
than for the cis, so a higher concentration of trans isomer is re- 
quired for the same rate of breakdown Roughly speaking, the 
trans isomer binds less well than the cis, probably because differ- 
ences in shape, even though not at the target site for the enzyme, 
affect substrate binding to the enzyme (e) The substrate for 
LCAD/VLCAD builds up differently, depending on the particular 
substrate; this is expected for the rate-limiting step in a pathway, 
(f) The kinetic parameters show that the trans isomer is a poorer 
substrate than the cis for LCAD, but there is little difference for 
VLCAD Because it is a poorer substrate, the trans isomer accu- 
mulates to higher levels than the cis (g) One possible pathway is 
shown below (indicating "inside" and "outside" mitochondria). 

Elaidoyl-CoA ""»"» u^inan-ic kis ( i elaidoyl-camitine "'""M""^ 
(outside) (outside) 

elaidoyl-camitine n > elaidoyl-CoA ^ l '" l " HlM ' l '' ; " >K!:,!l "" ) 

(inside) (inside) 

5-^rans-tetradecenoyl-CoA 1iti "^ it ' r ^' ■ 5-£ra/2s-tetradecanoic acid tliriu '-"" 
(inside) (inside) 



(h) It is correct insofar as trans fats are broken down less effi- 
ciently than cis fats, and thus trans fats may "leak" out of mito- 
chondria. It is incorrect to say that trans fats are not broken down 
by cells; they are broken down, but at a slower rate than cis fats. 

Chapter 18 



(a) "OOC— CH 2 — C— COO" Oxaioacetate 

O 

(b) " OOC— CH 2 — CH 2 — C— COO" a-Ketoglutarate 

O 

(c) CH 3 — C— COO" Pyruvate 




Phenylpyruvate 



2. This is a coupled-reaction assay. The product of the slow 
transamination (pyruvate) is rapidly consumed in the subse- 
quent "indicator reaction" catalyzed by lactate dehydrogenase, 
which consumes NADH„ Thus the rate of disappearance of 
NADH is a measure of the rate of the aminotransferase 
reaction. The indicator reaction is monitored by observing 
the decrease in absorption of NADH at 340 nm with a 
spectrophotometer. 

3. Alanine and glutamine play special roles in the transport of 
amino groups from muscle and from other nonhepatic tissues, 
respectively, to the liver. 

4. No, The nitrogen in alanine can be transferred to oxaioacetate 
via transamination, to form aspartate. 

5. 15 mol of ATP per mole of lactate; 13 mol of ATP per mole of 
alanine, when nitrogen removal is included 

6. (a) Fasting resulted in low blood glucose; subsequent adminis- 
tration of the experimental diet led to rapid catabolism of gluco- 
genic amino acids (b) Oxidative deamination caused the rise in 
NH 3 levels; the absence of arginine (an intermediate in the urea 
cycle) prevented conversion of NH 3 to urea; arginine is not syn- 
thesized in sufficient quantities in the cat to meet the needs 
imposed by the stress of the experiment. This suggests that argi- 
nine is an essential amino acid in the cat's diet, (c) Ornithine is 
converted to arginine by the urea cycle. 

7. H 2 0 + glutamate + NAD 4 " -> 

a-ketoglutarate + NH4 + NADH + H + 
NH4 + 2ATP + H 2 0 + C0 2 -> 

carbamoyl phosphate + 2 ADP + Pi + 3H + 
Carbamoyl phosphate + ornithine -» citrulline + Pi + H + 
Citrulline + aspartate + ATP -> 

argininosuccinate + AMP + PPi + H + 
Argininosuccinate — > arginine + fumarate 
Fumarate + H 2 0 -> malate 
Malate + NAD + -» oxaioacetate + NADH + H + 
Oxaioacetate + glutamate -> aspartate + a-ketoglutarate 
Arginine + H 2 Q -> urea + ornithine 

2 Glutamate + C0 2 + 4H 2 0 + 2NAD + + 3ATP — » 

2 a-ketoglutarate + 2NADH + 7H + + urea + 2 ADP + 

AMP + PPi + 2Pi (1) 
Additional reactions that need to be considered: 
AMP + ATP -> 2ADP (2) 
0 2 + 8H + + 2NADH + 6 ADP + 6Pi -» 

2NAD + + 6ATP + 8H 2 0 (3) 
H 2 0 + PPi -> 2Pi + H + (4) 
Summing equations (1) through (4), 

2 Glutamate + C0 2 + 0 2 + 2 ADP + 2Pi -> 

2 a-ketoglutarate + urea + 3H 2 0 + 2ATP 

8. The second amino group introduced into urea is transferred 
from aspartate, which is generated during the transamination 
of glutamate to oxaioacetate, a reaction catalyzed by aspartate 
aminotransferase. Approximately one-half of all the amino 
groups excreted as urea must pass through the aspartate 
aminotransferase reaction, making this the most highly active 
aminotransferase . 

9. (a) A person on a diet consisting only of protein must use amino 
acids as the principal source of metabolic fuel. Because the ca- 
tabolism of amino acids requires the removal of nitrogen as urea, 
the process consumes abnormally large quantities of water to di- 
lute and excrete the urea in the urine. Furthermore, electrolytes 
in the "liquid protein" must be diluted with water and excreted. 
If the daily water loss through the kidney is not balanced by a 
sufficient water intake, a net loss of body water results. 

(b) When considering the nutritional benefits of protein, one 
must keep in mind the total amount of amino acids needed for 
protein synthesis and the distribution of amino acids in the di- 
etary protein. Gelatin contains a nutritionally unbalanced distri- 
bution of amino acids. As large amounts of gelatin are ingested 



5-£rcms-tetradecanoic acid 
(outside) 
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and the excess amino acids are catabolized, the capacity of the 
urea cycle may be exceeded, leading to ammonia toxicity. This is 
further complicated by the dehydration that may result from ex- 
cretion of large quantities of urea. A combination of these two 
factors could produce coma and death. 

10. Lysine and leucine 

11. (a) Phenylalanine hydroxylase; a low-phenylalanine diet 

(b) The normal route of phenylalanine metabolism via hydroxy- 
lation to tyrosine is blocked, and phenyalanine accumulates, 

(c) Phenylalanine is transformed to phenylpyruvate by 
transamination, and then to phenyllactate by reduction The 
transamination reaction has an equilibrium constant of 1 0, and 
phenylpyruvate is formed in significant amounts when phenylala- 
nine accumulates (d) Because of the deficiency in production of 
tyrosine, which is a precursor of melanin, the pigment normally 
present in hair 

12. Catabolism of the carbon skeletons of valine, methionine, and 
isoleucine is impaired because a functional methylmalonyl-CoA 
mutase (a coenzyme B 12 enzyme) is absent The physiological 
effects of loss of this enzyme are described in Table 18-2 and 
Box 18-2 

13. The vegan diet lacks vitamin B 12 , leading to the increase in 
homocysteine and methylmalonate (reflecting the deficiencies 
in methionine synthase and methylmalonic acid mutase, 
respectively) in individuals on the diet for several years Dairy 
products provide some vitamin B 12 in the lactovegetarian 
diet 

14. The genetic forms of pernicious anemia generally arise as a 
result of defects in the pathway that mediates absorption of 
dietary vitamin B 12 (see Box 17-2, p 658). Because dietary 
supplements are not absorbed in the intestine, these conditions 
are treated by injecting supplementary B 12 directly into the 
bloodstream 

15. The mechanism is identical to that for serine dehydratase (see 
Fig. 18-20a, p 693) except that the extra methyl group of 
threonine is retained, yielding a-ketobutyrate instead of 
pyruvate 

16. (a) 15 NH 2 -CO- 15 NH 2 

(b) ^00 14 C-CH 2 -CH 2 - 14 COO- 

15 NH 

(c) R~NH-C- 15 NH 2 



O 



(d) R— NH— C— 15 NH 2 

(e) No label 



(f) 00 14 C- 



5 NH 2 
I ' 

-C 
I 

H 



CU 9 



4 cocr 



® ® ® ® © © 

17. (a) Isoleucine > II > IV > I > V > III > 

acetyl-CoA + propionyl-CoA (b) Step ©transamination, no 
analogous reaction, PLP; ® oxidative decarboxylation, analogous 
to .the pyruvate dehydrogenase rearf.ion, NAD 1 , TPP r lipoate. FAB; 
(f) d \id a tioi i \ a: tal rs^oi i&ti) tiw succinate de hyi \ r t.Jtfe t ia se rea<: : V t< » i . 
FAD;^4) hydration, analog ms \u Mie J'nmarase reaction, no cnfac- 
tor; ©oxidatioTL. analogous to the maJate dehydrogenase reaction, 
IVAD 4 ;©thiolysis (reverse aldol condensation), analogous to the 
thiolase reaction, CoA. 



18. A likely mechanism is: 



H OH 

I I 

OOC— C C-H 

I I 
+ NH 3 H 

Serine 




PLP 



H m 





PLP 

The formaldehyde (HCHO) produced in the second step reacts 
rapidly with tetrahydrofolate at the enzyme active site to pro- 
duce N 5 , AA 10 -methylenetetrahydrofolate (see Fig. 18-17) 

19. (a) Transamination; no analogies; PLP. (b) Oxidative decarboxy- 
lation; analogous to oxidative decarboxylation of pyruvate to 
acetyl- CoA prior to entry into the citric acid cycle, and of a- 
ketoglutarate to succinyl-CoA in the citric acid cycle; NAD + , 
FAD, lipoate, and TPP. (c) Dehydrogenation (oxidation); analo- 
gous to dehydrogenation of succinate to fumarate in the citric 
acid cycle, and of fatty acyl-CoA to enoyl-CoA in (3 oxidation; 
FAD (d) Carboxylation; no analogies in citric acid cycle or 

(3 oxidation; ATP and biotin (e) Hydration; analogous to hydra- 
tion of fumarate to malate in the citric acid cycle, and of enoyl- 
CoA to 3-hydroxyacyl-CoA in (3 oxidation; no cofactors 
(f) Reverse aldol reaction; analogous to reverse of citrate 
synthase reaction in the citric acid cycle; no cofactors 

20. (a) Leucine; valine; isoleucine (b) Cysteine (derived from cys- 
tine). If cysteine were decarboxylated as shown in Fig. 18-6, it 
would yield H 3 N + — CH 2 — CH 2 — SH, which could be oxidized to 
taurine (c) The January 1957 blood shows significantly elevated 
levels of isoleucine, leucine, methionine, and valine; the January 
1957 urine, significantly elevated isoleucine, leucine, taurine, and 
valine, (d) All patients had high levels of isoleucine, leucine, and 
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valine in both blood and urine, suggesting a defect in the break- 
down of these amino acids. Given that the urine also contained 
high levels of the keto forms of these three amino acids, the 
block in the pathway must occur after deamination but before 
dehydrogenation (as shown in Fig 18-28). (e) The model does 
not explain the high levels of methionine in blood and taurine in 
urine The high taurine levels may be due to the death of brain 
cells during the end stage of the disease. However, the reason for 
high levels of methionine in blood are unclear; the pathway of 
methionine degradation is not linked with the degradation of 
branched-chain amino acids. Increased methionine could be a 
secondary effect of buildup of the other amino acids, It is impor- 
tant to keep in mind that the January 1957 samples were from an 
individual who was dying, so comparing blood and urine results 
with those of a healthy individual may not be appropriate 
(f) The following information is needed (and was eventually 
obtained by other workers): (1) The dehydrogenase activity is 
significantly reduced or missing in individuals with maple syrup 
urine disease. (2) The disease is inherited as a single-gene 
defect. (3) The defect occurs in a gene encoding all or part of 
the dehydrogenase. (4) The genetic defect leads to production 
of inactive enzyme 

Chapter 19 

1. Reaction (1); (a), (d) NADH; (b), (e) E-FMN; (c) NAD+/NADH 
and E-FMN/FMNH 2 

Reaction (2): (a), (d) E-FMNH 2 ; (b), (e) Fe 3+ ; (c) E-FMN/FMNH 2 
and Fe 3+ /Fe 2+ 

Reaction (3): (a), (d) Fe 2+ ; (b), (e) Q; (c) Fe 3+ /Fe 2+ and 
Q/QH 2 

2. The side chain makes ubiquinone soluble in lipids and allows it to 
diffuse in the semifluid membrane, 

3. From the difference in standard reduction potential (A£"°) for 
each pair of half-reactions, one can calculate AG'° The oxidation 
of succinate by FAD is favored by the negative standard free- 
energy change (AG'° =-3 7 kJ/mol) Oxidation by NAD + would 
require a large, positive, standard free-energy change (AG' 0 = 
68 kJ/mol) 

4. (a) All carriers reduced; CN~ blocks the reduction of 0 2 cat- 
alyzed by cytochrome oxidase, (b) All carriers reduced; in the 
absence of 0 2 , the reduced carriers are not reoxidized. (c) All 
carriers oxidized, (d) Early carriers more reduced; later carriers 
more oxidized, 

5. (a) Inhibition of NADH dehydrogenase by rotenone decreases 
the rate of electron flow through the respiratory chain, which in 
turn decreases the rate of ATP production. If this reduced rate is 
unable to meet the organism's ATP requirements, the organism 
dies (b) Antimycin A strongly inhibits the oxidation of Q in the 
respiratory chain, reducing the rate of electron transfer and lead- 
ing to the consequences described in (a), (c) Because antimycin 
A blocks all electron flow to oxygen, it is a more potent poison 
than rotenone, which blocks electron flow from NADH but not 
from FADH 2 

6. (a) The rate of electron transfer necessary to meet the ATP de- 
mand increases, and thus the P/O ratio decreases, (b) High con- 
centrations of uncoupler produce P/O ratios near zero. The P/O 
ratio decreases, and more fuel must be oxidized to generate the 
same amount of ATP. The extra heat released by this oxidation 
raises the body temperature, (c) Increased activity of the respi- 
ratory chain in the presence of an uncoupler requires the degra- 
dation of additional fuel. By oxidizing more fuel (including fat 
reserves) to produce the same amount of ATP, the body loses 
weight. When the P/O ratio approaches zero, the lack of ATP 
results in death. 

7. Valinomycin acts as an uncoupler. It combines with K + to form a 
complex that passes through the inner mitochondrial membrane, 
dissipating the membrane potential. ATP synthesis decreases, 



which causes the rate of electron transfer to increase. This re- 
sults in an increase in the H + gradient, 0 2 consumption, and 
amount of heat released . 

8. (a) The formation of ATP is inhibited, (b) The formation of 
ATP is tightly coupled to electron transfer: 2,4-dinitrophenol is 
an uncoupler of oxidative phosphorylation, (c) Oligomycin 

9. Cytosolic malate dehydrogenase plays a key role in the transport 
of reducing equivalents across the inner mitochondrial mem- 
brane via the malate -aspartate shuttle. 

10. (a) Glycolysis becomes anaerobic, (b) Oxygen consumption 
ceases (c) Lactate formation increases, (d) ATP synthesis de- 
creases to 2 ATP/glucose, 

11. The steady-state concentration of Pi in the cell is much higher 
than that of ADP. The P t released by ATP hydrolysis changes 
total [PJ very little. 

12. The response to (a) increased [ADP] is faster because the re- 
sponse to (b) reduced p0 2 requires protein synthesis 

13. (a) NADH is reoxidized via electron transfer instead of lactic 
acid fermentation, (b) Oxidative phosphorylation is more effi- 
cient, (c) The high mass-action ratio of the ATP system inhibits 
phosphofructokinase- 1 t 

14. Fermentation to ethanol could be accomplished in the presence 
of 0 2 , which is an advantage because strict anaerobic conditions 
are difficult to maintain. The Pasteur effect is not observed, since 
the citric acid cycle and electron- transfer chain are inactive. 

15. More-efficient electron transfer between complexes. 

16. (a) External medium: 4.0 X 10" 8 M; matrix: 2,0 X 10~ 8 M 

(b) [H + ] gradient contributes 1.7 kJ/mol toward ATP synthesis 

(c) 21 (d) No (e) From the overall transmembrane potential 

17. (a) 0.91 jamol/s • g (b) 5.5 s; to provide a constant level of ATP, 
regulation of ATP production must be tight and rapid. 

18. 53 jitmol/s ■ g, With a steady state [ATP] of 7 0 /Ainol/g, this is 
equivalent to 10 turnovers of the ATP pool per second; the 
reservoir would last about 0 13 s 

19. Reactive oxygen species react with macromolecules, including 
DNA If a mitochondrial defect leads to increased production of 
ROS, the nuclear genes that encode proto-oncogenes (pp, 473, 
474) can be damaged, producing oncogenes and leading to un- 
regulated cell division and cancer. 

20. Different extents of heteroplasmy for the defective gene produce 
different degrees of defective mitochondrial function. 

21. The inner mitochondrial membrane is impermeable to NADH, 
but the reducing equivalents of NADH are transferred (shuttled) 
through the membrane indirectly: they are transferred to oxalo- 
acetate in the cytosol, the resulting malate is transported into 
the matrix, and mitochondrial NAD + is reduced to NADH. 

22. The citric acid cycle is stalled for lack of an acceptor of electrons 
from NADH Pyruvate produced by glycolysis cannot enter the 
cycle as acetyl-CoA; accumulated pyruvate is transaminated to 
alanine and exported to the liver. 

23. Pyruvate dehydrogenase is located in mitochondria; glyceralde- 
hyde 3-phosphate dehydrogenase in the cytosol. The NAD pools 
are separated by the inner mitochondrial membrane. 

24. Complete lack of glucokinase (two defective alleles) makes it im- 
possible to carry out glycolysis at a sufficient rate to raise [ATP] 
to the threshold required for insulin secretion. 

25. Defects in Complex II result in increased production of ROS, 
damage to DNA, and mutations that lead to unregulated cell divi- 
sion (cancer). It is not clear why the cancer tends to occur in the 
midgut. 

26. For the maximum photosynthetic rate, PSI (which absorbs light 
of 700 nm) and PSII (which absorbs light of 680 nm) must be 
operating simultaneously 

27. The extra weight comes from the water consumed in the overall 
reaction. 
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28. Purple sulfur bacteria use H 2 S as the hydrogen donor in photo- 
synthesis. No 0 2 is evolved, because the single photosystem 
lacks the manganese-containing water-splitting complex. 

29. 0.44 

30. (a) Stops (b) Slows; some electron flow continues by the cyclic 
pathway. 

31. During illumination, a proton gradient is established- When ADP 
and Pj are added, ATP synthesis is driven by the gradient, which 
becomes exhausted in the absence of light. 

32. DCMU blocks electron transfer between PSII and the first site of 
ATP production. 

33. In the presence of venturicidin, proton movement through the 
CF 0 CF! complex is blocked, and electron flow (oxygen evolu- 
tion) continues only until the free energy cost of pumping pro- 
tons against the rising proton gradient equals the free energy 
available in a photon. DNP, by dissipating the proton gradient, re- 
stores electron flow and oxygen evolution. 

34. (a) 56 kJ/mol (b) 0.29 V 

35. From the difference in reduction potentials, one can calculate 
that At7'° = 15 kJ/mol for the redox reaction. Figure 19-46 shows 
that the energy of photons in any region of the visible spectrum is 
more than sufficient to drive this endergonic reaction. 

36. 1.35 X 10~ 77 ; the reaction is highly unfavorable! In chloroplasts, 
the input of light energy overcomes this barrier. 

37. -920kJ/mol 

38. No. The electrons from H 2 0 flow to the artificial electron accep- 
tor Fe 3+ ,not to NADP + 

39. About once every 0.1 s; 1 in 10 8 is excited, 

40. Light of 700 nm excites PSI but not PSII; electrons flow from 
P700 to NADP + , but no electrons flow from P680 to replace 
them, When light of 680 nm excites PSII, electrons tend to flow 
to PSI, but the electron carriers between the two photosystems 
quickly become completely reduced. 

41. No The excited electron from P700 returns to refill the electron 
"hole" created by illumination. PSII is not needed to supply elec- 
trons, and no 0 2 is evolved from H 2 0. NADPH is not formed, 
because the excited electron returns to P700. 

42. (a) (1) The presence of Mg 2+ supports the hypothesis that 
chlorophyll is directly involved in catalysis of the phosphoryla- 
tion reaction: ADP + P 2 -» ATP (2) Many enzymes (or other pro- 
teins) that contain Mg 2+ are not phosphorylating enzymes, so 
the presence of Mg 2+ in chlorophyll does not prove its role in 
phosphorylation reactions. (3) The presence of Mg 2+ is essential 
to chlorophyll's photochemical properties: light absorption and 
electron transfer, (b) (1) Enzymes catalyze reversible reactions, 
so an isolated enzyme that can, under certain laboratory condi- 
tions, catalyze removal of a phosphoryl group could probably, un- 
der different conditions (such as in cells), catalyze addition of a 
phosphoryl group. So it is plausible that chlorophyll could be in- 
volved in the phosphorylation of ADP. (2) There are two possible 
explanations: the chlorophyll protein is a phosphatase only and 
does not catalyze ADP phosphorylation under cellular condi- 
tions, or the crude preparation contains a contaminating phos- 
phatase activity that is unconnected to the photosynthetic 
reactions. (3) It is likely that the preparation was contaminated 
with a nonphotosynthetic phosphatase activity (c) (1) This light 
inhibition is what one would expect if the chlorophyll protein 
catalyzed the reaction ADP + Pj + light -> ATP Without light, 
the reverse reaction, a dephosphorylation, would be favored. In 
the presence of light, energy is provided and the equilibrium 
would shift to the right, reducing the phosphatase activity (2) 
This inhibition must be an artifact of the isolation or assay meth- 
ods (3) It is unlikely that the crude preparation methods in use 
at the time preserved intact chloroplast membranes, so the inhi- 
bition must be an artifact, (d) (1) In the presence of light, ATP 

is synthesized and other phosphorylated intermediates are con- 



sumed. (2) In the presence of light, glucose is produced and is 
metabolized by cellular respiration to produce ATP, with changes 
in the levels of phosphorylated intermediates. (3) In the pres- 
ence of light, ATP is produced and other phosphorylated inter- 
mediates are consumed, (e) Light energy is used to produce 
ATP (as in the Emerson model) and is used to produce reducing 
power (as in the Rabinowitch model) , (f ) The approximate stoi- 
chiometry for photophosphorylation (Chapter 19) is that 8 pho- 
tons yield 2 NADPH and about 3 ATP Two NADPH and 3 ATP 
are required to reduce 1 C0 2 (Chapter 20). Thus, at a minimum, 
8 photons are required per C0 2 molecule reduced. This is in 
good agreement with Rabinowitch's value, (g) Because the en- 
ergy of fight is used to produce both ATP and NADPH, each pho- 
ton absorbed contributes more than just 1 ATP for 
photosynthesis. The process of energy extraction from light is 
more efficient than Rabinowitch supposed, and plenty of energy 
is available for this process — even with red light. 

Chapter 20 

1. Within subcellular organelles, concentrations of specific enzymes 
and metabolites are elevated; separate pools of cofactors and in- 
termediates are maintained; regulatory mechanisms affect only 
one set of enzymes and pools. 

2. This observation suggests that ATP and NADPH are generated in 
the light and are essential for C0 2 fixation; conversion stops as 
the supply of ATP and NADPH becomes exhausted. Further- 
more, some enzymes are switched off in the dark. 

3. X is 3-phosphoglycerate; Y is ribulose 1,5-bisphosphate. 

4. Ribulose 5-phosphate kinase, fructose 1,6-bisphosphatase, 
sedoheptulose 1,7-bisphosphatase, and glyceraldehyde 3-phos- 
phate dehydrogenase; all are activated by reduction of a critical 
disulfide bond to a pair of sulfhydryls; iodoacetate reacts irre- 
versibly with free sulfhydryls. 

5. To carry out the disulfide exchange reaction that activates the 
Calvin cycle enzymes, thioredoxin needs both of its sulfhydryl 
groups. 

6. Reductive pentose phosphate pathway regenerates ribulose 

1, 5-bisphosphate from triose phosphates produced during photo- 
synthesis. Oxidative pentose phosphate pathway provides 
NADPH for reductive biosynthesis and pentose phosphates for 
nucleotide synthesis. 

7. Both types of "respiration" occur in plants, consume 0 2 , and pro- 
duce C0 2 . (Mitochondrial respiration also occurs in animals,) 
Mitochondrial respiration occurs continuously; electrons derived 
from various fuels are passed through a chain of carriers in the 
inner mitochondrial membrane to 0 2 . Photorespiration occurs in 
chloroplasts, peroxisomes, and mitochondria. Photorespiration 
occurs during the daytime, when photosynthetic carbon fixation 
is occurring; mitochondrial respiration occurs primarily at night, 
or during cloudy days. The path of electron flow in photorespira- 
tion is shown in Fig. 20-21; that for mitochondrial respiration, in 
Fig. 19-16 

8. This hypothesis assumes directed evolution, or evolution with a 
purpose — ideas not generally accepted by evolutionary biolo- 
gists. Other processes, such as burning fossil fuels and global de- 
forestation, affect the global atmospheric composition, C 4 plants, 
by fixing C0 2 under conditions when rubisco prefers 0 2 as sub- 
strate, also contributes to setting atmospheric C0 2 /0 2 ratios. 

9. (a) Without production of NADPH by the pentose phosphate 
pathway, cells would be unable to synthesize lipids and other 
reduced products, (b) Without generation of ribulose 
1,5-bisphosphate, the Calvin cycle is effectively blocked. 

10. In maize, C0 2 is fixed by the C 4 pathway elucidated by Hatch 
and Slack, in which PEP is carboxylated rapidly to oxaloacetate 
(some of which undergoes transamination to aspartate) and 
reduced to malate. Only after subsequent decarboxylation does 
the C0 2 enter the Calvin cycle. 
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11. Measure the rate of fixation of 14 C carbon dioxide in the light (day- 
time) and the dark. Greater fixation in the dark identifies the CAM 
plant One could also determine the titratable acidity; acids stored 
in the vacuole during the night can be quantified in this way. 

12. Isocitrate dehydrogenase reaction 

13. Storage consumes 1 mol of ATP per mole of glucose 6-phosphate; 
this represents 3.3% of the total ATP available from glucose 
6-phosphate metabolism (i.e., the efficiency of storage is 96.7%). 

14. [PPJ is high in the cytosol because the cytosol lacks inorganic 
pyrophosphatase . 

15. (a) Low [Pi] in the cytosol and high [triose phosphate] in the 
chloroplast (b) High [triose phosphate] in the cytosol 

16. 3-Phosphoglycerate is the primary product of photosynthesis; [P,] 
rises when light-driven synthesis of ATP from ADP and slows 

17. (a) Sucrose + (glucose) n -> (glucose) n+1 + fructose 

(b) Fructose generated in the synthesis of dextran is readily 
imported and metabolized by the bacteria. 

18. Species 1 is C 4 ; species 2, C 3 . 

19. (a) In peripheral chloroplasts (b) and (c) In central sphere 

20. (a) By analogy to the oxygenic photosynthesis carried out by 
plants (H 2 0 + C0 2 -> glucose + 0 2 ), the reaction would be H 2 S + 
0 2 + C0 2 -» glucose + H 2 0 + S. This is the sum of the reduc- 
tion of C0 2 by H 2 S (H 2 S + C0 2 -> glucose + S) and the energy 
input (H 2 S + 0 2 -> S + H 2 0), (b) The H 2 S and C0 2 are pro- 
duced chemically in deep-sea sediments, but the 0 2 , like the 
vast majority of 0 2 on Earth, is produced by photosynthesis, 
which is driven by light energy, (c) In Robinson et al.'s assay, 
3 H labels the C-l of ribulose 1,5-bisphosphate, so reaction with 
C0 2 yields one molecule of [ 3 H]3-phosphoglycerate and one 
molecule of unlabeled 3-phosphoglycerate; reaction with 0 2 
produces one molecule of [ 3 H]2-phosphoglycolate and one mole- 
cule of unlabeled 3-phosphoglycerate. Thus the ratio of [ 3 H]3- 
phosphoglycerate to [ 3 H]2-phosphoglycolate equals the ratio of 
carboxylation to oxygenation, (d) If the 3 H labeled C-5, both 
oxygenation and carboxylation would yield [ 3 H] 3-phosphoglycer- 
ate and it would be impossible to distinguish which reaction had 
produced the labeled product; the reaction could not be used to 
measure H. 

[C0 2 ] 0.00038 



(e) Substituting 



[0 2 



0.2 



0.0019 into 



V, 



carboxylation [C0 2 ] 

= 12 , gives 



carboxylation 



[0 2 ] 



(8.6X0.0019) = 0.016 



' oxygenation 

Therefore, the rate of oxygenation would be roughly 60 times the 
rate of carboxylation! (f) If terrestrial plants had fl = 8,6, car- 
boxylation would occur at a much lower rate than oxygenation. 
This would be extremely inefficient, so one would expect the ru- 
bisco of terrestrial plants to have an H substantially higher than 
8.6. In fact, fl values for land plants vary between 10 and 250. 
Even with these values, the expected rate of the oxygenation re- 
action is still very high, (g) The rubisco reaction occurs with 
C0 2 as a gas. At the same temperature, 13 C0 2 molecules diffuse 
more slowly than the lighter 12 C0 2 molecules, and thus 13 C0 2 
will enter the active site (and become incorporated into sub- 
strate) more slowly than 12 C0 2 . (h) For the relationship to be 
truly symbiotic, the tube worms must be getting a substantial 
amount of their carbon from the bacteria. The presence of rubisco 
in the endosymbionts simply shows that they are capable of 
chemosynthesis, not that they are supplying the host with a signifi- 
cant fraction of its carbon. On the other hand, showing that the 
13 C: 12 C ratio in the host is more similar to that in the endosymbiont 
than that in other marine animals strongly suggests that the tube 
worms are getting the majority of their carbon from the bacteria. 



Chapter 21 

1. (a) The 16 carbons of palmitate are derived from 8 acetyl groups 
of 8 acetyl-CoA molecules. The 14 C-labeled acetyl-CoA gives rise 
to malonyl-CoA labeled at C-l and C-2. (b) The metabolic pool 
of malonyl-CoA, the source of all palmitate carbons except C-l 6 
and C-15, does not become labeled with small amounts of Re- 
labeled acetyl-CoA. Hence, only [15,16- I4 C] palmitate is formed. 

2. Both glucose and fructose are degraded to pyruvate in glycolysis. 
Pyruvate is converted to acetyl-CoA by the pyruvate dehydroge- 
nase complex. Some of this acetyl-CoA enters the citric acid cy- 
cle, which produces reducing equivalents (NADH and NADPH). 
Mitochondrial electron transfer to 0 2 yields ATP. 

3. 8 Acetyl-CoA + 15 ATP + 14NADPH + 9H 2 0 -> 

palmitate + 8CoA + 15ADP + 15Pi + 14NADP + + 2H + 

4. (a) 3 deuteriums per palmitate; all located on C-16; all other 
two-carbon units are derived from unlabeled malonyl-CoA. 
(b) 7 deuteriums per palmitate; located on all e^en-numbered 
carbons except C-16. 

5. By using the three-carbon unit malonyl-CoA, the activated form 
of acetyl-CoA (recall that malonyl-CoA synthesis requires ATP), 
metabolism is driven in the direction of fatty acid synthesis by 
the exergonic release of C0 2 . 

6. The rate-limiting step in fatty acid synthesis is carboxylation of 
acetyl-CoA, catalyzed by acetyl-CoA carboxylase. High [citrate] 
and [isocitrate] indicate that conditions are favorable for fatty 
acid synthesis: an active citric acid cycle is providing a plentiful 
supply of ATP, reduced pyridine nucleotides, and acetyl-CoA. 
Citrate stimulates (increases the F max of) acetyl-CoA carboxy- 
lase (a). Because citrate binds more tightly to the filamentous 
form of the enzyme (the active form), high [citrate] drives the 
protomer filament equilibrium in the direction of the 
active form (b). In contrast, palmitoyl-CoA (the end product 

of fatty acid synthesis) drives the equilibrium in the direction of 
the inactive (protomer) form. Hence, when the end product 
of fatty acid synthesis accumulates, the biosynthetic path 
slows. 



ATP + CoA (cyt) -> acetyl-CoA (cyt) 4- ADP - 



7. (a) Acetyl-CoA Cmi0 
Pi + CoA Cmit) (b) 1 ATP per acetyl group (c) Yes 

8. The double bond in palmitoleate is introduced by an oxidation 
catalyzed by fatty acyl-CoA desaturase, a mixed-function oxi- 
dase that requires 0 2 as a cosubstrate, 

9. 3 Palmitate + glycerol + 7ATP + 4H 2 0 -> 

tripalmitin + 7 ADP + 7P t + 7H + 

10. In adult rats, stored triacylglycerols are maintained at a steady- 
state level through a balance of the rates of degradation and 
biosynthesis. Hence, the triacylglycerols of adipose (fat) tissue 
are constantly turned over, which explains the incorporation of 
14 C label from dietary glucose. 

11. Net reaction: 

Dihydroxyacetone phosphate + NADH + palmitate + oleate 
+ 3ATP + CTP + choline + 4H 2 0 -> 

phosphatidylcholine + NAD + + 2 AMP + ADP + H + + CMP + 5P* 
7 ATP per molecule of phosphatidylcholine 

12. Methionine deficiency reduces the level of adoMet, which is re- 
quired for the de novo synthesis of phosphatidylcholine. The sal- 
vage pathway does not employ adoMet, but uses available 
choline. Thus phosphatidylcholine can be synthesized even 
when the diet is deficient in methionine, as long as choline is 
available. 

13. 14 C label appears in three places in the activated isoprene: 



4 CH 2 

X C- 14 CH 2 
4 CH, 



-CH 2 
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14. (a) ATP (b) UDP-glucose (c) CDP-ethanolamine (d) UDP- 
galactose (e) Fatty acyl-CoA (f) S-Adenosylmethionine 
(g) Malonyl-CoA (h) A 3 -isopentenyl pyrophosphate 

15. Linoleate is required in the synthesis of prostaglandins Animals 
are unable to transform oleate to linoleate, so linoleate is an essen- 
tial fatty acid. Plants can transform oleate to linoleate, and they 
provide animals with the required linoleate (see Fig, 21-12). 

16. The rate-determining step in the biosynthesis of cholesterol is the 
synthesis of mevalonate, catalyzed by hydroxymethylglutaryl-CoA 
reductase. This enzyme is allosterically regulated by mevalonate 
and cholesterol derivatives. High intracellular [cholesterol] also 
reduces transcription of the gene encoding HMG-CoA reductase, 

17. When cholesterol levels decline because of treatment with a 
statin, cells attempt to compensate by increasing expression of 
the gene encoding HMG-CoA reductase. The statins are good 
competitive inhibitors of HMG-CoA reductase activity nonethe- 
less and reduce overall production of cholesterol 

18. Note: There are several plausible alternatives that a student 
might propose in the absence of a detailed knowledge of the lit- 
erature on this enzyme. Thiolase reaction: begins with nucle- 
ophilic attack of an active-site Cys residue on the first acetyl-CoA 
substrate, displacing — S-CoA and forming a covalent thioester 
link between Cys and the acetyl group. A base on the enzyme 
then extracts a proton from the methyl group of the second 
acetyl-CoA, leaving a carbanion that attacks the carbonyl carbon 
of the thioester formed in the first step. The sulfhydryl of the 
Cys residue is displaced, creating the product acetoacetyl-CoA. 
HMG-CoA synthase reaction: begins in the same way, with a co- 
valent thioester link formed between the enzyme's Cys residue 
and the acetyl group of acetyl-CoA, with displacement of the 

— S-CoA. The — S-CoA dissociates as CoA-SH, and acetoacetyl- 
CoA binds to the enzyme. A proton is abstracted from the methyl 
group of the enzyme-linked acetyl, forming a carbanion that at- 
tacks the ketone carbonyl of the acetoacetyl-CoA substrate. The 
carbonyl is converted to a hydroxyl ion in this reaction, and this 
is protonated to create —OH The thioester link with the enzyme 
is then cleaved hydrolytically to generate the HMG-CoA product, 
HMG-CoA reductase reaction- two successive hydride ions de- 
rived from NADPH first displace the — S-CoA, and then reduce 
the aldehyde to a hydroxyl group. 

19. Statins inhibit HMG-CoA reductase, an enzyme in the pathway 
to the synthesis of activated isoprenes, which are precursors of 
cholesterol and a wide range of isoprenoids, including coenzyme 
Q (ubiquinone). Hence, statins might reduce the levels of coen- 
zyme Q available for mitochondrial respiration. Ubiquinone is ob- 
tained in the diet as well as by direct biosynthesis, but it is not 
yet clear how much is required and how well dietary sources can 
substitute for reduced synthesis. Reductions in the levels of par- 
ticular isoprenoids may account for some side effects of statins 

20. (a) 




Lycopene (C-40) 

bend ends around for cyclization 




HO 




Astaxanthin 

(b) Head-to-head. There are two ways to look at this. First, the 
"tail" of geranylgeranyl pyrophosphate has a branched dimethyl 
structure, as do both ends of phytoene. Second, no free —OH is 
formed by the release of PP i; indicating that the two —0—®~® 
"heads" are linked to form phytoene. (c) Four rounds of dehy- 
drogenation convert four single bonds to double bonds, (d) No. 
A count of single and double bonds in the reaction below shows 
that one double bond is replaced by two single bonds — so, there 
is no net oxidation or reduction. 



^-Carotene (C-40) 



(e) Steps® through®. The enzyme can convert IPP and 
DMAP to geranylgeranyl pyrophosphate, but catalyzes no further 
reactions in the pathway, as confirmed by results with the other 
substrates, (f) Strains 1 through 4 lack crtE and have much 
lower astaxanthin production than strains 5 through 8, all of 
which overexpress crtE. Thus, overexpression of crtE leads to a 
substantial increase in astaxanthin production Wild-type E. coli 
has some step (3) activity, but this conversion of farnesyl py- 
rophosphate to geranylgeranyl pyrophosphate is strongly rate- 
limiting, (g) IPP isomerase. Comparing strains 5 and 6 shows 
that adding ispA, which catalyzes steps ® and © , has little ef- 
fect on astaxanthin production, so these steps are not rate-limit- 
ing. However, comparing strains 5 and 7 shows that adding idi 
substantially increases astaxanthin production, so IPP isomerase 
must be the rate-limiting step when crtE is overexpressed. 
(h) A low (+) level, comparable to that of strains 5, 6, and 9. 
Without overexpression of idi, production of astaxanthin is 
limited by low IPP isomerase activity and the resulting limited 
supply of IPP. 

Chapter 22 

1. In their symbiotic relationship with the plant, bacteria supply 
ammonium ion by reducing atmospheric nitrogen, which re- 
quires large quantities of ATP. 

2. The transfer of nitrogen from NH 3 to carbon skeletons can be 
catalyzed by (1) glutamine synthetase and (2) glutamate dehy- 
drogenase. The latter enzyme produces glutamate, the amino 
group donor in all transamination reactions, necessary to the for- 
mation of amino acids for protein synthesis. 

3. A link between enzyme-bound PLP and the phosphohomoserine 
substrate is first formed, with rearrangement to generate the 
ketimine at the a carbon of the substrate , This activates the 

/3 carbon for proton abstraction, leading to displacement of the 
phosphate and formation of a double bond between the jS and y 
carbons. A rearrangement (beginning with proton abstraction at 
the pyridoxal carbon adjacent to the substrate amino nitrogen) 
moves the double bond between the a and /3 carbons, and con- 
verts the ketimine to the aldimine form of PLP. Attack of water 
at the (3 carbon is then facilitated by the linked pyridoxal, fol- 
lowed by hydrolysis of the imine link between PLP and the prod- 
uct, to generate threonine. 

4. In the mammalian route, toxic ammonium ions are transformed 
to glutamine, reducing toxic effects on the brain. 

5. Glucose + 2C0 2 + 2NH 3 -> 2 aspartate + 2H + + 2H 2 0 

6. The amino-terminal glutaminase domain is quite similar in all 
glutamine amidotransferases A drug that targeted this active 
site would probably inhibit many enzymes and thus be prone 
to producing many more side effects than a more specific 
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inhibitor targeting the unique carboxyl-terminal synthetase 
active site. 

7. If phenylalanine hydroxylase is defective, the biosynthetic route 
to tyrosine is blocked and tyrosine must be obtained from the 
diet 

8. In adoMet synthesis, triphosphate is released from ATP. Hydroly- 
sis of the triphosphate renders the reaction thermodynamically 
more favorable. 

9. If the inhibition of glutamine synthase were not concerted, satu- 
rating concentrations of histidine would shut down the enzyme 
and cut off production of glutamine, which the bacterium needs 
to synthesize other products. 

Folic acid is a precursor of tetrahydrofolate (Fig 18-16), re- 
quired in the biosynthesis of glycine (Fig. 22-12), a precursor of 
porphyrins. A folic acid deficiency therefore impairs hemoglobin 
synthesis. 

11. For glycine auxotrophs: adenine and guanine ; for glutamine 
auxotrophs: adenine, guanine, and cytosine;/or aspartate 
auxotrophs: adenine, guanine, cytosine, and uridine, 
(a) See Figure 18-6, step (2) , for the reaction mechanism of 
amino acid racemization. The F atom of fluoroalanine is an excel- 
lent leaving group. Fluoroalanine causes irreversible (covalent) 
inhibition of alanine racemase. One plausible mechanism is (Nuc 
denotes any nucleophilic amino acid side chain in the enzyme 
active site) : 



10 



12 



H 



-OOC-C-CH 2 -F 



HO 




OOC 



HO 



HB Nuc 

00C-C=CH 2 

I 

N 

II 

CH 

CH 3 N 



H Nuc 

I I 
-C-CH 2 
I 



CH 




(b) Azaserine (see Fig. 22-48) is an analog of glutamine. The 
diazoacetyl group is highly reactive and forms covalent bonds 
with nucleophiles at the active site of a glutamine 
amido transferase . 

13. (a) As shown in Figure 18-16, p-aminobenzoate is a component 
of A^^ 10 -methylenetetrahydrofolate, the cofactor involved in 
the transfer of one-carbon units, (b) In the presence of sulfanil- 
amide, a structural analog of p-aminobenzoate, bacteria are un- 
able to synthesize tetrahydrofolate, a cofactor necessary for 
converting AICAR to FAICAR; thus AICAR accumulates, (c) The 
competitive inhibition by sulfanilamide of the enzyme involved 
in tetrahydrofolate biosynthesis is overcome by the addition of 
excess substrate (^-airiinobenzoate). 

14. The 14 C-labeled orotate arises from the following pathway (the 
first three steps are part of the citric acid cycle): 



-OO u C- 14 CH 2 - 14 CH 2 - 14 COO- 
Succinate 



00 14 C H 

\ / 

14C = 14 C 

/ \ 

H 14 COO- 
Fumarate 



OH H 

I I 
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I I 

H H 

Malate 
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15. Organisms do not store nucleotides to be used as fuel, and they do 
not completely degrade them, but rather hydrolyze them to re- 
lease the bases, which can be recovered in salvage pathways. The 
low C:N ratio of nucleotides makes them poor sources of energy. 

16. Treatment with allopurinol has two consequences. (1) It inhibits 
conversion of hypoxanthine to uric acid, causing accumulation of 
hypoxanthine, which is more soluble and more readily excreted; 
this alleviates the clinical problems associated with AMP degrada- 
tion, (2) It inhibits conversion of guanine to uric acid, causing accu- 
mulation of xanthine, which is less soluble than uric acid; this is the 
source of xanthine stones. Because the amount of GMP degrada- 
tion is low relative to AMP degradation, the kidney damage caused 
by xanthine stones is less than that caused by untreated gout. 

17. 5-Phosphoribosyl-l -pyrophosphate; this is the first NH 3 acceptor 
in the purine biosynthetic pathway. 

(a) The a-carboxyl group is removed and an —OH is added to 
the y carbon (b) Btrl has sequence homology with acyl carrier 
proteins. The molecular weight of Btrl increases when incubated 
under conditions in which CoA could be added to the protein. 
Adding CoA to a Ser residue would replace an —OH (formula 
weight (FW) 17) with a 4'-phosphopantetheine group (see 
Fig. 21-5, p. 809). This group has the formula C n H2iN 2 0 7 PS 
(FW 356). Thus, 11,182 - 17 + 356 = 12,151, which is very 
close to the observed M r of 12,153. (c) The thioester could form 
with the a-carboxyl group, (d) In the most common reaction for 
removing the a-carboxyl group of an amino acid (see Fig. 18-6c, 
p. 679), the carboxyl group must be free. Furthermore, it is diffi- 
cult to imagine a decarboxylation reaction starting with a car- 
boxyl group in its thioester form, (e) 12,240 - 12,281 = 41, close 
to the M r of C0 2 (44) Given that BtrK is probably a decarboxy- 
lase, its most likely structure is the decarboxylated form: 



O 



Btrl 

(f) 12,370 - 12,240 = 130 Glutamic acid (C 5 H 9 N0 4 . M T 147), 
minus the —OH (FW 17) removed in the glutamylation reaction, 
leaves a glutamyl group of FW 130; thus, y-glutamylating the 
molecule above would add 130 to its M r BtrJ is capable of y-glu- 
tamylating other substrates, so it may y-glutamylate the struc- 
ture above. The most likely site for this is the free amino group, 
giving the following structure: 
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Chapter 23 



o 



'Antibiotic 



1. They are recognized by two different receptors, typically found 
in different cell types, and are coupled to different downstream 
effects. 

2. Steady-state levels of ATP are maintained by phosphoryl group 
transfer to ADP from phosphocreatine. l-Fluoro-2 t 4-dinitroben- 
zene inhibits creatine kinase. 

3. Ammonia is very toxic to nervous tissue, especially the brain. 
Excess NH 3 is removed by transformation of glutamate to gluta- 
mine, which travels to the liver and is subsequently transformed 
to urea. The additional glutamine arises from the transformation 
of glucose to a-ketoglutarate, transamination of a-ketoglutarate 
to glutamate, and conversion of glutamate to glutamine. 

4. Glucogenic amino acids are used to make glucose for the brain; 
others are oxidized in mitochondria via the citric acid cycle. 

5. From glucose, by the following route: Glucose -> dihydroxyace- 
tone phosphate (in glycolysis); dihydroxyacetone phosphate + 
NADH + H + -> glycerol 3-phosphate + NAD 4 " (glycerol 3- 
phosphate dehydrogenase reaction) 

6. (a) Increased muscular activity increases the demand for ATP, 
which is met by increased 0 2 consumption, (b) After the sprint, 
lactate produced by anaerobic glycolysis is converted to glucose 
and glycogen, which requires ATP and therefore 0 2 . 

7. Glucose is the primarv f;; 1 ^ the brain. TPP-dependent oxida- 
tive decarboxvhcion of pyruvate to acetyl-CoA is essential to 
complete ' b jcose metabolism. 

8. 190 m 

9. (a) Inactivation provides a rapid means to change hormone con- 
centrations, (b) Insulin level is maintained by equal rates of syn- 
thesis and degradation, (c) Changes in the rate of release from 
storage, rate of transport, and rate of conversion from prohor- 
mone to active hormone, 

10. Water-soluble hormones bind to receptors on the outer surface 
of the cell, triggering the formation of a second messenger 
(e.g., cAMP) inside the cell. Lipid-soluble hormones can pass 
through the plasma membrane to act on target molecules or 
receptors directly. 

11. (a) Heart and skeletal muscle lack glucose 6-phosphatase. Any 
glucose 6-phosphate produced enters the glycolytic pathway, 
and under 0 2 -deficient conditions is converted to lactate via 
pyruvate, (b) In a "fight or flight" situation, the concentration of 
glycolytic precursors must be high in preparation for muscular 
activity. Phosphorylated intermediates cannot escape from the 
cell, because the membrane is not permeable to charged species, 
and glucose 6-phosphate is not exported on the glucose 
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transporter. The liver, by contrast, must release the glucose nec- 
essary to maintain blood glucose level; glucose is formed from 
glucose 6-phosphate and enters the bloodstream. 

12. (a) Excessive uptake and use of blood glucose by the liver, lead- 
ing to hypoglycemia; shutdown of amino acid and fatty acid 
catabolism (b) Little circulating fuel is available for ATP require- 
ments. Brain damage results because glucose is the main source 
of fuel for the brain. 

13. Thyroxine acts as an uncoupler of oxidative phosphorylation. 
Uncouplers lower the P/O ratio, and the tissue must increase 
respiration to meet the normal ATP demands. Thermogenesis 
could also be due to the increased rate of ATP utilization by 
the thyroid-stimulated tissue, as increased ATP demands 

are met by increased oxidative phosphorylation and thus 
respiration. 

14. Because prohormones are inactive, they can be stored in quan- 
tity in secretory granules. Rapid activation is achieved by enzy- 
matic cleavage in response to an appropriate signal. 

15. In animals, glucose can be synthesized from many precursors 
(see Fig. 14-15) In humans, the principal precursors are 
glycerol from triacylglycerols and glucogenic amino acids from 
protein. 

16. The ob/ob mouse, which is initially obese, will lose weight. The 
OB/OB mouse will retain its normal body weight, 

17. BMI = 39.3, For BMI of 25, weight must be 75 kg; must lose 
43 kg = 95 lbs 

18. Reduced insulin secretion, Valinomycin has the same effect as 
opening the K + channel, allowing K + exit and consequent 
hyp e rp olarization t 

19. The liver does not receive the insulin message and therefore con- 
tinues to have high levels of glucose 6-phosphatase and gluco- 
neogenesis, increasing blood glucose both during a fast and after 
a glucose- containing meal. The elevated blood glucose triggers 
insulin release from pancreatic )8 cells, hence the high level of 
insulin in the blood. 

20. Some things to consider: What is the frequency of heart attack 
attributable to the drug? How does this frequency compare with 
the number of individuals spared the long-term consequences of 
type 2 diabetes? Are other, equally effective treatment options, 
with fewer adverse effects, available? 

21. Without intestinal glucosidase activity, absorption of glucose 
from dietary glycogen and starch is reduced, blunting the usual 
rise in blood glucose after the meal The undigested oligosaccha- 
rides are fermented by bacteria in the large intestine, and the 
gases released cause intestinal discomfort 

22. (a) Closing the ATP-gated K + channel would depolarize the 
membrane, leading to increased insulin release, (b) Type 2 dia- 
betes results from decreased sensitivity to insulin, not a deficit of 
insulin production; increasing circulating insulin levels will re- 
duce the symptoms associated with this disease, (c) Individuals 
with type 1 diabetes have deficient pancreatic (3 cells, so gly- 
buride will have no beneficial effect, (d) Iodine, like chlorine 
(the atom it replaces in the labeled glyburide), is a halogen, but 
it is a larger atom and has slightly different chemical properties 
It is possible that the iodinated glyburide would not bind to SUR. 
If it bound to another molecule instead, the experiment would 
result in cloning of the gene for this other, incorrect protein, 
(e) Although a protein has been "purified," the "purified" prepa- 
ration might be a mixture of several proteins that co-purify un- 
der those experimental conditions. In this case, the amino acid 
sequence could be that of a protein that co-purifies with SUR. 
Using antibody binding to show that the peptide sequences are 
present in SUR excludes this possibility (f ) Although the cloned 
gene does encode the 25 amino acid sequence found in SUR, it 
could be a gene that, coincidentally, encodes the same sequence 
in another protein In this case, this other gene would most likely 



be expressed in different cells than the SUR gene. The mRNA 
hybridization results are consistent with the putative SUR cDNA 
actually encoding SUR. (g) The excess unlabeled glyburide 
competes with labeled glyburide for the binding site on SUR. As 
a result, there is significantly less binding of labeled glyburide, 
so little or no radioactivity is detected in the 140 kDa protein, 
(h) In the absence of excess unlabeled glyburide, labeled 140 
kDa protein is found only in the presence of the putative SUR 
cDNA. Excess unlabeled glyburide competes with the labeled 
glyburide, and no 125 I-labeled 140 kDa protein is detected. This 
shows that the cDNA produces a glyburide-binding protein of 
the same molecular weight as SUR — strong evidence that the 
cloned gene encodes the SUR protein, (i) Several additional 
steps are possible, such as: (1) Express the putative SUR cDNA 
in CHO (Chinese hamster ovary) cells and show that the trans- 
formed cells have ATP-gated K + channel activity. (2) Show that 
HIT cells with mutations in the putative SUR gene lack ATP- 
gated K + channel activity. (3) Show that experimental animals 
or human patients with mutations in the putative SUR gene are 
unable to secrete insulin. 



Chapter 24 

1. 6.1 X 10 4 nm; 290 times longer than the T2 phage head 

2. The number of A residues does not equal the number of T 
residues, nor does the number of G equal the number of C, so 
the DNA is not a base-paired double helix; the M13 DNA is 
single-stranded. 

3. M T = 3 8 X 10 8 ; length = 200 /xm; Lk 0 = 55,200; Lk = 51,900 

4. The exons contain 3 bp/amino acid X 192 amino acids = 576 bp. 
The remaining 864 bp are in introns, possibly in a leader or signal 
sequence, and/or in other noncoding DNA, 

5. 5,000 bp. (a) Doesn't change; Lk cannot change without break- 
ing and re-forming the covalent backbone of the DNA. (b) Be- 
comes undefined; a circular DNA with a break in one strand has, 
by definition, no Lk. (c) Decreases; in the presence of ATP, 
gyrase underwinds DNA. (d) Doesn't change; this assumes that 
neither of the DNA strands is broken in the heating process. 

6. For Lk to remain unchanged, the topoisomerase must introduce 
the same number of positive and negative supercoils. 

7. a = -0 067; >70% probability 

8. (a) Undefined; the strands of a nicked DNA could be separated 
and thus have no Lk (b) 476 (c) 476; the DNA is already 
relaxed, so the topoisomerase does not cause a net change. 

(d) 460; gyrase plus ATP reduces the Lk in increments of 2. 

(e) 464; eukaryotic type I topoisomerases increase the Lk of un- 
derwound or negatively supercoiled DNA in increments of 1. 

(f) 460; nucleosome binding does not break any DNA strands 
and thus cannot change Lk 

9. A fundamental structural unit in chromatin repeats about every 
200 bp; the DNA is accessible to the nuclease only at 200 bp in- 
tervals. The brief treatment was insufficient to cleave the DNA at 
every accessible point, so a ladder of DNA bands is created in 
which the DNA fragments are multiples of 200 bp. The thickness 
of the DNA bands suggests that the distance between cleavage 
sites varies somewhat. For instance, not all the fragments in the 
lowest band are exactly 200 bp long. 

10. A right-handed helix has a positive Lk; a left-handed helix (such 
as Z-DNA) has a negative Lk. Decreasing the Lk of a closed cir- 
cular B-DNA by underwinding it facilitates formation of regions 
of Z-DNA within certain sequences. (See Chapter 8, p. 281, for a 
description of sequences that permit the formation of Z-DNA.) 

11. (a) Both strands must be covalently closed, and the molecule 
must be either circular or constrained at both ends (b) Forma- 
tion of cruciforms, left-handed Z-DNA, plectonemic or solenoidal 
supercoils, and unwinding of the DNA are favored. (c)i?. coli 
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DNA topoisomerase II or DNA gyrase. (d) It binds the DNA at a 
point where it crosses on itself, cleaves both strands of one of 
the crossing segments, passes the other segment through the 
break, then reseals the break. The result is a change in Lk of -2. 

12. Centromere, telomeres, and an autonomous replicating sequence 
or replication origin 

13. The bacterial nucleoid is organized into domains approximately 
10,000 bp long. Cleavage by a restriction enzyme relaxes the 
DNA within a domain, but not outside the domain. Any gene in 
the cleaved domain for which expression is affected by DNA 
topology will be affected by the cleavage; genes outside the 
domain will not. 

14. (a) When DNA ends are sealed to create a relaxed, closed circle, 
some DNA species are completely relaxed but others are trapped 
in slightly under- or overwound states. This gives rise to a distri- 
bution of topoisomers centered on the most relaxed species, 
(b) Positively supercoiled (c) The DNA that is relaxed despite 
the addition of dye is DNA with one or both strands broken DNA 
isolation procedures inevitably introduce small numbers of 
strand breaks in some of the closed-circular molecules, (d) Ap- 
proximately -0 05 This is determined by simply comparing na- 
tive DNA with samples of known a In both gels, the native 
DNA migrates most closely with the sample of a = -0 049 

15. (a) In nondisjunction, one daughter cell and all of its descen- 
dants get two copies of the synthetic chromosome and are white; 
the other daughter cell and all of its descendants get no copies of 
the synthetic chromosome and are red This gives rise to a half- 
white, half-red colony, (b) In chromosome loss, one daughter 
cell and all of its descendants get one copy of the synthetic chro- 
mosome and are pink; the other daughter and all its descendants 
get no copies of the synthetic chromosome and are red This 
gives rise to a half-pink, half-red colony (c) The minimum func- 
tional centromere must be smaller than 0 63 kbp, since all frag- 
ments of this size or larger confer relative mitotic stability. 

(d) Telomeres are required to fully replicate only linear DNA; a 
circular molecule can replicate without them, (e) The larger the 
chromosome, the more faithfully it is segregated The data show 
neither a minimum size below which the synthetic chromosome 
is completely unstable, nor a maximum size above which stability 
no longer changes 
(f) 



2. In this extension of the Meselson-Stahl experiment, after three 
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As shown in the graph, even if the synthetic chromosomes were 
as long as the normal yeast chromosomes, they would not be as 
stable. This suggests other, as yet undiscovered, elements are re- 
quired for stability. 

Chapter 25 

1. In random, dispersive replication, in the second generation, all 
the DNAs would have the same density and would appear as a 
single band, not the two bands observed in the Meselson-Stahl 
experiment. 



generations the molar ratio of 
2/6 - 0.33. 



>N- I4 N DNA to 14 N- 14 N DNA is 



3. (a) 4.42 X 10 5 turns; (b) 40 min. In cells dividing every 20 min, 
a replicative cycle is initiated every 20 min, each cycle beginning 
before the prior one is complete (c) 2,000 to 5,000 Okazaki frag- 
ments The fragments are 1,000 to 2,000 nucleotides long and 
are firmly bound to the template strand by base pairing. Each 
fragment is quickly joined to the lagging strand, thus preserving 
the correct order of the fragments 

4. A 28,7%; G 21.3%; C 21.3%; T 28 7% . The DNA strand made 
from the template strand: A 32 7%; G 18 5%; C 24 1%; T 24.7%; 
the DNA strand made from the complementary template strand: 
A 24.7%; G 24.1%; C 18.5%; T 32.7%. It is assumed that the two 
template strands are replicated completely. 

5. (a) No Incorporation of 32 P into DNA results from the synthesis 
of new DNA, which requires the presence of all four nucleotide 
precursors, (b) Yes Although all four nucleotide precursors 
must be present for DNA synthesis, only one of them has to be 
radioactive in order for radioactivity to appear in the new DNA. 
(c) No. Radioactivity is incorporated only if the 32 P label is in the 
a phosphate; DNA polymerase cleaves off pyrophosphate— i.e., 
the (3 and y phosphate groups 

6. Mechanism 1: 3'-OH of an incoming dNTP attacks the a phos- 
phate of the triphosphate at the 5' end of the growing DNA 
strand, displacing pyrophosphate. This mechanism uses normal 
dNTPs, and the growing end of the DNA always has a triphos- 
phate on the 5' end. 



5' end 

®-®-®-CH^O N 
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3' end : 



3' end 



Mechanism 2: This uses a new type of precursor, nucleotide 3' 
triphosphates. The growing end of the DNA strand has a 5'-OH, 
which attacks the a phosphate of an incoming deoxynucleotide 
3'-triphosphate, displacing pyrophosphate. Note that this mecha- 
nism would require the evolution of new metabolic pathways to 
supply the needed deoxynucleotide 3 '-triphosphates 
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3' end * 3' end • 

7. Leading strand: Precursors: dATP, dGTP, dCTP, dTTP (also 
needs a template DNA strand and DNA primer); enzymes and 
other proteins: DNA gyrase, helicase, single-stranded DNA- 
binding protein, DNA polymerase III, topoisomerases, and 
pyrophosphatase. Lagging strand: Precursors: ATP, GTP, CTP, 
UTP, dATP, dGTP, dCTP, dTTP (also needs an RNA primer); 
enzymes and other proteins: DNA gyrase, helicase, single- 
stranded DNA-binding protein, primase, DNA polymerase III, 
DNA polymerase I, DNA ligase, topoisomerases, and pyrophos- 
phatase. NAD + is also required as a cofactor for DNA ligase, 

8. Mutants with defective DNA ligase produce a DNA duplex in 
which one of the strands remains in pieces (as Okazaki frag- 
ments). When this duplex is denatured, sedimentation results 
in one fraction containing the intact single strand (the high 
molecular weight band) and one fraction containing the 
unspliced fragments (the low molecular weight band). 

9. Watson-Crick base pairing between template and leading strand; 
proofreading and removal of wrongly inserted nucleotides by the 
3'-exonuclease activity of DNA polymerase III. Yes — perhaps. 
Because the factors ensuring fidelity of replication are operative 
in both the leading and the lagging strands, the lagging strand 
would probably be made with the same fidelity However, the 
greater number of distinct chemical operations involved in mak- 
ing the lagging strand might provide a greater opportunity for 
errors to arise 

10. -1,200 bp (600 in each direction) 

11. A small fraction (13 of 10 9 cells) of the histidine-requiring 
mutants spontaneously undergo back-mutation and regain their 
capacity to synthesize histidine. 2-Aminoanthracene increases 
the rate of back-mutations about 1800-fold and is therefore 
mutagenic. Since most carcinogens are mutagenic, 
2-aminoanthracene is probably carcinogenic. 

12. Spontaneous deamination of 5-methylcytosine (see p. 289) pro- 
duces thymine, and thus a G-T mismatched pair. These are 
among the most common mismatches in the DNA of eukaryotes. 
The specialized repair system restores the G=C pair. 

13. (a) Ultraviolet irradiation produces pyrimidine dimers; in normal 
fibroblasts these are excised by cleavage of the damaged strand 
by a special excinuclease. Thus the denatured single-stranded 



DNA contains the many fragments caused by the cleavage, and 
the average molecular weight is lowered, These fragments of 
single-stranded DNA are absent from the XPG samples, as indi- 
cated by the unchanged average molecular weight, (b) The 
absence of fragments in the single-stranded DNA from the XPG 
cells after irradiation suggests the special excinuclease is defec- 
tive or missing. 

14. During homologous genetic recombination, a Holliday intermedi- 
ate may be formed almost anywhere within the two paired, ho- 
mologous chromosomes; the branch point of the intermediate 
can move extensively by branch migration. In site-specific re- 
combination, the Holliday intermediate is formed between two 
specific sites, and branch migration is generally restricted by het- 
erologous sequences on either side of the recombination sites. 

15. Once replication has proceeded from the origin to a point where 
one recombination site has been replicated but the other has not, 
site-specific recombination not only inverts the DNA between 
the recombination sites but also changes the direction of one 
replication fork relative to the other. The forks will chase each 
other around the DNA circle, generating many tandem copies of 
the plasmid. The multimeric circle can be resolved to monomers 
by additional site-specific recombination events. 




Plasmid with Partially 
recombination replicated 
sites 1 and 2 plasmid 




16. (a) Even in the absence of an added mutagen, background mu- 
tations occur due to radiation, cellular chemical reactions, and so 
forth, (b) If the DNA is sufficiently damaged, a substantial frac- 
tion of gene products are nonfunctional and the cell is nonviable, 
(c) Cells with reduced DNA repair capability are more sensitive 
to mutagens. Because they less readily repair lesions caused by 
R7000, uvr~ bacteria have an increased mutation rate and 
increased chance of lethal effects, (d) In the uvr + strain, the 
excision-repair system removes DNA bases with attached 
[ 3 H]R7000, decreasing the 3 H in these cells over time. In the 
uvr" strain, the DNA is not repaired and 3 H level increases as 
[ 3 H]R7000 continues to react with the DNA. (e) All mutations 
listed in the table except A=T to G=C show significant increases 
over background. Each type of mutation results from a different 
type of interaction between R7000 and DNA. Because different 
types of interactions are not equally likely (due to differences in 
reactivity, steric constraints, etc.), the resulting mutations occur 
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with different frequencies, (f) No. Only those that start with a 
G^C base pair are explained by this model. Thus A^T to C=G 
and A=T to T=A must be due to R7000 attaching to an A or a T. 
(g) R7000-G pairs with A. First, R7000 adds to G=C to give 
R7000-G=C. (Compare this with what happens with the CH 3 -G 
in Fig. 25-28b, p« 1000.) If this is not repaired, one strand is 
replicated as R7000-G—A, which is repaired to T=A. The other 
strand is wild-type. If the replication produces R7000-G=T, a 
similar pathway leads to an A=T base pair, (h) No. Compare 
data in the two tables, and keep in mind that different mutations 
occur at different frequencies. 

A=T to C=G: moderate in both strains; but better repair in the 
uvr + strain 

G=C to A=T: moderate in both; no real difference 

G=C to C=G: higher in uvr + ; certainly less repair! 

G=C to T=A: high in both; no real difference 

A=T to T=A: high in both; no real difference 

A=T to G=C: low in both; no real difference 

Certain adducts may be more readily recognized by the repair 

apparatus than others and thus are repaired more rapidly and 

result in fewer mutations. 

Chapter 26 

1. (a) 60 to 100 s; (b) 500 to 900 nucleotides 

2. A single base error in DNA replication, if not corrected, would 
cause one of the two daughter cells, and all its progeny, to have a 
mutated chromosome. A single base error in RNA transcription 
would not affect the chromosome; it would lead to formation of 
some defective copies of one protein, but because mRNAs turn 
over rapidly, most copies of the protein would not be defective. 
The progeny of this cell would be normal. 

3. Normal posttranscriptional processing at the 3' end (cleavage 
and polyadenylation) would be inhibited or blocked. 

4. Because the template-strand RNA does not encode the en- 
zymes needed to initiate viral infection, it would probably be 
inert or simply degraded by cellular ribonucleases. Replication 
of the template-strand RNA and propagation of the virus could 
occur only if intact RNA replicase (RNA-dependent RNA poly- 
merase) were introduced into the cell along with the template 
strand. 

5. (1) Use of a template strand of nucleic acid; (2) synthesis in the 
5'-»3' direction; (3) use of nucleoside triphosphate substrates, 
with formation of a phosphodiester bond and displacement of 



PPi . Polynucleotide phosphorylase forms phosphodiester bonds, 
but differs in all other listed properties. 

6. Generally two: one to cleave the phosphodiester bond at one 
intron-exon junction; the other to link the resulting free exon 
end to the exon at the other end of the intron, If the nucleophile 
in the first step were water, this step would be a hydrolytic event 
and only one transesterification step would be required to com- 
plete the splicing process. 

7. Many snoRNAs, required for rRNA modification reactions, are 
encoded in introns. If splicing does not occur, snoRNAs are not 
produced. 

8. These enzymes lack a 3'— >5' proofreading exonuclease and have 
a high error rate; the likelihood of a replication error that would 
inactivate the virus is much less in a small genome than in a large 
one 

9. (a) 4 32 = 1.8 X 10 19 (b) 0.006% (c) For the "unnatural selec- 
tion" step, use a chromatographic resin with a bound molecule 
that is a transition-state analog of the ester hydrolysis reaction 
(e.g., an appropriate phosphonate compound; see Box 6-3). 

10. Though RNA synthesis is quickly halted by a-amanitin toxin, it 
takes several days for the critical mRNAs and the proteins in the 
liver to degrade, causing liver dysfunction and death, 

11. (a) After lysis of the cells and partial purification of the con- 
tents, the protein extract could be subjected to isoelectric focus- 
ing. The j3 subunit could be detected by an antibody-based assay. 
The difference in amino acid residues between the normal /3 sub- 
unit and the mutated form (i.e., the different charges on the 
amino acids) would alter the electrophoretic mobility of the mu- 
tant protein in an isoelectric focusing gel, relative to the protein 
from a nonresistant strain, (b) Direct DNA sequencing (by the 
Sanger method). 

12. (a) 384 nucleotide pairs (b) 1,620 nucleotide pairs (c) Most of 
the nucleotides are untranslated regions at the 3' and 5' ends of 
the mRNA. Also, most mRNAs code for a signal sequence (Chap- 
ter 27) in their protein products, which is eventually cleaved off 
to produce the mature and functional protein. 

13. (a) cDNA is produced by reverse transcription of mRNA; thus, 
the mRNA sequence is probably CGG. Because the genomic DNA 
transcribed to make the mRNA has the sequence CAG, the pri- 
mary transcript most likely has CAG, which is posttranscription- 
ally modified to CGG. (b) The unedited mRNA sequence is the 
same as that of the DNA (except for U replacing T) . Unedited 
mRNA has the sequence (* indicates site of editing) 



(5'). . .GUCUCUGGUUUUCCUUGGGUGCCUUUAUGCAGCAAGGAUGCGAUAUUUCGCCAAG. . .(3') 
In step 1, primer 1 anneals as shown: 



(5'). . .GUCUCUGGUUUUCCUUGGGUGCCUUUAUGCAGCAAGGAUGCGAUAUUUCGCCAAG. . .(3') 

I I I I I I I I I I I I I I I I I I I I I I I I 
Primer 1: (3')-CGTTCCTACGCTATAAAGCGGTTC-(5') 

cDNA ( underlined^! is synthesized from right to left: 



(5'). . .GUCUCUGGUUUUCCUUGGGUGCCUUUAUGCAGCAAGGAUGCGAUAUUUCGCCAAG. . .(3') 

I I I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
(3') ...CAGAGACCAAAAGGAACCCACGGAAATACGT CGTTCCTACGCTATAAAGCGGTTC-(5 f ) 

Then step 2 yields just the cDNA: 



(3 ' ). . ■ C AGAGACC AAAAGGAACCCACGGAAATACGTCGTTCCTACGCTATAAAGCGGTTC-(5 ' ) 

In step 3, primer 2 anneals to the cDNA: 

Primer 2: (5')-CCTTGGGTGCCTTTA-(3') 
I I II I I I I I I I I I I I 

(3')...CAGAGACCAAAAGGAACCCACGGAAATACGTCGTTCCTACGCTATAAAGCGGTTC-(5') 



Abbreviated Solutions to Problems AS-33 



DNA polymerase adds nucleotides to the 3' end of this primer. Moving from left to right, it inserts T, G, C, and A. However, because the A 
from ddATP lacks the 3' — OH needed to attach the next nucleotide, the chain is not elongated past this point. This A is shown in italic; 
the new DNA is underlined : 

Primer 2: (S'hCCTTGGGTGCCTTTATGCA 



(3')- . . C AGAGACC AAAAGGAACCCACGGAAATACGTCGTTCCTACGCTATAAAGCGGTTC-(5 ' ) 

This yields a 19 nucleotide fragment for the unedited transcript. In the edited transcript, the *A is changed to G; in the cDNA this corre- 
sponds to C. At the start of step 3: 

Primer 2: (5')-CCTTGGGTGCCTTTA-(3') 
I I I I I I I I I I I I I I I 

(3')..-CAGAGACCAAAAGGAACCCACGGAAATACGCCGTTCCTACGCTATAAAGCGGTTC-(5') 

In this case, DNA polymerase can elongate past the edited base and will stop at the next T in the cDNA. The dideoxy A is italic; the new 
DNA is underlined . 

Primer 2: (5'hCCTTGGGTGCCTTTATGCGGCA 

I I I I I I I I I I I 1 I I I I I I I I I I 
(3 f ). . .CAGAGACCAAAAGGAACCC ACGGAAATACGCCGTTCCTACGCTATAAAGCGGTTC-(5') 



This gives the 22 nucleotide product, (c) Treatments (pro- 
teases, heat) known to disrupt protein function inhibit the edit- 
ing activity, whereas treatments (nuclease) that do not affect 
proteins have little or no effect on editing. A key weakness of 
this argument is that the protein-disrupting treatments do not 
completely abolish editing. There could be some background 
editing or degradation of the mRNA even without the enzyme, 
or some of the enzyme might survive the treatments, (d) Only 
the a phosphate of NTPs is incorporated into polynucleotides. 
If the researchers had used the other types of [ 32 P]NTPs, none 
of the products would have been labeled, (e) Because only an 
A is being edited, only the fate of any A in the sequence is of 
interest, (f) Given that only ATP was labeled, if the entire nu- 
cleotide were removed all radioactivity would have been re- 
moved from the mRNA, so only unmodified [ 32 P]AMP would be 
present on the chromatography plate, (g) If the base were re- 
moved and replaced, one would expect to see only [ 3 H]AMP. 
The presence of [ 3 H]IMP indicates that the A to I change occurs 
without removal of H at positions 2 and 8. The most likely 
mechanism is chemical modification of A to I by hydrolytic 
deamination (see Fig, 22-34, p. 885). (h) CAG is changed to 
CIG. This codon is read as CGG. 

Chapter 27 

1. (a) Gly-Gln-Ser-Leu-Leu-Ile (b) Leu-Asp-Ala-Pro 

(c) His-Asp-Ala-Cys-Cys-Tyr (d) Met-Asp-Glu in eukaryotes; 
fMet-Asp-Glu in bacteria 

2. UUAAUGUAU, UUGAUGUAU, CUUAUGUAU, CUCAUGUAU, 
CUAAUGUAU, CUGAUGUAU, UUAAUGUAC, UUGAUGUAC, 
CUUAUGUAC, CUCAUGUAC, CUAAUGUAC, CUGAUGUAC 

3. No. Because nearly all the amino acids have more than one 
codon (e.g., Leu has six), any given polypeptide can be coded for 
by a number of different base sequences However, some amino 
acids are encoded by only one codon and those with multiple 
codons often share the same nucleotide at two of the three posi- 
tions, so certain parts of the mRNA sequence encoding a pro- 
tein of known amino acid sequence can be predicted with high 
certainty. 

4. (a) (5')CGACGGCGCGAAGUCAGGGGUGUUAAG(3') 

(b) Arg-Arg-Arg-Glu-Val-Arg-Gly-Val-Lys 

(c) No. The complementary antiparallel strands in double-helical 
DNA do not have the same base sequence in the 5'— >3' direction. 
RNA is transcribed from only one specific strand of duplex DNA. 
The RNA polymerase must therefore recognize and bind to the 
correct strand. 

5. There are two tRNAs for methionine: tRNA^ 6 ^ which is the initi- 
ating tRNA, and tRNA Met , which can insert a Met residue in inte- 



rior positions in a polypeptide. Only fMet-tRNA™^ is recognized 
by the initiation factor IF-2 and is aligned with the initiating AUG 
positioned at the ribosomal P site in the initiation complex AUG 
codons in the interior of the mRNA can bind and incorporate 
only Met-tRNA Met 

6. Allow polynucleotide phosphorylase to act on a mixture of UDP 
and CDP in which UDP has, say, five times the concentration of 
CDR The result would be a synthetic RNA polymer with many 
UUU triplets (coding for Phe), a smaller number of UUC (Phe), 
UCU (Ser), and CUU (Leu), a much smaller number of UCC 
(Ser), CUC (Leu), and CCU (Pro), and the smallest number of 
CCC (Pro). 

7. A minimum of 583 ATP equivalents (based on 4 per amino acid 
residue added, except that there are only 145 translocation 
steps) Correction of each error requires 2 ATP equivalents For 
glycogen synthesis, 292 ATP equivalents, The extra energy cost 
for /3-globin synthesis reflects the cost of the information content 
of the protein. At least 20 activating enzymes, 70 ribosomal pro- 
teins, 4 rRNAs, 32 or more tRNAs, an mRNA, and 10 or more 
auxiliary enzymes must be made by the eukaryotic cell in order 
to synthesize a protein from amino acids. The synthesis of an 
(al^4) chain of glycogen from glucose requires only 4 or 5 
enzymes (Chapter 15). 



8. 



Glycine codons 


Anticodons 


(5')GGU 


(5')ACC, GCC, ICC 


(5')GGC 


(5')GCC,ICC 


(5')GGA 


(5')UCC,ICC 


(5')GGG 


(5')CCC, UCC 



(a) The 3' and middle position (b) Pairings with anticodons 
(5')GCC, ICC, and UCC (c) Pairings with anticodons (5')ACC 
and CCC 

9. (a), (c), (e), and (g) only; (b), (d), and (f) cannot be the result 
of single-base mutations: (b) and (f) would require substitutions 
of two bases, and (d) would require substitutions of all three 
bases. 

10. The two DNA codons for Glu are GAA and GAG, and the four 
DNA codons for Val are GTT, GTC, GTA, and GTG. A single base 
change in GAA to form GTA or in GAG to form GTG could ac- 
count for the Glu -» Val replacement in sickle-cell hemoglobin. 
Much less likely are two-base changes, from GAA to GTG, GTT, 
or GTC; and from GAG to GTA, GTT, or GTC 

11. Isoleucine is similar in structure to several other amino acids, 
particularly valine. Distinguishing between valine and isoleucine 
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in the aminoacylation process requires the second filter of a 
proofreading function Histidine has a structure unlike that of 
any other amino acid, and this structure provides opportunities 
for binding specificity adequate to ensure accurate aminoacyla- 
tion of the cognate tRNA. 

12. (a) The Ala-tRNA synthetase recognizes the G 3 -U 70 base pair in 
the amino acid arm of tRNA^. (b) The mutant tRNA Ala would 
insert Ala residues at codons encoding Pro, (c) A mutation that 
might have similar effects is an alteration in tRNA Pro that allowed 
it to be recognized and aminoacylated by Ala-tRNA synthetase, 
(d) Most of the proteins in the cell would be inactivated, so 
these would be lethal mutations and hence never observed. This 
represents a powerful selective pressure for maintaining the ge- 
netic code, 

13. The amino acid most recently added to a growing polypeptide 
chain is the only one covalently attached to a tRNA and thus is 
the only link between the polypeptide and the mRNA encoding 
it. A proofreading activity would sever this link, halting synthesis 
of the polypeptide and releasing it from the mRNA 

14. The protein would be directed into the ER, and from there the 
targeting would depend on additional signals. SRP binds the 
amino-terminal signal early in protein synthesis and directs the 
nascent polypeptide and ribosome to receptors in the ER Be- 
cause the protein is translocated into the lumen of the ER as it is 
synthesized, the NLS is never accessible to the proteins involved 
in nuclear targeting 

15. Trigger factor is a molecular chaperone that stabilizes an un- 
folded and translocation-competent conformation of ProOmpA, 

16. DNA with a minimum of 5,784 bp; some of the coding sequences 
must be nested or overlapping, 

17. (a) The helices associate through hydrophobic and van der 
Waals interactions, (b) R groups 3, 6, 7, and 10 extend to the 
left; 1, 2, 4, 5, 8, and 9 extend to the right, (c) One possible 
sequence is 

12345 6789 10 

AT-Phe-Ile-Glu-Val-Met-Asn-Ser-Ala-Phe-Gln-C 

(d) One possible DNA sequence for the amino acid sequence in 
(c) is 

Nontemplate strand 

(5 ' )-T T TAT TGAAGT AATGAAT AGTGC AT TC C AG-(3 ' ) 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
(3')-AAATAACTTCATTACTTATCACGTAAGGTC-(5') 
Template strand 



(e) Phe, Leu, He, Met, and Val. All are hydrophobic, but the 
set does not include all the hydrophobic amino acids; Trp, 
Pro, Ala, and Gly are missing, (f) Tyr, His, Gin, Asn, Lys, 
Asp, and Glu. All of these are hydrophilic, although Tyr is less 
hydrophilic than the others. The set does not include all the 
hydrophilic amino acids; Ser, Thr, and Arg are missing, 
(g) Omitting T from the mixture excludes codons starting 
or ending with T — thus excluding Tyr, which is not very 
hydrophilic, and, more importantly, excluding the two possible 
stop codons (TAA and TAG). No other amino acids in the NAN 
set are excluded by omitting T (h) Misfolded proteins are 
often degraded in the cell. Therefore, if a synthetic gene has 
produced a protein that forms a band on the SDS gel, it is 
likely that this protein is folded properly, (i) Protein folding 
depends on more than hydrophobic and van der Waals 
interactions. There are many reasons why a synthesized 
random-sequence protein might not fold into the four-helix 
structure. For example, hydrogen bonds between hydrophilic 
side chains could disrupt the structure. Also, not all sequences 
have an equal propensity to form an a helix 



Chapter 28 

1. (a) Tryptophan synthase levels remain high in spite of the pres- 
ence of tryptophan, (b) Levels again remain high, (c) Levels 
rapidly decrease, preventing wasteful synthesis of tryptophan, 

2. (a) Constitutive, low-level expression of the operon; most muta- 
tions in the operator would make the repressor less likely to 
bind, (b) Either constitutive expression, as in (a), or constant 
repression, if the mutation destroyed the capability to bind to 
lactose and related compounds and hence the response to induc- 
ers, (c) Either increased or decreased expression of the operon 
(under conditions in which it is induced), depending on whether 
the mutation made the promoter more or less similar, respec- 
tively, to the consensus E. coli promoter 

3. 7,000 copies 

4. 8 X 10~ 9 M, about 10 5 times greater than the dissociation con- 
stant. With 10 copies of active repressor in the cell, the operator 
site is always bound by the repressor molecule 

5. (a)-(e) Each condition decreases expression of lac operon genes. 

6. (a) Less attenuation of transcription. The ribosome completing 
the translation of sequence 1 would no longer overlap and block 
sequence 2; sequence 2 would always be available to pair with 
sequence 3, preventing formation of the attenuator structure, 
(b) More attenuation of transcription. Sequence 2 would pair 
less efficiently with sequence 3; the attenuator structure would 
be formed more often, even when sequence 2 was not blocked by 
a ribosome. (c) No attenuation of transcription. The only regula- 
tion would be that afforded by the Trp repressor, (d) Attenua- 
tion loses its sensitivity to Trp tRNA. It might become sensitive 
to His tRNA. (e) Attenuation would rarely, if ever, occur. Se- 
quences 2 and 3 always block formation of the attenuator. 

(f) Constant attenuation of transcription. Attenuator always 
forms, regardless of the availability of tryptophan, 

7. Induction of the SOS response could not occur, making the cells 
more sensitive to high levels of DNA damage. 

8. Each Salmonella cell would have flagella made up of both types 
of flagellar protein, and the cell would be vulnerable to antibod- 
ies generated in response to either protein. 

9. A dissociable factor necessary for activity (e g., a specificity fac- 
tor similar to the cr subunit of the E. coli enzyme) may have 
been lost during purification of the polymerase. 

10. 

Gal4 protein 



Gal4 DNA-binding domain 


Gal4 activator domain 


Engineered protein 


Lac repressor DNA-binding domain 


Gal4 activator domain 



The engineered protein cannot bind to the Gal4 binding site in 
the GAL gene (UAS G ), because it lacks the Gal4 DNA-binding 
domain. Modify the Gal4p DNA binding site to give it the nu- 
cleotide sequence to which the Lac repressor normally binds 
(using methods described in Chapter 9). 

11. Methylamine. The reaction proceeds with attack of water on the 
guanidinium carbon of the modified arginine. 

12. The bed mRNA needed for development is contributed to the 
egg by the mother. The egg develops normally even if its geno- 
type is bcd~/bcd~, as long as the mother has one normal bed 
gene and the bcd~ allele is recessive. However, the adult 
bcd~/bcd~ female will be sterile because she has no normal 
bed mRNA to contribute to her own eggs, 

13. (a) Hydrogen bonds form between the protein and DNA back- 
bone at A106, A110, A118, T119, and A122, and between the 
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protein and DNA bases at A106, T107, A118, and T119. The 
latter four nucleotides contribute directly to DNA sequence 
recognition (b) DNA backbone: A106-Arg 290 , AllO-Ser 212 , 
A118-Arg 199 , T119-Arg 204 , A122-Ser 303 . DNA bases: A106- 
Asn 253 , T107-Asn 253 , A118-Asn 163 , T119-Asn 163 . Asn, Gin, Glu, 
Lys, and Arg are commonly found hydrogen-bonded to bases in 
DNA. The majority of residues in the TATA-binding protein that 
are involved in hydrogen bonds are Arg and Asn. 

(c) TATATATA (residues 103 to 110) 
ATATATAT (residues 122 to 115) 

The TATA-binding protein recognizes A106, T107/T119, A118. 

(d) The hydrophobic interactions are numerous Many binding 
interactions of this type involve the burying of large amounts of 
hydrophobic surface. 

14. (a) For 10% expression (90% repression), 10% of the repressor 
has bound inducer and 90% is free and available to bind the op- 
erator. The calculation uses Eqn 5-8 (p. 156), with 6 = 01 and 
K d = 10" 4 m. 

[IPTG] [IPTG] 
" [IPTG] +K d ~ [IPTG] + 10" 4 m 

[IPTG] 

0.1 = — so 0.9[IPTG] = 10 -5 or [IPTG] = 1.1 X 10" 5 M 

[IPTG] + 10" 4 m 

For 90% expression, 90% of the repressor has bound inducer, 
so 6 = 0.9. Entering the values for 6 and K d in Eqn 5-8 gives 
[IPTG] = 9 x 10~ 4 m. Thus, gene expression varies 10-fold over a 
roughly 10-fold [IPTG] range, (b) You would expect the protein 
levels to be low before induction, rise during induction, and then 
decay as synthesis stops and the proteins are degraded, (c) As 



shown in (a), the lac operon has more levels of expression than 
just on or off; thus it does not have characteristic A. As shown in 
(b), expression of the lac operon subsides once the inducer is 
removed; thus it lacks characteristic B. (d) GFP-on: rep ts and 
GFP are expressed at high levels; rep ts represses OP A , so no LacI 
protein is produced. GFP-off: LacI is expressed at a high level; 
LacI represses OP lac , so rep ts and GFP are not produced, (e) 
IPTG treatment switches the system from GFP-off to GFP-on. 
IPTG has an effect only when LacI is present, so affects only the 
GFP-off state. Adding IPTG relieves the repression of OPi ac al- 
lowing high-level expression of rep ts , which turns off expression 
of LacI, and high-level expression of GFP. (f) Heat treatment 
switches the system from GFP-on to GFP-off. Heat has an effect 
only when rep ts is present, so affects only the GFP-on state. Heat 
inactivates rep ts and relieves the repression of OP A) allowing 
high-level expression of LacI. LacI then acts at OPi ac to repress 
synthesis of rep ts and GFP (g) Characteristic A: the system is 
not stable in the intermediate state. At some point, one repressor 
will act more strongly than the other due to chance fluctuations 
in expression; this shuts off expression of the other repressor 
and locks the system in one state. Characteristic B: once one 
repressor is expressed, it prevents the synthesis of the other; 
thus the system remains in one state even after the switching 
stimulus has been removed, (h) At no time does any cell express 
an intermediate level of GFP — this is a confirmation of character- 
istic A. At the intermediate concentration (X) of inducer, some 
cells have switched to GFP-on while others have not yet made 
the switch and remain in the GFP-off state; none are in between. 
The bimodal distribution of expression levels at [IPTG] = X is 
caused by the mixed population of GFP-on and GFP-off cells. 
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Key: b = boxed material; f = figures; s = structural 
formulas; t = tables; boldface = boldfaced terms 
in text 

a 

A bands, 176, 177f 

A (aminoacyl) binding site, ribosomal, 1089 
A kinase anchoring proteins (AKAPs), 431-432 
AAA+ ATPase, 985 

abasic site, in base-excision repair, 996-997, 997f 

ABC exinuclease, 997-998 

ABC1 protein, 840, 843-844 

ABC transporter, 400, 400f, 840 

abiotic synthesis, 30-31, 31f 

absolute configuration, 74 

absolute temperature, units of, 49 It 

absorbance (A ), 76b 

absorption 

of fat, in small intestine, 648-649, 648f 

of light, 744-749. See also light, absorption of 
absorption spectra 

of cytochrome c, 711, 71 If 

of nucleotides, 276, 276f 

of opsins, 465, 465f 
ACAT (acyl-CoA-cholesterol acyl transferase), 

836, 841 
acceptor control, 733 
acceptor control ratio, 733 
accessory pigments, 747 

in light absorption, 745f, 747 
acetaldehyde, 513s 
acetals, 238, 238f 
acetate, 505s 

activated, in citric acid cycle, 616-617, 616f, 
617f, 619f 

in cholesterol synthesis, 831-832, 832f 

in fatty acid synthesis, 813-814, 813f 

oxidation of, 631 

as source of phosphoenolpyruvate, 638 

transport of, 813-814, 813f 
acetic acid, 505s, 513s 

pK a of, 80, 80f 

titration curve for, 58, 58f 
acetic acid-acetate buffer system, 60, 60f 
acetoacetate, 666, 929s 
acetoacetate decarboxylase, 666 
acetoacetyl-ACP, 809, 81 Of, 81 If 
acetoacetyl-CoA, 298 
acetone, 513s, 666, 929s 

in diabetic ketoacidosis, 929 
acetyl groups 

in fatty acid synthesis, 813-814, 813f 

transport of, 813-814, 813f 
A/"-acetyl-j8-D-glucosamine, 240s 
acetylcholine receptor, 410 

defective, 412t 

open/closed conformation of, 453, 454f 
in signaling, 453, 454f 
structure of, 453, 454f 
synaptic aggregation of, 384 
acetyl-CoA, 13s, 607s 
amino acid degradation to, 695-696, 695f, 696f 
AMPK and, 934 

/3 oxidation yielding, 653f, 654-655 
in cholesterol synthesis, 832^836, 832f 
decarboxylation of pyruvate to, 619, 619f 
in fatty acid synthesis, 805-806, 806f, 807f, 811, 
812, 922 

in glucose metabolism, 922, 923f, 927f, 928 
hepatic metabolism of, 915-916 
hydrolysis of, 505, 505s, 505t 
oxidation of, in citric acid cycle, 655, 656t 
oxidation of pyruvate to, 616-617, 6l6f 



in plant glue oncogenesis, 798 
production of, 813-814 
in citric acid cycle, 616-620, 616f, 619f, 631f 
by pyruvate dehydrogenase complex, 636f, 
650-651 
acetyl-CoA carboxylase 
in bacteria, 814 

in fatty acid synthesis, 805-806, 806f, 807f, 
814,814f 

in plants, 814 
acetyl-CoA-ACP transacetylase, 810f, 81 If 
acetyl-coenzyme A. See acetyl-CoA 
acetylene, 513s 

^-acetylgalactosamine, in glycosaminoglycans, 
249, 250f 

N-acetylglucosamine (GlcNAc), 240, 240f 

in bacterial cell walls, 249 

in glycosaminoglycans, 249, 250f 

in peptidoglycan synthesis, 796, 797f 
N-acetylglutamate, regulator of urea cycle, 685 
Af-acetylglutamate synthase, 685 
Af-acetylmuramic acid (Mur2Ac), 240, 240f 

in peptidoglycan synthesis, 796, 797f 
A^-acetylneuraminic acid (Neu5Ac), 240, 240s, 
258-259, 258t, 259s, 354s 

in gangliosides, 354, 354s 
acetylsalicylate (aspirin), 818 
acetylsalicylic acid (aspirin), 818s 
achiral molecules, 16f 

acid(s) See also specific acid, e g , acetic acid 
amino acids as, 81 
as buffers, 59-63, 61f 
definition of, 57 

dissociation constant (KJ of, 57f-59f, 58-59 
fatty See fatty acid(s) 
relative strength of. See pK & 
strong, 57-58 

titration curve for, 58-59, 58f, 59f 

Henderson-Hasselbalch equation for, 60-61, 80 
weak, 57-59 

acid anhydride, standard free-energy changes of, 493t 
acid dissociation constant (iQ, 57f-59f, 58-59, 

58-59 
acid-base catalysis 

general, 192, 193, 193f 

specific, 192-193 
acid-base pairs, conjugate, 57, 57f 

as buffer systems, 59-63 
acid-base titration, 58-59, 58f, 59f, 79-^81, 79f 
acidemia 

argininosuccinic, 694t 

methylmalonic, 694t, 700b 
acidic activation domain, 1142-1143 
acidic sugars, 240s 
acidosis, 57, 64b, 667 

diabetic, 930 
acivicin, 894, 894f 
aconitase, 622, 625b 
aconitase, moonlighting, 624 
czs-aconitate, 622, 622s 

isocitrate formation via, 622-623, 623f 
aconitate hydratase, 622 
ACP (acyl carrier protein), 81 Of 
acridine, 1033, 1033 
actin, 176-178, 178f 

in ATP hydrolysis, 176, 178-179, 178f 

in muscle contraction, 178-179, 178f 

structure of, 177-178 

in thin filaments, 177-178, 178-179, 178f 
actin filaments, 9, 9f 
a-actinin, 177 
actin-myosin complex 

ATP in, 509-510 

phosphorylation of, 472 



actin-myosin interactions, 178-179, 178f 

actinomycin D, 1033, 1033f 

action spectrum, for photosynthesis, 747 

activation barrier, 25, 25f 

activation energy (AG*), 25, 25f f 187 

of enzymatic reactions, 187 

in membrane transport, 390-391, 390f 

rate constant and, 188 
activators, 1117-1118, 1118f 
active site, 154, 186, 186f 

transition-state complementarity and, 
210b-211b,211f 
active transport, 391 

ATP in, 509-510, 730-731 

primary transporters in, 391 

secondary transporters in, 391 

transporters in, 391 
active transport, ATP and, 509-510, 730-731 
Actos, 808s, 824, 936 
acyclovir, 992 

acyl carrier protein (ACP), 808, 809f, 810f 
acyl phosphate, 536 
acyl-carnitine/carnitine transporter, 651 
acyl-CoA, fatty, 650 

conversion of fatty acids into, 650-651, 651f 
acyl-CoA acetyltransf erase, 654 
acyl-CoA dehydrogenase, 654, 715 

medium-chain, 661 
acyl-CoA synthetases, 650 

in triacylglycerol synthesis, 820, 820f 
acyl-CoA-cholesterol acyl transferase (ACAT), 
836, 841 

acyl-enzyme intermediate, in chymotrypsin 

mechanism, 205-207, 206f, 208f, 209 
AT-acylsphinganine, 829, 831f 
W-acylsphingosine, 829, 831f 
adaptor hypothesis, 1066, 1066f 
adaptor proteins, in signaling, 431, 446-449 
ADARs, 1074 

adenine, 9s, 272, 272t, 502s, 517s, See also 
purine bases 

deamination of, 289, 290f 

evolutionary significance of, 1057, 1057f 
adenine nucleotide translocase, 731 
adenine nucleotides 

biosynthesis of, regulatory mechanisms in, 
885, 886f 

cellular concentrations, 502t 

in metabolic regulation, 575-577 
adenosine, 273s 

anti form of, 280-281, 280f 

as enzyme cofactor, 297-298, 297f 

evolutionary significance of, 297-298 

methylation of, 292 

synform of, 280-281, 280f 
adenosine 2'-monophosphate, 274s 
adenosine 2',3'-cyclic monophosphate, 274s 
adenosine 3',5'-cyclic monophosphate. See cAMP 

(adenosine 3', 5 '-cyclic monophosphate) 
adenosine deaminase, 892 
adenosine deaminase deficiency, 893 

gene therapy for, 335-336 
adenosine diphosphate (ADP). See ADP 

(adenosine diphosphate) 
adenosine 3 '-monophosphate, 274s 
adenosine S'-monophosphate, 274s. See also AMP 

(adenosine monophosphate) 
adenosine monophosphate (AMP). See AMP 

(adenosine monophosphate) 
adenosine phosphoribosyltransferase, 893 
adenosine triphosphate (ATP) . See ATP 

(adenosine triphosphate) 
S-adenosylhomocysteine, 690 
adenosylmethionine, as mutagen, 291f 




S-adenosylmethionine, 689, 689s 

as mutagen, 292 

synthesis of, 691f 
adenylate, 272t, 273s 
adenylate kinase, 510, 576, 888 
adenylyl cyclase 

activation of, 424f, 431 

in ^-adrenergic pathway, 426 
adenylyl group, ATP and, 508 
adenylyl transfer, 508f 
adenylylation, 508 
adenylylation, enzyme, 223, 223f 
adenylyltransferase, 858 
adhesion receptors, in signaling, 422, 422f 
adipocytes, 346, 346f, 916, 916f 

NADPH synthesis in, 812, 812f 
adipokines, 930-931 
adiponectin, 934r-936, 935, 935f 
adipose tissue 

brown, 736, 917, 917f 
heat generated by, in oxidative phosphorylation 

regulation, 736, 736f 
mitochondria in, 736 

endocrine functions of, 909f, 916-917 

in fasting/starvation, 927f, 928f 

fatty acid release from, 821-822, 822f, 823, 
916-917 

fatty acid synthesis in, 916-917 See also fatty 
acid synthesis 

glucagon and, 926, 927t 

glyceroneogenesis in, 822-823, 822f, 823f 

leptin synthesis in, 930-931, 931f 

metabolic functions of, 909f, 916-917 

triacylglycerol mobilization in, 648f, 650f 

triacylglycerol recycling in, 822, 822f 

white, 916, 916f 
A-DNA, 281, 281f 
adoMet See adenosylmethionine 
ADP (adenosine diphosphate) 

ATP stabilized relative to, on Fi component, 726f 

in fatty acid synthesis, 810f, 811 

phosphoryl transfer to 
from 1,3-bisphosphoglycerate, 536-537 
from phosphoenolpyruvate, 538 

in photosynthesis, 783 

synthesis of, 24 
ADP-glucose, 791 

in glycogen synthesis, 791-792, 791f 

in starch synthesis, 791-792, 79 If 
ADP-glucose pyrophosphorylase, in starch 

synthesis, 793, 794f 
ADP-ribosylation, enzyme, 223, 223f 
adrenal, 909f 

adrenaline, See epinephrine 
^-adrenergic receptor 

desensitization of, 430^31, 430f 

in rafts, 449 

^-adrenergic receptor kinase Q3ARK), 430f, 431 
adrenergic receptors, 423-431 
adrenocortical hormones, 906t, 908 
adrenoleukodystrophy, X-linked, 662 
adsorption chromatography, 364-365, 364f 
AE (anion exchange) protein, 395 
Aequorea victoria, fluorescent proteins in, 
434b^36b 

aerobic metabolism, of small vertebrates, 548b 
aerobic organisms, evolution of, 32-33, 33f 
affinity, receptor-ligand, 419-420, 420f 
affinity chromatography, 87f, 88, 88t, 313-314 

tags in, 313-314, 313t 
African sleeping sickness, 880b-881b 
agar, 249 

agarose, 249, 249f, 251t 

agarose gels, in electrophoresis, 249 

aggrecan, 254 

aging, mitochondrial DNA damage and, 739 
agonists, receptor, 423 
Agre, Peter, 404, 404f 
agriculture See also plant (s) 

genetic engineering in, 330-334 
Agrobacterium tumefaciens, in plant cloning, 
330-332, 330f, 331f 



AIDS See human immunodeficiency virus infection 
AKAPs (A kinase anchoring proteins), 431-432 
alanine, 9s, 15f, 15s, 75, 75s, 692, 865 

degradation of to pyruvate, 692, 692f 

properties of, 73t, 75 

stereoisomers of, 72-73, 72f 

structure of, 15, 15f 

in transport of ammonia to liver, 681 
alanine aminotransferase (ALT), 681 

measurement of, 678b 
alanylglutamylglycyllysine, 83s 
Alberts, Alfred, 843b, 843f 
albinism, 694t 

albumin, serum, 649-650, 916 
alcohol(s), 513s, 556s 

fermentation of, 528, 530f, 547-549, 547f 

hemiacetals and, 238, 238f 

hemiketals and, 238, 238f 

in lipid extraction, 363-364, 364f 
alcohol dehydrogenase, 518t, 547, 664 

reaction mechanism of, 547f 
aldehyde (s), 13 

hemiacetals and, 238, 238f 

hemiketals and, 238, 238f 
aldehyde dehydrogenase, 664 
aldohexoses, 237, 237f 
aldol condensation, 497, 497s 
aldolase, 575t, 783 

in Calvin cycle, 779 

in glycolysis, 533 
aldolase reaction mechanism, class I, 534f 
aldonic acids, 240 
aldopentoses, 236 

structure of, 236f, 239, 239f 
aldoses, 236, 236f 

D isomers of, 237 

l isomers of, 237, 237f 

structure of, 237-238, 237f 
aldosterone, 359, 359s 

algae, cell walls of, heteropolysaccharides in, 

249, 249f 
alkaline phosphatase, 305t 
alkalosis, 57, 64b 
alkaptonuria, 694t, 698 
AlkB protein, in DNA repair, 1000, lOOOf 
alkene, 513s 

alkylating agents, as mutagens, 291f, 292 
allantoin, 892 
alleles, 168 

alligator, anaerobic metabolism and 

movement of, 548b 
allopurinol, 894 

xanthine oxidase inhibition by, 894, 894f 
allose, 237s 

allosteric effectors, 220 

allosteric enzymes, 220-222, 221 f, 222f 
conformational changes in, 220-221, 221f 
feedback inhibition by, 221-222, 221f 
kinetics of, 222, 222f 

allosteric modulators, 220 

allosteric proteins, 162 
ligand binding of, 162, 162f See also protein- 
ligand interactions 

allosteric regulation 
of acetyl-CoA production, 635-636, 636f 
of amino acid biosynthesis, 870f, 872-873, 872f 
of aspartate transcarbamoylase, 887, 887f 
of carbohydrate metabolism, 606-608, 608f 

lipid metabolic integration with, 608 
of fat metabolism, 608 
of glutamine synthetase, 857-858 
of glycogen phosphorylase, 603-604, 603f 
of phosphofructokinase-1, 585-587, 586f 
of pyruvate kinase, 579-580, 589f 

all- trans -retinal, 463 

Alper, Tikvah, 147 

a cells, pancreatic, 924, 924f 

a chains, in collagen, 125, 125f 

a helix, 117-120, 118f, 120f, 121-122, 
122f, 123t 
in amino acids, 119, 119t 
in keratin, 123-124, 124f 



in membrane proteins, 376, 377f, 378 

in myoglobin, 130 

in polysaccharides, 248, 249f 

in protein folding, 135, 148 

in small globular proteins, 131 1 
a helix-type channels, 391, 392t 
a oxidation, 664^665 See also oxidation 

in peroxisomes, 664-665, 665f 
a-actin, 1764 
a-amanitin, 1033 

a-amino groups, transfer of to a-ketoglutarate, 677, 

677f, 679f 
aifi barrel, 135-136, 136f 
a-A-glucose 1-phosphate, 391, 595-596, 595f 
a-oxoglutarate See a-ketoglutarate 
ALT See alanine aminotransferase (ALT) 
Altman, Sidney, 1048 
altrose, 237s 
Alzheimer's disease 

amyloid deposits in, 147b 

apolipoprotein in, 839 
Ames, Bruce, 993 
Ames test, 993, 993f 

amide, standard free-energy changes of, 493t 
amines, 13 

as products of amino acid decarboxylation, 

878-879, 879f, 880f 
amino acid(s), 9f, 72-81. See also protein (s); 

specific amino acids 
in a helix, 117-119, 118f, 119t, 122f 
abbreviations for, 73t 
acid-base properties of, 81 
activation of, in protein synthesis, 1081-1084, 

1082f-1084f 
addition of, in protein synthesis. See protein 

synthesis 
ammonium assimilation into, 857 
as ampholytes, 79 

aromatic, as precursors of plant substances, 

878, 878f 
in /3 sheet, 121, 12 If, 122f 
in 0 turns, 121, 121f 
biosynthesis of, 860-873, 861f 

a-ketoglutarate in, 861-864, 862f, 863f 

allosteric regulation of, 870f, 872-873, 872f 

chorismate in, 865-869, 868f-870f 

histidine in, 869-872, 871f 

interlocking regulatory mechanisms in, 
872-873, 872f 

metabolic precursors in, 861, 861t 

oxaloacetate and pyruvate in, 865 

3-phosphoglycerate in, 863-864, 863f, 864f 

reactions in, 859, 861f 
branched-chain, not degraded in liver, 701, 701f 
as buffers, 80, 80f 
carbon designations for, 73-74 
codons for, 1066-1074 See also codons 
conversion of 

to glucose and ketone bodies, 688-689, 688f 

to a-ketoglutarate, 698-699, 698f 

to succinyl-CoA, 699-700, 699f 
degradation of 

to acetyl-CoA, 695-696, 695f, 696f 

to pyruvate, 692-694, 692f, 693f, 694t 
discovery of, 73 
electric charge of, 80-81 
essential, 686, 686t, 860 

biosynthesis of, 865, 866f-^867f 
in expanded genetic code, 1085b-1087b 
found in bacteria, 877-878 
in general acid-base catalysis, 193, 193f 
glucogenic, 557, 557t, 688 
hepatic metabolism of, 914-915, 915f 
ketogenic, 688 
light absorption by, 76b, 76f 
molecular weight of, 84 
molecules derived from, 873-882 
nonessential, 686t, 860 

biosynthesis of, 865 
number of, 83t, 84, 84t 
in peptide synthesis, 100-102, lOlf 
pK a of, 79-80, 79f-81f 
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in plant glue oncogenesis, 798 
polarity of, 75 

polymers of See peptide(s); protein(s) 
as precursors of creatine and glutathione, 

876-877, 877f 
R groups of, 73, 73t, 74-77 
relative amounts of, 84, 84t 
stereoisomerism in, 72-73, 72f 
structure of, 72-74, 72f, 72s, 79f 
symbols for, 73t 

titration curves of, 79-81, 79f, 8lf 
uncommon, 77 

zwitterionic form of, 79-80, 79f 
amino acid arm, of tRNA, 1081 
amino acid catabolism, 673-682, 675f 

acetyl-CoA in, 695-696, 695f, 696f 

a-ketoglutarate in, 698f 

ammonia in, 681-682 
alanine transport of, 681, 681f 
glutamate release of, 677-680, 680f 
glutamine transport of, 680-681, 680f 

asparagine in, 701-702, 702f 

aspartate in, 701-702, 702f 

enzyme cofactors in, 689-692, 689f, 
690f, 691f 

enzyme degradation of protein in, 674-677, 693f 
genetic defects in, 694t, 696-698, 697f 
glucose and ketone bodies in, 688-689, 688f 
a-ketoglutarate in, 698-699 
overview of, 687-688, 688f 
oxaloacetate in, 701-702, 702f 
pathways of, 673-674, 674f, 687-702, 913t 
pyruvate in, 692-694, 692f, 693f, 694t 
succinyl-CoA in, 699-700, 699f 
transfer of a-amino groups to a-ketoglutarate in, 
677, 677f, 679f 

amino acid metabolism, 570f, 913t 

amino acid oxidation, 673-702 
fates of amino groups in, 673-682 
nitrogen excretion and urea cycle in, 
682-687 

pathways of degradation in, 687-702 
amino acid residues, 72, 77, 78s 
ammo-terminal, 82 
carboxyl-terminal, 82 
in consensus sequences, 225-226, 226t 
farnesylation of, 1097, 1097f 
isoprenylation of, 1097, 1097f 
number of, 83, 83t, 84t 
phosphorylation of, 223f, 224-226 
sequence of, 92-93, 92f 
amino acid sequences, 28f, 29, 92-93, 92f, 102 
a helix and, 117-119, 118f, 120f, 121-122 
determination of, 93-100 See also amino 

acid sequencing 
evolutionary significance of, 102-106 
in genetic code, 1067-1068 See also codons 
homologous, 104-105, 104f, 105f 
hydrophobicity of, 73t, 378 
identification of, in probe design, 311, 31 If 
membrane protein topology and, 376f-379f, 

378-379 

nucleic acid sequences and, 97f, 98-100,948, 948f 
protein function and, 93 
protein structure and, 102 
signature, 105, 105f 
amino acid sequencing, 93-100 
bond cleavage in, 95-96, 96f, 96t 
bond location in, 97-98 
computerized, 104-106 
DNA sequencing and, 97f, 98-100 
Edman degradation in, 95, 94f 
in evolutionary analysis, 102-106, 104f-106f 
historical perspective on, 93-94 
for homologs, 104-105, 104f, 105f 
for large polypeptides, 95-96 
mass spectrometry in, 98-100, 98b-100b, 

99f-100f, lOOf 
peptide cleavage in, 95-96, 96f, 96t, 97f 
reagents for, 94-95, 95s 
for small polypeptides, 94-95 
steps in, 94f 



amino acid substitutions, 104 

in homologs, 104, 105f 
amino groups, transfer of, to a-ketoglutarate, 677, 

677f, 679f 
amino sugars, 240s 

aminoacyl (A) binding site, ribosomal, 1089 
aminoacyl-tRNA, 1066 

binding of, 1091, 1092f 
sites of, 1089, 1089f 

formation of, 1081-1084, 1082f-1084f 
aminoacyl-tRNA synthetases, 1066 

in protein synthesis, 1081-1084, 1082f-1084f 

reaction mechanism of, 1082f 
y-aminobutyric acid (GABA), receptor for, as ion 

channel, 410 
6-aminolevulinate, biosynthesis of, 874f 
aminopeptidase, 676 
aminopterin, 894-895 
amino-terminal residues, 82 

protein half-life and, 1107, 1109t 
aminotransferase, 677. See also transaminase 

PLP as prosthetic group of, 677, 695f 
ammonia 

in amino acid catabolism 
alanine transport of, 681, 681f 
glutamate release of, 677-680, 680f 
glutamine transport of, 680-681, 680f 

from amino acid metabolism, 914-915 

in nitrogen metabolism, 857 

reduction of nitrogen to, 852-854 

solubility of in water, 47, 47t 

toxicity of, 681-682 

urea production by, enzymatic steps in, 682-684, 
683f , 684f 

ammonium cyanide, adenine synthesis from, 

1056-1057, 1057f 
ammonotelic species, disposal of, 682 
amoxicillin, 218 

AMP (adenosine monophosphate) 

allosteric enzyme modification by, 227 

concentration of, 575-576 

relative changes in, 576t 

variant forms of, 274, 274s 
AMP-dependent protein kinase (AMPK), 576, 576f 

adiponectin and, 934-936, 935, 935f 

leptin and, 934, 934 
amphibolic pathway, 631 
amphipathic compounds, 46t 

solubility of in water, 46t, 48-49 
amphitropic proteins, 375 
ampholytes, amino acids as, 79 
amphoteric substances, amino acids as, 79 
AMPK See AMP-dependent protein kinase (ADPK) 
amplification, signal, 420, 420f, 429 
amylase, 246, 543 

amylo (l->4) to (l->6) transglycosylase, 

601, 601f 
amyloid, 145 
amyloidoses, 145-147 
amylopectin, 245-246, 245f, 251t, 792, 

See also starch 
amyloplast, 774-775, 775f 
amylose, 245-246, 245f, 248, 248f, 251t 
See also starch 

structure of, 245f, 248, 248f, 249f 
anabolic pathways, 773, 913t 

energy carriers in, 487f 

in glycogen metabolism, 595-596 
anabolism, 25, 26f, 487, 487f 

citric acid cycle in, 631, 632f 
anaerobic bacteria, incomplete citric acid 

cycle in, 631, 631f 
anaerobic metabolism, of coelacanths, 548b 
analytes, 98 
anammox, 852, 852f 
anammoxosome, 854, 854f 
anandamide, 442s 

anaplerotic reaction, in citric acid cycle, 631-633, 

632f, 632t 
Andersen's disease, 599t 
androgens, 908 
synthesis of, 844-845, 844f 



anemia 
megaloblastic, 691 

pernicious, role of vitamin B 12 , 659, 691 
Anfmsen, Christian, 141 
angina pectoris, nitro vasodilators for, 446 
angstrom (A), 117 

anion exchanger, 90. See also ion-exchange 

chromatography 
anion-exchange (AE) protein, 395 
ankyrin, 384f 

annealing, of DNA, 287-288, 287f 
annotated genome, 34 
annular lipids, 376, 377f 
anomeric carbon, 239 
anomers, 239 
anorexigenic neurons, 933 
antagonists, receptor, 423 

antenna chlorophylls, in light-driven electron flow, 

752, 753f, 754-755 
antenna molecule, 747-748 
antennapedia, 1151 
anterior pituitary, 909, 909f 
antibiotics 

mechanism of action of, 216-219, 404, 404f, 896, 
960b-961b 

protein glycosylation inhibition by, 1102 

resistance to 
ABC transporters and, 400 
plasmids in, 949 

topoisomerase inhibitor, 960b-961b 

transcription inhibition by, 1033 

translation inhibition by, 1098-1099 
antibodies, 170. See also immunoglobulin (s) 

in analytic techniques, 173-174, 174f 

antigens and, 170-175, 171f-173f. See also 
antigen-antibody interactions 

binding sites on, 171-173, 172f 

binding specificity of, 170, 173, 173f 

catalytic, 211 

diversity of, 170 
recombination and, 1014 

monoclonal, 173 

polyclonal, 173 
anticodon(s), 1070-1072, 1071f 

wobble base of, 1072 
anticodon arm, of tRNA, 1081 
antidiuretic hormone (ADH), 911s 
antigen(s), 171 

epitope of, 171 

T-cell binding of, 170 
antigen-antibody interactions, 170-175, 
171f-173f 

binding affinity in, 173, 173f 

binding sites for, 171-173, 172f 

conformational changes in, 173, 173f 

haptens in, 171 

induced fit in, 154, 173, 173f 

specificity of , 170, 173 

strength of, 173 
antigenic determinant, 171 
antigenic variation, 1135t 
antiporter(s),395,395f 

Na + K + -ATPase, 398-399, 398f, 403, 403f 
in membrane polarization, 450, 450f 
in membrane transport, in neurons, 920 
in neurons, 920 
in retina, 463 

for triose phosphates, 783-784, 784f 
antiviral agents, mechanism of action of, 

218-219 
AP endonucleases, 997 
AP site, in base-excision repair, 996, 997, 997f 
Apaf-1 (apoptosis protease activating factor-1), 738 
APC mutation, 477, 477f 
apo-aconitase, 625b 
apoB-48, gene for, RNA editing in, 1074 
apoB-100, 837f, 838, 840-841 

gene for, RNA editing in, 1074 
APOBEC, 1074 

APOE, in Alzheimer's disease, 839b 
apoE, LDL receptor binding to, 841 
apoenzyme, 184 
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apolipoproteins, 649, 836-837, 837f 

in Alzheimer's disease, 839b 

gene for, RNA editing in, 1074 
apoprotein, 184 
apoptosis, 477^78, 478f, 737 

cytochrome P-450 in, 737-738, 737f 

mitochondria in, 737-738, 737f 
apoptosis protease activating factor-1 (Apaf-1), 738 
apoptosome, 738 

appetite, hormonal control of, 930-937 
App(NH)p (j8,y-imidoadenosine 5 '-triphosphate), 

727s 
aptamers, 1058b 

aquaporins (AQPs), 404^406, 405f, 406t 

classification of, 406t 

distribution of, 406t 

microscopy of, 385b 

permeability of, 406t 
aqueous environments, adaptation to, 65-66 
aqueous solutions See also water 

amphipathic compounds in, 46t, 48-49 

buffered, 59-63, 61f 

colligative properties of, 51, 51f 

hydrophilic compounds in, 46^8, 47f 

hydrophobic compounds in, 10, 46, 46-49, 47f, 48f 

hypertonic, 52, 52f 

hypotonic, 52, 52f 

ionic interactions in, 46-47 

isotonic, 52, 52f 

osmolality of, 52, 52f 

pH of, 55-57, 56f, 56t 

weak acids/bases in, 57-63 

weak interactions in, 43-54 See also water 
Arabidopsis thaliana 

aquaporins in, 404 

cellulose synthesis in, 795-796, 795f 

signaling in, 458-460, 459t 
arabinose, 237s, 238s 
arachidic acid, 344t 
arachidonate, 815f, 817, 818f See also 

arachidonic acid 
arachidonic acid, 344t, 358, 358s See also 

arachidonate 
Arber, Werner, 304 
archaea, 4^5, 4f, 5f 

membrane lipids of, 352, 353f 
arginase, 684 

arginine, 75s, 77, 699, 861 

biosynthesis of, 861-863, 862f 

conversion of, to a-ketoglutarate, 698-699, 698f 

in nitric oxide biosynthesis, 882, 882f 

properties of, 73t, 77 
argininemia, 694t 
argininosuccinate, 684 
argininosuccinate synthetase, 684 

reaction mechanism of, 684f 
argininosuccinic acidemia, 694t 
Arnon, Daniel, 759, 759f 
aromatic amino acids. See also amino acid(s) 

as precursors of plant substances, 878, 878f 
arrestin 1 , 465 
arrestin 2, 430^31,431 
j8-arrestin (jSarr), 430f, 431 
ARS (autonomously replicating sequences), 991 
artificial chromosomes, 310f, 953 

bacterial, 309, 310 

human, 953 

yeast, 310, 310f, 953 
ascorbate, 127s 

ascorbic acid See vitamin C (ascorbic acid) 
asparaginase, 701 
asparagine, 75s, 77, 701, 865 

degradation of, to oxaloacetate, 701-702, 702f 

properties of, 73t, 77 
aspartame, 83s 

stereoisomers of, 19f 
aspartate, 9s, 75s, 77, 702, 865 

in C 4 pathway, 789-790, 789f 

degradation of, to oxaloacetate, 701-702, 702f 

properties of, 73t, 77 

pyrirnidine nucleotide synthesis from, 
886-887, 886f 



aspartate aminotransferase (AST), 678b 
aspartate transcarbamoylase, 220-221, 221f, 887 
aspartate-argininosuccinate shunt, 684 
Aspergillus niger, citric acid production by, 633b 
aspirin, 818, 818s. See also acetylsalicylate; 

acetylsalicylic acid 
association constant (/Q, 156 

in Scatchard analysis, 421b 
Astbury, William, 117 
asymmetry, molecular, 16, 16f 
atherosclerosis, 842-843 

trans fatty acids and, 348 
atom(s) 

electronegativity of, 208 

hydrogen, electron transfer and, 514 
atomic force microscopy, 385b 
atomic mass unit, 14b 
ATP (adenosine triphosphate), 23s 

in active transport, 509-510 

0 oxidation yielding, 654-655, 656t 

in Calvin cycle, 782-783, 782f-785f 

cAMP synthesis from, 426^27, 428f, 429f 

concentration of, 575 

energy by group transfer and, 506-507, 506f, 507f 
in glycolysis, 528-533 

balance sheet for, 538-539 

payoff phase, 535-538, 536f, 537f 

preparatory phase, 531-535 

in preparatory phase, 534f, 535f 
in heart muscle, 919 
hydrolysis of See ATP hydrolysis 
in hypoxia, 733 

inhibition of pyruvate kinase by, 579-580, 589f 
magnesium complexes of, 502, 502f 
in metabolism, 25, 26f 

in muscle contraction, 509-510, 918-919, 919f 
nucleophilic displacement reaction of, 508, 508f 
in nitrogen fixation, 856 
phosphoryl group transfers and, 501-511 
in photosynthesis, 782-786 

synthesis of, 782-783, 784, 785f 

transport of, 783-784, 784f 
in proteolysis, 1106f, 1108f 
regulation of, oxidative phosphorylation in, 

732-735, 734f, 735f 
relative changes in, 576t 
in replication initiation, 985-986, 985f 
supply of, 575 

synthesis. See ATP synthesis 
yield of, from oxidation of glucose, 733t 
ATP hydrolysis, 296-297, 296f 
actinin, 178-179, 178f 
biochemical equation for, 511 
chemical equation for, 511 
equilibrium constant for, 495 
free energy of, within cells, 503 
free-energy change associated with, chemical 

basis for, 502f 
free-energy change for, 501-503, 502f, 504f 
during ischemia, inhibitory proteins in, 733, 733f 
in membrane transport, 397^00, 398f 
in muscle contraction, 178-179, 178f, 918-919 
myosin in, 178-179, 178f 
as two-step process, 506, 506f 
ATP synthase(s), 723, 725 
jS subunits of, conformations of, 725f-726f, 

726-728 
binding-change model for, 729f 
of chloroplasts, 760-761, 760f 
functional domains of, 725 
in membrane transport, 399 
ATP synthasome, 731 
ATP synthesis, 723-732 
ATP synthase in 
conformations of (3 subunits of, 725f-726f, 

726-728 
functional domains of, 725 
binding-change mechanism for, rotational 

catalysis in, 728-729, 729f, 730f 
chemiosmotic theory of, 707, 707-708, 723, 

723-724, 723f 
coupling of electron transfer and, 723-724, 723f 



cytosolic NADH oxidation in, shuttle systems 
conveyance of, 731-732, 731f, 732f 

equation for, 723 

halophilic bacteria in, 762-764 

0 2 consumption and, nonintegral stoichiometrics 
of, 729-730 

by photophosphorylation, 759-761, 759f, 760f 
proton gradient in, 724, 725f, 726, 726f 
proton-motive force and active transport in, 

730-731, 730f 
stabilization of, 725-726, 726f 
standard free-energy change for, 725-726 
stoichiometry of, 630t 
ATPase(s) 
AAA+,985 

F-type, 399, 399f , See also ATP synthase(s) 
in membrane transport, 397-399, 397f-399f, 

403, 403f 
Na + K + -, 398-399, 398f, 403, 403f 
in membrane polarization, 450, 450f 
in membrane transport, in neurons, 920 
in retina, 463 
P-type, 396-397 
V-type, 399 
ATP-binding casette (ABC) transporter, 400, 

400f, 840 
ATP-gated K + channels, 924 
ATP-producing pathways, regulation of, 734, 735f 
atrial natriuretic factor (ANF), 444 
attractants, 457 
autocrine hormones, 906 
autonomously replicating sequences (ARSs), 991 
autophosphorylation, 439. See also 
phosphorylation 
of insulin-receptor tyrosine kinase, 439, 440f 
in receptor enzyme activation, 439, 440f 
in signaling, 439 
in bacteria, 457^58, 458f 
in plants, 460, 460f 
autotrophs, 4f, 5 
auxins, 331f, 459s, 878 
Avandia, 808s, 824, 936 
Avery, Oswald T., 278 
Avery-MacLeod-McCarty experiment, 278 
avian sarcoma virus, 1052, 1052f 
avidin, 633-634 
Avogadro's number (N), 491t 
azaserine, 894, 894f 

Azotobacter vinelandii, nitrogen fixation by, 856 
AZT, 1053b 

b 

BACs (bacterial artificial chromosomes), 309, 309f 
bacteria, 4-5, 4f See also Escherichia coli 
amino acids found in, 877-878 
anaerobic, incomplete citric acid cycle in, 

631,631f 
antibiotic-resistant, 217-218, 218f 

ABC transporters and, 400 

plasmids in, 949 
cell structure of, 5-7, 5f 
cell walls of 

heteropolysaccharides in, 249 

synthesis of, 797f 
cellulose synthesis in, 796 
DNA replication in, 977-991 
endosymbiotic, 33, 34f 

chloroplasts evolved from, 761 

in eukaryotic evolution, 33, 34f 

mitochondria evolved from, 739, 762f 
evolution of, 32-33, 33f, 761-764 
fatty acid synthesis in, 806-811, 815 
gene regulation in, 1126-1136 
genes in, 952 
genetic map of, 976f 
glycogen synthesis in, 792 
gram-negative, 5f 
gram-positive, 5f 

green sulfur, photochemical reaction center 

in, 749, 750f 
halophilic, in ATP synthesis, 762-764, 763f 
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lectins and, 260, 260f, 262f 
lipopolysaccharides of, 256-257, 257f 
nitrogen-fixing, symbiotic relationship with 

leguminous plants, 856-857, 856f 
nucleoids in, 970, 970f 
peptidoglycan synthesis in, 796, 797f 
photochemical reaction centers in, 749-751, 

750f, 751f 
potassium channel in, 408-409, 408f 
purple 

photochemical reaction center in, 749-751, 

750f, 751f 
photoreaction center of, 75 If 
photosynthetic reaction center of, 376, 377f 
signaling in, 457-458, 458f, 459t 
structure of, 5-7, 5f 
bacterial artificial chromosomes (BACs), 309, 309f 
bacterial DNA, 949, 950f, 950t 
packaging of, 970 
topoisomerases and, 959 
bacterial genes, 952 
mapping of, 976f 
naming conventions for, 976 
bacterial genome, 949, 950t 
sequencing of, 322, 323f 
bacteriophage A vector, 305t, 307, 308, 309f, 

1012, 1012f 
bacteriorhodopsin, 376, 762 
light-driven proton pumping by, 762-764, 763f 
microscopy of, 385b 
structure of, 376, 377f 
ball-and-stick model, 15, 15f 
Ballard, John, 822 
Baltimore, David, 1050, 1051f 
Banting, Frederick G , 903b 
basal transcription factors, 1139t 
base(s) 
amino acids as, 81 
in buffer systems, 59-63, 62f 
nucleotide/nucleic acid, 271-277, 272f-274f, 

272s-274s, 272t, 273s, 274s 
alkylated, repair of, 999, lOOOf 
anti form of, 280-281, 280f 
Chargaff s rules for, 278 
chemical properties of, 276-277 
in codons, 1067-1068. See also codons 
deamination of, 289-290, 290f 
in DNA, 276-277, 277f, 278-279, 279f, 280f 
estimation of via denaturation, 287-288, 287f 
functional groups of, 276-277 
in genetic code, 1067-1068. See also genetic code 
hydrogen bonds of, 277, 277f, 278-279, 279f, 

280f, 285, 286f 
methylation of, 292 
minor, 274, 274f 
nitrous acid and, 291, 291f 
pairing of, 277, 277f, 278-280, 279f, 280f See 

also base pairs/pairing 
in replication, 979f, 980f 
in RNA, 276, 284, 285, 286f 
structure of, 9s, 271-274, 272-274, 272f-274f, 

272t, 276-277 
syn form of, 280-281, 280f 
tautomeric forms of, 276, 276f 
variant forms of, 276, 280-281, 280f 
weak interactions of, 277, 277f, 279, 280f 
wobble, 1072 
relative strength of See pK a 
weak, 57 
base pairs/pairing, 277 
in DNA, 278-280, 279f See also DNA structure 

in replication, 980, 981f 
in DNA-protein binding, 1121-1122, 1121f 
in RNA, 284-285, 285f, 286f 
wobble in, 1072 
base stacking, 276-277 
in DNA, 278-279, 279f 
in replication, 979f, 980f 
in RNA, 276,284 
base-excision repair, 994t, 996-997, 997f 
basic helix-loop-helix, 1124, 1125 
B-DNA, 281, 281f 



Beadle, George W., 624b, 947, 948f 
beads-on-a-string formation, 963f, 964, 964f 
beer brewing, fermentation in, 549 
beeswax, 349, 349f 
benzoate, 687s 

for hyperammonemia, 686 
benzoyl-CoA, 686, 687s 
Berg, Paul, 303, 303f 
Bergstrom, Sune, 358f, 359 
Berson, Solomon, 902 
Best, Charles, 903b 

/3 adrenergic response, in signaling, termination 

of, 430 
)3 barrel, 131f, 379, 379f 

in membrane transport, 379, 379f 
j3 cells, 170, 170t, 921 

pancreatic, 923-924, 924f 

recombination in, 1015-1016, 1016f 
jS conformation, 120, 120f-123f 

in small globular proteins, 131t 

structural correlates of, 123t 
j3 conformation sheet See j8 sheet 
/3 oxidation, 647-649 See also oxidation 

in bears, 655b 

enzymes of, 663-664, 663f 

in peroxisomes, 662-663, 662f 

plants and, 662-663, 663f 

steps in, 653-655, 653f 

yielding acetyl-CoA and ATP, 653f, 654-655 
j8 sheet, 120-121, I20f-123f 

in large globular proteins, 135, 135f 

in protein folding, 135-136, 135f, 148 

structural correlates of, 123t 

twisted, 135, 135f 
j8 sliding clamp, 983-984, 984f, 988, 988f 
j8 subunits, of ATP synthase, different 

conformations of, 725f-726f, 726-728 
/Hum, 121, 122f, 123, 123t 
j3-a-j8 loop, 131, 131f, 136f 
^-adrenergic receptor, 423^431 

structure of, 424, 424f 
/3-adrenergic receptor kinase Q3ARK), 431 
^-adrenergic signaling pathway, 423^431, 424f, 

428f-430f 
jS-carotene, 361, 361s, 745s, 747 
/3-lactam antibiotics, resistance to, 217-218, 218f 
jS-lactamases, 217-218, 218f 
bicarbonate 

as buffer, 57f, 59-63,61-63 

formation of, 165-166 
bicoid, 1148-1149 
bilayer, lipid, 372, 374, 374f 

formation of, 374, 374f 

gel phase of, 381 

liquid-disordered (fluid) state of, 381 

liquid-ordered state of, 381 
bile acids, 357, 357s, 836 
bile pigment, heme as source of, 875-876, 876f 
bilirubin, 875 

breakdown products of, 876f 
binary switches, G protein(s) as, 425b^27b 
binding, cooperativity in, 160-165, 161f, 162f 
binding energy (4G B ), 189-192 

in enzymatic reactions, 189-192 

enzyme specificity and, 191-192 
binding sites, 153, 162 See also specific types 

antibody, 171-173, 172f 

characteristics of, 170 
binding specificity, antibody, 170-171, 173, 173f 
biochemical reactions 

bond cleavage in, 496-497 

common types of, 495-501 

electrophilic, 496, 496f 

eliminations in, 497-498 

free-radical, 498 

group transfer, 499-500, 499f 

internal rearrangements in, 497^198 

isomerization in, 497^498 

nucleophilic, 496, 496f 

oxidation-reduction See oxidation-reduction 
reactions 

vs chemical reactions, 500-501 



biochemical standard free-energy change {AG' 0 }, 

186, 187f 
biochemistry 

fundamental principles of, 2 

whole cell, 98-100 
bioenergetics, 22-24, 490-495 See also energy 

oxidation-reduction reactions and, 
512-521 

phosphorylation and, 501-511 

thermodynamics and, 490-495 
bioinformatics, 103 
biologic waxes, 348, 349, 349f 
biological energy transformation, in thermodynamics, 

490^91, 491t 
bioluminescence cycle, of firefly, 509b 
biomolecules, 1. See also under molecular 

amphipathic, 46t 

asymmetric, 16, 16f 

average behavior of, 28 

chirality of, 16, 16f 

conformation of, 18, 18f 

derived from amino acids, 873-882 

functional groups of, 12f, 13, 13f 

interactions between, 10-12 

light absorption by, 76b, 76f 

macromolecules, 14 

molecular mass of, 14b 

molecular weight of, 14 

nonpolar, 45, 46t 

origin of, 30-31, 31f See also evolution 
polar, 45, 46t 
size of, 10-11, lOf 
small, 10-11 

stereospecificity of , 15, 18, 18f, 19f 

in supramolecular complexes, 29 
bioorganisms 

classification of, 4-5, 4f, 5f 

distinguishing features of, 1-2, 2f 
biosignaling See signaling 
biotin, 554, 633 

deficiency of, 633-634 

in acetyl-CoA carboxylase reaction, 806f 

in phosphoenolpyruvate synthesis from pyruvate, 
554, 554f 

in pyruvate carboxylase reaction, 554, 554f, 633, 
634f, 635f 
Bishop, Michael, 1052 
1,3-bisphosphoglycerate (BPG), 504s, 535 
hydrolysis of, 504, 540s 
phosphoryl transfer from, to ADP, 

536-537 
synthesis of, 728f, 778-780, 779f, 
782-783 

2,3-bisphosphoglycerate (BPG), in hemoglobin- 
oxygen binding, 167, 168f 
bisulfites, as mutagens, 291 
Blobel, Giinter, 1100, llOOf 
Bloch, Konrad, 835, 835f 
blood 

buffering of, 57f, 61-63 
composition of, 921, 921f 
electrolytes of, 921 
glutamine transport of ammonia in, 

680-681, 680f 
metabolic functions of, 920-922 
normal volume of, 921 
pH of, 62-63 

transport functions of, 920-921 
blood clotting 

integrins in, 456 

proteolysis in, 227 
blood groups, sphingolipids in, 354, 355f 
blood plasma, 921, 921f 
blue fluorescent protein (BFP), 434b-436b 
blunt ends, 306, 307 
boat conformation, 239f 
body mass. See also fat, body 

regulation of, 821-822, 930-937 
leptin in, 930-931, 931f-934f 
body mass index, 930 
Bohr, Christian, 166 
Bohr effect, 166, 166 
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Bohr, Niels, 166 
boiling point 

of common solvents, 44t 

of water, 43, 44t 
Boltzmann constant (k), 491t 
bond(s). See also weak interactions 

carbon, 11-13, 12f 

carbon-carbon See carbon-carbon bond 
bond(s) 

carbon-hydrogen, cleavage of, 496-497, 496f 
covalent, 10 
in enzymatic reactions, 189 
heterolytic cleavage of, 496, 496f 
homolytic cleavage of, 496, 496f 
of phosphorus, 499-500, 499f 
disruption of 
in amino acid sequencing, 95, 96f 
energy for, 114 
glycosidic, 243 
phosphorolysis vs hydrolysis reactions of, 
595-596 
hydrogen See hydrogen bonds 
JV-jS-glycosyl, 272 

hydrolysis of, 290, 290f 
noncovalent, 10, 50-51 See also weak 

interactions 
O-glycosidic, 243 
peptide See peptide bonds 
phosphate, high-energy, 506 
phosphorus-oxygen, 499-500, 499f 
bond dissociation energy, 44 
bovine F r ATPase, structure of, 733, 733f 
bovine spongiform encephalopathy, 147-148 
Boyer, Herbert, 303, 303f 
Boyer, Paul, 728, 729f 

BPG (2,3-bisphosphoglycerate), in hemoglobin- 
oxygen binding, 167, 168f 
brain 

glucose supply for, 926-928, 927f 

metabolism in, 920, 920f 
branch migration, 1005-1006, 1006f, 1008f 
brassinolide, 360s, 459s 
brassinosteroids, 459, 459s, 460 
BRCA1/S, breast cancer and, 1003b 
breast cancer, 1003b 

brown adipose tissue (BAT), 736, 917, 917f 

heat generated by, in oxidative phosphorylation 
regulation, 736, 736f 

mitochondria in, 736 
Brown, Michael, 841, 841f 
Buchner, Eduard, 184, 184f, 528, 577 
buffers, 59-63, 59-66, 61f 
bulges, in RNA, 285, 285f 
bungarotoxin, 411-412 
trans-A 2 -butenoyl-ACP, 809, 81 Of, 811f 
butyryl-ACP, 809, 81 Of, 811f 
bypass reaction, in gluconeogenesis 

fructose 1,6-bisphosphate to fructose 
6-phosphate conversion as, 553t, 556 

glucose 6-phosphate to glucose conversion as, 
553t, 556 

pyruvate to phosphoenolpyruvate conversion as, 
553-557, 553t, 554f, 555f 

C 

C. See carbon entries 
C 2 cycle, 789 
C 4 pathway, 789 
C 3 plants, 776 

photosynthesis in, 776, 789, 790 
C 4 plants, photosynthesis in, 789-790 
Ca 2+ channels 

defective, diseases caused by, 412 1 

in glucose metabolism, 924, 924f 

in signaling, 451-452, 451f, 453-454 
Ca 2+ concentration, in cytosol vs. extracellular 
fluid, 45 It 

oscillation of, 437, 438f 
Ca 2+ ion channel, 406, 409-410 
Ca 2+ pump, 397-398, 397f 

Ca 2+ /calmodulin-dependent protein kinases, 437, 437f 



Cairns, John, 978, 982 

calcitonin, synthesis of, 1040, 1041f 

calcitonin gene, alternative processing of, 1040, 1041f 

calcitonin gene-related peptide, synthesis of, 

1040, 1041f 
calcitriol, 360, 360s, 906t, 908 
calcium 

blood levels of, 921 

IP 3 and, 432^33, 433f, 447 

in muscle contraction, 178 

regulation of, 908 

in vision, 464, 464f, 465 
calmodulin (CaM), 436, 437t 
calnexin, 261 
Calvin cycle, 774 

ATP in, 782-786, 782f-785f 

1,3-bisphosphoglycerate in, 778, 779f 

in C 4 plants, 789-790, 789f 

in CAM plants, 790 

carbon-fixation reaction in, 775-778, 775f 

2- carboxyarabinitol 1 -phosphate in, 778 
glyceraldehyde 3-phosphate synthesis in, 773, 

776f, 777f, 778 
NADPH in, 773, 775f, 778, 782-786, 782f-785f 

3- phosphoglycerate in 

conversion of to glyceraldehyde 3-phosphate, 

775, 778 
synthesis of, 773, 776-778 
rubisco in, 776-778, 776f-778f 

regeneration of, 778-780, 780f, 781f 
stoichiometry of, 782f 
triose phosphates in 
1 ,5-bis phosphate regeneration from, 778-780 
regeneration of rubisco from, 778-780, 780f, 781f 
synthesis of, 779f 
Calvin, Melvin, 774, 774f 
CaM kinases, 437 
CAM plants, 790 

cAMP (adenosine 3',5'-cyclic monophosphate), 
274, 298, 298s 
degradation of, by cyclic nucleotide phosphodi- 
esterase, 424f, 430 
hormonal regulation of, 431^32 
measurement of, by FRET, 434b^36b 
in protein kinase A activation, 424f, 428, 428f 
as second messenger, 298, 424f, 431-432, 431t 
in /3-adrenergic pathway, 426-127, 428f, 430f 
structure of, 298s, 424f 
synthesis of, adenylyl cyclase in, 426^27, 
428f, 429f 
cAMP receptor protein (CRP), 1029 

in gene regulation, 1126-1127 
cAMP response element (CRE), 1144 
cAMP response element binding 

protein (CREB), 431 
c AMP -dependent protein kinase See protein 

kinase A (PKA) 
camptothecin, 961 
cancer 

citric acid cycle mutations in, 637-638 

detection by PET scanning, 541 

DNA repair in, 1003b 

as gene therapy complication, 336 

gene therapy for, 336 

glucose catabolism in, 539, 540-541 

integrins in, 456 

mutations in, 474-475, 477f, 637, 993, 1003b 
oncogenes and, 473-474, 477f 
retroviruses and, 1051-1052, 1052f 
selectins and, 259 
treatment of 
chemotherapeutic agents in, 328, 894-896, 

960b-961b 
microarrays in, 328 
protein kinases in, 475b^76b 
targeting enzymes in nucleotide biosynthesis 

in, 894-896, 894f, 895f 
topoisomerase inhibitors in, 960b-961b 
transcriptional profiles in, 328 
tumor suppressor genes and, 474^75, 477f 
cannabinoids, 442, 442s 
CAP. See cAMP receptor protein (CRP) 



cap-binding complex, 1035, 1066f 

capsid, viral, icosahedral symmetry of, 139-140, 140f 

carbaminohemoglobin, 166s 

formation of, 166 
carbamoyl glutamate, 687s 
carbamoyl phosphate, pyrimidine nucleotide 

synthesis from, 886-887, 887f 
carbamoyl phosphate synthetase I, 684 

activation of, 685, 686f 

deficiency of, 694t 

reaction mechanism of, 684f 
carbamoyl phosphate synthetase II, 886 
carbamyl phosphate See carbamoyl phosphate 
carbanion, 496, 782f 
carbocation, 496 
carbohydrate(s), 235-265 

analysis of, 263-265, 264f 

chemical synthesis of, 263-265 

classification of, 235 

disaccharide, 235, 243-244 

dolichols and, 362 

Fischer projection formulas for, 236, 236f 

glycoconjugates of, 235 

intermediates of, 240f, 241 

lectin binding of, 258-261, 258t, 259f, 261f, 262f 

monosaccharide, 235, 235-241 See also mono- 
saccharides 

nomenclature of, 235-236, 236-238, 240-241 

oligosaccharide See oligosaccharides 

overview of, 235 

oxidation of, 241, 243-244 

polysaccharide, 235, 244-252. See also 
polysaccharide(s) 

size classes of, 235 
carbohydrate anabolism, pathways of, 913t 
carbohydrate catabolism 

in cellular respiration, 615, 616f 

pathways of, 913t 
carbohydrate metabolism, 570f 

anabolic, 913t 

catabolic, 913t 

in diabetes mellitus, 542f 

enzymes of, 574, 575t 

gene expression in, insulin and, 606 

hepatic, 914, 915f 

metabolic control analysis applied to, 581-582 
pathways of, 913t 

in plants, 773-786, 797-800, 798f. See also 

Calvin cycle 
regulation of 

allosteric and hormonal, 606-608, 608, 608f 

lipid metabolic integration with, 608 

in liver, 606-608 

in liver and muscle, 608f 

in muscle, 608 

xylulose 5-phosphate in, 588 
carbohydrate response element binding protein 

(ChREBP), 591-592, 591f 
carbohydrate synthesis, 552, 552f, 773-800 See also 

gluconeogenesis 
C 4 pathway in, 789-790 
cellulose synthesis and, 795-796, 797f 
glycolate pathway in, 787-789, 787f 
integrated processes in, 797-800, 798f, 799f 
pentose phosphate pathway in, 775, 

785-786, 798 
peptidoglycan synthesis and, 796-797, 797f 
photorespiration in, 786-790 
photosynthetic, 773-786 See also photosynthesis 
starch synthesis in, 791-792, 793 
sucrose synthesis in, 792-794 
carbon 
anomeric, 239 
asymmetric, 16, 16f 
oxidation of, 496f, 500, 513f 
carbon bonds, 11-13, 12f 
carbon dioxide, 513s 
cycling of, 486, 486f 
hemoglobin transport of, 166 
in hemoglobin-oxygen binding, 166 
oxidation of a-ketoglutarate to, 623 
oxidation of glucose to, 516 
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oxidation of isocitrate to, 623, 623f 

oxidation of pyruvate to, 616-617, 616f 

partial pressure of, 62 

in photosynthesis. See Calvin cycle 

solubility of in water, 47, 47t 
carbon dioxide assimilation, 774, 774f, 775-780. 
See also Calvin cycle 

in Qj plants, 789, 790 

in C 4 plants, 789-790, 789f 

in CAM plants, 790 
carbon fixation, 664, 774f, 775-778, 775f See also 
Calvin cycle 

in C 4 plants, 789-790 
carbon monoxide, 513s 

hemoglobin binding of, 155, 158, 
162, 168 

physiological effects of, 168 
carbon monoxide poisoning, 168 
carbon-assimilation reactions, 742, 783, 
785-786, 785f 

in C a plants, 789-790, 789f 

in C 4 plants, 789-790, 789f 
carbonates, hydrolysis of, transition state in, 

211, 211f 
carbon- carbon bond 

cleavage of, 496, 496f 

reactions of, 496-497, 504f 
carbon-carbon bonds, cleavage of, 496-497, 497f 
carbon-fixation reactions, 742 
carbon-hydrogen bond, cleavage of, 

496-497, 496f 
carbonic anhydrase, 166 
carbonyl groups, 497, 497f 
2-carboxyarabinitol 1 -phosphate, 778s 

in Calvin cycle, 778 
carboxybiotinyl-enzyme, 634s 
y-carboxyglutamate, 77, 78s 
carb oxyhemoglobin, 168 

physiological effects of, 168 
carboxylic acids, 13, 513s 
carboxyl-terminal residues, 82 

posttranslational modification of, 1096 
carboxypeptidase A, 676 
carboxypeptidase B, 676 
cardiac muscle, 919, 919f 
cardiolipin, 35 If, 370f, 372 

synthesis of, 826f, 827 
caretaker genes, 477 
camauba wax, 349 
carnitine, 651 

carnitine acyltransf erase I/II, 651 
carnitine shuttle, 650 
carotenoid, 747 

carriers, 391, 391f See also transporter (s) 

Caruthers, Marvin, 294 

carvone, stereoisomers of, 19f 

casein kinase II, 605 

CASP competition, 143 

caspase, in apoptosis, 478 

caspases, 738 

catabolic pathways, 773, 913t 

energy delivery in, 487f 

in glycogen metabolism, 595-596 
catabolism, 25, 26f, 487, 487f 

amino acid, 673-674, 687-702. See also amino 
acid catabolism 

glucose, in cancerous tissue, 539, 540-541 

protein, fat, and carbohydrate, in cellular 
respiration, 616f 

purine nucleotide, 892, 893f 

pyrimidine, 892, 893f 
catabolite repression, 1126 
catalase, 662 

catalysis, 25, 26f, 186-194 See also 

enzymatic reactions 
acid-base 

general, 192, 193f 

specific, 192-193 
covalent, 193 

in membrane lipid diffusion, 382t 
metal ion, 193-194, 213, 213f 
regulation of, 220-227 



rotational, 728-729 

vs. specificity, 191 
catalytic antibodies, 211 
catalytic site, 154 
catalytic triad, 207 
catecholamines, 906t, 907, 907 

as hormones, 902, 906t, 907 

as neurotransmitters, 902 
catenanes, 991 

cation-exchange chromatography, 86, 87f, 88t See 

also ion-exchange chromatography 
caudal, 1149-1150, 1150f 
caveolae, 386, 386f 
caveolin, 386, 386f, 449 
CB! receptor, 442 
C-C bond. See carbon-carbon bond 
CCA(3'), 1044-1045 

CCAAT-binding transcription factor 1 (CTF1), 

proline-rich activation domains of, 1143 
CDK9, 1032 

cDNA (complementary DNA), 316 

cloning of, 1060-1061 

in hybridization, 289, 290, 1051 
cDNA library, 316, 316f, 1060-1061 
CDP (cytidine diphosphate), 824 
CDP-diacylglycerol, in lipid biosynthesis, 824-827, 

825f, 826f 
Cech, Thomas, 1037, I037f 
cell(s),2-ll 

bacterial, 5-7, 5f 

death of See apoptosis 

energy needs of, in oxidative phosphorylation 
regulation, 733 

fat cells in, 346, 346f 

free-energy of ATP hydrolysis within, 503 
metabolic transformations in, 495-496 
origin of, 30-31 See also evolution 
size of, 3 

sources of free energy for, 491 
structure of 
in bacteria, 5-7, 5f 
in eukaryotes, 6f, 7-9 
hierarchical, 10-11, lOf 
cell cycle 
chromosome changes in, 962, 963f 
meiosis in, 1004-1005, 1004f 
in eukaryotes, 1004-1005, 1004f 
recombination in, 1004-1005, 1006f 
regulation of 
cyclin-dependent protein kinases in, 470f-472f, 

470f-473f 
cytokines in, 47 If, 472 
growth factors in, 471f, 472 
replication in See DNA replication 
retinoblastoma protein in, 472, 473f 
stages of, 469, 469f 
cell death, programmed, 477-478, 4781 See 

also apoptosis 
cell envelope, 5f, 6 
cell fractionation, 8-9, 8f, 52 
cell membrane. See membrane(s) 
cell wall polysaccharides, synthesis of 
in bacteria, 795-796, 797f 
in plants, 795-796 
cell-cell interactions/adhesion, 388-389 
cadherins in, 388 

integrins in, 254, 255f, 388, 455-456, 455f 
lectins in, 260, 262f 

proteoglycans/oligosaccharides in, 252-255, 
257-262 

selectins in, 388 
cell-cell signaling See signaling 
cellular differentiation, 571 
cellular functions, of proteins, 324, 325-329 
cellular immune system, 170. See also 

immune system 
cellular respiration, 615 

stages in, 615, 616f 
cellulase, 247, 247f 

cellulose, 246-247, 251t. See also polysaccharide(s) 
conformations of, 248f 
function of, 246, 248, 251t 



structure of, 246, 246f, 248, 248f, 251t, 
794, 794f 

synthesis of, 795-796 

tensile strength of, 248 
cellulose synthase, 795, 795 
central dogma, 945, 945f, 1050 
centrifugation, differential, 8-9 
centromere, 953, 953f 
ceramide, 353f, 354 

in cell regulation, 358 
cerebrosides, 350f, 353f, 354, 355f, 
829, 831f 

synthesis of, 829 
ceruloplasmin, 258-259 
CFTR, 401,410, 840 

cGMP (guanosine 3',5'-cyclic monophosphate), 
273s, 298, 298s, 445^46 
as second messenger, 445^46, 445f 
in signaling, 445^46 
structure of, 298s 

as second messenger, 445-446 
in vision, 463-464, 464f, 465 
cGMP PDE,446 

cGMP-dependent protein kinase (protein 

kinase G), 445-446 
C-H bond See carbon-hydrogen bond 
chair conformation, 239f 
Changeux, Jean-Pierre, 165 
channels See ion channels 
chaperones, 143-145, 144f, 145f, 1104, 1106f 
chaperonins, 144, 145f 
Chargaff, Erwin, 278 
Chargaff s rules, 278 
charge 

of amino acids, 80-81, 81f 

pH and, 80-81, 81f 
Chase, Martha, 278 
chemical elements 

essential, 1 If 

trace, llf, 13 
chemical models 

ball-and-stick, 15-16, 15f 

space-filling, 15, 15f 
chemical potential energy, of proton-motive 
force, 720 

chemical reaction(s), 19-26. See also under 
reaction 
activation energy in, 25, 25f 
direction of, 22, 24 
driving force of, 25 
dynamic steady state and, 20 
endergonic, 22, 24, 26f 
energy conservation in, 20, 20f 
energy sources for, 22 
energy-coupled, 22-24, 23f 
enzymatic, 183-227 See also catalysis; 
enzyme (s) 

enzyme-catalyzed, 25, 26f, 186-194 See also 

enzymatic reactions 
at equilibrium, 23 
equilibrium constant for, 24 
exergonic, 22, 24, 25, 26f 

in citric acid cycle, 636-637 

in conversion of pyruvate to phosphoenolpyru- 
vate, 553-557, 553t, 554f, 555f 

coupled with endergonic reactions, 
22-24, 23f 
free energy in, 22, 23-24, 186-187, 187f 
ground state in, 186 
intermediates in, 209 
oxidation-reduction, 22 
rate constant for, 188 
rate equation for, 188 
rate-limiting steps in, 187 
reaction intermediates in, 187 
sequential, 494-495 

standard free-energy changes of, 492, 492t 

at pH7 0and 25 °C, 493t 
steady-state, 21 

transition state in, 25, 187, 189-191, 207, 209, 
210b-211b 
chemical reaction mechanisms, 208, 208f 
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chemical reactions, vs biochemical reactions, 

500-501 
chemical synthesis, 263-265 
ofDNA, 294, 295f 

phosphoramidite method in, 294, 295f 
chemical uncouplers, of oxidative phosphorylation, 
724, 724f 

chemicals, industrial, fermentation yielding, 
549, 549f 

chemiosmotic coupling, of 0 2 consumption and 

ATP synthesis, 729-730 
chemiosmotic theory, 707, 723-724 
of ATP synthesis, 723 
prediction of, 724, 725f 
chemistry, prebiotic, 30-31 
chemoheterotrophs, 32 
chemotaxis, two-component signaling in, 

457-^58, 458f 
chemotherapeutic agents 
targeting enzymes in nucleotide biosynthesis, 

894-896, 894f, 895f 
transcriptional profiles for, 328 
chemotherapy, topoisomerase inhibitors in, 

960b-961b, 961s 
chemotrophs, 4f, 5 
chiral center, 16, 16f, 72 
chiral molecules, 16, 16f, 72-73 

optical activity of, 17, 73 
chitin, 247, 247f, 251t 
chloramphenicol, 1098 
chloride, blood levels of, 921 
chloride ion channels 
in cystic fibrosis, 401, 401f, 410 
in signaling, 453-454 
chloride ion concentration, in cytosol vs, 

extracellular fluid, 451t 
chloride-bicarbonate exchanger, 384f, 395, 395f 
chloroform, in lipid extraction, 363-364, 364f 
chlorophyll, 745 
antenna, in light-driven electron flow, 752, 

754-755, 754f 
in light absorption, 745-747, 745f, 746f 
excitation transfer and, 747-749, 748f 
chlorophyll a, 745f 
chlorophyll b, 745f 
chloroplast(s), 773, 774, 775f 
ATP synthase of, 760-761, 760f 
ATP synthesis in, 707-708 See also 

ATP synthesis 
evolution of, 761-762 
fatty acid synthesis in, 812, 812f 
integration of photosystems I and II in, 752-754 
lipid metabolism in, 812, 812f 
membrane lipids of, 352, 352f 
NADPH synthesis in, 812 
photosynthesis in, 743-744, 743f 

electron flow in, 743-744 
protein targeting to, 1104 
starch synthesis in, 791f, 792 
chloroplast DNA (cpDNA), 951 
cholecalciferol (vitamin D 3 ), 360, 360f, 360s 
cholecystokinin, 675 
cholera toxin, 258t, 260, 426^27 
cholesterol, 355, 355f See also sterols 
esterification of, 836, 836f 
excess production of, 843 
fate of, 836 

in isoprenoid synthesis, 845, 845f 

membrane, 372t 
microdomains of, 384-386, 386f 

receptor-mediated endocytosis of, 840f, 841 

in steroid hormone synthesis, 359, 359s, 
844-845, 844f 

structure of, 355-357, 355f, 839f 

synthesis of, 832-836, 832f-836f 
regulation of, 841-842, 841f, 842f 

trans fatty acids and, 348 

transport of, 836-841, 838f, 839f 
cholesterol intermediates, synthesis of, 844f, 845 
cholesterol-lowering drugs, 842b-843b, 843 
cholesteryl esters, 836 

receptor-mediated endocytosis of, 840f, 842 

transport of, 836-841, 839f 



chondroitin sulfate, 250, 250s, 253f 
chorismate, in amino acid biosynthesis, 865-869, 
868f-870f 

ChREBP (carbohydrate response element binding 

protein), 591-592, 591f 
chromatids, sister, 962, 1004f, 1005, 1005f 
chromatin, 962, 965f 

in 30 nm fiber, 966, 967f 

acetylation/deacetylation of, 1137-1138 

assembly of, 965f 

beads-on-a string appearance of, 963f, 964f 
condensed, 1136 
euchromatin, 1136 
hetero chromatin, 1136 

histones and, 963, 963f, 966b-967b, 1137-1138 
See also histone(s) 

nuclear scaffolds and, 968, 968f 

nucleosomes in, 964f, 965f, 967f 

remodeling of, 1137-1138, 1137t 

transcriptionally active vs inactive, 1136-1137 

transcription-associated changes in, 
1136-1138, 1137t 
chromatography, 173 

adsorption, 364-365, 364f 

affinity, 86, 87f, 88, 88t, 313-314, 313t 

column, 85-88, 86, 86f 

gas-liquid, 364f, 365 

high-performance liquid, 88 

ion-exchange, 86, 87f, 88t 

in lipid analysis, 364-365 

size-exclusion, 87, 87f 

thin-layer, 364-365, 364f 
chromatophore, 759 
chromosome(s) , 947-953 

artificial, 953 

bacterial artificial, 309, 309f 

cell-cycle changes in, 962-963, 963f 

condensation of, 962, 963f, 969, 970f 

contiguous dimeric, repair of, 1013, 1013f 

daughter, 469 

definition of, 962 

elements of, 947-953 

eukaryotic, 949-951, 951f 

genes in, 947, 948. See also gene(s) 

partitioning of, in bacteria, 991 

replication of, 962, 963f 

structure of, 962-971 

yeast artificial, 310, 310f, 953 
chylomicrons, 649, 837f, 837t, 838, 838f 

molecular structure of, 649f 
chymo trypsin, 676 

catalytic activity of, 194, 204f, 205-207, 
207f-209f 

transition-state complementarity and, 207, 
210b-211b, 210f 
reaction mechanism of, 208f-209f, 499 
structure of, 113f, 205, 206f 
synthesis of, proteolytic cleavage in, 
226-227, 227f 
chymotrypsinogen, 225, 227f, 676f 
cimetidine, 879 
ciprofloxacin, 960b, 960s 
circular dichromism (CD) spectroscopy, 
122, 122f 

cis configuration, of peptide bonds, 121, 121f 

cis-trans isomers, 15 

citrate, 622, 622s 
asymmetric reaction of, 629b, 629f 
in fatty acid synthesis, 813-814, 8l3f, 814, 814f 
formation of, in citric acid cycle, 622, 622f, 623f 

citrate lyase, 813 

citrate synthase, 622, 622s, 633b, 813 

reaction mechanism of, 623f 

structure of, 622f 
citrate transporter, 813, 813f 
citric acid, Aspergillus niger production of, 633b 
citric acid cycle, 615, 615-638 

activated acetate in, 616-620, 616f, 
617f, 618f, 634f 

anaplerotic reaction replenishing, 631-633, 
632f, 632t 

citrate formation in, 622, 622f, 623f 

components of, 631, 632f 



conversion of succinyl-CoA to succinate in, 626, 626f 
energy of oxidations in, 629f, 630t 
in glucose metabolism, 914-915, 914t, 915f, 
927f, 928 

glyoxylate cycle and, 639-640, 639f, 640f 
in hepatic metabolism, 914-915, 915f, 
927f, 928 

incomplete, anaerobic bacteria in, 631, 63 If 
intermediate 

anaplerotic reaction in, 631-633, 632f, 632t 

in gluconeogenesis, 557, 927f, 928 
isocitrate formation in, 622-623, 623f 
in lipid metabolism, 915f, 916 
oncogenic mutations in, 637-638 
oxidation of acetate in, 631, 631f 
oxidation of acetyl-CoA in, 655, 656t 
oxidation of isocitrate to a-ketoglutarate and 

C0 2 in, 623, 624f 
oxidation of malate to oxaloacetate in, 628-630 
oxidation of succinate to fumarate in, 628 
of oxidations in, 630-631 
products of, 630f 

pyruvate carboxylase reaction in, biotin and, 633, 

634f, 635f 
reactions of, 620-635, 636f 
regulation of, 635-638 
allosteric and covalent mechanisms in, 

635-636, 636f 
exergonic steps in, 636-637 
glyoxylate cycle in, 639-640, 639f, 640f 
multienzyme complexes in, 637, 637f 
role of, in anabolism, 631, 632f 
steps in, 622-630 
urea cycle links to, 684-685, 685f 
citrulline, 77, 78s 
CP . See chloride 

Claisen ester condensation, 497, 497s 

clathrates, 48 

clathrin, 1106, 1107f 

Clausius, Rudolf, 21b 

clavulanic acid, 217-218 

clonal selection, 170 

clone(s), 173 

definition of, 304 

in DNA libraries, 315f, 316f, 317 
cloned genes, expression/alteration of, 

312,312f 
cloning, 304, 304-314 

in animals, 332-334 

in bacteria, 304-312 

blunt ends in, 306 

cDNAin, 290f, 1051, 1060-1061 

DNA cleavage in, 304-307, 304f, 305t, 306f 

DNA fragment size in, 306-307 

DNA hybridization in, 310-311, 3llf 

DNA ligases in, 304, 304f, 306f, 307, 351f 

electr op oration in, 307 

enzymes in, 305t 

in eukaryotic cells, 315-316 

fusion proteins in, 313 

gene expression and, 312, 312f 

linkers/polylinkers in, 306f, 307 

nucleic acid probes in, 311, 31 If 

oligonucleotide -directed mutagenesis and, 
312, 313f 

in plants, 330-332, 330f-333f 

polymerase chain reaction in, 317, 318f 

procedures used in, 304, 304f 

restriction endonucleases in, 304-307, 304f, 
305t, 306f 

site-directed mutagenesis and, 312 

steps in, 304, 304f 

sticky ends in, 306, 306f, 307 

transformation in, 307 

in vitro packaging in, 308 
cloning vectors, 304, 305t, 307-310, 307f-310f 

bacterial artificial chromosome, 309, 309f 

in DNA library creation, 316 

expression, 312 
A, 309f 

A phage, 305t, 307, 308, 1012, 1012f 
for plants, 330-332, 330f-333f 
plasmid, 307, 308f, 309f 
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shuttle, 310 

Ti plasmid, 330-331 

viral, 333, 334f 

yeast artificial chromosome, 309, 310 
closed system, in thermodynamics, 20 
closed-circular DNA, 955, 955f 

linking number for, 956-958, 956f, 957f, 965 
Clostridium acetobutyricum, in fermentation, 549 
Clostridium botulinum, 388 
clotting 

proteolysis in, 227 

selectins in, 388 
CLV1 peptide, 460 

CMP (cytidine 5 '-monophosphate), 273s, 824 

C0 2 See carbon dioxide 

CoA (coenzyme A), 297f, 298, 607s. See also 

specific type, eg, succinyl-CoA 
coactivators, 1139-1140, 1139f 
coagulation 

proteolysis in, 227 

selectins in, 388 
coated pits, 1106, 1107f 
cobalarnin, See vitamin B 12 (cobalamin) 
cobrotoxin, 411 

coding strand, in transcription, 1024, 1024f 
CODIS database, 320, 320t 
codons, 31 If, 1066-1074, 1066f 

assignment of, 1068-1069, 1069f 
variations in, 1070b-1071b 

base composition of, discovery of, 1067 

base sequences in, discovery of, 1067-1069 

dictionary of, 1069, 1069f 

initiation, 1069, 1069f 
in protein synthesis, 1088-1091, 1090f 

reading frame for, 1038f, 1067, 1067f, 1072 

termination, 1069, 1069f, 1070b-1071b 
coelacanth, anaerobic metabolism of, 548b 
coenzyme(s), 3, 184, 185t. See also enzyme(s) 

examples of, 184t 

flavin nucleotide, 520t 

NAD + or NADP + as, stereospecincity of 
dehydrogenase employing, 518t 

prosthetic -group, 184 

in pyruvate dehydrogenase complex, 617, 617f 

as universal electron carrier, 516 
coenzyme A (CoA), 297f, 298, 607s. See also 

specific type, e g , succinyl-CoA 
coenzyme Bi 2 , 657, 658b-659b, 658f, 659f 
coenzyme Q, 362, 710, 710f 
coenzyme reduction, stoichiometry of, 630t 
cofactors 

enzyme, 184, 194t 

of nitrogenase complex, 855f 
Cohen, Stanley, 303, 303f 
cohesins, 969 
coiled coils 

collagen as, 124f, 125 

a-keratinas, 123-124, 124f 
cointegrate, 1014 
collagen 

amino acids in, 124f, 125, 126-127, 128 

ascorbic acid and, 126-127 

disorders of, 126-128 

proteoglycans and, 254 

structure of, I22f, 123t, 124-128, 124f, 125f 

triple heUx of, 122f, 123t, 124-125, 124f, 126-127 

types of, 125 
colligative properties, solute concentration and, 

51, 51f 
Collins, Francis, 317, 322f 
Collip, J,B.,903b 
colon cancer, 1003b 

mutations in, 475, 477f 
color vision, 465, 465f 
column chromatography, 85-88 See 

chromatography 
comparative genomics, 35-36, 325, 326, 328, 329f 
competitive inhibitor, 201-203, 202f, 203f, 203t 
complementary DNA See cDNA (complementary 
DNA) 

Complex I: NADH to ubiquinone 
flow of electrons and protons through, 719f 
in oxidative phosphorylation, 712-714, 714f 



Complex II: succinate to ubiquinone 
flow of electrons and protons through, 719f 
in oxidative phosphorylation, 715, 715f 

Complex III: ubiquinone to cytochrome c, 
716f, 717f 

flow of electrons and protons through, 719f 

in oxidative phosphorylation, 715-716, 716f 
Complex IV: cytochrome c to 0 2 , 716-718, 718f 

flow of electrons and protons through, 719f 

path of electrons through, 7l8f 
complex transposons, 1013 
concentration gradient, 389-390, 
389f, 390f 

in membrane polarization, 450, 450f 
concerted inhibition, 872 
concerted model, of protein-ligand binding, 

165, 165f 
condensation, 65f 

chromosomal, 962, 963f 

in peptide bond formation, 82, 82f 
condensation reacton, 65 
condensins, 969 
cone cells, 462-465, 462f 

in color vision, 465, 465f 
configuration, 15 

isomeric, 15-18, 15f, 16f 

vs. conformation, 239 
conformation 

boat, 239f 

chair, 239f 

molecular, 18, 18f 

native, 29, 114 

protein, 113-114 

vs. configuration, 239 
conjugate acid-base pairs, 57, 57f 

as buffer systems, 60, 60f, 61-62 
conjugate redox pair, 512 

electron transfer of, 514-515, 514f 
conjugated proteins, 84, 85t 
consensus sequences, 102 

oriC, 985f, 986-987 

of promoters, 1025, 1025f, 1117f 

of protein kinases, 225-226, 226t 
consensus tree of life, 106f 
conservation of energy, 20, 20f 
constitutive gene expression, 1116 
contig, 315 

contiguous dimeric chromosome, repair of, 

1013, 1013f 
contractile proteins, 175-179, 176f-179f 
cooperativity, binding, 160-165, 161f, 162f 
copper ions, in Complex IV, 716-718, 718f 
Corey, Robert, 115f, 117, 124, 130 
Cori, Carl F, 548b, 598, 598f 
Cori cycle, 548b, 552, 919, 919f 
Cori, Gerty T„ 548b, 598, 598f 
Cori's disease, 599t 
Cornforth, John, 835, 835f 
coronary artery disease 

hyperlipidemia and, 842-843 

trans fatty acids and, 348 
corrin ring system, 658 

corticosteroids, 359, 359f, 359s, 906t, 908. See also 
under steroid hormone (s) 

synthesis of, 844-845, 844f 
Cortisol, 359, 359s, 823s, 911, 929 

in glucose metabolism, 823 
cotransport systems, 395, 395f 
coupling, operational definition of, 723 
covalent bonds, 10 

in enzymatic reactions, 189 

heterolytic cleavage of, 496, 496f 

homolytic cleavage of, 496, 496f 

of phosphorus, 499-500, 499f 
covalent catalysis, 193 

covalent modification, of regulatory enzymes, 220, 

223-227, 223f, 226f, 226t, 227f 
COX (cyclooxygenase), 817, 818 

inhibitors of, 818, 818f 
COX4, 734, 734f 
cpDNA (chloroplast DNA), 951 
C-protein, 177 

Crassulaceae, photosynthesis in, 790 



creatine, 505s, 876-877, 918s 
amino acids as precursors of, 876-877 
biosynthesis of, 877f 
in muscle contraction, 918, 918f 

creatine kinase, 510, 678 

Creutzfeldt-Jakob disease, 147-148, 147f 

Crick, Francis, 93, 123, 277, 277f, 278-279, 945, 
977, 1056, 1056f, 1066 

crocodile, movement of, 548b 

crossing over, 1004f, 1005, 1005f 

CRP See cAMP receptor protein (CRP) 

cruciform DNA, 282, 282f, 958, 958f 

crude extract, 85 

cryptochrome, 520 

crystal structures, bound water molecules in, 

50-51, 51f 
culture, tissue, 332 

cyan fluorescent protein (CFP), 434b-436b 
cyanobacteria, 5f 

evolution of, 33, 34f 

nitrogen fixation by, 852 

phosphorylation in, 762, 762f 
cyanocobalamin, 658 

cyclic AMP. See cAMP (adenosine 3 ',5 '-cyclic 

monophosphate) 
cyclic GMP See cGMP (guanosine 3',5'-cyclic 

monophosphate) 
cyclic nucleotide phosphodiesterase, 

424f, 430 
cyclic symmetry, 139, 139f 
cyclin, 469, 469f 

degradation of, 471-472, 471f, 1109 

synthesis of, 472 
cyclin-cyclin-dependent kinase complex, 469-473, 

470f, 471f 
cyclin-dependent kinase 9, 1032 
cyclin-dependent protein kinases, 469^73, 470f 

in cell cycle regulation, 469-473, 470f 

inhibition of, 472^73 

oscillating levels of, 437, 438f 

phosphorylation of, 470-473, 470f, 471f, 473f 

regulation of, 470^73, 470f-472f 

synthesis of, 472 
cycloheximide, 1098, 1098s 
cyclooxygenase (COX), 817, 818 
cyclooxygenase 1/2 inhibitors, 817, 818, 818f 
cycloserine, 878 
cystathionine a-synthase, 864 
cystathionine y-lyase, 864 
cysteine, 9s, 75s, 77, 77s, 692, 864 

biosynthesis of, 864, 864f 

degradation of, to pyruvate, 692, 692f 

properties of, 73t, 77 
cystic fibrosis 

defective ion channels in, 401, 410 

protein misfolding in, 146, 147 
cystine, 77, 77s 
cytidine, 273s 

cytidine diphosphate (CDP), 824, 825f 
cytidine 5 '-monophosphate (CMP), 273s, 824 
cytidylate, 272t, 273s 
cytidylate synthetase, 887 
cytochrome(s), 710-711 

prosthetic groups of, 710-711, 710f 
cytochrome b 5 , 815 
cytochrome b 5 reductase, 815 
cytochrome bc u 716-717, 716f 
cytochrome bc Y complex, 715, 716. See also 

Complex III 
cytochrome b Q f, dual roles of, 762f 
cytochrome b G f complex, photosystems I and II 

linkage by, 755-756, 755f 
cytochrome c 

absorption spectra of, 711, 711f 

in apoptosis, 737-738, 737f 

structure of, 131t 
cytochrome/ water chain in, 50-51, 51f 
cytochrome oxidase, 716. See also 
Complex IV 

subunits of, 717f 
cytochrome P-450, 736-737, 816, 916 

mitochondria and, 736 

in xenobiotic metabolism, 736-737 
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cytokines, 443 

in cell cycle regulation, 47 If, 472 
cytokinins, 331f 
cytoplasm, 3, 3f 

filaments in, 9-10, 9f 
cytosine, 9s, 272, 272t, 273f See also 
pyrimidine bases 

deamination of, 289, 290f 

methylation of, 292 
cytoskeleton, 9-10, 9f 
cytosol, 3, 3f 

cellulose synthesis in, 795-796 

lipid synthesis in, 812, 812f 

sucrose synthesis in, 772, 827f 
cytotoxic T cells (T c cells), 170, 170t 



d 

D arm, of tRNA, 1080f, 1081 
dabsyl chloride, 94, 95s 
Dalgarno, Lynn, 1088 
daltons, 14b 
Dam, Henrik, 361, 361f 
dam methylase, 986, 986t 

in mismatch repair, 994 
dAMP (deoxyadenosine 5 '-monophosphate), 273s 
dansyl chloride, 94, 95s 
databases 

genomic, 323 

for protein structure, 136-138, 136f-137f 
daughter chromosomes, 469 
Dayhoff, Margaret Oakley, 72, 72f 
dCMP (deoxycytidine 5'-monophosphate), 273s 
DDI, 1053b 

De Duve, Christian, 599 
de novo pathway, 882 
deamination 

of nucleotide bases, 289-290, 290f 

oxidative, 679 
debranching enzyme, in glycogen metabolism, 

544, 596 
decarboxylation 

of amino acids, 878-879, 879f, 880f 

of 0-keto acid, 497s 

free radical-initiated, 498, 499f 

oxidative, 616 
degeneracy, of genetic code, 1069 
dehydration, of 2 -phosphogly cerate to 

phosphoenolpyruvate, 538 
dehydroascorbate, 127s 
7-dehydrocholesterol, 360, 360s 
dehydrogenase, 513 

stereospecificity of, employing NAD 4 " or NADP + 
as coenzymes, 516-519, 518t 
dehydrogenation, 513 

oxidations involving, 513-514, 513f 
dehydrohydroxylsinonorleucine, 128s 
deletion mutations, 993 
AG (free-energy change) See free-energy 

change (AG) 
AS (entropy change), 22, 22 
denaturation 

of DNA, 287-288, 287f, 288f 

of proteins, 140-142, 141f 

of RNA-DNA hybrids, 288 
denaturation mapping, 952 
dendrotoxin, 411 
denitrincation, 852 
deoxy sugars, 240s 
deoxyadenosine, 273s 

deoxyadenosine 5 '-monophosphate (dAMP), 273s 
5'-deoxyadenosyl group, 658 
5'-deoxyadenosylcobalamin, 657, 658 
deoxyadenylate, 273s 
2-deoxy-ai-D-ribose, 9s 
deoxycytidine, 273s 

deoxycytidine 5 '-monophosphate (dCMP), 273s 
/3-2'-Deoxy-D-ribofuranose, 274f 
deoxy-D-ribose, 274 
2-deoxy-D-ribose, 236s, 272 
deoxyguanosine, 273s 



deoxy guanosine 5 '-monophosphate (dGMP), 273s 

deoxyguanylate, 273s 

deoxyhemoglobin, 164 

deoxynucleoside triphosphate, regulation of 

ribonucleotide reductase by, 890, 909f 
deoxyribonucleotides, 272t, 273-274, 273f, See 

also nucleotides 
ribonucleotides as precursors of, 888-890, 888f, 

889f, 909f 
deoxy thy cytidylate, 273s 
deoxythymidine, 273s 
deoxythymidine 5 '-monophosphate 

(dTMP),273s 
deoxythymidylate, 273s 
depurination, 290, 290f 
dermatan sulfate, 250 
desaturases, 815-817, 817f 
desensitization, receptor, 420, 420f, 42 If, 

430-431, 430f 
desmin, 177 
desmosine, 77, 78s 

desolvation, in enzymatic reactions, 192, 192f 
development 

gene regulation in, gene silencing in, 
1145-1146, 1146f 

pattern-regulating genes and, 1147-1152 
dexamethasone, 823s 

in glucose metabolism, 823 
dextran, 246, 248f, 251t See also 
polysaccharide (s) 

synthetic, 246 
dextrose, 235, 236s See also glucose 
AG % (activation energy). See activation 

energy (AG X ~) 
DGDG (digalactosyldiacylglycerol), 352, 352f 
dGMP (deoxyguanosine 5 '-monophosphate), 273s 
diabetes insipidus, 394b 
diabetes mellitus, 929-930 

acidosis in, 63-64 

carbohydrate metabolism in, 542f 

defective glucose and water transport in, 394b 

diagnosis of, 241b, 930 

early studies of, 903b 

fat metabolism in, 542f 

fatty acid synthesis in, 821-822 

glucose metabolism in, 927f, 929-930 

glucose testing in, 241, 241b-242b 

ketosis/ketoacidosis in, 667, 929-930 

mature onset of the young (MODY), 593b-594b 

mitochondrial mutations and, 741 

pathophysiology of, 929-930 

sulfonylurea drugs for, 925 

treatment of, 808, 824, 936 

type 1, 929 

type 2, 929, 938-940 
drug therapy for, 808, 824 
insulin insensitivity in, 929, 935-936, 938-940 
management of, 839-840 
diabetic ketoacidosis, 63-64, 667, 929 
diacylglycerol, 357, 432, 433, 433f 

in triacylglycerol synthesis, 821f 
diacylglycerol 3-phosphate, in triacylglycerol 

synthesis, 821, 82 If 
dialysis, protein, 85 
diastereomers, 16, 16f 
dichromats, 465, 465 

dideoxy method, for DNA sequencing, 292-294, 

293f, 294f 
dideoxyinosine (DDI), 1053b 
dideoxynucleotides, in DNA sequencing, 292-294, 

293f, 294f 
dielectric constant, 46 
2,4-dienoyl-CoA reductase, 657 
differential centrifugation, 8-9, 8f 
diffusion 

facilitated, 390-391, 3901 See also 

transporter (s) 
hop, 384 

of membrane lipids, 380-833, 382f-384f 
simple, 389f, 390, 390f 
of solutes, 389-391, 389f, 390f See also 
membrane transport 



dihedral angles 

in secondary structures, 121-122, 122f 

secondary structures and, 122f 
dihedral symmetry, 139, 139f 
dihydroacetone phosphate, 533 
dihydrobiopterin reductase, 697 
dihydrofolate reductase, 890-891 

substrate binding to, 189f 
dihydrogen phosphate, as buffer, 58-59, 59f 
dihydrolipoyl dehydrogenase, 618 
dihydrolipoyl transacetylase, 618 
dihydroxyacetone, 236, 236s, 237s 
dihydroxyacetone phosphate, 533s, 545s 

catalysis of, 191 
in Calvin cycle, 779, 780f, 782f, 783 

in glyceroneogenesis, 822f, 823 

in glycolysis, 530, 539f 

Pi exchange for, 783-784, 783f, 784f 

transport of, 783-784, 784f 
1,25-dihydroxycholecalciferol, 360, 360s, 906t, 908 
dimethylallyl pyrophosphate, in cholesterol 

synthesis, 833, 833f, 834f 
dimethylnitrosamine, as mutagen, 291f, 292 
dime thy lsulfate, as mutagen, 291f, 292 
dinitrogenase, 855 
dinitrogenase reductase, 855 
Dintzis experiment, 1088 
Dintzis, Howard, 1088 
dioxygenases, 816b 
diphtheria toxin, 1098-1099 
direct transposition, 1014, 1014f 
disaccharides, 235, 243-244. See also 
carbohydrates; oligosaccharides 

conformations of, 247-248, 248f 

formation of, 243, 243f 

hydrolysis of, 243-244 
to monosaccharides, 543, 544f 

nomenclature of, 243-244 

oxidation of, 243-244 

reducing, 243-244 

structure of, 243-244, 243f, 244f 
dissociation constant (acid) (KJ, 57f-59f, 58-59 
dissociation constant (K d ), 156-157, 161t 

for enzyme -substrate complex, 198 

in Scatchard analysis, 421b 
dissociation energy, 44 
distal His, 168 

disulfide bonds, in amino acid sequencing, 95, 96f 
d,l system of stereochemical nomenclature, 17, 
72f, 74 

DNA, 14, 14t, 27, 28f, 29f. See also nucleic acids 
A-form, 281, 281f 
annealing of, 287-288, 287f 
bacterial, 949, 950f, 950t 

packaging of, 970 

topoisomerases and, 959 
base pairs in, 277, 277f, 278-280, 279f, 280f. See 

also base(s), nucleotide/nucleic acid; base 

pairs/pairing 
B-form, 281, 281f, 955 
Chargaffs rules for, 278 
chemical synthesis of, 294, 295f 
chloroplast, 951 
cleavage of, 304-307, 304f 

blunt ends in, 306, 306f, 307 

restriction endonucleases in, 304^-307, 304f, 
305t, 306f 

sticky ends in, 306, 306f, 307 
closed-circular, 955, 955f, 956 
coding, 952 

compaction of, 963-968, 970f 

complementary See cDNA (complementary DNA) 

damaged, 993-1002, 1000, lOOOf, 1001-1002, 

lOOlf See also DNA repair; mutation(s) 

repair of, 993-1002. See also DNA repair 

SOS response and, 1001, 1002t, 
1130-1131, 1130f 
degradation of, 979-980 
denaturation mapping of, 978 
denaturation of, 287-288, 288f, 290f 
double helix of, 278-280, 279f, 280f See also 

DNA structure 
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superceding and, 954-962, 955f See also DNA, 
supercoiling of 

in transcription, 1023f 

underwinding of, 955-958, 955f 

unwinding of/rewinding of, 287-288, 287f, 288f, 
978-979, 979f . See also DNA replication 

variations of, 280-281, 281f 
double-strand breaks in 

in recombination, 1006-1007, 1015f, 1016f 

repair of, 1004-1009, 1006f, 1016f 
early studies of, 278-280, 278f 
enzymatic degradation of, 979 
eukaryotic, 949-951, 950t 
evolutionary stability of, 27 
folding of, 968 
highly repetitive, 953 
histones and, 963-968, 964f 
human, 949-951, 950t 
hybridization of, 288-289, 290f 
hydrophilic backbone of, 275, 275f, 278 
junk, 1060 

light absorption by, 287-288 
linker, in nucleosome, 963f, 964-965 
linking number of, 956-958, 956f, 
957f, 965 

topoisomerases and, 958-961, 959f 
melting point for, 288, 288f 
methylation of, 292 
microinjection of, 333 

in animal cloning, 333 
mitochondrial, 738f, 951 See also mitochondria 

aging and, 739 
mutagenic changes in 

from alkylating agents, 291f, 292 

from nitrous acid, 291, 291f 

from radiation, 290, 291f 
noncoding regions of, 953 See also introns 
nontemplate (coding) strand of, in transcription, 

1023f, 1024 
nucleosomes and, 963, 964f, 965f, 967f 
nucleotide sequences in, amino acid sequences 

and, 97f, 98-100 
nucleotides of, 272t, 273-274, 273f, See also 

nucleotides 

packaging of, 947, 947f, 954-962, 962-971 See 

also DNA, supercoiling of 

in bacteria, 970 

in eukaryotes, 963-968, 970f 
phosphodiester linkages in, 274-275, 275f 
recognition sequences in, 305-306, 305t 
recombinant, 304. See also cloning 
regulatory sequences in, 948 
relaxed, 954, 955, 955f, 958f 
repetitive, 952f, 953 

centromeric, 953, 953f 

telomeric, 953, 953f 
replication of, 28, 28f 
satellite, 953 

sequence of See also DNA sequencing amino 

acid sequence and, 97f, 98-100 
simple sequence, 952f, 953 
as single molecule, 28 
size of, 949-952, 950f-951f 
specific linking difference for, 956-958 
stability of, 32 

strand length in, 949-952, 950f-951f 
structure of, 28-29 

tertiary, 947 
supercoiling of, 954-962, 955f, 964-965, 964f 

causes of, 932 

chromatin assembly and, 965f 

condensins and, 969, 970f 

degree of, 955f 

negative, 957, 957f 

plectonemic, 961 

positive, 957, 957f 

solenoidal, 961-962, 962f 

topoisomerases and, 958-961, 958f-961f 

in transcription, 1023f 
superhelical density of, 956 
superhelix of, 954-962, 955f 
synthesis of See DNA replication 



template strand of, in transcription, 1023f, 
1024, 1024f 

topoisomers of, 957 

topology of, 955 

transcription of, 28f 

as transcriptional template, 1022, 1023f 

transposon, 952f, 953 

viral, 948-949, 949t, 950t 

weak interactions in, 277, 277f, 279, 279f, 280f 

yeast, 949, 950t 

Z-form, 281,281,281f 
DNA amplification, by polymerase chain reaction, 

317, 318f 
DNA chips, 325-326, 327f 
DNA cloning, See cloning 
DNA fingerprinting, 319-321 
DNA glycolases, in base-excision repair, 

996-997, 997f 
DNAgyrase, 986t,989t 
DNA helicase II, in mismatch repair, 995 
DNA helix See DNA, double helix of 
DNA hybridization 

cDNAin, 289, 290f, 1051 

in cloning, 310-311, 31 If 

in DNA library creation, 315 
DNA libraries, 315-317, 315f, 316f 
DNA ligases, 304, 304f, 305t, 306f, 307, 984, 989t 

in mismatch repair, 995, 996f 

in nick translation, 983f, 984, 989, 990f, 994t, 
997, 998f 

DNA metabolism, 975-1016 See also DNA 

recombination; DNA repair; DNA replication 

nomenclature of, 976 

overview of, 975-976 
DNAmicroarrays, 325-326, 327f 
DNA mismatch repair, 292, 994-996, 994f-996f 
DNA photolyases, 994t, 998-999, 999f 

reaction mechanism of, 999f 
DNA polymerase (s), 293f, 305t, 317 

3'-»5' exonuclease activity of, 981f, 982 

5'->3' exonuclease activity of, 982, 983f 

dissociation/reassociation of, 980 

functions of, 982-984, 989t 

in nick translation, 983 

processivity of, 980 

proofreading by, 981-982, 981f 

properties of, 982t, 983t 

reaction mechanism of, 980f 

in replication, 979-984. See also replication 
in bacteria, 977-991 
in eukaryotes, 992 

RNA-dependent, 1050-1051, 1050f 

template for, 980, 980f 

types of, 982-984, 982t, 983t 

viral, 993 
DNA polymerase a, 992 
DNA polymerase j8, 1002 
DNA polymerase 6, 992 
DNA polymerase 7, 1002 
DNA polymerase 77, 1002, 1003b 
DNA polymerase 4, 1002 
DNA polymerase A, 1002 
DNA polymerase I, 982, 982f 

discovery of, 979-980 

functions of, 982, 989t 

large (Klenow) fragment in, 982, 982f 

in nick translation, 983, 983f 

structure of, 982, 982f 
DNA polymerase II, 982, 982t, 983t 
DNA polymerase III, 982, 982t, 983-984, 983t, 987, 
988, 988f 

functions of, 989t 

in mismatch repair, 995, 996f 

subunits of, 982t, 983t, 984f 
DNA polymerase IV, 982, 1001 
DNA polymerase V, 982, 1001 
DNA primases, 984, 986t, 987, 988f 
DNA probe, 311, 31 If 
DNA profiling, 319-321 
DNA recombination, 1003-1016 

branch migration in, 1005-1006, 1006f 
crossing over in, 1004f, 1005, 1005f 



in DNA repair, 1004-1007, 1006f, 1009-1010, 

1012f, 1013, 1013f 
DNA strand exchange reaction in, 1007-1008, 1009f 
double-strand break repair model for, 

1006-1007, 1006f 
enzymology of, 1033-1034, 1034f~l036f 
functions of, 1004-1005 
homologous genetic, 1003-1005, 1004-1007, 

1009-1010 

functions of, 1004-1005 

site-specific, 1004, 1010-1013, 101 If, 1012f 
in immunoglobulin genes, 1014-1016, 1015f, 1016f 
meiosis events and, 1004-1005, 1004f 
proteins in, 1007-1009, 1007f-1009f 
site-specific, 1004, 1010-1013, lOllf, 1012f 
transposition in, 1013-1014, 1014f 
DNA repair, 993-1002 
in bacteria, 994t 

base-excision, 994t, 996-997, 997f 
cancer and, 1003b 

in contiguous dimeric chromosome, 1013, 1013f 
cyclin- dependent protein kinases in, 471, 

472-173, 473f 
direct, 998-1000, 999f 
DNA photolyases in, 994t, 998-1000, 999f 
error-prone translesion DNA synthesis in, 

1001-1002, 1002t 
mismatch, 292, 982, 994-996, 994f-996f, 994t, 995f 
nick translation in, 983, 983f, 984, 989, 990f, 

994t, 997, 998f 
nucleotide-excision, 994t, 997-998, 998f, 1003b 
0 6 -methylguanine-DNA methyltransf erase in, 

999, lOOOf 
phosphorylation in, 471, 472-473, 473f 
proofreading in, 981-982, 981f 
recombinational, 1004-1009, 1006f, 1009-1010, 

1012f, 1013, 1013f 
SOS response in, 1001, 1002t, 1130-1131 
TLS polymerases in, 1001-1002, 1003b 
DNA replicase system, 984 
DNA replication, 28, 28f, 977-993 
accuracy of, 980-982, 98 If 
automated, 293f, 294, 294f 
in bacteria, 977-991, 979-991, 982-991, 982f, 

987-991 
base pairing in, 980, 980f, 981f 
base stacking in, 979f , 980f 
chain elongation in, 979-980, 980f, 986t, 987, 

987f, 988f 
directionality of 

in bacteria, 978, 978f, 979f 

in eukaryotes, 992 
DNA polymerases in, 293f, 979-980, 982-984. 

See also DNA polymerase (s) 
DNA-binding proteins in, 984, 985f 
enzymology of, 979-984, 980f, 989t See also 

specific enzymes 
error-prone translesion, 1001-1002, 1002t 
in eukaryotes, 991-992 
helicases in, 984, 985f 
initiation of, 985-987, 985f, 986t 
lagging strand in, 979, 979f 

synthesis of, 987-988, 987f, 988f 
leading strand in, 979, 979f 

synthesis of, 987, 987f, 988f 
mistakes in, 980-981, 981f 
nick translation in, 983, 983f, 984, 989, 990f 
nucleophilic attack in, 979, 980f 
Okazaki fragments in, 979, 979f, 987-989, 987f 
phosphorylation in, 979, 980f 
primer in, 980, 981f, 984 
proofreading in, 981-982, 981f 
rate of 

in bacteria, 989 

in eukaryotes, 992 
replication fork in, 978, 978f, 979f, 987-989, 987f 

in bacteria, 978, 978f, 979f, 987-989, 987f 

damage from, 976-977, lOOlf 

in eukaryotes, 992, 995f 

repair of, 1004-1007, 1006f, 1012f, 1013, 1013f. 
See also DNA repair 

stalled, 989-991, 990f 
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DNA replication ( continued) 
replisomes in, 984, 988, 989 
reverse transcriptase in, 1050-1051 
RNA-dependent, 1050-1059 See also 

RNA replication 
rules of, 977-979 
semiconservative, 977 
site-specific, 1010-1013 
strand synthesis in, 979, 979f 
structural aspects of, 279-280, 280f 
template for, 977, 980, 980f 
Ter sequences in, 989-991, 990f 
termination of, 989-991 
topoisomerases in, 958-961, 958f-961f, 

968, 984 

Tus-Ter complex in, 989-991, 990f 

vs transcription, 1022 
DNA replication origin (oriC), 985f, 986-987 

in bacteria, 978, 978f 
DNA sequencing, 292-294, 293f 

amino acid sequencing and, 97f, 98-100 

automated, 294, 294f 
DNA strand exchange reaction, 1007-1008, 1009f 
DNA structure, 277-283 

antiparallel orientation in, 279 

in bacteria, 970f, 1003b 

base stacking in, 276-277, 278-279, 
279f, 280f 

compacted, 963-968, 970f 

cruciforms in, 282, 282f, 958, 958f 

early studies of, 278-280, 278f 

folding in, 968, 968f 

functional correlates of, 278, 279-280, 280f 
hairpins (loops) in, 282, 282f, 968, 968f 

in replication fork, 978, 978f 
helical. See DNA, double helix of 
Hoogsteen positions/pairings in, 282, 283f 
inverted repeats in, 281, 282f 
linking number and, 956-958, 956f, 957f, 965 
major/minor groove and, 279, 279f 
mirror repeats in, 282, 282f 
nuclear scaffolds and, 968, 968f 
palindromes in, 281, 282, 282f 
primary, 277-278 
ribbon model of, 958f 
secondary, 278 

strand complementarity in, 279, 279-280 
strand separation in, 956-958, 956f, 957f 
supercoiling and, 954-962. See also DNA, 

supercoiling of 
tetraplex, 282 
three-dimensional, 280-281 
topoisomerases and, 958-961, 958f, 985f 
triplex, 282, 283f 
twist in, 957 

underwinding and, 955-958, 955f 
variations in, 280-283, 281f 
Watson-Crick model for, 278-279, 

279f, 280f 
writhe in, 957 

x-ray diffraction studies of, 278, 279, 279f 
DNA synthesis. See DNA replication 
DNA transposition, 1004 
DNA typing, 319-321 

in forensic medicine, 319b-321b 
DNA unwinding element (DUE), 985 
DnaA protein, in replication initiation, 985f, 

986, 986t 
DnaB helicase 

in mismatch repair, 996f 

in replication initiation, 985f, 986, 986t, 

987, 988f, 989t 
DNA-binding domains/motifs 

in gene regulation, 1121-1124, 1122f 

helix-turn-helix, 1122 

homeodomain, 1123-1124, 1124f 

zinc finger, 1122-1123, 1123f 
DNA-binding proteins, 984, 985f 
DnaC protein, in replication initiation, 985f, 
986, 986t 

DNA-dependent RNA polymerase, 1022-1024, 
1023f, 1117 



DNA-dependent transcription, 1002-1030 

See also transcription 
DnaG protein, 986t, 987, 988f, 989t 
DnaK/DnaJ, in protein folding, 144, 144f 
DNases, 979 

Dobzhansky, Theodosius, 29 
dodecanoic acid, 344t 
Doisy, Edward A , 361, 36 If 
dolichols, 362s, 363 
domains, 135 
dopamine, 878 
double helix, 28, 28f 
DNA, 278-280, 279f, 280f. See also 
DNA structure 

supercoiling and, 954-962, 955f See also DNA, 

supercoiling of 
in transcription, 1023f 
underwinding of, 955-958, 955f 
unwinding of/rewinding of, 287-288, 287f, 288f, 

978-979, 979f See also DNA replication 
variations of, 280-281, 281f 
RNA, 284-285, 284f, 285f 
double-displacement mechanism, 200, 200f 
double-reciprocal plot, 197b, 198 

for enzyme inhibitors, 202b 
double-strand break repair model, 1016 
double- strand breaks 
error-prone translesion DNA synthesis repair for, 
1006-1007 

in recombination, 1006-1007, 1006f, 1016, 1016f 
recombinational DNA repair for, 1004r-1009, 1006f 
doxorubicin, 961b, 961s 
Drosophila melanogaster 
development in 
gene regulation in, 1146-1151 
pattern-regulating genes in, 1147-1152 
genome of, 949, 950t 
life cycle of, 1147, 1147f 
drug development, genomics/proteomics in, 
335-337 

drug metabolism, cytochrome P-450 in, 
816b, 916 

drug resistance 

ABC transporters and, 400 

plasmids in, 949 
drug therapy See specific drugs 
dTMP (deoxythymidine 5 '-monophosphate), 273s 
dynamic steady state, 20 
dyneins, 175 

e 

E (exit) binding site, ribosomal, 1089 

E coli See Escherichia coli 

El, E2, E3 enzymes, in protein degradation, 1107, 

1108f, 1109 
eating, hormonal control of, 930-937 
Edelman, Gerald, 172 
editing, mRNA, 1073-1074 
Edman degradation, 94f, 95 
Edman, Pehr, 95 
effectors, 1117 
Ehlers-Danlos syndrome, 128 
eicosanoic acid, 344t 

eicosanoids, 358-359, 358f, 906t, 907-908 See also 
leukotrienes; prostaglandin(s); thromboxanes 

in signaling, 819 

synthesis of, 817-819, 818f 
eicosatrienoate, synthesis of, 815f 
eIF4E binding proteins, 1145 
elasticity, of enzyme, 579b 
elasticity coefficient (e), 579b, 580 

relation of, to enzyme response, 580-581, 580f 
elastin, proteoglycans and, 254 
electrical charge 

of amino acids, 80-81 

pH and, 80-81 
electrical potential energy, of proton-motive 

force, 720f 
electrochemical gradient, 389f, 390, 390f 

free-energy change for, equation in, 720 

in membrane polarization, 450, 450f 



electrogenic transport, 396 
electrolytes, plasma levels of, 921 
electromagnetic radiation, 744, 744f 
electromotive force (emf), 508, 512 
electron(s) 

reduction potential affinity for, 514-515, 
514f, 515t 

transfer of, 512-513 
electron acceptors 

pyruvate as, in lactic acid fermentation, 546-547 

universal, 709, 709t 
electron carriers: See also electron-transfer reactions, 
mitochondrial 

soluble, NADH and NADPH as, 516-519, 51 7f, 
518f, 518t 

specialized, 516 

universal, coenzymes and proteins as, 516 
electron flow 
light-driven, 749-759 
acting in tandem, 752-754, 752f 
antenna chlorophylls in, 752, 754-755, 754f 
bacterial photochemical reaction center in, 

749-751, 750f, 751f 
in chloroplasts, 743-744 
cytochrome b 6 f complex links, 755-756, 755f 
kinetic and thermodynamic factors in, 751-752 
water split in, 756-758 
proton gradient in, 759-760, 760f 
through cytochrome b 6 f complex, 755f 
electron pushing, 208, 208f 
electronegativity, atomic, 208 
electroneutral transport, 395 
electron-transfer reactions 
ATP synthesis in, 723-724, 723f 
mitochondrial, 708-722 
alternative mechanism of NADH oxidation in, 

721,722b 
Complex I, 712-714, 713t, 714f 
Complex II, 715, 715f 
Complex III, 715-716, 716f 
Complex IV, 716-718, 718f 
in multienzyme complexes, 712-718 
in plants, 721 

through membrane-bound carriers, 710-712, 
710f-712f, 712t 
protein components of, 712, 71 3t 
proton gradient conservation of energy in, 

718-720, 720f 
universal electron acceptors in, 709, 709t 
electron-transferring flavoprotein (ETF), 654 
electrophiles, in enzymatic reactions, 208f, 

496, 496f 
electrophoresis, 88-90, 89f 
in cloning, 310 

in DNA sequencing, 292-294, 293f, 294f 
polyacrylamide gel, 88-89, 89f 
pulsed field gel, 310 
two-dimensional, 90, 91f 
two-dimensional gel, 325 
electroporation, 307 

electrospray ionization mass spectrometry, 

98-99, 99f 
electrostatic interactions, with water, 

46-47, 47f 
elements, essential, 11, llf 
elimination reaction, 497^98, 498f 
Elion, Gertrude, 894, 894f, 992 
ELISA, 174, 174f, 904 
ellipticine, 961b, 961s 
elongation, 1091 
elongation factors, 1091 

transcriptional, 1032, 1032t 
Elvehjem, Conrad, 519 
Embden, Gustav, 528 

embryonic development, gene regulation in, 
1146-1151 

gene silencing in, 1145-1146, 1146f 
enantiomers, 16, 16f, 72-73, 236 

monosaccharide, 236, 236f 
encephalomyopathy, mitochondrial, 740-741 
endergonic reactions, 22, 24, 25 

coupled with exergonic reactions, 22-24, 23f 
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Endo, Akira, 842b, 843f 

endocrine glands, 906, 909-911, 909f 

hormone release from, 909-910, 91 If 
endocrine hormones See also hormone (s) 
endocytosis, 9 

receptor-mediated, 840f, 841, 1106-1107, 1106f 
endomembrane system, 9 
endonucleases, 979 

in mismatch repair, 995 

restriction See restriction endonucleases 
endoplasmic reticulum 

fatty acid synthesis in, 812f, 814, 815 

lipid metabolism in, 812f, 814, 815 

co oxidation of fatty acids in, 664, 664f 

protein targeting in, 1100-1101, HOlf 
endosymbiosis, 33 

in eukaryotic evolution, 33, 34f 
endosymbiotic bacteria, chloroplast evolution 

from, 761-762 
energy, 20-25 

activation, 25, 25f, 187, 187f 
in membrane transport, 390, 390f 
rate constant and, 188 

ATP, 296-297, 296f, 506-507, 507f, 560f 

binding, 189 

biological transformations of, in thermodynamics, 

490-491, 491t 
bond dissociation, 44 
cellular, in oxidative phosphorylation 

regulation, 733 
chlorophyll funneled absorbed, to reaction 

centers by excitation transfer, 747-749, 748f 
conservation of, 20, 20f 
for dissolution, 47-48 
entropy and, 22,24 
free See free energy (G) 
Gibbs free energy, 490. See also 

free energy (G) 
informational macromolecules requirement 

for, 508 
interconversion of, 20f 
light, chlorophyll absorption of, 745-747, 

745f, 746f 

oxidative, in citric acid cycle, 630-631, 630f, 630t 

potential, of proton-motive force, 720 

preventing dissipation of, kinetic and thermody- 
namic factors in, 751-752 

of protein conformation, 114 

for protein folding, 114, 143, 143f 

proton gradient conservation of, in mitochondrial 
electron-transfer reactions, 718-720, 720f 

solar, 742, 743f 

sources of, 20 
energy metabolism, 570f 
energy-coupled reactions, 22-24, 23f 
enhancers, 1117, 1139f 
enolase, 538, 575t 

catalytic activity of, 213, 21 3f 

reaction mechanism of, 213f 
enoyl-ACP reductase, 809, 81 Of, 811f 
enoyl-CoA hydratase, 654 
enoyl-CoA isomerase, 657 
enteropeptidase, 676f 
enthalpy (H), 22,490 
enthalpy change (AH), 490^91 

units of, 491t 
entropy (S), 20f, 21b, 22, 490 

increase in, 22 

negative, 21 

protein stability and, 114-115 
solubility and, 47, 49, 49f 
entropy change (AS), 22 
entropy reduction, in enzymatic reactions, 

192, 192f 
env, 1051, 1051f, 1052, 1053 
enzymatic pathways, 25 

feedback inhibition in, 221-222, 22 If 
enzymatic reaction(s), 25, 25f, 26f, 186-194 
acid-base catalysis in 
general, 192, 193, 193f 
specific, 192-193 
activation energy of, 187, 189 



binding energy of, 189-192 
bisubstrate, 200, 200f 

chymotrypsin-catalyzed, 194, 204f, 205-209, 

207f-209f, 209f 
coordinate diagram for, 186-187, 187f 
covalent interactions in, 189 
desolvation in, 192, 192f 

double-displacement (Ping-Pong) mechanism in, 

200, 200f 
electron pushing in, 208, 208f 
enolase-catalyzed, 213, 213f 
entropy reduction in, 192, 192f 
equilibria of, 186-188, 188t 
evolution of, 31-32, 3 If 
first-order, 188 

free-energy change in, 23-24, 186, 187-188, 
187f, 188t, 191-192 

ground state in, 186 

hexokinase-catalyzed, 212, 212f 

inhibition of 
competitive, 201-203, 20 If, 202f, 203t 
reversible, 201-203, 201f, 202f, 203t 

intermediates in, 209 

lysozyme-catalyzed, 213-216, 214f, 215f 

mechanism-based inactivators in, 204 

mechanisms of, 208-209, 208f-209f, 209f 

metal ion catalysis in, 193-194 

noncovalent interactions in, 189 

pre-steady state, 195 

in protein purification, 88t, 91-92, 91f 

rate constants for, 188, 198, 199t 

rate equation for, 188 

rate of, 186-188, 187f See also enzyme kinetics 

acceleration of, 186-188, 188t 

measurement of, 201 
rate-limiting steps in, 188, 198 
reaction intermediates in, 187 
regulation of, 220-227, 571-577 See also 

enzyme (s), regulatory 

metabolic, 574 
in RNA processing, 1046f-1048f, 1047-1049 
steady-state, 195, 198 
steps in, 208-209, 208f-209f, 209f 
suicide inactivators in, 204 
transition-state, 25, 187, 189-191, 207, 209n, 

210b-211b, 210f 
weak interactions in, 49, 49f, 50, 189-191, 190f 
enzymatic reaction mechanisms, 208-209, 

208f, 209f 
chymotrypsin, 194, 204f, 207f-209f, 

210b-211b,2l0f 
definition of, 208 
enolase, 213, 213f 
hexokinase, 2l2,212f 
lysozyme, 213-216, 214f, 215f 
Phillips, 213-216 
enzyme(s), 25, 183-227, 184. See also specific en- 

zyme, e g , glucose 6-phosphate dehydrogenase 
active site of, 186, 186f 

transition- state complementarity and, 
210b-211b, 211f 
activity of, 88t, 91-92, 91f, 186-194 

catalytic, 25, 26f, 186-194 See also catalysis; 
enzymatic reactions 

evolution of, 31-32, 31f 

specific, 88t, 91-92, 91f 
adenylation of, 223, 223f 
ADP-ribosylation of, 223, 223f 
allosteric, 220-222, 22lf, 222f 

conformational changes in, 220-221, 221f 

feedback inhibition by, 221-222, 221f 

kinetics of, 222, 222f 
in carbohydrate metabolism, 574, 575t 
classification of, 184-185, 185t 
coenzymes and, 3, 184, 185t See also 

coenzyme (s) 
cofactors for. See enzyme cofactors 
contributing to flux, 578, 578f 
debranching, 544, 596 
definition of, 184 
early studies of, 184 
elasticity of, 579b 



in fatty acid oxidation, 649-652, 650f-652f, 663f 

evolution of, 663-664 
feedback inhibition of, 221-222, 221f 
functions of, 25, 26, 26f 
glycine cleavage, 864 
hepatic, 912-916 
in hydrolysis reactions, 65 
lysosomal, targeting of, 260 
methylation of, 223, 223f 
microbial target, in drug development, 337 
in nitrogenase complex, 854-857, 855f, 856f 
nomenclature of, 184-185, 184t, I85t, 

816b-817b 

in nucleotide biosynthesis, chemotherapeutic 

agents targeting, 894-896, 894f, 895f 
overview of, 183-185 
pH of, 204, 204f 
pH optimum of, 63 

phosphorylation of, 223f, 224-226, 226f, 

226t, 227f 
prosthetic groups and, 184 
in protein degradation to amino acids, 

674-677, 676f 
in protein folding, 144 
proteolytic, regulation of, 227, 227f 
purification of, 88t, 91-92 
in pyruvate dehydrogenase complex, 618-619 
receptor, 439-446. See also receptor enzymes 
in recombinant DNA technology, 305t, 337 
in redox reactions, 816b-817b 
regulatory, 220-227 

allosteric, 220-222, 221f, 222f 

complex, 227 

covalent modification of, 220, 223-227, 223f , 

226f, 226t, 227f 
functions of, 220, 221-222 
kinetics of, 222, 222f 

phosphorylation and, 223f, 224-226, 226f, 

226t, 227t 
proteolytic cleavage and, 227, 227f 
unique properties of, 220 
response of 
elasticity coefficient related to, 579b-580b, 

580f 

to metabolite concentration, 574, 575t 

restriction See restriction endonucleases 

RNA, 31-32, 31f, 1033-1034, 1036-1039, 
1046f-1048f, 1047-1049, 1056-1059, 1078b 

selectivity of, 25 

in signaling, 420, 420f 

specificity of, 191 

structure of, 184 

turnover number for, 198, 199t 

unique characteristics of, 154 

in urea synthesis, 682-684, 683f, 684f 
genetic defects and, 686-687, 687f 

uridylylation of, 223, 223f 
enzyme activity 

change in, in metabolite flux, 578-580, 
578f, 580f 

factors determining, 571-574, 572f 
enzyme cascade, 420, 420f, 431, 603 

MAPK, 440, 441f 

in plants, 460, 460f 
enzyme cofactors, 184, 184t 

adenosine as, 297-298, 297f 

in amino acid catabolism, 689-692, 689f, 690f 

lipids as, 357, 361 
enzyme inhibitors, 201-204, 201f 

competitive, 201-203, 201f, 202f, 203t 

in feedback inhibition, 221-222, 221f 

irreversible, 203-204 

mixed, 20 If, 202f, 203, 302t 

reversible, 201-203, 201f, 202f, 203t 

uncompetitive, 201f, 202f, 203, 203t 
enzyme kinetics, 194-205 

comparative, 199 

initial velocity and, 194^195, 194f, 195f 
maximum velocity and, 195-198, 195f 
mechanism-based inactivators in, 204 
Michaelis-Menten, 197-200 
Michaelis-Menten equation for, 196-197, 197f 




enzyme kinetics ( continued ) 
pH and, 204, 204f 
pre-steady- state, 195 
rate equation for, 196-197, 197f 
rate measurement and, 201 
of regulatory enzymes, 222, 222f 
steady-state, 195, 198 
steady-state assumption and, 196 
substrate concentration and, 194-197, 
195f, 197f 

reaction rate and, 194-197, 195f, 197f 
for regulatory enzymes, 222, 222f 
suicide inactivators in, 203-204 
transition-state, 25, 187, 189-191, 207, 209n, 
210b-211b, 210f 
enzyme multiplicity, 872 
enzyme-linked immunosorbent assay (ELISA), 

174, 174f, 903 
enzyme-substrate complex, 186, 186f, 189-191 
active site in, 154, 186, 186f 
transition-site complementarity and, 
210b-211b, 211f 
binding energy of, 49, 49f, 50, 51f, 189-192 
rate acceleration and, 189-191 
specificity and, 191-192 
bisubstrate, 200, 200f 
dissociation constant for, 198 
induced fit in, 192, 212, 212f 
lock-and-key configuration in, 189, 189f 
ordered water and, 49, 49f 
substrate concentration in 
reaction rate and, 194-197, 195f, 197f See also 

enzyme kinetics 
in regulatory enzymes, 197f, 222, 222f 
in transition state, 189-191 
transition-state analogs for, 210b-211b, 21 If 
turnover number for, 198, 199t 
weak interactions in, 49, 49f, 50, 189-191, 190f 
enzyme-transition-state complementarity, 207, 

210b-211b 
E Q (standard reduction potential), 514 
of biologically important half-reactions, 515t 
to calculate free-energy change, 515-516 
measurement of, 514, 514f 
epidermal growth factor receptor, 449 

oncogenic mutations in, 472f, 474, 475 
epigenetics, 966b-967b 
epimers, 237f, 238 
epinephrine, 423s, 878, 907 
cascade mechanism of, 604f 
in glucose metabolism, 928-929, 928t 
in lipid metabolism, 928-929, 928t 
as neurotransmitter vs hormone, 902 
regulation of, 423-431, 424f 
as signal amplifier, 905 
synthetic analogs of, 424f 
epinephrine cascade, 428-429, 429f 
epitope, 171 

epitope tagging, 316f, 317, 328 
equilibrium, 23 

equilibrium constant (if eq ), 24, 55, 188 
for ATP hydrolysis, 495 
calculation of, 491^92, 495 
in carbohydrate metabolism, 574, 575t 
free-energy change and, 187-188, 188t, 491^93, 

492t, 493t 
for two coupled reactions, 495 
for water ionization, 55 

equilibrium expression, 155 

erbB2 oncogene, 474 

ergocalciferol (vitamin D 2 ), 360, 360f 

ERK, 430, 441f 

erlotibin, 476b 

error-prone translesion DNA synthesis, 

1001-1002, 1002t 
erythrocytes, 921, 921f 
aquaporins of, 404, 406t 
chloride-bicarbonate exchanger in, 384f, 

395, 395f 
culling of, 259 
formation of, 158 

glucose transport in, 391-393, 392f, 394f 



glycolytic reactions in, free-energy 

changes of, 553t 
glycophorin in, 376, 376f 
G6PD-deficient, Plasmodium falciparum 

inhibition of, 559 
membrane proteins of, 376, 376f 
shape of, 168f 

synthesis of, JAK-STAT pathway in, 
443-444, 443f 
erythropoietin, recombinant, 337 
erythropoietin receptor, 443-444, 443f 
erythrose, 237s 

erythrose 4-phosphate, in Calvin cycle, 

779, 780f, 781f 
erythrulose, 237s 

ES complex. See enzyme -substrate complex 
Escherichia coli See also bacteria 
citric acid and glyoxylate cycles in, 640 
cloning in, 304-312 
DNA in, 949, 950f, 950t, 970 
packaging of, 970, 970f 
topoisomerases and, 959 
DNA replication in, 987-991 
fatty acids in, 382t 

synthesis of, 806-811 
F 0 Fj complexes of, 739 
genetic map of, 976f 
genome of, 949, 950f 
sequencing of, 323f 
inorganic polyphosphate in, 511 
lac operon in, 1119-1121, 1119f 

regulation of, 1126-1127, 1127f 
lactose transporter in, 400t, 402, 402f 
lipopolysaccharides of, 256-257 
membrane proteins of, structure of, 379f 
metabolism in, 26 
as model organism, 5-6 
phospholipid synthesis in, 825, 826f, 829f 
protein folding in, 144, 144f, 145f 
protein targeting in, 1104-1106, 1106f 
ribosome of, 6f, 1076-1079, 1077f 
RNase P of, 285f 
signaling in, 457-458, 458f 
structure of, 5-7, 6f 
transcription in, 1022-1030, 1023f 
Escherichia coli expression vector, 312, 312f 
Escherichia coli plasmid vector, 307-308, 

307f, 308f 
ESI MS, 98-99, 99f 
essential amino acids, 860. See also 

amino acid(s) 
essential fatty acids, 815-817 
esters 

hydrolysis of, transition state in, 211, 211f 

oxygen, free-energy hydrolysis of, 505-506, 506f 

standard free-energy changes of, 493t 
estradiol, 359, 359s, 459s, 908 
estrogens, 359, 359s, 459s, 908 

synthesis of, 844-845, 844f 
ETF: ubiquinone oxidoreductase, 713f, 715 
ethane, 513s 
ethanol See alcohol (s) 
ethanolamine kinase, mutations in, 829 
ether lipids, 350, 351f 
ethylene receptor, in plants, 460f 
etoposide, 961b 
euchromatin, 966, 1136 
eukarya, 4-5, 4f 
eukaryotes 

cell structure in, 6f, 7-9 

evolution of, 33, 33f 
eukaryotic DNA, 949-951, 950t 
evaporation, 44t, 45 
evolution, 29-36 

adaptation to aqueous environments in, 65-66 

adenine in, 1057, 1057f 

amino acid sequences in, 102-106, 104f-106f 

amino acid substitutions in, 104 

of bacteria, 32-33, 33f, 761-764 

cellular specialization in, 33 

chemoheterotrophs in, 32 

of chloroplasts, 761-762 



divergence of /3-oxidation enzymes during, 
663-664 

endosymbiosis in, 33, 34f, 739, 761-762 

eukaryotic, 33, 33f, 34f 

in Galapagos finches, 1152b-1153b 

genetic divergence in, 34-35 

genomics and, 33-36, 35t 

of immune system, 1014-1016 

introns in, 1052-1053 

lateral gene transfer in, 104 

Miller-Urey experiment in, 30-31, 31f 

mitochondrial, 33, 34f 

from endosymbiotic bacteria, 739 
molecular, 102-106 See also evolution 

amino acid sequences and, 102-106, 
104r-106f 

amino acid substitutions in, 104 

homologs in, 104-105 

lateral gene transfer in, 104 
of molecular parasites, 1058-1059 
mutations in, 29-30, 30f, 33-35, 1152b-1153b 
nucleic acids in, 30-32, 1056-1059 
photosynthesis in, 32, 34f, 761-764, 763f 
prebiotic, 31-32 
prokaryotic, 32-33, 33f 
protein families and, 138 
protein homologs in, 104-105 
proteins in, 1056-1059 
retrotransposons in, 1052-1053 
retroviruses in, 1052-1053 
RNA world hypothesis and, 3 If, 32, 

1057-1059 
time line of, 33, 33f 

in vitro, 1056-1059, 1058b, 1058b, 1078b 
evolutionary trees, 105-106, 106f 
excitation transfer, light absorption funneled to 

reaction centers by, 747-749, 748f 
excited state, 744 
exciton, 744 
exciton transfer, 745 
exergonic reactions, 22, 24, 25 

in citric acid cycle, 636-637 

in conversion of pyruvate to phosphoenolpyru- 
vate, 553-557, 553t, 554f, 555f 

coupled with endergonic reactions, 22-24, 23f 
exinucleases, 997-998 
exit (E) binding, ribosomal, 1089 
exocytosis, 9 
exons, 952, 1035 

transcription of, 1035 
exonuclease(s), 979 

in mismatch repair, 995, 996f 
exonuclease III, 305t 
expressed sequence tag (EST), 315 
extracellular matrix 

glycosaminoglycans in, 249-251, 250f, 
251f, 252 

proteoglycan aggregates in, 254, 255f 
proteoglycans in, 252-255, 255f 

f 

F x component, of ATP synthase, 725, 726f 
stabilization of ATP relative to ADP on, 
725-726, 726f 
Fab fragment of immunoglobin, 171, 171f, 173f 
Fabry's disease, 356 

facilitated diffusion, 390-391, 390f. See also 

transporter (s) 
F-actin, 176, 176f 

in muscle contraction, 178 
FAD (flavin adenine dinucleotide), 297s, 520s 
familial hypercholesterolemia, 843-844, 843b 
Fanconi-Bickel disease, 599t 
Faraday constant, 396, 49lt 
farnesyl groups, membrane attachment of, 380f 
farnesyl pyrophosphate, in cholesterol synthesis, 

834, 834f 
Fas receptor, in apoptosis, 478, 478f 
fasting state, glucose metabolism in, 925-928, 926f, 

92 7f, 928f 
fast-twitch muscle, 918 
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fat(s), body See also Adipose tissue 

body See also adipose tissue; body mass 
heat generated by, in oxidative phosphorylation 
regulation, 736, 736f 

metabolic pathways of, 913t 

rancid, 348 
fat cells See adipocytes 
F r ATPase, 725 

structure of, 733, 733f 
fatty acid(s), 343-349 See also lipid (s); 
triacylglycerol (TAG) 

activation and transport of, 650-652, 651f 

as amphipathic compounds, 48-49, 48f 

analysis of, 364f, 365 

(3 oxidation of, 809, 820 

body stores of, 92 7t 

cleavage of, 355, 355f 

desaturation of, 815-817, 815f, 81 7f 

double bonds of, 344, 344t 

mE, coli cells, 382t 

essential, 815-817 

free, 345, 649 

in glycerophospholipids, 350-352 
as hydrocarbon derivatives, 343 
as lipid anchors, 379-380, 380f 
melting point of, 344t, 345, 348f 
mobilization of, 821-822, 822f, 823, 

916-917 

glucagon in, 926 
nomenclature for, 343-344, 343f, 344t 
omega-3, 345 
omega-6, 345 
packing of, 345, 345f 
physical properties of, 344t, 345 
polyunsaturated, 345 
reesterification of, 821-822 
saturation of, 348, 348f 
in signaling, 819 
solubility of, 344t, 345 
structure of, 343-344, 344t 
synthesis of, 805-817 
trans, 347-348, 348f, 348t 
in triacylglycerols, 345f, 346-348, 346f, 348f. 

See also triacylglycerol (s) 
fatty acid metabolism 
in adipose tissue, 821-822, 822f, 823, 909f, 

916-917, 927f, 928f 
in brain, 920 
Cortisol in, 929 
in diabetes mellitus, 542f 
epinephrine in, 928-929, 928t 
in fasting/starvation, 927f, 928f 
in liver, 914-915, 915f 
in muscle, 918-919, 918f 
pathways of, 913t 
fatty acid oxidation, 647, 652-665 
a, 664-665 

in peroxisomes, 664-665, 665f 

in bears, 655 

enzymes of, 663-664, 664f 

in peroxisomes, 662-663, 662f 

plants and, 662-663, 663f 

steps of, 653-655, 653f 

yielding acetyl-CoA and ATP, 654-655, 656t 
complete, 657, 658b-659b 
enzymes of, 649-652, 650f-652f, 650t-652t 
of monounsat urate d fat, 656-657, 656f 
odd-number, 657, 660f 
w, in endoplasmic reticulum, 664, 664f 
of polyunsaturated fats, 656-657, 657f 
regulation of, 660 
stages of, 653f 

of unsaturated fats, 656-657, 656f, 657f 
fatty acid synthase, 806-811 

associated proteins of, 808-809 

in plants, 812 

variants of, 806-808, 807f 
fatty acid synthesis, 916-917 

acetoacetyl-ACP in, 809, 81 Of, 81 If 

acetyl-CoA carboxylase in, 805-808, 806f, 814, 
814f, 822f 



acetyl-CoA in, 805-808, 806f, 811, 822f, 922 
acetyl-CoA-ACP transacetylase in, 81 Of, 811f 
acyl carrier protein in, 808, 809f, 810f 
adipose tissue in, 916-917 
in bacteria, 806-811, 815 
jS-hydroxyacyl-ACP dehydratase in, 809, 
810f, 81 If 

j8-ketoacyl-ACP reductase in, 809, 810f, 81 If 
j3-ketoacyl-ACP synthase in, 809, 810f, 81 If 
butyryl-ACP in, 809, 810f, 81 If 
carbonyl group reduction in, 809, 81 Of, 

811, 811f 
condensation in, 809, 81 Of 
incytosol, 811,812f 
dehydration in, 809, 8l0f 
in diabetes mellitus, 821-822 
double-bond reduction in, 809, 810f 
in endoplasmic reticulum, 812f, 814, 815 
enoyl-ACP reductase in, 809, 81 Of, 811f 
fatty acid synthase in, 806-811 
fatty acyl chain elongation in, 806-808, 814, 

822f, 823f 
in hepatocytes, 815 
insulin in, 922 

malonyl/acetyl-CoA-ACP transacetylase in, 809 
malonyl/acetyl-CoA-ACP transferase in, 81 If 
malonyl-CoA in, 814 
malonyl-CoA-ACP transferase in, 810f 
palmitate in, 808, 811, 81 If, 814, 814f 
palmitate synthesis in, 808, 808f, 811 
in plants, 812, 812f 
regulation of, 814, 814f, 821-822 
steps in, 806-808, 809, 81 Of, 822f 
£rcms-A 2 -butenoyl-ACP in, 809, 810f, 81 If 
in vertebrates, 814f 
in yeast, 814f 

fatty acyl-CoA, 650 
conversion of fatty acids into, 650-651, 651f 
in triacylglycerol synthesis, 820-821, 820f 

fatty acyl-CoA dehydrogenase, genetic defects in, 
661,662 

fatty acyl-CoA desaturase, 815 

fatty-acyl-CoA synthetase, reaction mechanism of, 
650-651 

Fc region of immunoglobin, 171, 171f, 172, 172f 
feedback inhibition, 26 

allosteric, 221-222, 221f 

in purine nucleotide biosynthesis, 885f, 886-887 

in pyrimidine nucleotide biosynthesis, 887, 887f 

sequential, 873 
Fehling's reaction, 241b 
fermentation, 528, 547, 547-551 

in beer brewing, 549 

ethanol (alcohol), 528, 530f, 547-549, 547f 

foods resulting from, 550-551 

industrial-scale, 549f, 551 

lactic acid, 530 
pyruvate in, 546-547 

thiamine pyrophosphate in, 549-550, 
550f, 551t 
ferredoxin, 751, 753, 856 
ferredoxin:NADP + oxidoreductase, 754 
ferredoxin-thioredoxin reductase, 785, 785f 
ferritin, 624b 

ferrous ion, oxidation of, 512-513 
Fe-S reaction center, 751 See also 

iron-sulfur entries 
fetus, hemoglobin-oxygen binding in, 167 
fibrin, 361 
fibrinogen, 227,361 

fibroblast growth factor, 25 If, 254, 254f 
fibroin, 128, 128f 

fibronectin, proteoglycans and, 254, 255f 
fibrous proteins, 123-128 

coiled coils in, 123-124, 124f 

collagen as, 124-128, 124f 

fibroin as, 128, 128f 

in hair, 123-124, 124f, 125, 125f 

a-keratinas, 123-124 

polypeptide chain arrangement in, 123 

structure of, 124-128, 124f 
functional correlates of, 123t 



secondary, 117-122, 118f, 120f-122f, 123t 

tertiary, 123-131, 124f, 125f, 128f 
fight-or-flight response, epinephrine in, 

928-929, 928t 
filaments, cytoskeletal, 9, 9f 
firefly bioluminescence cycle, 509b 
first law of thermodynamics, 20 
Fischer, Emil, 74, 189 
Fischer projection formulas, 236, 236f 
5' cap, 1034-1035, 1066f 
5' end, 275, 275f 

5'->3' exonuclease activity, in DNA polymerases, 

983-984, 983f 
fixation, in nitrogen cycle, 852 
flagella, 6f 
flagellar motion, 175 

flavin adenine dinucleotide (FAD), 297s, 520s 
flavin mononucleotide (FMN), 520f 
flavin nucleotides, 519-520, 709 

bound to flavoproteins, 519-520, 520f, 520t 
flavopiridol, 476b 

flavoproteins, 84, 85t, 519-520, 709 
electron-transferring, 654 
flavin nucleotides bound to, 519-520, 
520f, 520t 

flickering clusters, 44, 48f 

flip-flop diffusion, of membrane lipids, 
381-382, 382f 

flippases, 382, 382f, 400 

fioppases, 382f, 383 

fluid mosaic model, 373, 374f 

fluorescence, 745 

fluorescent resonance energy transfer (FRET), 

435b-436b 
L-fluoroalanine, 878 
fluorouracil, 89fs, 894 
flux (J), 571, 578, 579b-580b 

enzymes contributing to, 578 

glycolytic, 578f 

increased, metabolic control analysis prediction 
of, 579b-580b 

metabolite, change in enzyme activity on, 
578-580, 578f, 580f 

response coefficient effect on, 579b-580b 
flux control coefficient (C), 578, 579b, 580b 
fMet-tRNA, in protein synthesis, 1088 

posttranslational modification of, 1096 
FMN (flavin mononucleotide), 520f 
F 0 component, of ATP synthase, 725 

rotation of, 729, 730f 
FqFx complex, of Escherichia coli, 739 
folate, deficiency of, 691-692, 891 
folate metabolism, as chemotherapy target, 895f 
folds, 131-138, 136f-137f. See also protein folding 
food 

fermentation yielding, 550-551 

trans fatty acids in, 347-348, 348t 
footprinting, 1026b 
Forbes' disease, 599t 
forensic medicine, DNA fingerprinting in, 
319-321 

forkhead box other (FOXOl), 592, 592f 
formaldehyde, 513s 
formic acid, 513s 

A^-formylmethionyl-tRNA fMet , in protein 
synthesis, 1088 

posttranslational modification of, 1096 
FOXOl (forkhead box other), 592, 592f 
fractionation 

cellular, 8-9, 8f, 52 

protein, 85-90, 86f 
frameshifting, 1072 
Franklin, Rosalind, 278, 278f 
FRAP technique, 383, 383f 
free energy (G), 23, 490 

absolute temperature and, 23 

cell sources of, 491 

enthalpy and, 22 

entropy and, 22 

Gibbs, 490 

of hydrolysis, 501-503, 504f, 505-506, 505f, 
505t, 506f 
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free fatty acids (FFAs), 345, 649. See also 

fatty acid(s) 
free radical reaction, 498 
free radicals, 498, 499f 
free-energy change {AG), 23, 23-24, 186, 

187f, 490 

in ATP hydrolysis, 502-503, 502f, 504f 
in carbohydrate metabolism, 575t 
of electrochemical gradient, equation 
for, 720 

in enzymatic reactions, 23-24, 186-187, 187-188, 

187f, 188t, 191-192 
of esterification, 493t 

of glycolytic reactions in erythrocytes, 553t 

in membrane transport, 396 

in protein synthesis, 1095 

vs standard free-energy change, 493-494 See 
also standard free-energy change 
free-living bacteria, nitrogen-fixing, 856-857 
FRET (fluorescent resonance energy transfer), 

435b^36b 
fructokinase, 545 
fructose, 235, 236s, 237s, 238 
fructose 1,6-bisphosphatase, 556 

light activation of, 784-785, 785f 
fructose 2,6-bisphosphatase, 588 
fructose 1,6-bisphosphate, 532s, 533s 

in Calvin cycle, 779, 780f 

cleavage of, 533-534, 534f 

conversion of to fructose 6-phosphate in 
gluconeogenesis, 553t, 556 

in glycolysis, 529f, 530 

phosphorylation of fructose 6-phosphate 
to, 532 

regulation of, 585-586, 587f 
fructose 2,6-bisphosphate (F26BP), 587, 587s, 
793, 793f 

in regulation of glycolysis and gluconeogenesis, 
587-588, 587f 

in sucrose synthesis, 793, 793f 
fructose 1,6-bisphosphate aldolase, 533 
fructose 1 -phosphate, 545s 
fructose 6-phosphate, 532, 532s 

in Calvin cycle, 779, 780f, 78lf 

conversion of fructose 1,6-bisphosphate to, in 
gluconeogenesis, 553t, 556 

conversion of glucose 6-phosphate to, 
532, 533f 

phosphorylation of, to fructose 
1,6-bisphosphate, 532 

in sucrose synthesis, 793, 793s, 798f, 799 
fructose 1-phosphate aldolase, 545 
j3-D-fructuranose, 239 
fruit fly 

development of 
gene regulation in, 1146-1151 
pattern-regulating genes in, 1147-1152 

genome of, 949, 950t 

life cycle of, 1147, 1147f 
F-type ATPase. See also ATP synthase (s) 

in membrane transport, 399, 399f 
fiz, 1150, 1150f 
fucose, 240s 
fumarase, 628 
fumarate, 628 

glucogenic amino acids in, 557t 

oxidation of succinate to, 628 
fumarate hydratase, 628 
fumaric acid, 15s 
functional genomics, 35 
functional groups, 12f, 13, 13f 
fungi, cellulase in, 247, 247f 
furanoses, 239, 239f 
Sushi tarazu, 1150, 1150f 
fusion proteins, 313, 387 
futile cycles, 583, 822 

triacylglycerol cycle as, 822 

9 

G (free energy) See free energy (G) 
y light chains, immunoglobulin, 172 



Gprotein(s),425b^27b 
binary switches in, 425b-427b 
disease-causing defects in, 426b-427b 
Gj (inhibitory), 431 
G olf , 466-467, 466f 
G q , 432 

G s (stimulatory), 424-427, 424f 
adenylyl cyclase and, 426 
self-inactivation of, 426-427 
Ras-type, 425b, 440, 441f 
small, 440 
Ras-type, 425b, 440, 441f 
in signaling, 440 
G protein-coupled receptor(s), 422, 422f, 423, 
423^31,467-168 
jS-adrenergic receptor as prototype of, 423—131 
evolutionary significance of, 467 
heptahelical, 424 
serpentine, 424 
G protein-coupled receptor kinases (GRKs), 431 
G tetraplex, 282 
y turns, 121 

GABA (y-aminobutyric acid), 878 

receptor for, as ion channel, 410 
G-actin, 176, 176f 
gag, 1051, 1051f 

frameshifting and, 1072 
gag-pol, frameshifting and, 1072 
GAL genes, regulation of, 1141-1142 
D-galactitol, 546s 
galactokinase, 545 
galactolipids, 352f 
galactosamine, 240, 240s 
galactose, 237, 237s, 238 

conversion of to glucose 1-phosphate, 554f 

epimers of, 237s 

oxidation of, 240-241, 240f 
galactose metabolism genes, regulation of, 

1141-1142, 1141f 
galactosemia, 545 

/3-galactosides, lac operon and, 1119-1121 
Galapagos finches, beak evolution in, 

1152b-1153b 
Gal4p 

acidic activation domain of, 1142-1143 

in yeast two-hybrid analysis, 329 
-y-aminobutyric acid (GABA), 878 

receptor for, as ion channel, 410 
ganglioside(s), 256, 350f, 353f, 354, 355f, 829 

functions of, 354-355 

lysosomal degradation of, 355, 355f 

structure of, 354s 

synthesis of, 829 
ganglioside GM2, 353f 

in Tay-Sachs disease, 356, 356f 
gangliosidosis, 356 
gap genes, 1147, 1150 
GAPs (GTPase proteins), 426b 
gas constant (R), 491t 
gases, solubility of, 47, 47t 
gas-liquid chromatography, 364f, 365 
gastric ulcers, 260, 260f 
gastrin, 675 

gastrointestinal tract, 676f 
GATC sequences 

in mismatch repair, 994-996, 994f-996f 

in replication, 986 
Gaucher's disease, 356 
GCN5- AD A2 - AD A3 , 1137t 
GDGT (glycerol dialkyl glycerol tetraether), 

352, 353f 
GDP (guanosine 5 '-diphosphate) 

in ^-adrenergic pathway, 424, 424f 

in olfaction, 466^67, 466f 

in vision, 463-464, 464f 
gel electrophoresis See electrophoresis 
gene(s), 271, 948, 948. See also protein(s) 

bacterial, 952 
mapping of, 976f 
naming conventions for, 976 

caretaker, 477 

chromosome population of, 949 



cloned. See also cloning 

alteration of, 312-313, 313f 

expression of, 312, 312f 
definition of, 948 

biochemical, 948 

molecular, 948 
defintion of, 273 

evolutionary divergence of, 34-35 
exons in, 952, 1035 

transcription of, 1035 
functional analysis of, 324-329. See also 

protein function 
functional classification of, 35 
functionally related, identification of, 328 
gap, 1147, 1150 

homeotic, 1148, 1150-1151, 1151f 
homologous, 34 
housekeeping, 1116 

immunoglobulin, recombination of, 1014-1016, 

1015f, 1016f 
introns in. See introns 
jumping, 1004, 1013 

maternal, 1147, 1148-1150, 1148f, 1149f 
mutation of See mutations 
naming conventions for, 976 
orthologous, 34, 325 
pair-rule, 1147 
paralogous, 34, 325 
pattern-regulating, 1147-1152 
segment polarity, 1147-1148, 1150 
segmentation, 1147 
size of, 948, 948f 
stability, 477 
gene expression, 1116 
of cloned genes, 312, 312f, 313f 
constitutive, 1116 
definition of, 315 
induction of, 1116 

regulated, 1116. See also gene regulation 

repression of, 1116 
gene inactivation studies, 324 
gene products 

inducible, 1116 

repressible, 1116 
gene regulation, 1115-1154 

acidic activation domain in, 1142-1143 

activators in, 1117-1118, 1117f 

antigenic variation in, 1135t 

catabolite repression in, 1126 

chromatin in, 1136-1138 

coactivators in, 1139, 1139f 

in development, 1146-1151 
gene silencing in, 1145-1146, 1146f 

DNA-binding domains in, 1121-1124, 
1122f-1124f 

effectors in, 1117 

enhancers in, 1117, 1139 

in eukaryotes, 1136-1154 
steps in, 1140 

histone in, 1136-1138 

hormonal, 456-157, 1143-1144, 1144f 

host range, 1135t 

inducers in, 1120-1121 

induction in, 1116 

insulin in, 440, 441f, 606, 1144 

mating-type switch in, 1135t 

mRNA concentration in, 1049-1050 

mRNAin, 1132-1134 

negative, 1117, 1118f 

operators in, 1117 

operons in, 1119-1121 
regulation of, 1126-1127, 1127f 

phase variation in, 1134, 1135f 

positive, 1118, 1118f, 1138 
in eukaryotes, 1138 

principles of, 1115-1125 

in prokaryotes, 1126-1136 

proline-rich activation domains in, 1143 

protein-protein interaction domains in, 1124-1125 

recombinational, 1134, 1135t 

regulons in, 1127 

repression in, 1116 
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repressors in, 1026, 1029, 1117, lllSf, 1122, 
1140-1141 

translational, 1131-1132, 1144-1145, 1148 
riboswitches in, 1133-1134 
RNA interference in, 1145-1146, 1146f 
second messengers in, 1132 
signaling in, 1132, 1143-1144 
site-specific recombination in, 1004, 1010-1013, 

101 If, 1012f 
SOS response in, 1001, 1002t, 1130-1131, 1130f 
specificity factors in, 1117 
stringent factor in, 1132, 1132f 
stringent response in, 1132, 1132f 
subunit mixing in regulatory proteins 

in, 1125 

TATA binding protein in, 1138, 1139, 1140 

transcription activators in, 1139, 1142-1144 

transcriptional attenuation in, 1127-1128, 
1127f-1129f 

translational repression in, 1131-1132, 1131f, 
1140-1141, 1148 

translational repressor in, 1131-1132 

upstream activator sequences in, 1140 

in yeast, 1141-1142, 1141f 
gene silencing, by RNA interference, 

1145-1146, 1146f 
gene therapy, 335-336 
gene transfer, lateral, 104 
general acid-base catalysis, 193, 193f 
general recombination See homologous genetic 

recombination 
general transcription factors, 1030-1031, 

1031f, 1032t 
genetic code, 1066-1074 

base composition in, 1067 

base sequences in, 1067 

codons in, 1066-1067, 1066f 

cracking of, 1066-1069 

degeneracy of , 311f, 1069, 1070t 

expansion of, 1085b-1087b 

overlapping, 1066f 

reading frames in, 1067, 1067f 

second, 1083-1084 

triplet (nonoverlapping), 1066f, 1067 

universality of, 1066f, 1069, 1070t 

variations in, 1070b-1071b, 1094b 

wobble and, 1072 
genetic counseling, for inborn errors of 

metabolism, 356 
genetic defects 

in amino acid catabolism, 694t, 696-698, 697f 

in fatty acyl-CoA dehydrogenase, 661, 662 

in urea cycle, 686-687 
treatment of, 687f 
genetic diseases 

gene therapy for, 335-336 

genetic counseling for, 356 

inborn errors of metabolism in, 356 

protein misfolding in, 145-148 
genetic engineering, 304. See also cloning 

practical applications of, 330-338 
genetic map, of E coli, 976f 
genetic mutations. See mutations 
genetic recombination See also DNA recombination 

functions of, 1004-1005 

homologous, 1003-1005 

site-specific, 1004, 1010-1013, lOllf, 1012f 
genetics, overview of, 27-29 
genome, 3, 33 

annotated, 34 

bacterial, 949, 950t 
sequencing of, 323, 323f 

components of, 323, 323f, 952, 952f 

eukaryotic, 949-951, 950t 

evolution of, 33-36 

mapping of, 317-324, 322f 

sequencing of, 33-35, 35t, 317-324, 322f, 323f 
by mapping, 317-324, 322f 
polymerase chain reaction in, 317 
shotgun, 322 

syntenyin, 325, 325f 

viral, 947, 949, 949t, 950t 



yeast, 949, 950t 
sequencing of, 323, 323f 
genomic library, 315 
genomic mapping, for E. coli, 976f 
genomics, 303, 315 

applications of, 35-36, 335-338 

comparative, 35-36, 326, 328, 328f 

functional, 35 
geometric isomers, 15 

geranyl pyrophosphate, in cholesterol synthesis, 

832f, 834, 834f 
geranylgeranyl groups, membrane attachment of, 

380, 380f 

germination, seed, triacylglycerols in, 663, 663f 

ghrelin, 932f, 937, 937f 

Gi (inhibitory G protein), 431 

Gibbs free energy (G), 490. See also free 

energy (G) 
Gibbs, J. Willard, 22 
Gilbert, Walter, 292 
Gilman, Alfred G , 424, 425, 425f 
Gleevec, 476b 

globin(s), 155, 155f. See also hemoglobin; 
myoglobin 

structure of, 138 
nuclear magnetic resonance studies of, 

133-134, 133f, 134f 
x-ray diffraction studies of, 132b-133b, 
132f-133f 
globosides, 350f, 353f, 354, 355f 
globular proteins, 123, 129-136 

£ turns in, 121, 121f 

diversity of, 129 

folding of, 128, 129-135, 130f 

functions of, 129-136 

hydrophobic interactions in, 130, 130f 

in large proteins, 131-138, 135f, 136f 

myoglobin as, 129-131, 130f 

polypeptide chain arrangement in, 123 

small, structure of, 129-136 

in small proteins, 129-135, 131t 

structure of, 128, 129-138, 130f 
glucagon, 587, 922, 923 

cascade mechanism of, 604f 

in cholesterol regulation, 842, 842f 

in fatty acid mobilization, 926 

in glucose regulation, 925-926, 926f 
glucocorticoids, 359, 359f, 359s, 906t, 908 See also 
under steroid 

synthesis of, 844-845, 844f 
glucogenic amino acid, 688 
glucokinase, 599, 914 

in glucose regulation, 924, 924f 

kinetic properties of, 584-585, 585f 

regulation of, 584-585, 585f 
gluconate, 240s 

gluconeogenesis, 551-558, 552, 582-583, 595. See 
also glucose metabolism 
amino acid metabolism in, 915 
amino acids in, 557, 557t, 915 
bypass reactions in 
fructose 1,6-bisphosphate to fructose 

6-phosphate conversion as, 553t, 556 
glucose 6-phosphate to glucose conversion as, 
553t, 556 

pyruvate to phosphoenolpyruvate conversion 
as, 553-557, 553t, 554f, 555f 
carbohydrate synthesis and, 552, 552f 
in chloroplast, 793, 793f 
citric acid cycle and, 557, 927f, 928 
in fasting/starvation state, 926-928, 927f 
in germinating seeds, 798-799 
glycolysis and See also glycolysis 

fructose 2,6-bisphosphate in, 587-588, 587f 

opposing pathways of, 552, 553, 553f, 583f 

regulation of, 557-558, 582-594 
in liver, 914, 925-926, 927f, 927t 
in muscle, 916-917 

regulation of, 582-594, 587f, 589f, 822-823 

glycolysis and, 582-594 
sequential reactions in, 556-557, 556t 
in well-fed state, 922, 923f 



glucono-fi-lactone, 240s 
glucopyranose, 239f 
glucosamine, 240, 240s 
glucose, 236s, 532s 
a form of, 238f, 239 
(3 form of, 238f, 239 
blood levels of, 921-922 
in diabetes, 930 
reference ranges for, 927t 
regulation of, 914, 914f, 922-930 See also 
glucose metabolism 
blood tests for, 241, 241b-242b 
body stores of, 927t 
in cellulose synthesis, 795-796 
conversion of amino acids to, 688-689, 688f 
conversion of glucose 6-phosphate to, in 

gluconeogenesis, 553t, 556 
degradation of See glycolysis 
anaerobic. See fermentation 
epimers of, 237s 

hexokinase catalysis of, 212, 212f 
lac operonand, 1126-1127, 1127f 
membrane transport of See glucose transporters 
in muscle contraction, 918-919, 918f, 919f 
in myocytes, control of glycogen synthesis from, 
581-582 

oxidation of See glucose oxidation 
phosphorylation of, 241, 532 
as reducing sugar, 241, 241f 
regulation of, 914, 914f, 922-930. See also 

glucose metabolism 
in starch synthesis, 791, 791f 
storage of, 922, 923f 

in glycogen, 244, 246, 922, 923f 

in starch, 244, 245-246 
structure of, 9s, 212s, 213s, 236, 237s, 238-239 
synthesis of, 915 

triacylglycerol conversion to, 663, 663f 

UDP. See UDP-glucose 

urine tests for, 241 

utilization of, 527-528 
glucose carbon, in formation of glyceraldehyde 

3-phosphate, 535f 
glucose catabolism, 913t 

in cancerous tissue, 539, 540-541 
glucose metabolism, 922-930 

in adipose tissue, 916-917, 916f 

in brain, 920, 920f 

Cortisol in, 929 

in diabetes melUtus, 929-930 

epinephrine in, 928-929, 928t 

in fasting state, 926-928, 926f, 927f, 928f 

glucagon in, 925-926 

insulin in, 922-924, 923f, 923t 

in liver, 914, 915f, 923f, 925-928, 927t 

in muscle, 918-919, 918f, 919f 

neuronal, 920, 920f 

pancreas in, 922-926, 923t, 924f 

in starvation, 926-928, 927f, 928f 

in well-fed state, 922, 923f 
glucose oxidation, 24, 240-241, 240f 

ATP yield from, 733t 

cellular, to carbon dioxide, 516 

energy-coupled reactions in, 23f, 24 

neuronal, 920, 920f 

pentose phosphate pathway of, 558-563, 560f. 
See also pentose phosphate pathway 
glucose 6-phosphatase, 556 
hepatic metabolism of, 914, 914f 
hydrolysis of glucose 6-phosphate by, 596, 597f 
glucose 1-phosphate, 599 
conversion of galactose to, 554f 
glycolysis of, 596, 597f, 598b, 599t 
in starch synthesis, 791 
glucose 6-phosphate, 212s, 240f, 241, 532, 569, 
575t, 599, 914, 914f 
conversion of 
to fructose 6-phosphate, 532, 533f 
to glucose in gluconeogenesis, 553t, 556 
fate of, 914, 914f 
in glycolysis, 563, 563f 
hepatic metabolism of, 914, 914f 
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glucose 6-phosphate (continued ) 
hexokinase catalysis of, 212 
hydrolysis of, by glucose 6-phosphatase, 596, 597f 
insulin regulation of, 922 

nonoxidative recycling of pentose phosphates to, 

560-563, 561f, 562f, 563f 
in pentose phosphate pathway, 563, 563f, 914 
glucose 6-phosphate dehydrogenase (G6PD), 
518t, 559 
deficiency of, 559 
light inactivation of, 785-786 
glucose tolerance test, 930 
glucose transporters, 608 
defects in, in diabetes, 394b 
erythrocyte (GLUT1), 391-393, 391f, 393t 
intestinal (GLUT2), 393, 393t, 403, 
403f, 924 

muscle (GLUT4), 393t, 394b, 441-442, 442f 
in diabetes, 394b 

Na + -glucose symporter, 403, 403f 

types of, 393t 
glucose-alanine cycle, 681, 681f, 915 
glucosuria, 929 
glucosylcerebroside, 353f, 354 
glucuronate, 240f, 240s, 250, 250s 
GLUT1 transporter, 391-393, 391f, 393t 
GLUT2 transporter, 393, 393t, 403, 403f, 584^585, 
585f, 924 

GLUT4 transporter, 393t, 394, 441^42, 442f 

in diabetes, 394b 
glutamate, 75s, 77, 453s, 698-699, 857, 861 

ammonia released by, 677-680, 680f 

biosynthesis of proline and arginine from, 
861-863, 862f 

biosynthetic pathway of, 857 

catabolic pathways for, 698f 

in nitrogen metabolism, 674 

properties of, 73t, 77 

titration curve for, 81, 81f 
L-glutamate dehydrogenase, 518t, 679 

oxidative deamination catalyzed by, 680, 680f 
glutamate synthase, reaction of, 857 
glutamate-oxaloacetate transaminase 

(GOT), 678b 
glutamate-pyruvate transaminase (GPT), 678b 
glutaminase, 680-681 
glutamine, 75s, 77, 686, 687s, 698-699, 
857, 861,894f 

ammonia transported in bloodstream as, 
680-681, 680f 

biosynthetic pathway of, 857-859 

catabolic pathways for, 698f 

in nitrogen metabolism, 674 

properties of, 73t, 77 
glutamine aminotransferase, 859 

proposed reaction mechanism for, 879f 
glutamine synthetase, 227, 680, 857 

allosteric regulation of, 857-858, 858f 

in nitrogen metabolism, 857-859 

reaction of, 857 

subunit structure of, 857-858 
glutamine -rich domains, 1143 
glutaredoxin, 888 
glutathione, 876 

amino acids as precursors of, 876-877 

biosynthesis of, 877f 

in cell protection against oxygen derivatives, 
559, 559f 

metabolism of, 877f 
glutathione peroxidase, 721, 877 
glycans, 244. See also polysaccharide(s) 
glycated hemoglobin, 242b 
glyceraldehyde, 236, 237s, 545s 

isomers of, 236-237, 236f 
glyceraldehyde 3-phosphate, 533, 533s 

catalysis of, 191 
in Calvin cycle, 779, 780f, 781f 

glucose carbons in formation of, 535f 

in glycolysis, 529f, 530 

oxidation of, to 1,3-bisphosphoglycerate, 
535-536, 536f 

synthesis of, 773, 778, 782f, 783, 797f 



glyceraldehyde 3-phosphate dehydrogenase, 518t, 
535, 575t, 778, 779f, 782-783 

light activation of, 785, 785f 

reaction mechanism of, 536, 536f 
glycerol, 346s 

in archaebacteria membrane lipids, 352, 353f 

chiral forms of, 350, 350f 

in galactolipids, 352f 

in phospholipids, 349, 350, 350f, 35 If 

structure of, 346s, 350s 

in triacylglycerol synthesis, 820f 

in triacylglycerols, 345f, 346-348, 346f, 348f 
glycerol dialkyl glycerol tetraethers (GDGTs) , 

352, 353f 
glycerol kinase, 650, 820, 820f 
glycerol 3-phosphate 

in carbohydrate synthesis, 798, 799f 

in lipid synthesis, 820, 820f, 822-823, 824, 826f 

synthesis of, 822, 822f 
glycerol 3-phosphate dehydrogenase, 715, 
820, 820f 

glycerol 3-phosphate shuttle, 732, 732f 
glyceroneogenesis, 557, 822-823, 822f, 823f. See 

also glucose metabolism 
glycerophospholipid(s), 349, 350-352, 350f, 351f. 

See also triacylglycerol (s) 
fatty acids in, 350 

head groups of, 349, 824, 826f, 827, 829 
nomenclature of, 350, 350s, 351f 
structure of, 350, 350s 
synthesis of, 820-822, 820f 

head group attachment in, 824, 826f 
transport of, 830 
glycine, 75, 75s, 692, 864 
in a helix, 119, 121, 121f, 122f 
in 0 sheet, 12 If 
in (3 turn, 121f 
biosynthesis of, 863f, 864 
as buffer, 80, SOf 
in collagen, 124f, 125 

degradation of to pyruvate, 692-694, 692f, 693f 
in photosynthesis, 787f, 788, 806f 
ptf a of, 79-80, 80f 

as precursor of porphyrins, 873, 874f 

properties of, 73t 

receptor for, as ion channel, 410 

in secondary structures, 121, 122f 

titration curve for, 79-81, 79f 
glycine cleavage enzyme, 692, 864 

reaction mechanism of, 693f 
glycine decarboxylase complex, 787f, 788, 806f 
glycine synthase, 864 
glycobiology, 257 

glycoconjugates, 235, 252, 252-257 
glycolipids, 252, 256-257, 257f 
glycoproteins, 252, 255-256, 257f 
proteoglycans, 252-255, 253f 
glycogen, 236, 245-248, 246, 251t. See also 

polysaccharide (s) 
biosynthesis of, in bacteria, 792 
body stores of, 927t 
branch synthesis of, 601f 
degradation of 

glycogen phosphorylase in, 553f, 595-596, 
595f, 597f 

in glycolysis, 542-543 
glucose removal from, 246 
glucose storage in, 244, 246 
glycogenin priming of sugar residues in, 

601,601f 
granular form of, 246 
inhepatocytes, 246 
hydrolysis of, 246 
metabolism of, 594-602 

glucose 1-phosphate in, 596, 597f 

glycogen breakdown in, 595-596, 597f, 653f 

glycogenin in, 601, 601f, 616f 

phosphoprotein phosphatase 1 in, 606, 607f 

sugar nucleotide UDP-glucose in, 596-601, 
600f, 601f 
in muscle, 918-919, 918f, 919f 
reducing end of, 246 



storage of, 595, 595f 

structure of, 245f, 246, 247-248, 249f, 251t 
synthesis of, 441^42, 442f, 600f 

control vs regulation of, 581-582 

regulation of, 582-594 

sugar nucleotide in, 596-601 
glycogen granule, in hepatocyte, 595, 595f 
glycogen phosphorylase, 224-226, 544, 922 
allosteric modification of, 227 
alb forms of, 223f, 224, 603 
breakdown of glycogen by, 553f, 595-596, 

595f, 597f 
covalent modification of, 223f, 225-226 
in glycogen breakdown, 224-225, 227 
interconvertible forms of, 603 
phosphorylation of, 224-226, 233f 
regulation of, 224, 225f, 603 

allosteric and hormonal, 604f, 61 7f 
glycogen storage diseases, 598-599, 

598b-599b, 599t 
glycogen synthase, 441, 442f, 600, 922 
phosphorylation of, 224-225 
primer for, 601 

regulation of, 605-606, 605f, 606f 
glycogen synthase a, 605 
glycogen synthase b, 605 

glycogen synthase kinase 3 (GSK3), 441, 442f, 447, 
447f, 605 

effects of, on glycogen synthase activity, 

605, 605f 
insulin activation of, 605, 606, 606f 
glycogenosis, 595 
glycogenin, 601 
and glycogen particle, 616f 
structure of, 601,601f 
sugar residues in glycogen and, 601 
glycogenolysis, 595 
glycogen-targeting subunits, 606 
glycolate pathway, 787-789, 787f 
glycolipids, 252, 256-257, 257f, 348. See also 
lipid (s) 
as glycoconjugates, 252 
neutral, 350f, 353f, 354, 355f 
synthesis of, 829, 831f 
transport of, 830 
glycolysis, 528-543, 650f, 922, 923f See also 
glucose metabolism 
ATP formation coupled to, 530-531 
in chloroplast, 793f 
feeder pathways for, 543-546, 552f 
glycogen and starch degradation in, 544 
monosaccharides in, 545, 554f 
polysaccharide and disaccharide hydrolysis in, 
543, 552f 

free-energy changes of, in erythrocytes, 553t 
gluconeogenesis and. See also glue oncogenesis 
fructose 2,6-bisphosphate in, 587-588, 587f 
opposing pathways of, 552, 553, 553f, 583f 
regulation of, 557-558, 582-594 
of glucose 1-phosphate, 596, 597f, 

598b-599b, 599t 
glucose 6-phosphate in, 562f, 563, 563f 
in limiting oxygen concentration, 539 
payoff phase of, 529f, 530 
ATP and NADH in, 535-538, 536f, 537f 
conversion of 3-phosphoglycerate to 

2-phosphoglycerate in, 537-538, 537f 
dehydration of 2-phosphoglycerate to phospho- 

enolpyruvate in, 538 
oxidation of glyceraldehyde 3-phosphate to 
1,3-bisphosphoglycerate in, 535-536, 536f 
phosphoryl transfer from 1,3-bisphosphoglycerate 

to ADP in, 531-532 
phosphoryl transfer from phosphoenolpyruvate 
to ADP in, 538 
phosphorylated hexoses in, 531 
preparatory phase of, 529f, 530 
ATP in, 531-535, 533f, 534f, 535f 
cleavage of fructose 1 ,6-bisphosphate in, 

533-534, 534f 
conversion of glucose 6-phosphate to fructose 
6-phosphate in, 532, 533f 
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glycogen, starch, disaccharides, and 

hexoses in, 552f 
interconversion of those phosphates in, 

534-535, 535f 
phosphorylation of fructose 6-phosphate to 

fructose 1 ,6-bisphosphate in, 532 
phosphorylation of glucose in, 532 
regulation of, 539, 822 
in solid tumors, 539, 540-541 
regulation of, 582-594 
gluconeogenesis and, 582-594 
glycolytic flux, 578f 

glycolytic pathway, glycerol entry into, 578f, 650f 
glycomics, 256 
glycophorin, 373, 376, 384f 

topology of, 376, 376f 
glycoproteins, 84, 85t, 252, 255-256, 257f 

as glycoconjugates, 252 

ligand binding of, 373 

membrane, 256, 3731 See also 
membrane proteins 

oligosaccharide linkage to, 255-256, 255f, 
1101-1104, 1102f, 3731 
in protein targeting, 1101-1104 

sugar moieties of, 373 

topology of, 375-376, 376f 
glycosaminoglycans, 249-251, 250f, 251t 

in proteoglycans, 252-255, 253f 
glycosidases, retaining, 216 
glycoside, standard free-energy changes of, 493t 
glycosphingoUpids, 350f, 353f, 354, 355f 
iV-0-glycosyl bonds, nucleotide, 272 

hydrolysis of, 290, 290f 
glycosylated derivatives of phosphatidylinositol 

(GPI), as lipid anchor, 379-380, 380f, 384-385 
glycosylation, in protein targeting, 

1101-1104, 1102f 
glyoxylate, 638-640 
glyoxylate cycle, 638-640, 639f 

four-carbon compound production from, 638 

in plants, 798, 798f 

regulation of citric acid cycle and, 639-640, 
639f, 640f 
glyoxysomes, 798, 798f 
(3 oxidation in, 662f 
plants and, 662-663, 663f 
glyphosate-resistant plants, 332, 333f 
glypicans, 253, 253f 

GMP (guanosine 5 '-monophosphate), 273s, 

298, 298s 
Goldberger, Joseph, 519 
Goldstein, Joseph, 841, 841f 
G 0 i f , 466f, 467 
Golgi complex 

lectins and, 260-261 

protein sorting in, 1102-1103 

transport vesicles of See transport vesicles 
GOT (glutamate-oxaloacetate transaminase), 678b 
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G6PD See glucose-6-phosphate 
dehydrogenase (G6PD) 
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sitol), as lipid anchor, 379-380, 380f, 384-385 

GPR14, 337 
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grana, 743 

Grb2, 439, 441f, 442 
SH2 domain of, 439, 441f 
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GRKs (G protein-coupled receptor kinases), 431 

GroEL/GroES, in protein folding, 144, 144f, 145f 
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growth factors, 472, 472f 
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in olfaction, 466, 466f 
in vision, 463-464, 464f 
GTPase, G s as, 426 
GTPase proteins (GAPs), 425b^27b 
guanine, 9s, 272, 273f, 275t. See also 
purine bases 
deamination of, 289, 290f 
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mechanisms in, 885-886, 885f 
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in splicing, 1036, 1036f, 1057 
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guanosine nucleotide exchange factors, 426 
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guanylate, 273s, 275t 
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guanylyl cyclase, in vision, 464f, 465 
guanylyl cyclases, 445-446, 445f 
guide RNA, 1073 
Guillemin, Roger, 902 
gulose, 237s 

gustation, signaling in, 467, 468f 
gustducin, 467 
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H + See hydrogen ion (s) 

//(enthalpy), 22, 490 

H 4 folate (tetrahydrofolate), 689, 689f 

conversions of one-carbon units in, 690f 

substrate binding to, 189f 
haem. See heme 
hair 

a-keratin in, 123-124, 124f 

coiled coils in, 124f 

permanent waving of, 125 
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inDNA, 282,282f 
in replication fork, 978, 978f 

in RNA, 285, 285f, 1029f, 1049 
Haldane, J B S , 64, 184, 184f, 190 
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standard reduction potentials of, 515t 
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bacteriorhodopsin in, 376, 377f, 385b 
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762-764, 763f 
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Hanson, Richard, 822 
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hashish, 442 

HAT (histone acetyltransferases), 1137-1138 
Hatch, Marshall, 789 

Haworth perspective formulas, 239, 239f 
HDLs. See high-density lipoproteins (HDLs) 
head group exchange reaction, in phospholipid 

synthesis, 827, 828f 
heart attack, 920 
heart disease 

angina in, nitrovasodilators for, 335 

atherosclerotic, 842-844 

hyperlipidemia in, 842b-843b 

trans fatty acids and, 348, 348t 
heart muscle, 919-920, 919f 
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production of. See thermogenesis 

randomization of, 21 
heat of vaporization, 43, 44t 

of water, 44-45, 44t 
heat shock gene promoters, 1028 
heat shock proteins, in protein folding, 143, 144f 
heavy chains, immunoglobulin, 171, 171f 

recombination in, 1015, 1016 
helical symmetry, 139, 140, 140f 
helicases, 984 

in mismatch repair, 995, 995f, 996f 

in replication, 984, 986, 986t 
Helicobacter pylori, lectins and, 260, 260f, 262f 
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a, 117-120, 118f, 120f, 121-122, 122f, 123t 

of a-keratin, 123-124, 124f 

of membrane proteins, 376, 378 

of myoglobin, 130 

of polysaccharides, 248, 249f 

protein folding and, 135 

of small globular proteins, 131t 
double 

DNA, 28, 28f, 278-280, 279f, 280f. See also 

DNA structure 
supercoiling and, 954-962, 955f. See also DNA, 

supercoiling of 
in transcription, 1023f 
underwinding of, 955-958, 955f 
unwinding of/rewinding of, 287-288, 287f, 288f, 

978-979, 979f See also DNA replication 
variations of, 280-281, 281f 
RNA, 284-285, 284f, 285f 
triple, 282, 283f 
of collagen, 122f, 123t, 124-125, 124f, 

126-127 
of DNA, 282,283f 
helix-loop-helix, 1125, 1125f 
helix-turn-helix, 1122, 1123f 
helper T cells (T H cells), 170, 170t 
hematocytoblasts, 158 
heme, 154 
from S-aminolevulinate, biosynthesis 

of, 874f 
definition of, 154 
free, 154-155 

in oxygen binding, 154-155 See also 

hemoglobin-oxygen binding 
as source of bile pigment, 875-876, 876f 
structure of, 154-155, 155f 
heme A, 711s 

heme b, of Complex II, 715 
heme C, 711s 

heme cofactors, of cytochromes, 710f, 711 
heme group, 130, 130f 
hemiacetals, 238, 238f 
hemiketals, 238, 238f 
hemin-controlled repressor (HCR), 1145 
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amino acids of, 9f, 10 

genetic variations of, 168 

glycated, 242b 

as oligomer, 138, 139f 

R-state, 160, 161f, 167, 168f 

sickle-cell, 168, 168f 

structure of, 138, 139f, 159-160, 159f, 
168, 168f 

conformational changes in, 160, 
161f, 162f 
subunits of, 159-160, 159f 
T-state, 160, 161f, 167, 168f 
hemoglobin A, structure of, 168, 168f 
hemoglobin glycation, 242b 
hemoglobin S, 168 
hemoglobin transport 
of hydrogen, 166 

of oxygen, 158-159 See also hemoglobin-oxygen 
binding 

hemoglobin-carbon dioxide binding, 166 
hemoglobin-carbon monoxide binding, 158, 159, 

162-165, 168 
hemoglobin-hydrogen binding, 166 
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hemoglobin-oxygen binding, 154-170, See also 
protein-ligand interactions 
2,3-bisphosphoglycerate in, 167, 168f 
Bohr effect in, 166, 166 
carbon dioxide in, 166 
in carbon monoxide poisoning, 163-164 
conformational changes in, 160, 161f, 162f 
cooperative, 160-165, 162f 
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heme in, 154-155, 155f 
hemoglobin transport and, 158-163 
models of, 165, 165f 
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epinephrine cascade in, 428^129, 429f 
fatty acid metabolism in, 914-915, 9l5f 
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glucose metabolism in, 914, 915f 
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glycogen in, 246 
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heptahelical receptors, 424 
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herbicide-resistant plants, 332, 333f 
hereditary nonpolyposis colon cancer, 1003b 
hereditary optic neuropathy, Leber's, 741 
herpes simplex virus, DNA polymerase of, 992 
Hers' disease, 599t 
Hershey, Alfred D., 278 
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heterolytic cleavage, of covalent bonds, 496, 496f 
heteroplasmy, 740 

heteropolysaccharides, 244, 249-251, 251 1. See 

also polysaccharide (s) 
heterotrophs, 4f, 5 

as earliest cells, 32 
heterotropic ligand binding, 162 
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hexokinase, 212s, 532, 575t 
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regulatory, 583-585, 604f 
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kinetic properties of, 584-585, 585f 
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regulation of, 584-585, 585f 
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derivatives of, 240-241, 240f 

phosphorylated, in glycolysis, 531 

structure of, 236f 
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pentose phosphate pathway 
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HIF (hypoxia-inducible transcription factor), in 
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1139f, 1141f 
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deficiency of, 840 
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histone(s), 963, 963-968, 963f, 963t 
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chromatin and, 963, 964f, 966b-967b, 1136-1137, 
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in chromatin remodeling, 1137-1138 

in gene regulation, 1136-1138 
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in nucleosomes, 964f, 965f, 967f, 1136-1137 

positioning of, 965 

properties of, 963-964, 963t 

types of, 963-964, 963t 

variant forms of, 964 
histone acetyltransf erases (HATs), 1137-1138 
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remodeling, 1138 
histone variants, 966b-967b 
Hitchings, George, 894, 894f 
HIV (human immunodeficiency virus), as 

retrovirus, 1052 
HIV (human immunodeficiency virus) infection, 

drugs for, 218-219, 1053b 
HMG proteins, 1139, 1139f, 1141f 
HMG-CoA, 666 
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homocystinuria, 694t 
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lipid metabolic integration with, 608 
catecholamine, 906t, 907 See also catecholamines 
in cholesterol regulation, 842, 842f 
classification of, 906-909, 906t 
discovery of, 902-904, 903b 
diversity of, 906-909, 906t 
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receptor for, 1143-1144, 1144f 

synthesis of, 844-845, 844f 
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in transcription regulation, 456^57, 457f 
transport of, in blood, 920 
in triacylglycerol synthesis, 649-650, 650f, 651f, 
821-822, 822f 
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human immunodeficiency virus (HIV) infection, 

drugs for, 218-219, 1053b 
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immune system 
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hyaluronate, 250s, 251t, 255f 
hyaluronic acid, 250, 250s, 251t, 255f 
hyaluronidase, 250 
hybrid duplexes, 289 

hybridization See cloning; DNA hybridization 
hydride ion, 514 
hydrocarbons, 13 

hydrogen, hemoglobin transport of, 166 
hydrogen bonds, 10, 44, See also bond(s) 

directionality of, 46, 46f 

examples of, 46t 

formation of, 45-46, 114-115 

in ice, 44-45, 45f 

low-barrier, 207 

of nucleic acid base pairs 
in DNA, 277, 277f, 278, 279f, 280f 
in RNA, 284, 286f 

number of, 114 

with polar hydroxyl groups, 45-46 

in polysaccharides, 245f, 247-248, 248f 

properties of, 44-46, 45f 

in water, 43^6, 44f-47f, 46-49, 48f, 49f 

as weak interactions, 50-51, 50t, 51f 
hydrogen donors, in photosynthesis, 762 
hydrogen ion(s) 

concentration of, 55-57, 56f, 56t 
See also pH 

from water ionization, 54-57 
hydrogen sulfide, solubility of in water, 

47, 47t 
hydrolases, 65 
hydrolysis, 65f 

of acetyl-CoA, 505, 505f, 505t 

ATP See ATP hydrolysis 

of 1,3-bisphosphoglycerate, 504, 504s 

of disaccharides, 243-244 

free energy of, 501-503, 504f, 505-506, 505f, 
505t, 506f 

of glucose 6-phosphate, by glucose 
6-phosphatase, 596, 597f 

of glycosidic bonds, vs phosphorolysis reaction, 
595-596 

of oxygen esters, 506f 

of phosphocreatine, 504, 505f 

of phosphoenolpyruvate, 504, 504f 

of phosphorylated compounds, 504-505, 505t 

of polysaccharides and disaccharides, to 
monosaccharides, 543, 552f 

of thioesters, 505, 505f 

transition state in, 211, 21 If 
hydrolysis reaction, 65 
hydronium ions, 77 
hydropathy index, 378 
hydrophilic compounds, 46, 46t 
hydrophobic compounds, 10, 46, 46t 

solubility of in water, 46-49, 48f, 49f 
hydrophobic interactions, 49, 50t, 114. See also 
weak interactions 

in amphipathic compounds, 48^9, 48f 

in globular proteins, 130, 130f 

in protein stability, 114-115 
jS-hydroxyacyl-ACP dehydratase, 809, 81 If, 826f 
/3-hydroxyacyl-CoA (3-hydroxyacyl CoA), 654 
j3-hydroxyacyl-CoA dehydrase, 654 
/3-hydroxyacyl-CoA dehydrogenase, 654 
j3 - hydroxy-j3-methylglutaryl-CoA. 

See HMG-CoA 
0-hydroxybutyrate, 666 

in brain metabolism, 920 
hydroxylases, 816b 
5-hydroxylysine, 77, 78s 

in collagen, 126-127 
5-hydroxymethylcytidine, 274s 

3- hydroxyproline, in collagen, 126-127 

4- hydroxyproline, 77, 78s, 126-127, 127s 
in collagen, 124f, 125, 126-127 

hyperammonemia, 686 
hypercholesterolemia, 843-844 
hyperchromic effect, 288 
hyperglycemia, insulin secretion in, 922, 923f 
hyperinsulinism-hyperammonemia 
syndrome, 680 



hyperlipidemia, 843-844, 843b 

in heart disease, 842b-843b 

trans fatty acids and, 347-348, 348f, 348t 
hypertonic solutions, 52, 52f 
hypochromic effect, 287 
hypoglycemia, 921-922, 92 If 
hypothalamic-pituitary axis, 909-910, 91 If 
hypothalamus, 909, 909f, 910f 

in body mass regulation, 931-932, 931f 
hypotonic solutions, 52, 52f 
hypoxanthine, from adenine deamination, 290f 
hypoxanthine-guanine 

phosphoribosyltransferase, 893 
hypoxia, 167, 530, 733-734 

adaptive responses in, 733-734 

reactive oxygen species and, 733-734 
hypoxia-inducible transcription factor (HIF), 
734, 734f 

in cancerous tissue, 540 
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I bands, 176, 177f 

ibuprofen, 818, 818s 

ice, hydrogen bonds in, 44^5, 45f 

icosahedral symmetry, 139-140, 139f, 140f 

icosatetraenoic acid, 344t 

idose, 237s 

iduranate, 250, 250s 

IgA, 172 

IgD, 172 

IgE, 172 

IgG, 171-172, 171f, 172f 

IgG genes, recombination of, 1014-1016, 1015f 

IgM, 172, 172f 

imatinib, 476b 

immune system, 170-175 

antigen-antibody interactions in, 170-175 

cells of, 170, 170t 

cellular, 170 

clonal selection in, 170 

evolution of, 1014-1016 

humoral, 170 

integrins in, 456 

oligosaccharides in, 259, 259f 

in plants vs, animals, 460, 461f 

selectins in, 259, 259f 
immunization, viral vaccines in, 1052 
immunoblot assay, 174, 174f 
immunodeficiencies 

drugs for, 1053b 

gene therapy for, 335-336 
immunoglobulin (s) (Ig), 170. See also antibodies 

classes of, 172 

heavy chains of, 171f, 172 
recombination in, 1015, 1016 

light chains of, 171f, 172 
recombination in, 1014-1016, 1015f, 1016f 

structure of, 171f, 172, 1015 
immunoglobulin A, 172 
immunoglobulin D, 172 
immunoglobulin E, 172 
immunoglobulin fold, 172 
immunoglobulin G, 171-172, 171f, 172f 
immunoglobulin genes, recombination of, 

1014-1016, 1015f 
immunoglobulin M, 172, 172f 
IMP. See inosinate (IMP) 
importins, 1104 

in vitro evolution (SELEX), 1058b, 1078b 

in vitro packaging, 308, 309f 

in vitro studies, limitations of, 10 

inborn errors of metabolism, 356 

indirubicin, 476b 

indole-3-acetate, 450s 

induced fit, 153 
in antigen-antibody binding, 173, 173f 
in enzyme-substrate binding, 192, 212, 212f 

inducers, 1120-1121 

inducible gene products, 1116 

induction, 1116 

industrial-scale fermentation, 549f, 551 



influenza 

drug therapy for, 259-260 

selectins and, 259-260 
influenza virus, lectins of, 258t, 259 
information theory, 21 
informational macromolecules, 14 
inhibition 

concerted, 872 

sequential feedback, 873 
inhibitory G protein, 431 
inhibitory proteins, in ATP hydrolysis during 

ischemia, 733, 733f 
initial velocity (rate) (F 0 ), 194-195, 195f 
initiation codons, 1069, 1069f. See also codons 

in protein synthesis, 1088-1091, 1090f 
initiation complex 

in bacteria, 1088-1089, 1089f 

in eukaryotes, 1089-1091, 1091t 
initiation factors, 1091t, 1144 
initiator sequences, 1138-1139, 1139f, 1141f 
Inman, Ross, 978 

inorganic phosphate (Pi). See phosphate, inorganic 

inorganic pyrophosphatase (PPJ, in plants vs. 
animals, 792 

inosinate (IMP), 883 
inanticodons, 1071-1072, 1071f 
biosynthesis of AMP and GMP from, 885f 
purine ring of, 883f 

inosine, 274s, 1073b-1074b 

inositol, in lipid synthesis, 826f, 827, 
827f, 829f 

inositol 1,4,5-trisphosphate (IP 3 ), 357-358, 
432^33, 433f 

in plants, 357 
Inr (initiator) sequences, 1138-1139, 1139f, 1141f 
insertion mutations, 993 
insertion sequences, 1013 
insulin, 906, 907s 

amino acid sequence of, 93-94 

in cholesterol regulation, 842, 842f 

in diabetes mellitus, 929 

discovery of, 903b 

in gene regulation, 439, 441, 441f, 1144 
in carbohydrate and fat metabolism, 606-608 

in glucose regulation, 439, 441, 922-924, 
923t, 924f 

in glucose transport, 394b 

glycogen synthase kinase 3 mediation of, 605, 
606, 606f 

in glycogen synthesis, 441^42, 442f 

in lipid metabolism, 842, 842f 

as peptide hormone, 906 

signal sequence of, 906, 907f 

in signaling, 439-443, 441 

synthesis of, 906, 907f 

in triacylglycerol synthesis, 821-822, 821f 

in weight regulation, 934, 935f 
insulin insensitivity, 935-936, 938-939 
insulin pathway, 439, 441, 441f 
insulin receptor, 439^43, 441f, 442f 
insulin receptor substrate-1 (IRS-1), 439, 
441f, 442 

integral membrane proteins, 376 See also 

membrane proteins 
integrins, 254, 255f, 388, 455f, 465^66 
intermediate, 209 

intermediate filament proteins, 123-124 

intermediate filaments, 9, 9f 

internal guide sequences, 1047, 1047f 

intervening sequences, 952, 952f 

intestinal glucose transporter (GLUT2), 393, 393t, 

403, 403f, 924 
intrinsic factor, 659 
introns, 286f, 952, 952f, 1035 

enzymatic properties of, 1047-1048, 1047f 

evolutionary significance of, 1052-1053 

horning by, 1053, 1054f 

as mobile elements, 1053, 1054f 

self-splicing, 1036, 1058 
enzymatic properties of, 1047-1049, 1047f 
evolutionary significance of, 1058 

spliceosomal, 1037, 1038f 
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introns (continued ) 

splicing of, 1034, 1034f, 1036-1039, 1036f-1038f 

transcription of, 1035 
inverted repeat DNA, 281-282, 282f 
ion(s) 

blood levels of, 921 

hydride, 514 

as intracellular messengers, 453-454 
ion channels, 389f, 391, 391f, 406-412 
a helix-type, 391 
Ca 2+ , 406, 409-410 
Cl- 
in cystic fibrosis, 401, 401f, 410 
in signaling, 453-454 
current measurement in, 407, 407f 
defective, 410-412 

in cystic fibrosis, 401 
K + , 407-409, 408f, 451, 45 If 
defective, diseases caused by, 412t 
in signaling, 451, 451f, 453-454 
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fluid, 45 It 
ion gradients, 400-404, 402f-404f 
ion product of water (X w )> 55> 56 
ion pumps See ATPase(s); transporter (s) 
ion transporters See ATPase(s); membrane trans- 
port; transporter (s) 
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lecithin-cholesterol acyl transferase (LCAT), 839 
lectins, 258-262, 258t, 259f-262f 
Leder, Philip, 1068 
leghemoglobin, 856 
Lehninger, Albert, 620, 708 
Leloir, Louis, 597, 598 
leptin, 930-931, 931f-934f 
leptin receptor, 931 
Lerner, Richard, 211 
Lesch-Nyhan syndrome, 893 
Letsinger, Robert, 294 
leu operon, 1130 
leucine, 75, 75s, 695, 865 
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lysozyme, 249 

catalytic activity of, 213-216, 214f, 215f 

reaction mechanism of, 215f 

structure of, 131 1 
lyxose, 237s 

m 

Mline (disk), 176, 177f 
Mackinnon, Roderick, 407, 407f 
MacLeod, J J R , 903b 
macrocytes, 691 



[ 1-24 J Index 



macromolecules 

informational, 14 
energy requirements of, 508 

weak interactions in, 50-51, 51f 
macrophages, 170, 170t, 172, 172f 
mad cow disease, 147-148 
magnesium complex, ATP and, 502, 502f 
magnesium ions, in Calvin cycle, 776f-778f, 777, 
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in JAK-STAT pathway, 443f, 444 

in plants, 460, 460f, 461f 
MAPKs, 439-440, 440-441, 44 If, 442 
maple syrup urine disease, 694t, 701 
mapping 

denaturation, 978 

genetic, ofE. coli, 976f 
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flexibility of, 381 

fluid mosaic model of, 373, 373f 

fluidity of, 380f, 381 

functions of, 371 

lipid bilayer of, 373f, 374 
formation of, 374 

overview of, 371 

permeability of, 373 

plasma, 3, 3f 
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membrane transport, 389-416 
activation energy for, 390, 390f 
active, 395-404 

primary, 389f, 395, 395f 

secondary, 389f, 395f, 396 
active, ATP energy in, 509-510 
aquaporins in, 404^06, 405f, 406t 
ATP synthases in, 399 
ATP-driven, 395-400, 397f-399f 
Ca 2+ pump in, 397-398, 397f 
chloride-bicarbonate exchanger in, 384, 384f, 

395, 395f 
cotransport systems in, 395, 395f 
in diabetes, 394 
direction of, 389f 

electrochemical gradient in, 389f, 390, 390f 

electrogenic, 396 

electroneutral, 395 

by facilitated diffusion, 389f See also 

transporter(s) 
free-energy change in, 396 
of glucose 

in erythrocytes, 391-393, 391f 

transporters for, 391-393, 391f, 392f, 393t, 394f 
ion channels in, 389f, 406-412 See also 

ion channels 
ion gradients in, 400-104, 402f-404f 




ionophore-mediated, 389f, 404 

kinetics of 
in active transport, 395-396 
in passive transport, 390-391, 390f, 
391-392, 392f 

lectins in, 260-261 

of lipids, 830 

membrane potential in, 389f, 390 
modes of, 389f, 390f, 41 It 
Na + K + ATPase in, 398-399, 398f 
neuronal, 920 
passive, 389-395, 390-391 
porins in, 379, 379f 
rate of, 390, 390f, 391-392, 392f 
SERCA pump in, 397-398 
by simple diffusion, 389f, 390 
in targeting, 1102-1103 See also protein 
targeting 

transporters in, 391-395 See also transporter(s) 
membrane -bound carriers, mitochondrial electron 
passage through, 710-712, 710f-712f 

sequence of, 711-712, 71 2f 

standard reduction potential of, 711-712, 712t 
menaquinone (vitamin K 2 ), 361 
Menten, Maud, 195, 195f 
Merrifield, R Bruce, 101, lOlf, 294 
Meselson, Matthew, 977 
Meselson-Stahl experiment, 977, 977f 
mesophyll cells, in C 4 plants, 789f, 790 
messenger RNA See mRNA (messenger RNA) 
metabolic acidosis, 57, 64b 
metabolic alkalosis, 57, 64b 
metabolic control, 574 
metabolic control analysis, 577-582, 578 

applied to carbohydrate metabolism, 581-582 

increased flux prediction by, 579b-580b 

quantitative aspects in, 579b-580b 
metabolic fuels, body stores of, 927t 
metabolic pathways, 25, 487-488, 570f, 913t See 
also specific pathways, e g , pentose 
phosphate pathway 

anabolic 

in glycogen metabolism, 595-596 

vs catabolic pathway, 487f 
catabolic 

in glycogen metabolism, 595-596 

vs anabolic pathway, 487f 
glycolytic See glycolysis 
regulation of, 487-488, 570-577 

adenine nucleotides in, 575-577 

enzyme activity in, 571-574, 572f 

maintenance of steady state in, 571 

mechanisms of, 569-608, 570f 

near- equilibrium and nonequilibrium steps in, 
574, 574f, 575t 
metabolic regulation, 574 
metabolic syndrome, 938-939 
metabolic water, 65 
metabolism, 25, 486 
in adipose tissue, 909f, 916-917 
aerobic, of vertebrates, 548b 
amino acid, 570f 
anaerobic, of coelacanths, 548b 
ATP in, 25, 26f 
bioenergetics and, 490-495 
blood in, 920-922 
in brain, 920, 920f 

carbohydrate See carbohydrate metabolism 
carbon-carbon bond reactions in, 496-497, 

496f, 504f 
cellular, transformations in, 495-496 
energy, 570f 

feedback inhibition in, 26 
folate, as chemotherapy target, 895f 
free radical reactions in, 498 
glutathione, 877f 

glycogen. See glycogen, metabolism of 
group transfer reactions in, 499-500, 499f 
hepatic, 912f, 913-916 
intermediary, 486 

internal rearrangements in, 497^498, 498f 
lipid See lipid metabolism 



in muscle, 918-919, 918f, 919f 
nitrogen See nitrogen metabolism 
nucleotide, 570f 
overview of, 485-488 

oxidation-reduction reactions in, 500, 500f 

regulation of, 26, 574 
metabolite (s), 3, 13-14, 486 

secondary, 13 
metabolite concentration, regulatory enzyme 

response to, 574, 574f, 575t 
metabolite flux, change in enzyme activity on, 

578-580, 578f 
metabolite pools, in plants, 799-800, 799f 
metabolome, 14, 573 
metabolon, 637 

metal ion catalysis, 193-194, 213, 213f 
metalloproteins, 84, 85t 
metamerism, 1147 
methane, 513s 

methanol, in lipid extraction, 363-364, 364f 
methanol poisoning, 202-203 
methionine, 75, 75s, 699, 865 

biosynthesis of, 865, 866f-867f 

conversion of, to succinyl-CoA, 699, 699f 

properties of, 73t, 75 

synthesis of, 691f 
methionine adenosyl transferase, 689 

synthesis of, 691f 
methotrexate, 894, 895s 
1-methyladenine, demethylation of, 1000, lOOOf 
methyladenosine, 274s 
methylamine, pK R of, 80, 80f 
methylation 

in DNA mismatch repair, 994-996, 994f, 
995f, 996f 

enzyme, 223, 233f 

of nucleotide bases, 292 

5- methylcytidine, 274s, 292 
1-methylcytosine, demethylation of, 1000, lOOOf 
7-methylguanosine, 274s 

6- A^-methyllysine, 77, 78s 
methylmalonic acidemia (MMA), 694t, 700b 
methylmalonyl-CoA, 657 
methylmalonyl-CoA epimerase, 657 
methylmalonyl-CoA mutase, 657 
Mevacor, 842-843 

mevalonate, in cholesterol synthesis, 832-833, 

832f, 833, 833f, 841 
Meyerhof, Otto, 528 
Mg 2+ , in Calvin cycle, 776f-778f, 777 
MGDG (monogalactosyldiacylglycerol), 352f 
mice, transgenic, 334 
micelles, 48-49, 48f, 374, 374f 
Michaelis constant (iT m ), 196-200, 197f, 199, 199t 
apparent, 203t 

interpretation of, 197, 198, 198t 

Michaelis, Leonor, 195, 195f 

Michaelis-Menten equation, 196-197, 197f 

Michaelis-Menten kinetics, 197-200, 573 

microinjection, of DNA, in animal cloning, 333 

micro-RNA (miRNA), 1045, 1145 

microscopy, atomic force, 385b 

microtubules, 9, 9f, 175-176 

Miescher, Friedrich, 278 

mifepristone, 456, 456s 

Miller-Urey experiment, 30-31, 31f 

Milstein, Cesar, 173, 173f 

mineralocorticoids, 908 
synthesis of, 844-845, 844f 

minichromosome maintenance (MCM) proteins, 991 

Minkowski, Oskar, 903b 

minor groove, 279, 279f 

miRNA (micro-RNA), 1045, 1145 

mirror repeat DNA, 282, 282f 

mismatch repair, 292, 994-996, 994f-996f, 994t 

Mitchell, Peter, 707, 723, 723f 

mitochondria, 951 
aging and, 739 
in apoptosis, 737-738, 737f 
ATP synthase complex in, 725f-726f 
ATP synthesis in, 707-708 See also ATP 
synthesis 



£ oxidation in, 662f 
biochemical anatomy of, 708, 708f 
chemiosmotic theory applied to, 723f 
classes of cytochromes in, 710-711, 71 If 
DNA in, 738f 

electron-transfer reactions in, oxidative phospho- 
rylation and, 708-721. See also electron- 
transfer reactions, mitochondrial 

evolution of, 33, 34f 
from endosymbiotic bacteria, 738f, 739 

functions of, 735-738 

genetic code variations in, 1070b-1071b 

heteroplasmy and, 740 

homoplasmy and, 740 

hypoxic injury of, 734 

lipid metabolism in, 812, 812f, 813-814 

matrix of, 708-709, 708f 

membranes of, 708, 708f 

mutations in, 738f, 739-741 
apoptosis and, 737-738 
disease associated with, 740-741 
endosymbiotic bacteria and, 738f, 739 
nitrous acid and, 291f, 294 
oxidative stress and, 720-721, 721f 

plant, alternative mechanism for NADH oxidation 
in, 720, 722b 

protein targeting to, 1104 

reactive oxygen species and, 720-721, 721f 

inrespirasomes, 718 

respiratory proteins and, 739t 

shuttle systems convey cytosolic NADH into, 
731-732, 731f, 732f 

in thermogenesis, 736 

transport of fatty acids into, 650-652, 651f 

uncoupled, heat generated by, 736, 736f 

in xenobiotics, 736-737 
mitochondrial DNA (mtDNA). See mtDNA 

(mitochondrial DNA) 
mitochondrial encephalomyopathy, 740-741 
mitochondrial inheritance, 739-740 
mitochondrial respiration, 786 
mitosis, 469, 469f 

mixed inhibitor, 201f, 202f, 203, 203t 
mixed-function oxidases, 664, 697, 
816, 816b 

mixed-function oxygenases, 816 
M?tRNA Tyr , 1086f, 1087b 
M/TyrRS, 1086f, 1087b 
MMA (methylmalonic acidemia), 694t 
mobile elements, 1004, 1013 

introns as, 1053, 1054f 
modulators, in protein-ligand binding, 162 
MODY (mature onset diabetes of the young), 

593b-594b, 741 
molecular biology, central dogma of, 

945, 945f 

molecular chaperones, 143-144, 144f, 145f, 
1104, 1106f 

molecular evolution, 102-106. See also evolution 

amino acid sequences and, 102-106, 
104f-106f 

amino acid substitutions in, 104 

homologs in, 104-105 

lateral gene transfer in, 104 
molecular function, of proteins, 324, 328-329 
molecular logic of life, 2 
molecular mass, 14b 

molecular parasites, evolution of, 1058-1059 

molecular weight, 14b 

molecules. See biomolecules 

molten globule, 143 

monocistronic mRNA, 284 

monoclonal antibodies, 173 

Monod, Jacques, 165, 283, 1119, 1119f 

monogalactosyldiacylglycerol (MGDG), 352f 

monooxygenases, 816b 

monophosphates, nucleoside, conversion of, to 

nucleoside triphosphates, 888 
monosaccharides, 235, 235-241. See also 
carbohydrate(s) 

abbreviations for, 243t 

aldose, 236, 236f 
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monosaccharides ( continued) 
anomeric, 239 
in aqueous solutions, 238 
chiral centers of, 236-237, 236f, 237f 
conformations of, 239, 239f 
d isomers of, 237-238, 237f 
derivatives of, 240-241, 240f 
enantiomers of, 236, 236f 
epimer, 237f, 238 
families of, 236, 236f 
in glycolysis, 545, 554f 
Haworth perspective formulas for, 239, 239f 
hemiacetal, 238, 238f 
hemiketal, 238, 238f 
heptose, 236 
hexose, 236 

hydrolysis of polysaccharides and disaccharides 
to, 543, 552f 

intermediates of, 240f, 241 

ketose, 236, 236f 

l isomers of, 237, 238 

nomenclature of, 236, 237-238, 240-241 

oxidation of, 240-241, 240f, 241 

pentose, 236 

phosphorylation of, 241 

pyranose, 239, 239f 

reducing, 241, 241f 

stereoisomers of, 236-237, 236f 

structure of, 237f-241f, 238-239 

tetrose, 236 

triose, 236, 236f 
moonlighting enzymes, 624 
morphogens, 1147 
motifs, 131-138, 136f-137f See also 

protein folding 
motor proteins, 175-179, 176f-179f 
M-protein, 177 

mRNA (messenger RNA), 271, 1021. 
See also RNA 
5' cap of, 1034-1035, 1066f 
artificial, in genetic code studies, 1067 
base pairing of with tRNA, 1070-1071, 1071f 
degradation of, 1048-1049, 1096 
early studies of, 283 
editing of, 1040, 1072, 1073-1074 
functions of, 283-284 
hairpin loops in, 1029f, 1049 
half life of, 1096 
length of, 284 

maternal, 1147, 1148-1150 

monocistronic, 284, 284f 

polycistronic, 284, 284f 

polypeptide coding by, 283-284 

poly(A) tail of, 1039-1040, 1039f 

processing of, 1033-1040 See also RNA 
processing in translation, 

stability of, 571 

synthesis of, rate of, 571-572 
a-MSH (a-melanocyte-stimulating hormone), 933 
mtDNA (mitochondrial DNA), 951 

genetic code variations in, 1070b-1071b 
mucins, 256 
Mullis, Kary, 317 

multidrug transporter (MDR1), 400 
multienzyme complex(es) 
in oxidative phosphorylation 
Complex I: NADH to ubiquinone, 712-714, 
713t, 714f 

Complex II: succinate to ubiquinone, 715, 715f 
Complex III: ubiquinone to cytochrome c, 

715-716, 716f, 71 7f 
Complex IV: cytochrome c to 0 2 , 716-718, 

717f, 718f 
electron carriers in, 712-718 
substrate channeling through, 637, 637f 
multifunctional protein (MFP), 664 
mul timer, 138 

multisubunit proteins, 83t, 84 
muramic acid, 240f 
muscle 

alanine transport of ammonia from, 681, 681f 
energy sources for, 918-919, 918f, 919f 



fast-twitch, 918 
heart, 919, 919f 

metabolism in, 918-919, 918f, 919f 
red, 918 

in regulation of carbohydrate metabolism, 
608, 608f 

slow-twitch, 918 

structure of, 176-177, 177f 

white, 918 
muscle contraction, 178-179, 178f 

ATP in, 509-510 

fuel for, 918-919, 9l8f,919f 
muscle fibers, structure of, 176-177, 177f 
muscle proteins, 175-179, 176f-179f 
mutagenesis 

oligonucleotide-directed, 312-313, 313f 

site-directed, 312 
mutarotation, 239 
mutase, 544 

phosphoglycerate, 537, 575t 
reaction of, 537f 
mutations, 29, 289, 947, 993. See also 
genetic defects 

alkylating agents and, 291f, 292 

apoptosis and, 737-738 

cancer-causing, 474-475, 477f, 637, 
993, 1003b 

citric acid cycle, in cancer, 637-638 

deletion, 993 

DNA damage from, 999, lOOOf 

endosymbiotic bacteria and, 739 

in error-prone translesion DNA synthesis, 

1001-1002 
in evolution, 29-30, 30f, 34-35, 1152 
in fatty acyl-CoA dehydrogenase, 

661, 662 
insertion, 993 

in mitochondria. See mitochondria, 

mutations in 
nonsense, 1094b 

oncogenic, 474-475, 477f, 637, 993, 1003b 
oxidative stress and, 720-721, 721f 
radiation-induced, 290, 291f 
silent, 993 
substitution, 993 
suppressor, 1094b 
wild-type, 30 
MutH, in DNA mismatch repair, 995-996, 995f, 
996f, 1003b 

MutL, in DNA mismatch repair, 995-996, 995f, 996, 
996f, 1003b 

MutS, in DNA mismatch repair, 995, 995f, 996, 

996f, 1003b 
MWC model, of protein-ligand binding, 

165, 165f 

myelin sheath, components of, 372, 372t 

myocardial infarction, 920 

myoclonic epilepsy and ragged-red fiber disease 

(MERRF),741 
myocyte, 918 
glucose in, control of glycogen synthesis from, 
581-582 
myofibrils, 176, 177f 
myoglobin, 129-131, 130f, 155, 155f 
heme group in, 130, 130s 
in oxygen binding, 155, 157-158, 157f . See also 

hemoglobin-oxygen binding 
structure of, 129-131, 130f, 155, 155f, 
159-160, 159f 

nuclear magnetic resonance studies of, 

133-134, 133f, 134f 
x-ray diffraction studies of, 132b-133b, 
132f-133f 
subunits of, 159-160, 159f 
myosin, 175-176, 175-176, 176f See also 
actin-myosin entries 
coiled coils in, 124 

in muscle contraction, 178-179, 178f 
phosphorylation of, 472 
structure of, 175-176, 176f 
in thick filaments, 176, 176f, 
178-179, 178f 



myosin-actin interactions, 178-179, 178f 
myristic acid, 344t 

myristoyl groups, membrane attachment of, 
380, 380f 

n 

N 2 See nitrogen 

N (Avogadro's number), 491t 

Na + . See sodium ion(s) 

NAD + (nicotinamide adenine dinucleotide), 297s 
in Calvin cycle, 783 
reduced form of See NADH 
NADH, 584b 
cytosolic, shuttle systems acting on, 731-732, 

731f, 732f 
dehydrogenase reactions and, 517-519, 
517f, 518f 

oxidation of, in plant mitochondria, 720, 722b 

in payoff phase of glycolysis, 535-538, 536f, 537f 
NADH dehydrogenase, 654, 713, 713t 
NADH ubiquinone oxidoreductase, 712-713. See 
also Complex I 

in oxidative phosphorylation, 714f 
NADP + ,517s 

dehydrogenase reactions and, 516-519, 517f, 
518f, 518t 

reduced form of See NADPH 

vitamin form of, deficiency of, 519 
NADPH, 26f, 584b 

in anabolic reactions, 812 

in Calvin cycle, 773, 776f, 778 

in cell protection against oxygen derivatives, 
559, 559f 

dehydrogenase reactions and, 516-519, 

517f, 518f 
in fatty acid synthesis, 811, 826f 
in glucose oxidation, 559, 560, 560f 
in glyceraldehyde 3-phosphate synthesis, 

773, 776f, 778 
in partitioning of glucose 6-phosphate, 

563, 563f 

in photosynthesis, 782-786, 782f-785f, 812 

synthesis of 
in adipocytes, 812, 812f 
in chloroplasts, 812 
in cytosol, 812, 812f 
in hepatocytes, 812, 812f 
Na + -glucose symporter, 403, 403 
Na + K + ATPase, 398-399, 398f, 403, 403f 

in membrane polarization, 450, 450f 
in retina, 463 

in membrane transport, in neurons, 920 
nalidixic acid, 960b, 960s 
nanos, 1148-1150 
naproxen, 818, 818s 
native conformation (protein), 29, 114 
native protein, 29, 114 
ncRNA (noncoding RNA), 1146 
near- equilibrium steps, in metabolic pathway, 

574, 574f 
nebulin, 177 
Neher, Erwin, 407, 407f 
NeuSAc, 258-259, 262 
neural transmission, steps in, 451f 
neuroendocrine system, 901 
neuron(s) 

anorexigenic, 933 

gustatory, 467, 468f 

membrane transport in, 920 

Na + channels in, 409-410, 451f, 452-453 

olfactory, 465-467, 466f 

orexigenic, 932 

photosensory, 462-463, 462f 

visual, 462^63, 462f 
neuronal signaling, 902, 902f 
neuropathy, optic, Leber's hereditary, 741 
neuropeptide Y (NPY), 932 
neurotransmitters 

biosynthesis of, from amino acids, 
878-882, 879f 

as hormones, 902 
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receptors for, 453^154 

release of, 453^54 
membrane fusion in, 388, 388f 
neutral fats See triacylglycerol(s) 
neutral glycolipids, 350f, 353f, 354, 355f 
neutral pH, 55 

neutrophilic displacement reaction, of ATP, 508, 508f 

newborn, PKU screening in, 698 

Nexavar, 476b 

NH 3 . See ammonia 

niacin (nicotinic acid), 519, 519s 

dietary deficiency of, 519 
nick translation, 983, 983f, 984, 989, 990f, 994t, 

997, 998f 
nicotinamide, 519s 

nicotinamide adenine dinucleotide (NAD + ), 297s 

reduced form of. See NADH 
nicotinamide adenine dinucleotide phosphate. 

See NADP + 
nicotinamide nucleotide-linked 
dehydrogenases, 709 
reactions catalyzed by, 709t 
nicotine, 519s 

nicotinic acetylcholine receptor, 410 
defective, 412t 

open/closed conformation of, 453, 454f 

in signaling, 453, 454f 

synaptic aggregation of, 384 
nicotinic acid (niacin), 519, 519s 

dietary deficiency of, 519 
Niemann-Pick disease, 356 
Nirenberg, Marshall, 1067-1068, 1067f 
nitric oxide (NO), 446 

as hormone, 906t, 909 

synthesis of, arginine as precursor in, 882, 882f 
nitric oxide (NO) synthase, 446, 909 
nitrification, 852 
nitrogen 

available, nitrogen cycle maintenance of, 

852, 852f 
cycling of, 486, 486f 

enzyme fixation of, nitrogenase complex in, 
852-857, 855f, 856f 

excretion of, 682-687, 683f, 684f, 685f, 686f, 687f 

reduction of, to ammonia, 852-854 

solubility of in water, 47, 47t 
nitrogen cycle, 852 

available nitrogen in, 752f, 852 
nitrogen metabolism, 851-860 

ammonia in, 857 

available nitrogen in, 852, 852f 

biosynthetic reactions in, 859, 879f 

enzyme fixation in, 852-857, 855f, 856f 

glutamate and glutamine in, 674 

glutamine synthetase in, 857-859, 858f 
nitrogen mustard, as mutagen, 291f, 292 
nitrogenase complex, 854^-857 

enzymes of, 855f 

nitrogen fixation by, 855, 855f, 856f 

nitrogen-fixing nodules in, 856, 856f 
nitrogen-fixing nodules, 856, 856f 
nitroglycerin, 446 
nitrous acid, as mutagen, 291, 291f 
nitrovasodilators, 446 

AT-linked oligosaccharides, 255-256, 255f, 373, 

1101-1104, 1102f 
NLS (nuclear localization sequence), 1104, H05f 
NMR spectroscopy See nuclear magnetic 

resonance spectroscopy 
NO. See nitric oxide (NO) 
NO synthase, 446, 909 
nocturnal inhibitor, 778 
nodules, nitrogen-fixing, 856, 856f 
nomenclature systems 

D,L, 17 

R,S, 17 

Nomura, Masayasu, 1076, 1076f 
noncoding RNA (ncRNA), 1146 
noncompetitive inhibitor, 201f, 202f, 203, 203t 
noncovalent bonds, 10 See also weak interactions 
nonequilibrium steps, in metabolic pathway, 
574, 574f 



nonessential amino acids, 860. See also 

amino acid(s) 
nonoxidative reaction, of pentose phosphate 

pathway, 560-563, 56 If 
nonpolar compounds. See hydrophobic compounds 
nonreducing sugars, 243, 244 
nonsense codons, 1069, 1069f See also codons 
nonsense mutations, 1094b 
nonsense suppressors, 1094b 
nonsteroidal anti-inflammatory drugs (NSAIDs), 

358-359, 818, 818s 
norepinephrine, 878, 907 

as neurotransmitter vs hormone, 902 
Northrop, John, 184 
NPY (neuropeptide Y), 932 
NS domains, 253-254, 253f, 254f 

in proteoglycans, 255f 
NSAIDs (nonsteroidal anti-inflammatory drugs), 

358-359,818, 818s 
NuA4, in chromatin remodeling, 1137t 
nuclear localization sequence (NLS), 1104, 1105f 
nuclear magnetic resonance (NMR) spectroscopy 

in carbohydrate analysis, 263, 264f 

protein structure determination, 133-134, 
I33f, 134f 

nuclear proteins, targeting of, 1104, 1105f 
nuclear receptors, 422, 422f, 905 
nucleases, 979 

nucleic acid(s), 14, See also DNA; RNA 
3' end of, 275 
5' end of, 275 

bases of. See base(s), nucleotide/nucleic acid; 

base pairs/pairing 
chemical synthesis of, 294, 295f 
components of, 9f 
evolution of, 30-32, 1056-1059 
5' end of, 275f 

hydrophilic backbones of, 275, 275f, 278 
long, 276 

nomenclature of, 272, 272t, 274 
nonenzymatic transformation of, 289-292 
nucleotides of, 271-277 See also nucleotide(s) 
phosphate bridges in, 274-276, 275f 
phosphodiester linkages in, 274-276, 275f 
polarity of, 275, 275f 
pyrimidine bases of, 272-274, 273, 274f 
short, 276 

structure of, 277-287 
base properties and, 276-277, 276f 
in DNA, 277-283. See also DNA structure 
overview of, 277-280 
in RNA, 284-285, 284f-286f 
schematic representation of, 276 
synthesis of. See DNA replication; translation 
nucleic acid probes, 311, 31 If 
nucleic acid sequences 
amino acid sequence and, 97f, 98-100, 948, 948f 
determination of, 98-100 
in evolutionary studies, 105 
nuclein, 278 

nucleoids, 3, 3f, 6, 6f, 970 

nucleophiles, in enzymatic reactions, 208f, 496, 496f 
nucleoside (s), 271 

nomenclature of, 273t, 274, 275t 
nucleoside diphosphate kinase, 510, 627, 888 

Ping-Pong mechanism of, 510f 
nucleoside diphosphates, 296, 296f 
nucleoside monophosphate kinase, 888 
nucleoside monophosphates, 296, 296f 

conversion of, to nucleoside triphosphates, 888 
nucleoside triphosphates, 296, 296f 

hydrolysis of, 296-297, 296f 

nucleoside monophosphate conversion to, 888 

in RNA synthesis, 508 
nucleosomes, 963, 964-965, 964f, 965f, 967f 

in 30 run fiber, 966, 968f 

acetylation of, 1137-1138 

positioning of, 965 
5 '-nucleotidase, 892 
nucleotide(s), 271 

abbreviations for, 272t, 273f, 296f 

absorption spectra of, 276, 276f 



in ATP hydrolysis, 296-297, 296f 

bases of, 271-274, 271-277, 272s-274s. See also 

base(s), nucleotide/nucleic acid 
components of, 271-274, 272f-274f 
depurination of, 290, 290f 
evolution of, 30-32, 1056-1059 
flavin, 519-520, 709 

bound to flavoproteins, 519-520, 520f, 520t 
functions of, 296-298 
linkage of, 274-276, 275f 
metabolism of, 570f, 913t 
AT-/3-glycosyl bonds of, hydrolysis of, 290, 290f 
nomenclature of, 272, 272t, 273f, 274 
nonenzymatic reactions of, 289-292, 290f, 291f 
nonenzymatic transformation of, 289-292 
phosphate groups of, 271-272, 271f, 272t, 

273f, 296 
purine See purine nucleotides 
pyrimidine, biosynthesis of, 886-887, 886f, 888f 

regulation of, 887, 887f 
regulatory, 298, 298f 
as second messengers, 298, 298f 
sequences of, schematic representation of, 

276, 276f 

structure of, 271-274, 272f-274f, 272s, 

272t, 273s 
sugar, 596-601 
formation of, 598, 600f 
in glycogen synthesis, 596-601, 600f, 624f 
synthesis of, 859, 879f, 882-896 
enzymes in, chemotherapeutic agents 
targeting, 894-896, 894f, 895f 
transphosphorylations, 511f 
transphosphorylations between, 510-511 
variant forms of, 274, 274s, 280-281, 281f 
nucleotide sequences 
amino acid sequence and, 97f, 98-100, 

948, 948f 
determination of, 98-100 
in evolutionary studies, 105 
schematic representation of, 276, 276f 
nucleotide sugar, 791 
nucleotide triplets See codons 
nucleotide -binding fold, 298 
nucleotide-excision repair, 994t, 997-998, 997f, 
998f, 1003b 
in bacteria, 994t, 997-998, 998f 
in humans, 998f 
nucleus, 3, 3f 

protein targeting to, 1104, H05f 
NURF, in chromatin remodeling, 1138 
Nusslein-Volhard, Christiane, 1147, 1149f 
nutrients, transport of, in blood, 920-921 

0 

0 2 . See oxygen 

obesity, 930-937 See also body mass 

Ochoa, Severo, 598, 1049, 1049f 

octadecadienoic acid, 344t 

octadecanoic acid, 344t 

odered water, 49, 49f 

O-glycosidic bond, 243 

Okazaki fragments, 979, 979f, 987-989 

synthesis of, 987-988, 987f 
oleate, 656-657, 815, 817f 

synthesis of, 815, 815f 
oleic acid, 344t 

olfaction, signaling in, 465-467, 466f 
oligo glucanotransf erase, 596. See also 

debranching enzyme 
oligo (al-»6) to (a->4) glucan-transferase, 596 
oligomers, 14, 83t, 84, 138 
oligonucleotide, 276 
oligonucleotide-directed mutagenesis, 

312-313, 313f 
oligopeptides, 82, 86-87 

oligosaccharides, 235. See also carbohydrate (s); 
disaccharides 
analysis of, 263-265, 264f 
chemical synthesis of, 263-265 
conformations of, 248, 248f 
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oligosaccharides (continued) 
diversity of, 257-258 

glycoprotein linkage to, 255-256, 255f, 373, 

1101- 1103, 1102f 

as informational molecules, 14, 257-262, 262f 
lectin binding of, 159f-262f, 258-261, 258t 
AMinked, 255-256, 255f, 373, 1101-1104, 1102f 
nomenclature of, 243-244 
O-linked, 255-256, 255f, 373, 1102-1103, 1102f 
in peptidoglycan synthesis, 797f 
separation and quantification of, 263, 264f 
structure of, 263, 264f 
synthesis of, 1101-1102, 1102f 
O-linked oligosaccharides, 255-256, 255f, 373, 

1102- 1103, 1102f 
omega-3 fatty acids, 345 
omega-6 fatty acids, 345 
(o oxidation, 664 

0 6 -methylguanine, mutation from, 999, lOOOf 
0 6 -metnylguanine-DNA methyltransferase, 

999, lOOOf 
OmpLA, structure of, 379f 
OmpX, structure of, 379f 
oncogenes, 473^74, 478f 
oncogenic mutations, 474^75, 477f, 637, 
993, 1003b 

in citric acid cycle, 637-638 
one gene-one enzyme hypothesis, 624, 948 
one gene-one polypeptide hypothesis, 948 
open reading frame (ORF), 1069 
open system, 20 
operators, 1117 
operons, 1119-1121 

his, 1130 

lac, 1119-1121, 1119f, H20f 
leu, 1130 
phe, 1130 

regulation of, 1126-1127, 1127f 
trp, 1127-1130, 1128f, 1129f 
opines, 33 If 

opsins, 463. See also rhodopsin 

absorption spectra of, 465, 465f 
optic neuropathy, Leber's hereditary, 741 
optical activity, 17, 73 
ORC (origin replication complex), 991 
orexigenic neurons, 932 
ORF (open reading frame), 1069 
organelles, 6f, 7-9 

cytoskeleton and, 9-10, 9f 

in plants, 774-775, 775f 
organic solvents, in lipid extraction, 363-364, 364f 
organotropic, 4f, 5 
Orgel, Leslie, 1056, 1056f 
oriC (DNA replication origin), 875f, 986-987 

in bacteria, 978, 978f 

in eukaryotes, 991 
origin replication complex (ORC), 991 
origin-independent restart of replication, 1009 
ornithine, 77, 78s 
ornithine 5 -aminotransferase, 863 
ornithine S -aminotransferase reaction, 863f 
ornithine decarboxylase, 879, 880b-881b, 881f 
ornithine transcarbamoylase, 684 
orotate, 882-883 
orthologs, 34, 104, 325 
oseltamivir, 259-260, 260s 
osmolarity, 51f, 52, 52f 
osmosis, 51f, 52 
osmotic lysis, 52 
osmotic pressure, 51f, 52 
osteogenesis imperfecta, 128 
outer membrane phospholipase A, structure 

of, 379f 
ovary, 909f 

overweight, 930-937. See also body mass 
oxaloacetate, 590, 622, 813, 813f 

in amino acid biosynthesis, 865, 866f-867f 

asparagine and aspartate degradation to, 
701-702, 702f 

in C 4 pathway, 789, 789f 

glucogenic amino acids in, 557t 

inglyceroneogenesis, 821f 



in glyoxylate cycle, in plants, 798, 798f 
oxidation of malate to, 628-630 
oxidases, 816b 

mixed-function, 664, 697, 816, 816b 
oxidation 
a, 664-665 See also a oxidation 

in endoplasmic reticulum, 665f 

in peroxisomes, 664-665, 665f 
of acetate, 631, 631f 
of amino acids, 673-702 See also amino 

acid oxidation 
ATP yield from, 733t 
/3, 647. See also fi oxidation 
of carbon, 513f 

in citric acid cycle, energy of, 630-631, 630f, 630t 
of fatty acids, 652-665. See also fatty 

acid oxidation 
of glucose, 24, 558-563. See also glucose oxidation 
of glyceraldehyde 3-phosphate to 

1,3-bisphosphoglycerate, 535-536, 536f 
of isocitrate to a-ketoglutarate and C0 2 , 

623-625, 624f 
of a-ketoglutarate to succinyl-CoA and C0 2 , 

625-626 

of malate to oxaloacetate, 628-630 

<o, in endoplasmic reticulum, 664, 664f 

of pyruvate to acetyl-CoA and C0 2 , 616-617, 616f 

of succinate to fumarate, 628 
oxidation-reduction reactions, 22, 500, 500f 

bioenergetics and, 512-521 

dehydrogenation in, 513-514, 513f 

electromotive force and biologic work in, 512 

enzymes in, 816b-817b 

half-reactions in, 512-513 

reduction potentials in, 514-515, 514f, 515t 

standard reduction potentials in, 515-516, 515t 
oxidative deamination, 679 
oxidative decarboxylation, 616 
oxidative pentose phosphate pathway, 775, 

785-786, 798 
oxidative phosphorylation, 708-741. 
See also phosphorylation 

ATP hydrolysis inhibition in, 733, 733f 

ATP synthesis in, 723-732, See also 
ATP synthesis 

ATP yield in, 733t 

ATP-producing pathways in, 734-735, 735f 
brown adipose tissue heat production in, 736 
brown fat heat production in, 736f 
cellular energy needs in, 733 
chemical uncouplers of, 724, 724f 
chemiosmotic theory of, 707 
in heart muscle, 919-920 
mitochondria in, 707-708 
electron-transfer reactions of, 708-722. See 

also electron- transfer reactions, mitochondrial 
gene mutations of, 740-741 
apoptosis and, 737-738 
disease associated with, 740-741 
endosymbiotic bacteria and, 739, 739f 
oxidative stress and, 720-721, 721f 
photophosphorylation and, 707-708. See also 

photophosphorylation 
reactive oxygen species and, 720-721, 721f 
regulation of, 732-735 
oxidative photosynthetic carbon cycle, 788-789 
oxidative reactions, of pentose phosphate pathway, 

559-560, 560f 
oxidative stress, role of mitochondria in,720-72l, 721f 
oxidoreductase, 518 
oxygen 
cycling of, 486, 486f 
electron transfer through, 513-514 
hemoglobin binding of, 154-170 See also 

hemoglobin -oxygen binding 
partial pressure of, 158 
solubility of in water, 47, 47t 
oxygen concentration, limiting, glycolysis in, 539 
oxygen consumption, ATP synthesis and, 

chemiosmotic coupling in, 729-730 
oxygen ester, free-energy hydrolysis of, 505, 506f 
oxygen transport, in blood, 920-921 



oxygenases, 816b 

mixed-function, 816 
oxygen-binding proteins, 154-170 
oxygen-evolving complex, 758 

water split by, 756-758, 758f 
oxytocin, 911s 

P 

P (peptidyl) binding site, ribosomal, 1089 

p53 mutations, 477 

Pace, Sidney, 1048 

pair-rule genes, 1147, 1150 

Palade, George, 1100, HOlf 

palindromic DNA, 281, 281, 282, 282f 

palmitate 

desaturation of, 815, 815f 

in fatty acid synthesis, 814, 814f 

release of, 811 

synthesis of, 808, 808f, 811, 811f 
palmitic acid, 344t 
palmitoleate, synthesis of, 815, 815f 
palmitoleic acid, 344t 

palmitoyl groups, membrane attachment of, 

380, 380f 
palmitoyl- Co A, 814 

oxidation of, ATP in, 654-655, 656t 
pancreas, 909f 

in glucose regulation, 922-926 
pancreatic a cells, 924, 924f 
pancreatic jS cells, 923-924, 924f 
pancreatic trypsin inhibitor, 676 
pancreatitis, acute, 677 
paracrine hormones, 906 

eicosanoid, 358-359, 358f 
paralogs, 34, 104, 325 
paralysis, toxic, 410-412 
Paramecium, membrane components 

in, 372t 
paramyosin, 177 

parasites, molecular, evolution of, 1058-1059 
parathyroid, 909f 

Parkinson's disease, protein misfolding in, 146 
passive transporters, 391 
Pasteur, Louis, 17, 184, 539, 569 
patch-clamp technique, 407, 407f 
pattern-regulating genes, 1147-1152 
Pauling, Linus, 103, 115f, 117, 123, 130, 190, 211 
PCNA (proliferating cell nuclear antigen), 992 
PCR (polymerase chain reaction), 317, 318f 
PDI (protein disulfide isomerase), in protein 

folding, 144 
PDK1 (pyruvate dehydrogenase kinase 1), 441 
P/2e ratio, 729 
pellagra, 519 

penicillin See also antibiotics 

mechanism of action of, 216-218 

plasmids in, 949 
pentose(s),236, 236s 

conformations of, 272, 273f 

nucleic acid, 272, 273f 

nucleotide, 236, 236s, 271-274, 272f-274f 
ring numbering conventions for, 271f, 274 
pentose phosphate pathway, 558, 721, 775, 
785-786, 798, 914 

general scheme of, 558, 560f 

glucose 6-phosphate in, 560-563, 561f, 
562f, 563f 

glycolysis and, 563, 563f 

NADPH in, 559-560, 560, 560f 

in NADPH synthesis, 812, 812f 

nonoxidative reactions of, 560-563, 561f 

oxidative, 562, 775, 785-786, 798 

oxidative reactions of, 559-560, 560f 

reductive, 562, 775 

Wernicke-Korsakoff syndrome in, 563 
pentose phosphates 

movement of, 799-800, 799f 

synthesis of, in Calvin cycle, 779-780, 780f-781f, 
781f, 782-783, 782f 
PEP See phosphoenolpyruvate (PEP) 
pepsinogen, 675 



Index 1-29 



peptic ulcers, 260, 260f 
peptide(s), 82. See also polypeptide(s); 
protein(s) 

amino acid residues in, 77, 78s, 82 

ionization behavior of, 82-83 

naming of, 82f 

size of, 83 

standard free-energy changes of, 493t 

structure of, 82, 82f, 83f 

synthesis of, 100-102, lOlf, 102t 

titration curves of, 82-83 
peptide bonds, 82f, 85, 115-116, H8f. 
See also bond(s) 

in a helix, 118f, 119-120 

cis configuration of, 121, 121f 

electric dipole in, 116f, 119-120 

formation of, in protein synthesis, 
1091, 1092f 

properties of, 115-116 

trans configuration of, 12 If 
peptide group, 115, 116f 
peptide hormones, 906, 906t, 907f 
peptide prolyl cis-trans isomerase (PPI), in protein 

folding, 144 
peptide translocation complex, 1101, llOlf 
peptidoglycans, 251t, 796 

bacterial synthesis of, 796, 797f 

in penicillin, 216-217 

structure of, 796, 796f 
peptidyl (P) binding site, ribosomal, 1089 
peptidyl transferase, 1091 
perilipin, 649 
permanent waves, 125 
permeability transition pore complex, 737 
permeases, 391. See also transporter (s) 
pernicious anemia, 659, 691 
peroxisome (s), 662 

j8 oxidation in, 662-663, 662f 
plants and, 662-663, 663f 

lipid metabolism in, 812f 
peroxisome proliferator-activated receptors 

(PPARs), 660-661, 936-937, 937f 
pertussis toxin, 427b 
Perutz, Max, 138, 138f 
PFK-1. See phosphofructokinase-1 (PFK-1) 
pH, 55-57, 56f, 56t 

of aqueous solutions, 55-57, 56f, 56t 

of blood, 62-63 

buffering and, 59-63, 60f, 61f 

enzymatic activity and, 56-57, 63f, 68 

enzyme, 204, 204f 

functional importance of, 56, 57 

in hemoglobin-oxygen binding, 166, 166f 

Henderson-Hasselbalch equation for, 60-61 

isoelectric, 73t, 80-81 
determination of, 90, 90f 

measurement of, 56-57 

neutral, 55 

scale for, 56-57, 56t 

standard free-energy changes and, 493t 

in titration curve, 58-59, 58f, 59f, 79-80, 79f 
pH optimum, 63 
phagocytosis, 172, 172f 
pharmaceuticals See also under drug and 
specific drugs 

genomics/proteomics in, 335-337 
phase variation, 1134, 1135f 
phe operon, 1130 
phenotype, 947 

phenotypic functions, of protein, 324 
phenylacetate, 686, 687s 
phenylacetyl-CoA, 686, 687s 
phenylacetylglutamine, 686-687, 687s 
phenylalanine, 75-76, 75s, 695, 869 

biosynthesis of, 869, 871f 

catabolic pathways for, 695f, 696f, 697f 
alternative, 697-698, 697f 

degradation of, to acetyl-CoA, 695-696, 695f 

genetic defects in, 696-698, 697f 

properties of, 73t, 75-76 
phenylalanine hydroxylase, 697, 869 

role of tetrahydrobiopterin in, 697, 697f 



phenylketonuria (PKU), 694t, 697 

newborn screening for, 698 

phenylalanine in, alternative catabolic pathways 
for, 697-698, 697f 
phenylpyruvate, 697 
pheophytin, 749 

pheophytin-quinone reaction center, 

749-751, 750f 
<j> angle, secondary structures and, 121, 122f 
Phillips, David, 213 
Phillips mechanism, 213-216 
phorbol esters, 436 
phosphagens, 511 
phosphatase, 627b 
phosphate 
as buffer, 57f, 58-59, 59f, 61 
inorganic 

in cells, 502t 

in glucose oxidation, 24 

in photosynthesis, 783-784, 783f 

in starch synthesis, 793, 794f 
in nucleotides, 271-272, 271f 

variant forms of, 274, 274f 
triose, interconversion of, 534-535, 535f 
phosphate bond, high-energy, 506 
phosphate translocase, 731 
phosphatidate, in triacylglycerol synthesis, 

821,821f 

phosphatidic acid, 350, 351f, 374, 377f, 821 

synthesis of, 820f, 821,824 

in triacylglycerol synthesis, 821, 821f 
phosphatidic acid phosphatase, in triacylglycerol 

synthesis, 821, 82 If 
phosphatidylcholine, 350, 351f, 352, 354s, 815, 
817f, 827, 839s 

membrane distribution of, 374, 377f 
phosphatidylethanolamine, 350, 351f 

in lipid synthesis, 824, 826f, 827, 828f 

membrane distribution of, 374, 377f 

synthesis of, 820-823, 820f 
phosphatidylglycerol, 350, 351f 

in lipid synthesis, 824, 826f 
phosphatidylinositol (s), 357-358 

membrane distribution of, 374, 377f 

synthesis of, 826f, 827-829, 827f, 828f, 829f 
in yeast, 827, 828f , 829f 
phosphatidylinositol 4,5-bisphosphate (PIP2), 350, 
351f, 357-358 

membrane distribution of, 374, 377f 

in plants, 359 

in signaling, 359 
phosphatidylinositol kinase, 826f, 827, 
827f, 829f 

phosphatidylinositol pathway, 432-433, 433f 
phosphatidylinositol 4-phosphate, membrane 

distribution of, 374, 377f 
phosphatidylinositol 3,4,5-trisphosphate (PIP 3 ), 
357-358 

in signaling, 433f, 441443 
phosphatidylserine, 827 

in lipid synthesis, 824, 826f, 827, 828f 

membrane distribution of, 374, 377f 
phosphoanhydrides, 297f 
phosphocreatine, 504s, 876, 918s 

cellular concentration of, 502t 

hydrolysis of, 504, 504s 

in muscle contraction, 918, 918f 
phosphodiester linkages, in nucleic acids, 

274-275, 275f 
phosphodiesterase, in vision, 463, 464f 
phosphoenolpyruvate (PEP), 213s, 537s, 538 

acetate as source of, 638 

dehydration of 2-phosphoglycerate to, 538 

enolase catalysis of, 213 

in gluconeogenesis, 553-557, 553t, 554f, 
555f, 926 

in glyceroneogenesis, 821f, 823 

in glyoxylate cycle, in plants, 798, 798f 

hydrolysis of, 504, 504f 

synthesis of, from pyruvate, 553-556, 554f 

transfer of phosphoryl group from, to 
ADP, 538 



phosphoenolpyruvate carboxykinase, 554s, 
555,917 

in triacylglycerol synthesis, 821f, 823, 823f, 
824, 917 

phosphoenolpyruvate carboxylase, in C 4 pathway, 
789, 789f 

phosphofructokinase-1 (PFK-1), 532, 533, 574, 575t 

regulation of, 585-587, 586f, 587f 
phosphofructokinase-2 (PFK-2), 588 
phosphoglucomutase, 596 

as catalyst in glucose metabolism, 597f 
6-phosphogluconate dehydrogenase, 560 
phosphogluconate pathway, 558. See also pentose 

phosphate pathway 
phosphoglucose isomerase, 532, 575t 
phosphoglutamase, 544 

2- phosphoglycerate, 213s, 537s 
conversion of 3-phosphoglycerate to, 

537-538, 537f 
dehydration of, to phosphoenolpyruvate, 538 
enolase catalysis of, 213, 213f 

3- phosphoglycerate, 504s, 537-538, 537s 
in amino acid biosynthesis, 863-864, 863f 
conversion of 

to glyceraldehyde 3-phosphate, 775, 777f, 

778, 782f 
to 2-phosphoglycerate, 537-538 
in glycolate pathway, 787f, 788 
Pi exchange for, 783-784, 783f, 784f 
in starch synthesis, 793 
synthesis of, 775, 776-778, 777f 
phosphoglycerate kinase, 536-537, 779f 
3-phosphoglycerate kinase, 778 
phosphoglycerate mutase, 537, 575t 

reaction of, 537f 
3-phosphoglyceric acid, 504s 
phosphoglycerides, 350f See glycerophospholipids 
2-phosphoglycolate, 786, 787f 

in glycolate pathway, 787-789, 787f, 788f 
phosphohexose isomerase, 532 
reaction mechanism of, 533f 
phosphoinositide 3-kinase (PI-3K), 

441^42, 442f 
phospholipase, 355, 355f, 365 
phospholipase A 
in eicosanoid synthesis, 817, 818f 
outer membrane, structure of, 379f 
phospholipase C, 357-358, 365, 432, 443f 
phospholipid(s), 349. See also lipid(s) 
head groups of, 349, 824, 826f, 827, 829 
lysosomal degradation of, 355, 355f 
membrane. See membrane lipids 
synthesis of, 824-831 
cytidine nucleotides in, 824, 825f, 826f 
in Escherichia coli, 825, 826f, 829f 
in eukaryotes, 827-829, 827f-829f 
head group attachment in, 824, 825f, 826f 
head group exchange reaction in, 

827, 828f 
salvage pathways in, 827-829, 828f 
steps in, 824 

in vertebrates, 827-829, 828f, 829f 
in yeast, 827, 828f, 829f 
transport of, 830 
phospholipid bilayer, 374, 374f 

formation of, 374, 374f 
phospholipid head group attachment, 824, 826f 
phospholipid head group exchange reaction, 

827, 828f 
phosphomannose isomerase, 545 
phosphopantetheine, 80f, 808 
phosphopentose isomerase, 560 
phosphoporin E, structure of, 379f 
phosphoprotein phosphatase 1, 603 
in glycogen metabolism, 606, 607f 
phosphoprotein phosphatase 2A, 588, 589f 
phosphoproteins, 84, 85t 
phosphoramidite method, of DNA synthesis, 

294, 295f 
5-phosphoribosylamine, 883 
5-phosphoribosyl- 1 -pyrophosphate 
(PRPP),861 
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phosphorolysis, 544 

of glycogen and starch, 544 

of glycosidic bonds, vs hydrolysis, 595-596 
phosphorus, covalent bonds of, 

499-500, 499f 
phosphorus-oxygen bond, 499-500, 499f 
phosphoryl group 

ATP and, 508 

function of, in glycolysis, 529f, 531 
phosphoryl group transfer, 499-500, 508f 
ATP in, 501-511 

chemical basis for, 501-503, 502f 
from 1,3-bisphosphoglycerate to ADP, 

536-537 

from inorganic polyphosphate, 502t, 511 
from phosphoenolpyruvate to ADP, 538 

phosphorylase, 627b 

phosphorylase a phosphatase, 603 

phosphorylase b kinase, 603 

phosphorylase kinase, 224, 233f 

phosphorylase phosphatase, 224, 233f 

phosphorylated compounds, free-energy hydrolysis 
of, 504, 504f, 505-506, 505f, 505t, 506f 
ranking of, 507f 

phosphorylation. See also autophosphorylation 
of acetyl-CoA carboxylase, 814, 814f 
of amino acid residues, 224-226 
of ATPase transporters, 397^00, 398f 
bioenergetics and, 501-511 
consensus sequences in, 225-226, 226f 
of cyclin-dependent protein kinases, 470-173, 

470f, 471f, 473f 
in DNA repair, 471, 472-473, 473f 
in DNA replication, 979-980, 980f 
in enzyme cascades, 420, 420f, 441, 441f 
in enzyme regulation, 224-226, 226f, 226t, 

227f, 233f 
of fructose 6-phosphate to fructose-1,6 

bisphosphate, 532 
in gene regulation, 1144 
of glucose, 532 

of glycogen phosphorylase, 223f , 224-226 

in heart muscle, 918 

multiple, 225-226, 226f 

oxidative, 708-741 See also oxidative 
phosphorylation 

postradiational, 1096f, 1097 

in protein targeting, 1103, 1103f 

proton gradient in, 759-760, 781f 

respiration-linked, 537 

of retinoblastoma protein, 472, 473f 

reversibility of, 226 

of rhodopsin, 464f, 465 

in RNA processing, 1049 

in signaling, 439-446. See also signaling 
in bacteria, 457-458, 458f 
in ^-adrenergic pathway, 423^31, 424f, 

428f-430f, 429t 
by jS-adrenergic receptor kinase, 431 
by CAMP, 431^32 
by cGMP, 445^46, 445f 
by G-protein-coupled receptor kinases, 431 
in IP 3 /diacylglycerol pathway, 432^33, 443f 
in JAK-STAT pathway, 443^44, 443f 
multivalent adaptor proteins in, 446-449 
in PI-3K-PKB pathway, 441-442, 442f 
in plants, 458^60 

by protein kinase A, 424f, 428^29, 429t, 431 
of receptor tyrosine kinases, 439—143, 
440f-443f 
substrate-level, 537 
in sucrose synthesis, 793, 794f 
in transcription, 1032, 1144 
phosphorylation potential (/K? p ), 502 
phosphotyrosine phosphatases (PTPases), 449 
phosphotyrosine-binding domains, 446—147 
photochemical event, central, in photophosphoryla- 

tion, 749-759, 750f-758f 
photochemical reaction centers, 747, 749-751 
bacterial types of, 749-751, 750f, 751f 
Fe-S, 751 

in plants, acting in tandem, 752-754, 752f, 753f 



photolithography, 326, 326f 
photolyases, 520 

DNA, 994t, 998-1000, 999f 

reaction mechanism of, 999f 
photon, 744 

absorbed, energy conversion of, 747-749, 748f 
photophosphorylation 

ATP synthesis by, 759-761, 759f, 760f, 798f 

central photochemical event in, 749-759, 
750f-758f 

chemiosmotic theory of, 707 

chloroplasts in, 707-708 

general features of, 742-744 

light absorption in, 744-749, 746f 

oxidative phosphorylation and, 707-708 See also 
oxidative phosphorylation 

stoichiometry of, 760 
photopigments, 465, 465f See also pigments 

absorption of visible light by, 746f 

primary and secondary, 745f 
photorespiration, 786-790, 787f, 788f, 789f 

in C 4 plants, 789-790, 789f 
photosensory neurons, 462-463, 462f 
photosynthesis, 742-764, 773-786 

action spectrum for, 747, 747f 

ATP in, 782-786, 782f-785f 

C 2 cycle in, 789 

in C 4 plants, 789-790 

in CAM plants, 790 

carbon fixation in, 774, 775-778, 794f. See also 
Calvin cycle 

electron flow in, 742-743 
carbon-assimilation reactions in, 742-743, 743f, 

783, 784-786, 785f 
C0 2 assimilation in, 752, 752f, 775-780 See also 

Calvin cycle 
dark reactions of, 783 
in evolution, 32, 34f, 761-764, 763f 
glycolate pathway in, 787-789, 787f 
light energy for, chlorophyll absorption of, 

745-747, 745f, 746f 
light-dependent reactions in, 742-743, 743f 
NADPH in, 782-786, 782f-785f 
photophosphorylation and, 742-744 See also 

photophosphorylation 
photorespiration and, 786-790, 787f, 

788f, 789f 
Pi-triose antiporter in, 783-784, 784f 
reductive pentose phosphate cycle in, 775 
starch synthesis in, 783-784, 791-792, 

791f, 793 
state transitions in, 756 
sucrose synthesis in, 244, 783-784, 792-794, 

792f, 793f 
water in, 65 
photosynthetic carbon reduction cycle, 774 
photosynthetic organisms, diversity of, 761 
photosynthetic reaction center, in purple 

bacteria, 376 
photo system, 747 
photosystem I/II, 752-754 
cytochrome b 6 f complex links of, 755-756, 755f 
integration of, in chloroplasts, 752-754, 752f 
supramolecular complex of, 754-755, 754f 
phototrophs, 4f, 5 
phycobilin, 746 
phycobiliprotein, 746 
phycobilisome, 74 7f 
phycoerythrobilin, 745s 
phylloquinone [vitamin KO, 361, 362s, 753 
phytanic acid, a oxidation of, 664-665, 665f 
phytol, 745-746 
Pi See phosphate, inorganic 
pi (isoelectric point), 73t, 80-81, 90t 

determination of, 90, 90f 
pigments. See also photopigments 
accessory, 747 

in light absorption, 745f, 747 
bile, heme as source of, 875-876, 876f 
in color vision, 465, 465f 
light-absorbing, of Halobacterium 
salinarum, 762 



PI-3K-PKB pathway, 441-442, 442f 
pili, 6f 

Ping-Pong mechanism, 200, 200f 
pioglitazone (Actos), 808s, 824, 936 
PIP 2 . See phosphatidylinositol 

4,5-bisphosphate (PIP 2 ) 
PIP 3 (phosphatidylinositol 3,4,5-trisphosphate), 
357-358 

in signaling, 433f, 442-443 
Pi-triose antiporter, 783-784, 784f 
pituitary 

anterior, 909, 909f, 910f 

posterior, 909, 909f, 9l0f 
pituitary hormones, 911s 
PKA. See protein kinase A (PKA) 
pfiT a (relative strength of acid), 58, 59f 

of R groups, 83 

in titration curve, 58-59, 59f 
p#a (relative strength of acid/base), 58f 

of amino acids, 73t, 79-81, 79f-81f 

effects of chemical environment on, 80, 80f 

in Henderson-Hasselbalch equation, 60-61 

of R groups, 73t 

in titration curve, 58f, 79-81, 79f 
PKB See protein kinase B (PKB) 
PKC See protein kinase C (PKC) 
PKG (protein kinase G), 445-446 
PKU See phenylketonuria (PKU) 
plant(s) 

aquaporins in, 404 

C 3 , 776 

C 4 , photosynthesis in, 789-790 
C 4 pathway in, 789-790 
CAM, 790 

carbohydrate metabolism in, 773-786, 797-800, 

798f See also Calvin cycle 
cell structure in, 6f 
cell wall synthesis in, 795-796 
cloning in, 330-332, 330f-333f 
desaturases in, 815, 817f 
DNA in, 951 

ethylene receptor in, 460, 460f 
gluconeogenesis in, 792f, 793, 793f 
glycolate pathway in, 787-789, 787f 
glycolysis in, 793, 793f 
growth hormones of, 330-331, 331f 
immune response in, 460, 461f 
leguminous, symbiotic relationship of 

nitrogen-fixing bacteria and, 856, 856f 
membrane components in, 372t 
membrane lipids of, 351f, 352 
metabolite pools in, 799, 799f 
mitochondria in, alternative mechanism for 

NADH oxidation in, 720, 722b 
mitochondrial respiration in, 786 
NADPH synthesis in, 812 
organelles in, 774-775, 775f 
osmotic pressure in, 52 

pentose phosphate pathway in, 775, 785-786, 798 
photochemical reaction centers in, acting in 

tandem, 752-754, 752f, 753f 
photorespiration in, 786-790, 787f, 788f, 789f 
photosynthesis in See photosynthesis 
signaling in, 359, 819, 906 
vascular, 359 
plant glyoxysome, 0 oxidation in, 663, 663f 
plant peroxisome, j8 oxidation in, 663, 663f 
plant substances, biosynthesis of, from amino 

acids, 878, 878f 
plasma, 921, 92 If 

plasma lipoproteins, transport of, 836-841, 839f 
plasma membrane, 3, 3f . See also membrane (s) 

bacterial, 6, 6f 

Ca 2+ pump, 397-398, 397f 

composition of, 370f, 372-373, 372t 

lipid rafts in, 384r-386, 386f 

lipopolysaccharides of, 256, 257f 

microdomains of, 384-386, 386f 

neuronal, transport across, 920 

permeability of, 52 

protein targeting to, 1103-1104, 1103f 
syndecan in, 253, 253f 
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plasma proteins, 921 
plasmalogens, 350, 35 If, 829, 830f 

double bond of, 829, 830f 

synthesis of, 829, 830f 
plasmid vectors, 307-308, 307f, 309f 
plasmids, 6, 307-308, 949, 950f 

antibiotic resistance -coding, 949 
plasmodesmata, 789-790 
Plasmodium falciparum, inhibition of, 559 
plastids, 774-753, 775f 

evolution of, 33, 34f 
plastocyanin, 753 
plastoquinone, 362s 
plastoquinone (PQa), 753 
platelet, 921, 921f 
platelet-activating factor, 350, 351f 

synthesis of, 829, 830f 
platelet-derived growth factor receptor, 449 
PLC (phospholipase C), 432 
plectonemic supercoiling, 961, 962f 
PLP. See pyridoxal phosphate (PLP) 
P/O ratio, 729 
poisons 

ion channels and, 410-412 

translation inhibition by, 1098-1099 
pol, 1051, 1051f 

frameshifting and, 1072 
pol II See RNA polymerase II and related entries 
Polanyi, Michael, 190 
polar lipids, transport of, 830 
polarity 

of amino acids, 74 

in embryonic development, 1147 

hydrophilicity/hydrophobicity and, 46^49, 46t, 
47f, 47t, 48f 
polio virus, icosahedral symmetry of, 
140, 140f 

polyacrylamide gel electrophoresis (PAGE), 88-89, 

89f See also electrophoresis 
polyadenylate polymerase, 1039f, 1040 
polycistronic mRNA, 284, 284f 
polyclonal antibodies, 173 
polylinkers, 306f, 307 

polymerase chain reaction, in DNA fingerprinting, 
320b-321b 

polymerase chain reaction (PCR), 317, 318f 
polymorphic protein, 93 
polynucleotide (s), 276 

synthetic, in genetic code studies, 1067 
polynucleotide kinase, 305t 
polynucleotide phosphorylase, 1049, 1067 
polypeptide (s), 82. See also peptide (s) 

size of, 83 

vs proteins, 82 
polypeptide chain elongation, in protein synthesis, 
1076, 1091-1094. See also protein synthesis, 
elongation in 
polyphosphate, inorganic, 502t, 511, 511s 

as phosphoryl donor, 502t, 511 
polyphosphate kinase-1 (PPK-1), 511 
polyphosphate kinase-2 (PPK-2), 511 
polysaccharide(s), 14, 235, 244-252. See also 
carbohydrate (s) 

in cell communication, 252-257, 253f 

classification of, 244-245, 251t 

conformations of, 248, 248f 

in extracellular matrix, 249 

folding of, 247-248, 248f 

fuel storage in, 244-252 

functions of, 245-247, 251t, 252 

glycoconjugate, 252-257, 253f 

heteropolysaccharides, 249-251 

homopolysaccharides, 244-248 

hydrolysis of, 246, 543, 552f 

molecular size of, 251t 

molecular weight of, 245 

repeating unit in, 25H 

structure of, 244-245, 245f, 246f, 247-248, 248f, 
249f, 251t 

weak interactions in, 247-248 
poly(A) site choice, 1040 
polysomes, 1095, 1096f 



poly(A) tail, 1039-1040, 1039f 
polyunsaturated fatty acids (PUFAs), 345 
Pompe's disease, 599t 
Popj'ak, George, 835, 835f 
porins, 379, 708, 708f 

/3-barrel structure of, 379, 379f, 391 
porphobilinogen, 873 
porphyrias, 873, 875b 
porphyrin(s), 873 

glycine as precursor of, 873, 874f 
porphyrin ring, 154, 155f 
Porter, Rodney, 172 
porters. See also transporter (s) 
positive -inside rule, 379 
posterior pituitary, 909, 909f 
posttranslational modifications, in protein 

synthesis, 1075t, 1076, 1096-1098 See also 
protein synthesis 
potassium, blood levels of, 921 
potassium ion channels, 407^09, 408f, 451, 451f 

ATP-gated, 924 

defective, diseases caused by, 412t 
in glucose metabolism, 924, 924f 
in signaling, 451, 451f, 453^54 
structure of, 379f 
potassium ion concentration, in cytosol vs. 

extracellular fluid, 451t 
potassium ion transport, Na + K + ATPase in, 

398-399, 398f 
potential energy, of proton-motive force, 720 
PPARs [peroxisome proliferator-activated 

receptors), 660-661, 936-937, 937f 
ppGpp (guanosine tetraphosphate), 298, 298s 
PPi (inorganic pyrophosphatase), in plants vs. 
animals, 792 

PPI (peptide prolyl cis-trans isomerase), in protein 

folding, 144 
PPK-1 (polyphosphate kinase-1), 511 
PPK-2 (polyphosphate kinase-2), 511 
Prader-Willi syndrome, 937 
pravastatin (Pravachol), 842b-843 
pRb (retinoblastoma protein), 472, 473f 
prebiotic chemistry, 30-31 
prednisolone, 359, 359s 
prednisone, 359, 359s 
prenylation, 830f, 845 
preproinsulin, 906, 907f 
pre-replicative complexes (pre-RCs), 991 
preribosomal rRNA (pre-rRNA), processing of, 

1042, 1043f 
pre-steady state, 195 
primary structure, of proteins, 92, 93-100 
primary systemic amyloidosis, 146 
primary transcript, 1034 

splicing of, 1038f 
primases, 985, 986t, 987-988, 989t 
primer 

in DNA replication, 980f, 981f, 984, 1051 

in RNA replication, 1051 
primer terminus, 980, 981f 
priming, 605, 605f 

of glycogen synthase kinase 3 phosphorylation, 
605, 605f 
primosome, 987 

replication restart, 1009 
prion diseases, 147-148, 148f, 149f 
prion protein (PrP), 148 
probes 

fluorescent, 434b-436b 

nucleic acid, 311, 31 If 
procarboxypeptidase A, 676 
procarboxypeptidase B, 676 
processivity, of DNA polymerases, 980 
prochiral molecules, 629 
proenzymes, 227 

progesterone, synthesis of, 844-845, 844f 
programmed cell death, 477^78, 478f See 

also apoptosis 
prohormones, 906 
proinsulin, 906, 907f 
prokaryotes, 3 See also bacteria 
proliferating cell nuclear antigen (PCNA), 992 



proline, 75, 75s, 698-699, 861 

in a helix, 119, 121f 

in 0 sheet, 121, 121f 

in /3 turns, 121, 121f 

biosynthesis of, 861-863, 862f 

in collagen, 124f, 125, 126-127 

conversion of, to a-ketoglutarate, 698-699, 698f 

properties of, 73t, 75 
proline-rich activation domains, 1143 
prolyl 4-hydroxylase, 127 
promoters, 1025-1028, 1025f, 1027f, 1031f, 
1116-1118 

expression vectors and, 312, 312f 

specificity factors of, 1025, 1117 
proofreading 

in transcription, 981-982, 981f, 1024 

in translation, 1081-1083, 1093-1094 
proopiomelanocortin (PMOC), 906, 907f 
propionate, 657 

Propionibacteriumfreudenreichii, in 

fermentation, 551 
propionyl-CoA, 657 

oxidation of, 657, 660f 
propionyl-CoA carboxylase, 657 
proplastid, 775 
propranolol, 423s 
proproteins, 227 

prostaglandin(s), 358f, 359. See also eicosanoid(s) 

synthesis of, 817, 818f 
prostaglandin Ej, 431, 459s 
prostaglandin G 2 , 817 
prostaglandin H 2 , 817, 818f, 819 
prostaglandin H 2 synthase, 817 
prostaglandin inhibitors, 818-819, 818s, 907 
prosthetic groups, 84, 85t, 184 

heme as, 154 

posttranslational addition of , 184, 1098 
protease (s) 

in amino acid sequencing, 95-96, 96t 

regulation of, 227 
protease inhibitors, 218-219, 219f, 1052 
proteasomes, 3, 471, 1108 
protein(s), 14, 82. See also gene(s); 
specific proteins 

alios teric, 162 

amino acid composition of, 84, 85t See also 

amino acid(s) 
amphitropic, 375 
body stores of, 927t 
bound water molecules in, 50-51, 51f 
catalytic, 25 

cellular concentration of, regulation of. 

See gene regulation 
conformation of, 113-114 
conjugated, 84, 85t 
crude extract of, 85 
culling of, 259 
degradation of, 1107-1109 
denaturation of, 140-142, 141-142, 141f 
enzyme. See enzyme (s) 

enzyme degradation of, in amino acid catabolism, 

674-677, 676f 
evolution of, 31-32, 31f, 136-138. 

See also evolution 
evolutionary significance of, 1058 
fibrous, 123-128. See also fibrous proteins 
flavoprotein, 519-520, 519-520, 520f, 520t 
folding of See protein folding 
functional classification of, 35 
functions of. See protein function 
fusion, 313, 387 
G. See G protein(s) 
globular, 123, 129-136. See also 

globular proteins 
as glucose source, 927t, 928 
half-life of, 572, 572t, 1107 
homologous, 34, 104 
immune system, 170-175 
inhibitory, in ATP hydrolysis during ischemia, 

733, 733f 
intermediate filament, 123-124 
iron-sulfur, 711 
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protein (s) (continued) 

Fe-S centers of, 71 If 
isoelectric point of, 73t, 80-81, 90t 

determination of, 90, 90f 
membrane See membrane proteins 
misfolded, 145-148 

in mitochondrial electron-transfer chain, 713t 
molecular weight of, 83t 
estimation of, 89, 90f 
motor, 175-179, 176f-179f 
multifunctional, 664 
multimeric, 138 
multisubunit, 83t, 84 
naming conventions for, 976 
native, 29, 114 

nuclear, targeting of, 1104, 1105f 
oligomeric, 83t, 84, 138 
orthologous, 34, 104 
overview of, 71 
oxygen-binding, 154-170 
paralogous, 34 
phosphoproteins, 85t 
phosphorylation and dephosphorylation 

of, 573f 
plasma, 921 
polymorphic, 93 

polypeptide chains in, 83-84, 83t See also 

polypeptide (s) 
posttranslational processing of, 1075t, 1076, 

1096-1098 See also protein synthesis; protein 

synthesis, 
proproteins, 227 

proteolytic activation of, 227, 227f 
protomeric, 138 
RAG, 1015-1016, 1016f 
regulatory See also gene regulation 

DNA-binding domains of, 1121-1124 
renaturation of, 141f 
respiratory, 739t 

mitochondrial gene encoding of, 739t 
ribosomal, 1076t 

synthesis of, rRNA synthesis and, 1131-1132, 
1131f, 1132f 
Rieske innvsulfur, 711, 716f 
scaffold, in chromatin, 968, 968f 
separation/purification of, 85-92 

column chromatography in, 85-88, 86f, 88t 

dialysis in, 85 

electrophoresis in, 88-89, 88-90, 89f-91f 

for enzymes, 88t, 91-92, 91f 

fractionation in, 85-88, 86f 

isoelectric focusing in, 90, 90f 

protocols for, 88t 
signaling See signaling proteins 
structure of, See protein structure 
in supramolecular complexes, 29 
synthesis of See protein synthesis 
transport of. See membrane transport; 

transporter (s) 
trifunctional, 654 
uncoupling (thermogenin), 736 
as universal electron carrier, 516 
protein binding See protein-ligand interactions 
protein catabolism, in cellular respiration, 616f 
protein chips, 328 

Protein Data Bank (PDB), 113f, 129 
protein disulfide isomerase (PDI), in protein 

folding, 144 
protein domains, 135 
protein families, 138 

protein folding, 29, 29f, 1075t, 1076 See also 
tertiary structure 
assisted, 143-145, 144f, 145f 
chaperones in, 143-144, 144f, 145f 
chaperonins in, 144, 144f, 145f 
domains and, 135, 135f 
energy for, 114-115 
errors in, 145-148 

in globular proteins, 124-125, 129-135, 130f, 

131-135 
glycoproteins and, 256 
molten globule in, 143 



motifs (folds) in, 131-138, 136f-137f 

oligosaccharides and, 256 

patterns in, 136-138, 136f-137f 

protein disulfide isomerase in, 144 

representation of, 130f 

rules for, 135-136 

secondary structures in, 131-136, 
135f-137f, 148 

steps in, 142-143, 142f 

thermodynamics of, 143, 143f 
protein function, 71f, 153-179 See also protein- 
ligand interactions 

amino acid sequence and, 93 

analysis of, 324-329 
comparative genomics in, 325, 326, 

-328, 328f 
DNA microarrays in, 325-326, 327f 
epitope tagging in, 316f, 317, 328 
gene inactivation in, 324 
protein chips in, 328 
protein tags in, 328 

two-dimensional gel electrophoresis in, 325 
yeast two-hybrid analysis in, 329, 329f 

catalytic See enzyme (s) 

cellular, 324, 325-329 

conformational changes and, 154 

expression patterns and, 324-325 

levels of, 324 

molecular, 324, 328-329 

multiple, 624b-625b 

noncatalytic, 154 

phenotypic, 324 

principles of, 153-154 

structural correlates of, 113, 123, 129, 
324-325 

protein kinase(s), 224. See also specific types 
AMP-activated, 576 
autoinhibition of, 447, 447f 
^-adrenergic receptor, 430f, 431 
Ca 2+ /calmodulin-dependent, 437, 437t 
cAMP-dependent See protein kinase A (PKA) 
in cell cycle regulation, 469^73 
cGMP-dependent, 445-446 
consensus sequences for, 225-226, 226t 
cyclin-dependent, 469^73, 470f, 471f 
G protein-coupled, 431 
His-specific, 457, 458f 
phosphorylation by, 224-226, 441, 441f 
receptorlike, in plants, 460, 460, 461f 
in signaling, 429t, 431, 437, 437t, 439-446, 

446-449, 447f 
substrate specificity of, 225, 226t 
in transcription, 1032 
tyrosine-specific, 439, 439^43, 440f-443f 

epidermal growth factor receptor as, 
442, 449 

insulin receptor as prototype of, 439-443, 
440f-443f 

platelet-derived growth factor receptor as, 

442, 449 
in rafts, 449 
protein kinase A (PKA), 428 
activation of, 428, 428f, 429f, 441, 4245 
AKAPs and, 431-432 
in ^-adrenergic pathway, 424, 428-429, 
428f, 429f 

enzymes/proteins regulated by, 428, 429t, 431 

inactivation of, 424f, 428, 428f 

measurement of, by FRET, 434b-436b 
protein kinase B (PKB) 

activation of, 441, 442f 

in signaling, 441^42, 442f 
protein kinase C (PKC), 357-358, 433 

activation of, 357-358 
protein kinase G (PKG), 445-446 
protein kinases, in cancer treatment, 

475b-476b 
protein moonlighting, 624b-625b 
protein phosphatases, 224r-226 
protein sequences. See amino acid sequences; 

nucleic acid sequences 
protein sorting, 1103-1104 



protein structure, 113-148 
a helix and, 117-119, 118f, 120f, 121 
amino acid sequences and, 28f, 29, 102 
analysis of, 93-100. See also amino acid 

sequencing 
£ conformation and, 121-122, 121f, 122f 
classification of, 136-138, 136f-137f 
covalent, 93-100 

database for, 113f, 136-138, 136f-137f 
free energy of, 114 

functional correlates of, 113, 123, 129, 140-141, 

324-325 
glycoproteins and, 256 
helical symmetry in, 139, 140, 140f 
key concepts for, 113 
ligand binding and, 158 

motifs (folds) in, 131-138, 136f-137f. See also 

protein folding 
nuclear magnetic resonance studies of, 133-134, 

133f, 134f 
oligosaccharides and, 256 
overview of, 113-117 
primary, 92f, 93-100 

quaternary, 92f, 123, 138-140, 139f, 140f.5ee 

also quaternary structure 
Ramachandran plots for, 117, 117f, 122f 
representations of, 130f 
rotational symmetry in, 139-140, 139f 
secondary, 92f, 117-122, 118f, 120f-122f See 

also secondary structure 
stability of, 114-115 
supersecondary, 131-138 
tertiary, 92f, 123-138. See also tertiary structure 
three-dimensional, 29, 29f, 113-148 
weak interactions and, 50-51, 51f, 114-115 
x-ray diffraction studies of, 129, 132b-133b, 

132f-133f 
protein superfamilies, 138 
protein synthesis, 100-102, lOlf, 102t, 1075-1099 
amino acid activation in, 1075, 1075t, 1081-1084, 

1082f-1084f 
aminoacyl-tRNA binding in, sites of, 1089, 1089f 
aminoacyl-tRNA formation in, 1081-1084, 

1082f-1084f 
codons in, 1066-1067, 1068-1074. 

See also codons 
elongation in, 1075t, 1076, 1091-1094, 

1091-1094 

aminoacyl-tRNA binding in, 1091 

direction of, 1088, 1088f 

elongation factors in, 1091 

peptide bond formation in, 1091, 1092f 
errors in, 1081-1083 
evolutionary significance of, 1078b 
fMet-tRNA fflet in, 1088 
frameshifting in, 1072 
inhibition of, 1098-1099 
initiation of, 1075t, 1076, 1088-1091 

in bacteria, 108S-1089, 1089f 

in eukaryotes, 1089-1091, 1090f 

initiation complex in, 1089, 1089f, 1090f 

initiation factors in, 1091t, 1144 

Shine-Dalgarno sequences in, 1089-1090, 1090f 
mRNA degradation in, 1096 
overview of, 1075-1076, 1075t 
polypeptide release in, 1075t, 1076 
polysomes in, 1095, 1096f 
posttranslational modifications in, 1075t, 

1096-1098, 1976 

amino acid modifications, 1097, 1097f 

ammo-ternimaVcarboxyl-terminal 
modification, 1096 

carbohydrate side chain attachment, 1097 

isoprenyl group addition, 1097-1098, 1097f 

loss of signal sequences, 1096-1097 

prosthetic group addition, 1098 

proteolytic processing, 1098 
proofreading in, 1081-1083, 1093-1094 
protein folding in, 1075t, 1076, 1096 
rate of, 142, 1065 
regulation of See gene regulation 
ribosome as site of, 1065-1066, 1076 
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ribozyme-catalyzed, 31f, 32, 1056-1059, 1078b 

RNA world hypothesis and, 31f, 32, 1057-1059 

rRNA synthesis and, 1131-1132, 1131f, 1132f 

simultaneous, 1095, 1096f 

steps in, 1075-1076, 1075t 

termination of, 1075t, 1076, 1094, 1095f 

thermodynamics of, 1095 

transcription coupled with, 1095-1096, 1096f 

transcriptional. See transcription 

translational. See translation 

translocation in, 1091-1093, 1093f 
protein tagging, 316-317, 316f, 328 
protein targeting, 1100-1109 

to chloroplasts, 1104 

in endoplasmic reticulum, 1100-1101, HOlf 
glycosylation in, 1101-1104, 1103f 
in Golgi complex, 1102-1103, 1103f 
lectins in, 260-261 
tolysosomes, 1103-1104, 1103f 
to mitochondria, 1104 
to nucleus, 1104, 1105f 
peptide translocation complex in, 1101, HOlf 
to plasma membrane, 1103-1104, 1103f 
receptor-mediated endocytosis in, 1106-1107 
signal recognition particle in, 1101, HOlf 
transport mechanisms in, 1102-1103, 1103f, 
See also membrane transport 
protein turnover, 572 

protein tyrosine kinases. See tyrosine kinases 
protein-ligand interactions, 153-154 

allosteric, 162, 162f 

binding equilibrium in, 156 

binding sites for, 153, 162, 170 

complementary, 170-175 

conformational changes in, 154, 161f, 162, 162f 

cooperative binding in, 160-165, 162f 

enzyme. See enzyme(s) 

graphical representations of, 156f, 157f 

heterotropic, 162 

homotropic, 162 

in immune system, 170-175 

induced fit and, 153 

ligand concentration and, 156-157 

models of 
MWC (concerted), 165, 165f 
sequential, 165, 165f 

modulators in, 162 

of oxygen-binding proteins, 154-170 See also 
hemoglobin-oxygen binding 

principles of, 153-154 

protein structure and, 158 

quantitative descriptions of, 155-158, 157f, 
162-165 

regulation of, 154 

reversibility of , 153 

specificity of, 153 
protein-protein interactions, analytic techniques 

for, 325-329 
proteoglycan aggregates, 254 
proteoglycans, 250, 252-255, 253f 
proteolysis, 1107-1109, 1108f 

ATP-dependent, 1107 

in protein activation, 227, 227f 

ubiquitin-dependent, 1107-1109, 1108f 
proteolytic enzymes, regulation of, 227, 227f 
proteome, 14, 100, 324, 572 
proteomics, 303, 324 

applications of, 335-338 
prothrombin, 227, 362 
protomers, 84, 138 

proton flow, through cytochrome b 6 f complex, 755f 

proton gradient 
in ATP synthesis, 724, 725f, 726, 726f 
conservation of, in mitochondrial electron- 
transfer reactions, 718-720, 720f 
in electron flow and phosphorylation, 759-760 

proton hopping, 51, 51f, 54f, 55 

proton pumps See ATPase(s); transporter (s) 

proton transfer, in acid-base catalysis, 192-193, 193f 

proton-motive force, 720, 720f 
bacterial flagella rotation by, 739, 739f 
energizes active transport, 730-731, 730f 



proto-oncogenes, 473-474, 474f 
protoporphyrin, 154, 873 
proximal His, 168 

PRPP (5-phosphoribosyl-l-pyrophosphate), 861 
Prusiner, Stanley, 148 
pseudoinosine, 274s 
pseudouridine, 1045 
psicose, 237s 

^ angle, secondary structures and, 121, 122f 
PTB domains, 447 
PTEN, 442 

P-type ATPases, 396-397, 397f, 449 
P-type Ca 2+ pump, 397-398, 397f 
puffer fish poisoning, 410, 412s 
pulsed field gel electrophoresis, 310 

in cloning, 310 
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self-splicing introns, 1035, 1058 

enzymatic properties of, 1047-1049, 1047f 

evolutionary significance of, 1057f, 1058 
semiconservative replication, 977 
sensory neurons 

gustatory, 467, 467f 

olfactory, 465^68, 466f 

visual, 462-465, 462f 
sequenator, 95 

sequence polymorphisms, 319 

in DNA fingerprinting, 319b-321b 
sequence-tagged sites, 316 
sequential feedback inhibition, 873 
sequential model, of protein-ligand binding, 165, 165f 
sequential reactions, in gluconeogenesis, 

556-557, 556t 
SERCA pump, 397-398 
serine, 9s, 75s, 77, 692, 863 

biosynthesis of, 863-864, 863f 

chymotrypsin acylation/deacylation of, 205-207 

in chymotrypsin mechanism, 207 

degradation of, to pyruvate, 692, 692f, 693f 

in glycolate pathway, 787-789, 787f, 788f 

in lipid synthesis, 824, 826f 

phosphorylated, 224-226 
binding domains for, 446-449, 447f 
in plants, 460, 46lf 



properties of, 73t, 77 

in proteoglycans, 252-253, 253f 
serine dehydratase, 692 

reaction mechanism of, 693f 
serine hydroxymethyltransf erase, 
692, 864 

reaction mechanism of, 693f 
serotonin, 453s, 459s, 878 

receptor for, as ion channel, 410 
serpentine receptors, 424. See also G protein- 
coupled receptor (s) 
serum albumin, 649-650, 916 
serum amyloid A (SAA), 146-147 
seven transmembrane segment receptors. See G 

protein-coupled receptor(s) 
severe combined immunodeficiency, gene therapy 

for, 336-337 
sex hormones, 359, 359s, 459s, 906t, 908 

steroid, 359, 359s 

synthesis of, 844-845, 844f 
SH2 domain, 439, 443f, 444, 447-148, 447f 
Shafrir, Eleazar, 822 
Sharp, Phillip, 1035 
shellfish poisoning, 410 
Shine, John, 1088 

Shine-Dalgarno sequence, 1088-1089 
shivering thermogenesis, 919 
Shoemaker, James, 700b 
short tandem repeats (STRs), in DNA 

fingerprinting, 3l9b-321b 
shuttle vectors, 310 

sialic acid, 240, 240s, 258-259, 262, 354s 

in gangliosides, 354, 354s 
sialoadhesins, 262 
sickle cell disease, 168-169, 168f 
sickle cell trait, 168 
siglec-1, 261-262 
cr cycle, 1028, 1028f 

o-subunit, of RNA polymerase, 1025-1028, 

1025f, 1028f 
signal recognition particle (SRP), 1101, HOlf 
signal sequence(s), 1100 
of insulin, 906 

posttranslational removal of, 1096-1097 

in protein targeting, 1100-1106 
in bacteria, 1104-1106 
in eukaryotes, 1100-1101, 1104-1106 
signal transduction, 419, 422f , See also signaling 
signaling, 419-479 

acetylcholine receptor in, 453, 454f 

adaptor proteins in, 431, 446-449 

adhesion receptors in, 422, 422f 

adrenergic receptors in, 423-431 

autoinhibition in, 447, 447f 

autophosphorylation in, 439. See also 
autophosphorylation 

in bacteria, 457-458, 458f, 459t 

basic mechanisms of, 422, 422f 

/J adrenergic response in, termination of, 430 

blood in, 920 

carbohydrates in, 262f 

caveolae in, 385, 386f 

cooperativity in, 420, 420f 

cross talk in, 444 

eicosanoids in, 819 

enzyme cascades in, 420, 420f, 441, 441f 

in plants, 460, 460f, 46 If 
in epinephrine regulation, 423-431, 424f 
evolutionary aspects of, 443, 467^68 
fatty acids in, 819 

G protein-coupled receptors in, 422, 

422f, 423-431 
gated ion channels in, 410, 422, 422f, 450f, 

451^53, 451f 
in gene regulation, in eukaryotes, 1143-1144 
guanylyl cyclase in, 445-446, 461f 
in gustation, 462f, 463f, 467 
hormonal, 902, 902f, 905-906, 905f 

in hormonal cascade, 910-911 

signal amplification in, 911 

target tissues in, 909-910, 910f 
in hormonal cascade, 91 If 



in insulin regulation, 439-443, 440f-443f 
integration in, 420, 420f, 454 
lipids in, 357-359 
in mammals, 459t 

membrane polarization in, 430f, 450-453, 

450f, 451f 
membrane potential in, 450, 450f 
membrane rafts in, 449 
neuronal, 902, 902f 
in olfaction, 465^67, 466f 
overview of, 419—123 

phosphatidylinositol in, 826f, 827, 827f, 829f 
phosphorylation in, 440f-443f, 442^43 
in plants, 359, 458-460, 458f-461f, 459t, 
819, 906 

protein interactions in, 446-449, See also 

signaling proteins 
protein kinases in. See protein kinase (s) 
proteoglycans/oligosaccharides in, 252-255, 

257-262 
in proteolysis, 1109 

receptor desensitization in, 420, 420f, 421f, 

430^31, 441f 
receptor enzymes in, 422, 422f, 439—146 
receptor-ligand binding in, 420, 420f, 421b 
second messengers in, 431-438, 467-468. 

See also second messengers 
signal amplification in, 420, 420f, 429, 442, 

905, 91 If 
signal transduction in, 420f, 422f 
signal variety in, 420t 
signal-receptor affinity in, 420, 420f 
specificity in, 419, 420f 
steps in, 451f 

steroid hormones in, 359, 359s 
steroid receptors in, 422, 422f 
termination of, 442 
two-component systems in 

in bacteria, 457^58, 458f 

in plants, 457-460, 460f, 461f 
tyrosine kinases in, 439-444, 441f 
in vision, 462-465, 462f-464f 
signaling pathways 
jS-adrenergic, 423-131, 424f, 428f-430f 
conservation of, 467-468, 468f 
IP 3 /diacylglycerol, 432-433, 433f 
JAK-STAT, 443^44, 443f, 933, 933f 
phosphatidylinositol, 432-433, 443f 
PI-3K-PKB, 441-442, 442f 
signaling proteins, 446-449 
binding domains of, 446-449 

examples of, 448f 

IRS-1, 439, 441f, 442, 446^47 

multiple, 446-449, 448f 

PTB, 447 

SH2, 439, 446^47, 447f 
specificity of, 446, 447f 

conservation of, 467-468, 468f 

interaction of, 446-449 

multivalent, 446-149, 448f 
signaling systems 

common features of, 467-468, 468f 

self-inactivation of, 468-469 
signature sequences, 105, 105f 
sildenafil (Viagra), 446, 446s 
silent mutations, 993 
silk fibroin, 128, 128f 
simple diffusion, 389f, 390 
simple transposition, 1014, 1014f 
simple-sequence DNA, 952f, 953 
simvastatin (Zocor), 842b-843b 
SINEs (short interspersed elements), 952f 
single nucleotide polymorphisms (SNPs), 323 
single particle tracking, 384 
single-stranded DNA-binding protein (SSB), 
986, 986t, 989t 

in mismatch repair, 995, 996f 
siRNA (small interfering RNA), 1145 
sister chromatids, 962, 1004f, 1005, 1005f 
site-directed mutagenesis, 312 
site-specific recombination, 1004, 1010-1013, 
lOllf, 1012f 
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sitosterol dextrin, in cellulose synthesis, 795 

6-4 photoproduct, 290, 291f 

size -exclusion chromatography, 87, 87f, 88t 

skeletal muscle. See muscle 

skin cancer, in xeroderma pigmentosum, 1003b 

Skou, Jens, 398, 399f 

Slack, Rodger, 789 

slow-twitch muscle, 918 

Sly, William, 700b 

small G proteins, 440 

small interfering RNA (siRNA), 1145 

small intestine, fat absorption in, 648-649, 648f 

small nuclear RNA (snRNA), 1037, 1045, 

1061b, 1146 
small nucleolar RNA (snoRNA), 1043, 1045 
small RNA (sRNA), 1133 
small temporal RNA (stRNA), 1145 
smell, signaling in, 465-467, 466f 
Smith, Hamilton, 305 
S N 1 reaction, 213-216, 215f 
S N 2 reaction, 215f, 216 
snake venom, 411 
SNAP25, 388, 388f 
SNARE s, 388, 388f 

snoRNA (small nucleolar RNA), 1043, 1044f, 

1045, 1061b 
snoRNPs, 1043 

SNPs (single nucleotide polymorphisms), 323 
snRNA (small nuclear RNA), 1037, 1045, 

1061b, 1146 
sodium, blood levels of, 921 
sodium dodecyl sulfate (SDS), 89 
in electrophoresis, 89, 90f, 91f 
sodium ion(s), 517s 
dehydrogenase reactions and, 516-519, 51 7f, 

518f, 518t 
sodium ion channels, 409-410 
defective, diseases caused by, 412t 
in signaling, 451, 451f, 452, 452f, 453, 458 
sodium ion concentration, in cytosol vs. 

extracellular fluid, 451t 
sodium ion transport, Na + K + ATPase in, 

398-399, 398f 
sodium-glucose symporter, 403, 403 
solar energy, 742, 743f 
solenoidal supercoiling, 961-962, 962f 
solid tumor, glycolysis in, 539, 540-541 
solubility 

of amphipathic compounds, 48^9, 48f, 49f 
entropy and, 47, 49, 49f 
of gases, 47, 47t 

of hydrophilic compounds, 46—47, 47f 
of hydrophobic compounds, 47, 47t 
solutes 

concentration of, colligative properties and, 
51,51f 

membrane transport of. See membrane transport 
solutions 

buffered, 59-63, 60f, 61f 

colligative properties of, 51-52, 51f, 52f 

hypertonic, 52, 52f 

hypotonic, 52, 52f 

ionic interactions in, 46—47 

isotonic, 52, 52f 

osmolality of, 51f, 52 

pH of, 55-57, 56f , 56t 
solvation layer, 114 
solvents 

boiling points of, 44t 

heat of vaporization of, 44t 

melting points of, 44t 

organic, in lipid extraction, 363-364, 364f 

water as, 46, 47f See also water 
somatostatin, 923 
sorafenib, 476b 
sorbose, 237s, 238 
sos, 439, 441f, 442 

SOS response, 1001, I002t, 1130-1131, 1130f 

Southern blotting, 319, 319f 

Southern blotting, in DNA fingerprinting, 319b-320b 

Southern, Jeremy, 319f 

Spl, glutamine rich domains of, 1142f, 1143 



space-filling model, 15, 15f 

specific acid-base catalysis, 192 

specific linking difference (tr), 957 

specificity, enzyme, 191 

specificity constant (fc eat /iT m ), 199, 199t 

specificity factors, 1024, 1024f, 1117 

spectrin, 383f, 384f 

spectrophotometry, 76b 

sperm whales, fat storage in, 347, 348, 349 

spermaceti oil, 347, 348 

spermidine, 879 

biosynthesis of, 880f 
spermine, 879 

biosynthesis of, 880f 
sphinganine, 829, 83 If 
sphingolipids, 350f, 35 If, 352-354, 355f 

in blood groups, 354, 355f 

functions of, 354-355 

lysosomal degradation of, 355, 355f 

membrane, microdomains of, 384^386, 386f 

structure of, 354s 

synthesis of, 829, 83 If 

transport of, 830 
sphingomyelin, 350f, 353f, 354s, 355f, 829, 831f 

in cell regulation, 358 

membrane distribution of, 374, 377f 
sphingomyelinase, in Niemann-Pick disease, 356 
sphingomyelins, 354 
spliceosomalintrons, 1037, 1038f 
spliceosomes, 1037, 1038f 
splicing, 1034, 1034f, 1036-1039, 1036f-1308f 

alternative patterns of, I014f, 1040, 1042f 

guanosine in, 1036, 1036f, 1058 

internal guide sequence in, 1047, 1047f 
spongiform encephalopathies, 147f, 148-149, 148f 
squalene, in cholesterol synthesis, 832f, 833-834, 

834, 835f 
squalene 2,3-epoxide, 834, 835f 
squalene monooxygenase, 834, 835f 
Src, 439, 444 

SREBPs (sterol regulatory element-binding 

proteins), 592, 841-842, 841f 
sRNA, 1133 

SRP (signal recognition particle), 1101, HOlf 
SSB. See single-stranded DNA-binding 

protein (SSB) 
SSRs (simple sequence repeats), 952f 
stability genes, 477 
Stahl, Franklin, 977 

standard free-energy change 24, 186, 187f 

for acid anhydride, 493t 
additive, 494-495 
of amide, 493t 
for ATP synthesis, 725-726 
biochemical, 186 

concentration dependent, 493-494 
calculation of, 492 

related to equilibrium constant, 491-493, 
492t, 493t 

units of, 491t 
in electron transfer, 718-720 
equilibrium constant and, 188, 188t, 491^93, 

492t, 493t 
of peptides, 493t 
pH and, 493t 

vs. free-energy change, 493^94. See also 
free-energy change {AG) 
standard reduction potential (E°), 514 

of biologically important half-reactions, 515t 

to calculate free-energy change, 515-516 

measurement of, 514f, 515 
standard transformed constants, 491 
starch, 245-246, 245-248, 251t. See also 
carbohydrate(s); polysaccharide(s) 

in amyloplasts, 774, 775f 

biosynthesis of, 791-794, 791f 

degradation of, 544 

glucose storage in, 244, 246 

granular form of, 246 

hydrolysis of, 246 

structure of, 245-246, 245f, 247-248, 248f, 249f 
synthesis of, in plants, 783-784 



starch phosphorylase, 544 
starch synthase, 791 

starvation, overproduction of ketone bodies 

during, 667 
state transitions, 756 
statin drugs, 842b-843b, 844 
STATs, 443-444, 443f, See also 
JAK-STAT pathway 

leptin and, 933, 933f 
steady state, 195 

cellular maintenance of, 571 

dynamic, 20, 24 
steady-state assumption, 196 
steady-state enzyme kinetics, 195, 198 
steady-state reactions, 23-24 
stearate 

desaturation of, 815, 815f 

synthesis of, 815f 
stearic acid, 344t 

1-stearoyl, 2-linoleoyl,3-palimitoyl glycerol, 346s 
stereochemistry, 15-18, 15f-19f 

nomenclature for, 17 
stereoisomers, 15, 15f, 72-73, 72f, 74 

nomenclature for, 17 

optical activity of, 17, 73 
stereospecificity, 15, 18, I8f, 19f 
steroid hormone(s), 359, 359f, 359s, 906t, 908 

synthesis of, 736-737, 844-845, 844f 
cytochrome P-450 in, 736-737 
steroid hormone receptors, 422, 422f, 905, 

1143-1144, 1144f 
steroid receptors, 422, 422f f 905 
sterol(s), 350f, 355-357, 355f 

as lipid anchors, 379-380, 380f 

membrane, 372t 
microdomains of, 384-386, 386f 

steroid hormones from, 359, 359s 

structure of, 355-357, 355f 
sterol regulatory element-binding proteins 

(SREBPs), 592, 841-842, 841f 
sticky ends, 306, 306f, 307 
stimulatory G protein (G s ), 424^27, 424f 

adenylyl cyclase and, 426 

self-inactivation of, 426-427 
stomach ulcers, 260, 260f 
stop codons, 1069, 1069f See also codons 
storage lipids, 350f 

classification of, 350f 

fatty acids, 343-349 

waxes, 349, 349f 
Streptomyces lividans, potassium channel in, 

408-409, 408f 
streptomycin, 1098, 1098s 
stress, oxidative, role of mitochondria in, 

720-721, 721f 
stress response 

Cortisol in, 928t, 929 

epinephrine in, 928-929, 928t 
stringent factor, 1132, 1132f 
stringent response, 1132, 1132f 
stRNA (small temporal RNA), 1145 
stroma, 743 
strong acids, 57-58 
Strong, Frank, 519 

Structural Classification of Proteins (SCOP), 
136-138, 136f-137f 

substitution mutations, 993 

substrate, enzyme, 154, 186. See also enzyme- 
substrate complex 

substrate channeling, 619 

substrate cycle, 583 

succinate, 127s, 626 
conversion of succinyl-CoA to, 626-627, 626f 
in glyoxylate cycle, in plants, 798, 798f 
oxidation of, to fumarate, 628 

succinate dehydrogenase, 628, 713t, 715. See 
also Complex II 

succinic thiokinase, 626 

succinyl-CoA 
conversion of, to succinate, 626-627, 626f 
glucogenic amino acids in, 557t 
oxidation of a-ketoglutarate to, 625-626 
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succinyl-CoA synthase, 626 

reaction of, 626f 
sucrose, 235, 244, 244f, 792s 

in photosynthesis, 244, 783-784 

synthesis of, in germinating seeds, 798-799, 798f 
sucrose 6-phosphate, 792, 792s, 810f 
sucrose 6-phosphate phosphatase, 792, 810f 
sucrose 6-phosphate synthase, 792, 810f 
sucrose synthase, in cellulose synthesis, 795, 797f 
sugar(s), 235. See also carbohydrate (s); monosac- 
charides; oligosaccharides; polysaccharide (s) 

acidic, 240s 

amino, 240s 

complex, 235 

deoxy, 240s 

dolichols and, 362 

epimer, 237f, 238 

nonreducing, 243, 244 

nucleotide, 791 

phosphorylation of, 241 

reducing, 241, 241f 

simple, 235 
sugar code, 257-262, 262f 
sugar nucleotides, 596-601 

formation of, 598, 600f 

in glycogen synthesis, 596-601, 600f, 602f 
suicide inactivator, 203-204 
sulfation, in proteoglycans, 253-254, 254f, 255f 
6-sulfo-6-deoxy-a-D-glucopyranosyldiacylglycerol, 

352, 352f 
sulfolipids, 352, 352f 
sulfonylurea drugs, 925 
Sumner, James, 184, 184f 
sunitinib, 476b 

supercoiled DNA, See DNA, supercoiling of 
superhelical density, of DNA, 956 
superoxide dismutase, 720 
superoxide radical, 715 

mitochondrial production and disposal of, 72 If 
suppressor tRNA, 1094 
supramolecular complexes, 29 
Sutent, 476b 

Sutherland, Earl, 598-599, 603 

SWI/SNF, in chromatin remodeling, 1137t, 1138, 

1138, 1140 
switches, binary, in G protein(s), 425b-427b 
SWR1 family, in chromatin remodeling, 

1137t, 1138 
symbiont, 852 
symmetry 

helical, 139, 140, 140f 

rotational, 139-140, 139f, 140f 
symporters, 395, 395f 

Na + ~glucose, 403, 403f 
synaptic transmission, steps in, 451f 
syndecan, 253, 253f, 254 
syndrome X, 938-939 
synteny, 325, 325f 
synthase, 627b 
synthetase, 627b 
system, 20 

closed, 20 

isolated, 20 

open, 20 
systems biology, 26, 324 

t 

T 3 (triiodothyronine), 906t, 909 
T 4 (thyroxine), 906t, 909 
Tcell(s), 170, 170t, 921 

antigen binding by, 170, 171 

cytotoxic, 170, 170t 

functions of, 170 

helper, 170, 170t 
T DNA, in plant cloning, 330-332, 

330f, 33 If 
Tloop, 1055, 1055f 
TAG See triacylglycerol(s) 
tagatose, 237s 

tags, in affinity chromatography, 313-314, 313t 
talose, 237s 



Tamiflu, 259-260, 260s 
tamoxifen, 456 

tandem mass spectrometry (MS/MS), 99, 

lOOf, 263, 264f 
Tarceva, 476b 
Tarui's disease, 599t 
taste, signaling in, 467, 467f, 468f 
TATA box, 1030, 1030f 

TATA-binding protein (TBP), 1031, 1032t, 1140 

Tatum, Edward L , 624b, 947, 948f 

taurocholic acid, 357s 

Tay-Sachs disease, 356 

TBP (TATA binding protein, 1140, 1140 

TCA (tricarboxylic acid) cycle, 615. See also 

citric acid cycle 
TcC arm, of tRNA, 1080f, 1081 
T-cell receptors, 170 
telomerase, 1053-1055, 1054, 1055f 
telomeres, 953, 953t, 992, 1053-1055, 1055f 
Temin, Howard, 1051, 1051f 
temperature, absolute, units of, 491t 
template strand 
in replication 
DNA as, 977, 979, 980f 
RNAas, 1050, 1058 
in transcription, 1024, 1024f 
Ter sequences, 989-991, 990f 
terminal complexes, in cellulose, 795 
terminal tags, in affinity chromatography, 

313-314, 313t 
terminal transferase, 305t 
temunation codons, 1038f, 1069. See 

also codons 
termination factors, 1094-1095, 1095f 
terminators, transcriptional, 1029-1030, 1029f 
termites, cellulose digestion by, 246-247 
tertiary structure, 92f, 123, 123-138. See also 
protein folding 
of fibrous proteins, 123-128 See also 

fibrous proteins 
of globular proteins, 123, 129-138 See also 

globular proteins 
of a-keratin, 123-124, 124f 
Ramachandran plots for, 117, 119, 122f 
representations of, 130f 
testis, 910f 

testosterone, 359, 359s, 908 
tetracosanoic acid, 344t 
tetracyclines, 1098. See also antibiotics 
tetradecanoic acid, 344t 
tetrahydrobiopterin, 692 

in phenylalanine hydroxylase reaction, 
697-698, 697f 
tetrahydrocannabinol (THC), 442, 442s 
tetrahydrofolate (H 4 folate), 689, 689f 

substrate binding to, 189f 
Tetrahymena thermophila, splicing in, 1037, 

1047, 1047f, 1057 
tetraplex DNA, 282 
tetrodotoxin, 410, 412s 
tetroses, 236 
TFIIA, 1031, 1032t, 1140 
TFIIB, 1031, 1032t, 1140 
TFIIE, 1031, 1032t, 1140 
TFIIF, 1032, 1032t, 1140 
TFIIH, 1032, 1032t, 1140 
THC (tetrahydrocannabinol), 442, 442s 
thermodynamic factors, preventing dissipation of 

energy, 751-752, 752f 
thermodynamics 

bioenergetics and, 490^495 

biological energy transformations in, 
490-491, 491t 

first law of, 20 

free energy in See free energy (G) 

physical constants used in, 49 It 

second law of, 491 
thermogenesis, 917. See also brown 
adipose tissue 

mitochondrial, 736 

shivering, 919 
thermogenin, 736, 917, 932 



thiamine pyrophosphate, 549-550 

in Calvin cycle, 779, 781f 

in fermentation, 549-550, 550f, 551t 

reaction mechanism of, 550f 
thiazolidinediones, 808s, 824 
thick filaments, 176, 176f, 177f 

in muscle contraction, 178, 178f 
thin filaments, 176, 176f, 177f 

in muscle contraction, 178, 178f 
thin-layer chromatography, 364-365, 364f 
thioesters, 617 

free-energy hydrolysis of, 505, 505f 
thiolase, 654 

thioredoxin, 578f, 785, 888 
thioredoxin reductase, 888 
4-thiouridine, 274s 
30 nm fiber, 966, 968f 
3' end, of nucleic acids, 275, 275f 
3'— >5' exonuclease activity, in DNA polymerases, 
983-984 

threonine, 75s, 77, 695, 699, 865 

biosynthesis of, 865, 866f-867f 
conversion of, to succinyl-CoA, 699, 699f 
degradation of, to pyruvate, 692f, 695 
phosphorylated, 224-226 
binding domains for, 446-449 
in plants, 460, 461f 
properties of, 73t 
L-threonine dehydratase, feedback inhibition of, 

221-222, 221f 
threose, 237s 
thrombin, 227, 254f 

thromboxane(s), 358-359, 358f See also 
eicosanoid(s) 

synthesis of, 818f, 819 
thromboxane synthase, 818f, 819 
Thudichum, Johann, 353f, 354 
thylakoid(s), 743 

proton and electron circuits in, 749f 
thylakoid membrane 

galactolipids in, 352, 352f 

photosystems in, 748f 
thymidylate, biosynthesis of, 890-892, 891f 

as chemotherapy target, 895f 
thymidylate synthase, 890 

reaction mechanism of, 895f 
thymine, 9s, 272, 272t, 273f. See also 
pyrimidine bases 

evolutionary significance of, 289-290 

from 5-methylcytosine deamination, 289-290, 290f 
thyroglobulin, 909 
thyroid, 910f 

thyroid hormones, 906t, 909 
thyrotropin, 258 
thyrotropin-releasing hormone 

isolation and purification of, 902 

structure of, 904f 
thyroxine (T 4 ), 906t, 909 
Ti plasmid, 330-331 
tissue culture, 333 
tissue damage, assays for, 678b 
tissue fractionation, 8-9, 8f 
titins, 177 
titration curves 

for amino acids, 79-81, 79f, 81f 

Henderson-Hasselbalch equation for, 60-61 

of peptides, 82-83 

for weak acids, 58-59, 58f, 59f 
TLS polymerases, 1001-1002, 1003b 
TNF. See tumor necrosis factor (TNF) 
tobacco mosaic virus, icosahedral symmetry of, 

140, 140f 
tocopherols, 361-362, 362s 
topoisomerase(s), 958-961, 958f-961f, 968, 984, 
986, 986t 

reaction mechanism of bacterial topoisom- 
erase I, 985f 

in replication initiation, 986, 986t 
topoisomerase inhibitors, 960-961 
topoisomers, 957 
topology, 955 

of DNA, 955 
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topotecan, 960b, 960s 
toxicity, of ammonia, 681-682 
toxins 

ion channels and, 410-412 
translation inhibition by, 1098-1099 
trace elements, 1 If, 13 
trans configuration, of peptide bonds, 121f 
trans fatty acids, 347-348, 348f, 348t 
transaldolase, 560-561, 562s 
in Calvin cycle, 780f 
covalent interactions with, 562, 562f 
transaminase, 677 
transamination, 677 

enzyme -catalyzed, 677f 
transcript 
complex, 1040, 1042f 
primary, 1034 

splicing in, 1038f 
processing of. See RNA processing 
transcription, 29f, 945, 1021, 1022-1030. See also 
protein synthesis 
in bacteria, 970f, 1022-1030, 1023f 
regulation of, 1126-1136. See also 
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